Open Data Blend Docs
  • Introduction
  • Open Data Blend Datasets
    • Datasets
    • Dataset Versions
    • Dataset UI
    • Dataset API
    • Frictionless Data Compatibility
    • Modelling Conventions
    • Loading Data Files in Excel
    • Loading Data Files in Power BI Desktop
    • Loading Data Files in Tableau Desktop
    • Loading Data Files in Python
    • Loading Data Files in R
    • Loading Data Files in Other Tools
  • Open Data Blend Analytics
    • Analytics
    • Analytics Queries
    • Analytics Users
    • Connecting from Excel
    • Connecting from Power BI Desktop
    • Connecting from Tableau Desktop
    • Connecting from Other Tools
  • Open Data Blend Insights
    • Insights
    • Report Drill Throughs
    • Report Drill Downs
  • Subscription Management
    • Subscription Portal
    • Managing Analytics Users
    • Managing Access Keys
    • Updating Payment Details
Powered by GitBook
On this page
  • Star Schemas
  • Naming Conventions
  • Column Prefixes
  • Column Names
  • Acronym Handling
  • Foreign Key References

Was this helpful?

  1. Open Data Blend Datasets

Modelling Conventions

PreviousFrictionless Data CompatibilityNextLoading Data Files in Excel

Last updated 1 year ago

Was this helpful?

Star Schemas

All Open Data Blend Datasets are created by using to optimise the data for data analysis. We carefully transform the source data into facts (i.e. the tables with columns containing measurable values such as the number of prescription items) and dimensions (i.e. tables with the columns containing descriptive values such as the name of a prescribed medicine).

There are a vast number of resources that explain what these modelling practices are and why they exist. Rather than reinventing the wheel, we recommend looking through this comprehensive by the Kimball Group.

Naming Conventions

Column Prefixes

We use the following column prefixes in our data files:

Prefix

Description

nlm_

Metadata column*

nll_

Licence information column*

src_

Source column

srk_

Source column that is also a natural key

drv_

Derived column*

drk_

Derived column that is also a natural key*

*The values in these columns either originate from Open Data Blend or are derived from source columns by Open Data Blend.

Column Names

All column names are in lower-case, alpha-numeric (no special characters), and spaced using underscores. This is to ensure the broadest level of compatibility across all tools and platforms.

Acronym Handling

Acronyms are expanded to their corresponding word parts except for when the acronym is very well known and non-ambiguous. Even then, it may still be expanded.

Foreign Key References

Foreign key references are implied. It's safe to assume that whenever two tables each have a column name that starts with drv_ and ends with _key (e.g. drv_prescribing_practice_key), these columns are intended to be joined.

dimensional modelling techniques
glossary