Daniel Linstedt - Building a Scalable Data Warehouse with Data Vault 2.0
- What is Big Data?
- Veracity (optional)
- Value (optional)
What is Data Vault?
Methodology for building maintaining and expanding DWH.
- “Data pyramid”
Data Vault 2.0 - Overview
Source system(s) –hard business rules–> Staging Area –soft business rules–> Information Marts
Linstedt proposes using “information marts” instead of “data marts” as those are objects following data operations (e.g. aggregation, consolidation) - i.e. at a higher pyraimd level than raw pieces of data.
- Auditability limited to 4 pieces of information:
- Where from
- Where to
- In addition to the columns from the source system, each table in the stage area includes:
- sequence number
- record source
- hash key computations for all business keys and their combinations