delta-lake
- Layers
- Consumption: BI tools, Visualization and other API
- Compute: where we compute queries
- Storage: where data is stored
Storage
-
Datalake
- Openness
- Flexibility
- Because of the flexibility we have in Datalake this leads to a lot of Data quality issues
- Datalake operates in file level
-
Data-warehouse
- Closed, propriety format
- Offers a good data quality
- Cons
- Limited to structured data
- Performance issues
-
Deltalake
- Get version data