A Data Lakehouse open data management architecture that combines the flexibility, cost-efficiency, and scale of Data Lakes with the data management and ACID transactions of Data Warehouse with Data Lake Table Formats (Delta Lake, Apache Iceberg & Hudi) that enable Business Intelligence (BI) and Machine Learning (ML) on all data.
The initial concept was created by Databricks in the CIDR Paper in 2017.
Start breaking your data siloes with Airbyte