What is Data Lakehouse?

A Data Lakehouse open data management architecture that combines the flexibility, cost-efficiency, and scale of Data Lakes with the data management and ACID transactions of Data Warehouse with Data Lake Table Formats (Delta Lake, Apache Iceberg & Hudi) that enable Business Intelligence (BI) and Machine Learning (ML) on all data.

The initial concept was created by Databricks in the CIDR Paper in 2017.

