Data glossary

What is Data Lakehouse?

A Data Lakehouse open data management architecture that combines the flexibility, cost-efficiency, and scale of Data Lakes with the data management and ACID transactions of Data Warehouse with Data Lake Table Formats (Delta Lake, Apache Iceberg & Hudi) that enable Business Intelligence (BI) and Machine Learning (ML) on all data.

The initial concept was created by Databricks in the CIDR Paper in 2017.

Getting started is easy

Start breaking your data siloes with Airbyte