Data Glossary 🧠

Search

Search IconIcon to open search

What is Apache Hudi?

Last updated Sep 7, 2022 - Edit Source

Apache Hudi is a  Data Lake Table Format and was originally developed at Uber in 2016 (code-named and pronounced “Hoodie”), open-sourced end of 2016 ( first commit in 2016-12-16), and submitted to the Apache Incubator in January 2019. More about the back story on  The Apache Software Foundation Announces Apache® Hudi™ as a Top-Level Project.

Read more about how to build a Data Lake on top of it on our  Data Lake and Lakehouse Guide.