Data Glossary 🧠

Search

Search IconIcon to open search

What is Apache Hive?

Last updated Sep 7, 2022 - Edit Source

Apache Hive is a  Data Warehouse software project built on top of  Apache Hadoop for providing data queries and analysis. Hive gives an SQL-like interface to query data stored in various databases and file systems that integrate with Hadoop. Traditional SQL queries must be implemented in the  MapReduce Java API to execute SQL applications and queries over distributed data. Hive provides the necessary SQL abstraction to integrate SQL-like queries ( HiveQL) into the underlying Java without the need to implement queries in the low-level Java API.