What is HIVE?
Apache Hive is data warehousing infrastructure based on Hadoop. Hadoop provied massive scale out and fault tolerance capabilities for data storage and processing on commodity hardware.
Hive is designed to enable data summarization, ad-hoc querying and analysis of large volume of data. At the same time, Hive’s SQL gives users multiple palces to integrate their own functionality to do custom analysis like UDFs
Architecture of HIVE
Here CLI -Command Line Interface, JDBC- JavaDataBase Connector and Web GUI(Graphical User Interface). When user comes with CLI then directly connected with Drivers, user comes with JDBC at that time by using API it connected to Hive driver. When Hive Driver receives the tasks queries from user and send to Hadoop architecture then architecture uses name node, data node ,job tracker , task tracker for receiving data.