Welcome to the segment on ‘Basic Hive Query‘.
So far, you have got an understanding of the Hive architecture and its salient features. You must have clearly understood how Hive works on top of HDFS and how you can query into the HDFS using an SQL-like language and read the data in a structured format.
Now, you will learn about the practical aspect of Hive, that is, how you can query using Hive, and how you create the schema in the metastore and then read the data according to the schema created. You will learn about DDL and DML statements, and query optimisation techniques.
In this session:
You will learn about the following concepts:
- Creation of a database
- Internal and external tables.
- Order by and sort by clauses.
- Distributed by and clustered by clauses.
- Indexing