|Table of Contents|
Is Removed since 3.0
There are alternate options which might work similarily to indexing:
- Materialized views with automatic rewriting can result in very similar resultsusing . Hive 2.3.0 adds support for materialzed views.
- Using columnar file formats (parquetParquet,orc) - they ORC) – they can do selective scanning; they may even skip entire files/blocks.
has been removed in version 3.0 (HIVE-18448).
Overview of Hive Indexes
The goal of Hive indexing is to improve the speed of query lookup on certain columns of a table. Without an index, queries with predicates like 'WHERE tab1.col1 = 10' load the entire table or partition and process all the rows. But if an index exists for col1, then only a portion of the file needs to be loaded and processed.