...
Column level top K statistics are still pending; see HIVE-3421.
Quick overview
Description | Stored in | Collected by | Since |
---|---|---|---|
Number of partition the dataset consists of | Fictional metastore property: numPartitions | computed during displaying the properties of a partitioned table | Hive 2.3 |
Number of files the dataset consists of | Metastore table property: numFiles | Automatically during Metastore operations | |
Total size of the dataset as its seen at the filesystem level | Metastore table property: totalSize | ||
Uncompressed size of the dataset | Metastore table property: rawDataSize | Computed, these are the basic statistics. Calculated automatically when Configuration Properties#hive.stats.autogather is enabled. | Hive 0.8 |
Number of rows the dataset consist of | Metastore table property: numRows | ||
Column level statistics | Metastore; TAB_COL_STATS table | Computed, Calculated automatically when Configuration Properties#hive.stats.column.autogather is enabled. Can be collected manually by: ANALYZE TABLE ... COMPUTE STATISTICS FOR COLUMNS |
Implementation
The way the statistics are calculated is similar for both newly created and existing tables.
...