Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: More TODOs

...

HCatalog storage handlers

TODO: This This is well documented on the wiki.

HCat CLI

TODO: This This is well documented on the wiki.

Note
iconfalse
titleTODO

Describe differences with hive CLI.

Metastore

A Thrift based API with Java bindings, described by the IMetaStoreClient interface. The API decouples the metastore storage layer from other Hive internals. Because Hive itself uses this internally, it is required to implement a comprehensive feature set which makes it attractive developers who might find the other APIs lacking. It was not originally intended to be a public API although it became public in version 1.0.0 (HIVE-3280) and it has been proposed that it be documented more fully (HIVE-9363). Anecdotally, its use outside of Hive project is not currently recommended.

Note
iconfalse
titleTODO: API usage

There are numerous ways of instantiating the metastore API including: HCatUtil.getHiveMetastoreClient(), new HiveMetaStoreClient.HiveMetaStoreClient(...). It may be useful to make some recommendations on the preferred approach.

Hive

...

iconfalse
titleTODO

...

Refers to the org.apache.hadoop.hive.ql.metadata.Hive class. Appears to be a distinct concrete implementation of a variation of the metastore API. Delegates to the metastore API but does not directly extend/implement it

...

.

Driver

Note
iconfalse
titleTODO

I refer here to suspect its use is not encouraged. Seeking clarification on the motivations behind this class and thoughts on its use outside of Hive.

Driver

Refers to the org.apache.hadoop.hive.ql.Driver class.

Note
iconfalse
titleTODO

Describe the role of Driver, when to use it, etc.

WebHCat

TODO: WebHCat is a REST API for HCatalog. This is well documented on the wiki.

Note
iconfalse
titleTODO

Is this API actively used? Is its use encouraged? When should one use it?

Streaming Data Ingest

A Java API focused on the writing of continuous streams of data into transactional tables using Hive’s ACID feature. New data is inserted into tables using small batches and short-lived transactions. Documented on the wiki and has package level Javadoc. Introduced in Hive version 0.13.0 (HIVE-5687).

...

A Java API focused on mutating (insert/update/delete) records into transactional tables using Hive’s ACID feature. Large volumes of mutations are applied atomically in a single long-lived transaction. Documented with package level Javadoc. Scheduled for release in Hive version 2.0.0 (HIVE-10165).

JDBC

Note
iconfalse
titleTODO

Seeking documentation.