This Confluence has been LDAP enabled, if you are an ASF Committer, please use your LDAP Credentials to login. Any problems file an INFRA jira ticket please.

Child pages
  • Hadoop-compatible Input-Output Format for Hive

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: add link to HIVE-3752, minor edits, revise OutputFormat usage #4

...

This is a proposal for adding API to hive Hive which allows reading and writing using a Hadoop compatible API. Specifically, the interfaces being implemented are:

...

The classes will be named HiveApiInputFormat and HiveApiOutputFormat.

See HIVE-3752 for discussion of this proposal.

InputFormat (reading from Hive)

...

  1. Create a HiveInputDescription object.
  2. Fill it with information about the table to read from (with database, partition, columns).
  3. Initialize HiveApiInputFormat with the information.
  4. Go to town using HiveApiInputFormat with your Hadoop-compatible reading system.

...

  1. Create a HiveOutputDescription object.
  2. Fill it with information about the table to write to (with database and partition).
  3. Initialize HiveApiOutputFormat with the information.
  4. Go to town using HiveApiInputFormat HiveApiOutputFormat with your Hadoop-compatible reading writing system.