...
- !SerDe is a short name for "Serializer and Deserializer."
- Hive uses SerDe (and !FileFormat) to read and write table rows.
- HDFS files -!InputFileFormat)--> <key, value> --(Deserializer-> Row object
- Row object -Serializer)--> <key, value> --(!OutputFileFormat-> HDFS files
...
A helpful overview of the Hive query processor can be found in this Hive Anatomy slide deck.
Compiling and Running Hive
h3 Hive now uses maven for build, see updated hive maven build instructions
Hive can be made to compile against different versions of Hadoop.
...