Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migrated to Confluence 4.0

...

1. Datatype support: we discussed the datatype support for Sqoop2. Among the points raised were the following:

  • Sqoop 1 datatypes are different for Java, Hive and Avro. Most of the time the connector does the datatype mapping.
  • For Sqoop ,2 we would like to standardize on the datatype support so that all connectors can conform for consistency.
  • Some datatypes discussed today were:
    • String, char: UTF 8 encoded
    • Timestamp: Unix time representation to millisecond level
    • Binary: byte array
    • Numeric: byte, short, int, long, float, double, byte
    • Null type

2. Intermediate data format

...