Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Tajo System Directory Hierarchy

The default system directory hierarchy is as follows:

No Format
[HDFS ROOT]
     |
     |-- tajo
     |     |
     |     |-- system
     |     |     |-- resource
     |     |             |-- system_conf.xml
     |     |-- warehouse
     |           |- [database  A]
     |           |        |- [table 1a]
     |           |        |- [table 2]
| |- [tabne N]
|
|-- tmp
 b]
     |           |
     |           |- [database ..]
     |           |
     |           |- [database  N]
     |                    |- [table x]
     |                    |- [table z]
     |
     |-- tmp
          |-- staging
                 |- q_1378468137418_0001
                 |           |- RESULT
                 |
                 |- q_1378468137418_0002
                 |           |- RESULT
                 |
                 |- q_1378468137418_000N

root directory

  • Tajo  Tajo has a root directory (config key - 'tajo.rootdir')
  • tajo tajo.rootdir is configurable.
  • tajo tajo.rootdir contains a subdirectory 'system'.

...

System directory

  • system directory location is not configurable.
  • system/resource directory contains 'system_conf.xml' which contains a global configuration used across a number of workers.

...

Warehouse directory

  • It contains a subdirectory per table.
  • If you execute CREATE TABLE statement, the table is stored in here as a subdirectory.

staging directory

  • It contains a temporary directory for each query.
  • Each temporary directory has a query id as a directory name.
  • For each query, query results are stored temporarily in here. But these results will be removed after a while.
  • If a query is either CREATE TABLE or INSERT TABLE, the query result is moved to the warehouse directory.

...

  •