Apache Tajo is a relational and distributed data warehouse system for Hadoop. Tajo is designed for low-latency and scalable ad-hoc queries, online aggregation and ETL on large-data sets by leveraging advanced database techniques. It supports SQL standards. Tajo uses HDFS as a primary storage layer and has its own query engine which allows direct control of distributed execution and data flow. As a result, Tajo has a variety of query evaluation strategies and more optimization opportunities. In addition, Tajo will have a native columnar execution and and its optimizer.

General Information

Official Apache Tajo Website: source code, bug-tracking, mailing-lists, etc.
Overview of Tajo
Powered By
Presentations
Architecture of Tajo
Logos of Tajo

Developer Documentation

Roadmap
Tajo Internal
How To Contribute
How To Setup Your Development Environment
TPC-H Benchmark
How to update Apache Tajo website
Coding Style
UnitTests
MajorReleaseAnnouncementTemplate
How to write user documentations

User Documentation

User documentations is located at http://tajo.incubator.apache.org/docs/0.8.0/index.html.

Others

Google Summer of Code 2013

...

Child pages

Versions Compared

Old Version 1

New Version 2

Key

General Information

Developer Documentation

User Documentation

Others

Child pages

Page History

Versions Compared

Old Version 1

New Version 2

Key

General Information

Developer Documentation

User Documentation

Others