Apache Tajo is a robust big data relational and distributed data warehouse system for Apache Hadoop. Tajo is designed for low-latency and scalable ad-hoc queries, online aggregation, and ETL (extract-transform-load process) on large-data sets stored on HDFS (Hadoop Distributed File System) and other data sources. By supporting SQL standards and leveraging advanced database techniques, Tajo allows direct control of distributed execution and data flow across a variety of query evaluation strategies and optimization opportunities.

General Information

Developer Documentation

...

Child pages

Versions Compared

Old Version 10

New Version Current

Key

General Information

Developer Documentation

Child pages

Page History

Versions Compared

Old Version 10

New Version Current

Key

General Information

Developer Documentation