Apache Kylin : Analytical Data Warehouse for Big Data

Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

This article mainly uses the standard SSB data set and TPC-H data set to obtain the performance data about the build engines and query engines of Kylin on Parquet and Kylin 3.0 respectively, and then conducts comparative analysis to allow users to understand the advantages and disadvantages of Kylin on Parquet compared to Kylin 3.0 (which still using Kylin on HBase).

  • SSB (Star Schema Benchmark) is a set of benchmark test specifications used to test the performance of database products in star mode, and is also a data set often used in the OLAP field.
  • TPC (Transaction Processing Performance Council) has a variety of benchmark test systems, and here we use the TPC-H data set. The main purpose of using TPC-H is to test the response time of complex queries of the database system, in order to evaluate the decision support ability of specific queries.

...