Apache Kylin : Analytical Data Warehouse for Big Data

Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

ModulePropertyDefault ValueOptional ValueDescription

kylin.env

DEVQA, PROD, LOCALThe environment of Kylin. DEV will turn on some dev features, QA and PROD has no difference in terms of functions.

kylin.env.hdfs-working-dir

/kylin


Working folder in HDFS, better be qualified absolute path, make sure user has the right permission to this directory


kylin.env.zookeeper-base-path

/kylin

kylin zk base path


kylin.env.zookeeper-is-local
false
Run a TestingServer for curator locally

kylin.env.zookeeper-connect-string
sandbox.hortonworks.com

Connect to a remote zookeeper with the url, should set kylin.env.zookeeper-is-local to false

kylin.env.hadoop-conf-dir
/etc/hadoop/conf

Hadoop conf folder, will export this as "HADOOP_CONF_DIR" to run spark-submit. This must contain site xmls of core, yarn, hive, and hbase in one folder

kylin.env.hdfs-metastore-bigcell-dir
kylin.env.hdfs-metastore-bigcell-dir



kylin.env.zookeeper-base-sleep-time
3000


kylin.env.zookeeper.zk-auth
digest:ADMIN:KYLIN



kylin.env.zookeeper-acl-enabled
false







kylin.job.zookeeper-monitor-interval
30


kylin.job.log-dir
/tmp/kylin/logs


kylin.job.use-remote-cli




kylin.job.remote-cli-port
22


kylin.job.remote-cli-hostname




kylin.job.remote-cli-username




kylin.job.remote-cli-password




kylin.job.remote-cli-working-dir




kylin.job.allow-empty-segment
true


kylin.job.max-concurrent-jobs
10


kylin.job.sampling-percentage
100


kylin.job.dependency-filter-list
"[^,]*hive-exec[^,]*?\\.jar" + "|"
+ "[^,]*hive-metastore[^,]*?\\.jar" + "|" + "[^,]*hive-hcatalog-core[^,]*?\\.jar"



kylin.job.notification-enabled
false


kylin.job.notification-mail-enable-starttls
false


kylin.job.notification-mail-port
25


kylin.job.notification-mail-host




kylin.job.notification-mail-username




kylin.job.notification-mail-password




kylin.job.notification-mail-sender




kylin.job.notification-admin-emails
null


kylin.job.retry
0


kylin.job.retry-interval
30000


kylin.job.retry-exception-classes




kylin.job.sampling-hll-precision
14


kylin.job.scheduler.default
0


kylin.job.scheduler.priority-considered
false


kylin.job.scheduler.priority-bar-fetch-from-queue
20


kylin.job.scheduler.poll-interval-second
30


kylin.job.scheduler.safemode
false


kylin.job.scheduler.safemode.runnable-projects




kylin.job.error-record-threshold
0


kylin.job.use-advanced-flat-table
false


kylin.job.advanced-flat-table.class




kylin.job.tracking-url-pattern




kylin.job.metadata-persist-retry
5


kylin.job.cube-auto-ready-enabled
true


kylin.job.cube-inmem-builder-class
org.apache.kylin.cube.inmemcubing.DoggedCubeBuilder



kylin.job.execute-output.max-size
10485760








kylin.job.log-print-enabled
true
kylin.metadata.url
kylin_metadata@hbase

The metadata store in hbase

kylin.metadata.sync-retries
3
metadata cache sync retry times

kylin.metadata.sync-error-handler

MockupErrHandler







kylin.server.mode
allquery, job
Kylin server mode, valid value [all, query, job]

kylin.server.cluster-servers
localhost:7070
List of web servers in use, this enables one web server instance to sync up with other servers.

kylin.server.query-metrics-percentiles-intervals
60, 360, 3600

kylin query metrics percentiles intervals default=60, 300, 3600

kylin.server.host-address
localhost:7070
get server rest address

kylin.server.cluster-name




kylin.server.init-tasks




kylin.server.sequence-sql.workers-per-server
1


kylin.server.sequence-sql.expire-time
86400000


kylin.server.query-metrics-enabled
false


kylin.server.query-metrics2-enabled
false


kylin.server.auth-user-cache.expire-seconds
300


kylin.server.auth-user-cache.max-entries
100


kylin.server.external-acl-provider




kylin.server.cluster-servers-with-mode









kylin.storage.hbase.owner-tag
whoami@kylin.apache.org

Optional information for the owner of kylin platform, it can be your team's email, Currently it will be attached to each kylin's htable attribute

kylin.storage.url
hbase
The storage for final cube file in hbase

kylin.storage.hbase.cluster-fs


HBase Cluster FileSystem, which serving hbase, format as hdfs://hbase-cluster:8020, Leave empty if hbase running on same cluster with hive and mapreduce

kylin.storage.hbase.compression-codec
gzip
default compression codec for htable,snappy,lzo,gzip,lz4

kylin.storage.hbase.namespacedefault


kylin.storage.hbase.cluster-hdfs-config-file




kylin.storage.hbase.region-cut-gb
0.1
The cut size for hbase region, in GB. E.g, for cube whose capacity be marked as "SMALL", split region per 10GB by default

kylin.storage.hbase.min-region-count
1


kylin.storage.hbase.max-region-count
500


kylin.storage.hbase.hfile-size-gb
2
The hfile size of GB, smaller hfile leading to the converting hfile MR has more reducers and be faster. set 0 to disable this optimization

kylin.storage.hbase.run-local-coprocessor
false


kylin.storage.hbase.coprocessor-mem-gb
3.0


kylin.storage.partition.aggr-spill-enabled
true


kylin.storage.partition.max-scan-bytes
3L * 1024 * 1024 * 1024



kylin.storage.hbase.coprocessor-timeout-seconds
0


kylin.storage.hbase.max-fuzzykey-scan
200


kylin.storage.hbase.max-fuzzykey-scan-split
1


kylin.storage.hbase.max-visit-scanrange
1000000



kylin.storage.hbase.gtstorage
org.apache.kylin.storage.hbase.cube.v2.CubeHBaseEndpointRPC



kylin.storage.hbase.scan-cache-rows
1024


kylin.storage.hbase.region-cut-gb
5.0


kylin.storage.hbase.max-scan-result-bytes
5 * 1024 * 1024



kylin.storage.hbase.compression-codec
none


kylin.storage.hbase.rowkey-encoding
FAST_DIFF


kylin.storage.hbase.block-size-bytes
1048576


kylin.storage.hbase.small-family-block-size-bytes
65536


kylin.storage.hbase.endpoint-compress-result
true


kylin.storage.hbase.max-hconnection-threads
2048


kylin.storage.hbase.core-hconnection-threads
2048


kylin.storage.hbase.hconnection-threads-alive-seconds
60


kylin.storage.hbase.replication-scope
0


kylin.storage.clean-after-delete-operation
false


kylin.storage.project-isolation-enable
true


kylin.storage.limit-push-down-enabled
true


kylin.storage.providerorg.apache.kylin.common.storage.DefaultStorageProviderdefault
2


kylin.storage.columnarhbase.shardtable-sizename-mbprefix
128KYLIN_







kylin.storage.columnar.shard-rowcount
2500000
kylin.storage.columnar.shard-countdistinct-rowcount
1000000
kylin.storage.columnar.repartition-threshold-size-mb
128
kylin.storage.columnar.shard-min
1
kylin.storage.columnar.shard-max
1000
kylin.storage.columnar.hdfs-blocksize-bytes
5 * shard_size
kylin.storage.columnar.shard-expand-factor
10
kylin.storage.columnar.dfs-replication
3
kylin.storage.columnar.jdbc.file-system
kylin.storage.default
2
kylin.storage.hbase.table-name-prefix
KYLIN_
kylin.web.timezone
GMT+8
Display timezone on UI,format like[GMT+N or GMT-N]
kylin.web.help.length
4
kylin.web.help.0
start|Getting Started|http://kylin.apache.org/docs/tutorial/kylin_sample.html
kylin.web.help.1
odbc|ODBC Driver|http://kylin.apache.org/docs/tutorial/odbc.html
kylin.web.help.2
tableau|Tableau Guide|http://kylin.apache.org/docs/tutorial/tableau_91.html
kylin.web.help.3
onboard|Cube Design Tutorial|http://kylin.apache.org/docs/howto/howto_optimize_cubes.html
web.timezone
GMT+8
Display timezone on UI,format like[GMT+N or GMT-N]

kylin.web.help.length
4


kylin.web.help.0
start|Getting Started|http://kylin.apache.org/docs/tutorial/kylin_sample.html



kylin.web.help.1
odbc|ODBC Driver|http://kylin.apache.org/docs/tutorial/odbc.html



kylin.web.help.2
tableau|Tableau Guide|http://kylin.apache.org/docs/tutorial/tableau_91.html



kylin.web.help.3
onboard|Cube Design Tutorial|http://kylin.apache.org/docs/howto/howto_optimize_cubes.html



kylin.web.export-allow-admin
true


kylin.web.export-allow-other
true


kylin.web.hide-measures
RAW


kylin.web.default-time-filter
0


kylin.web.cross-domain-enabled
true


kylin.web.dashboard-enabled
false







kylin.engine.mr.config-override.




kylin.engine.mr.mem-hungry-config-override.




kylin.engine.mr.uhc-config-override.




kylin.engine.mr.base-cuboid-config-override.




kylin.engine.mr.reduce-input-mb
500


kylin.engine.mr.reduce-count-ratio
1.0


kylin.engine.mr.min-reducer-number
1


kylin.engine.mr.max-reducer-number
500


kylin.engine.mr.mapper-input-rows
1000000



kylin.engine.mr.max-cuboid-stats-calculator-number
1
set 1 to disable multi-thread statistics calculation

kylin.engine.mr.cuboid-number-per-stats-calculator
100


kylin.engine.mr.per-reducer-hll-cuboid-number
100


kylin.engine.mr.hll-max-reducer-number
1


kylin.engine.mr.build-dict-in-reducer
true


kylin.engine.mr.yarn-check-status-url
null


kylin.engine.mr.yarn-check-interval-seconds
10


kylin.engine.mr.use-local-classpath
true


kylin.engine.mr.uhc-config-override.mapreduce.reduce.memory.
kylin.web.export-allow-admin
true
kylin.web.export-allow-other
true
kylin.web.hide-measures
RAW
kylin.web.default-time-filter
0
kylin.web.cross-domain-enabled
true
kylin.web.dashboard-enabled
false
kylin.engine.mr.config-override.
kylin.engine.mr.mem-hungry-config-override.
kylin.engine.mr.uhc-config-override.
kylin.engine.mr.base-cuboid-config-override.
kylin.engine.mr.reduce-input-mb
500


kylin.engine.mr.reduceuhc-count-ratio
1.0
kylin.engine.mr.min-reducer-number
1
config-override.mapred.reduce.child.java.opts
-Xmx400M


kylin.engine.mr.maxyarn-check-reducerinterval-numberseconds
10
Time interval to check hadoop job status
500

kylin.engine.mr.mappermax-inputreducer-rowsnumber1000000
5


kylin.engine.mr.maxuhc-cuboid-stats-calculator-number
1
reducer-count
3
for testset 1 to disable multi-thread statistics calculation

kylin.engine.mr.cuboid-number-per-stats-calculatorlib-dir









kylin.engine.spark-conf.


getSparkConfigOverride
100

kylin.engine.mrspark.per-reducer-hll-cuboid-number
100
additional-jars




kylin.engine.mrspark.hllrdd-maxpartition-reducercut-numbermb
110.0


kylin.engine.mrspark.build-dict-in-reducermin-partition
1true


kylin.engine.mr.yarn-check-status-urlspark.max-partition
5000null


kylin.engine.mrspark.yarn-check-interval-secondsstorage-level
MEMORY_AND_DISK_SER10


kylin.engine.mrspark.usesanity-localcheck-classpathenabled
truefalse


kylin.engine.mr.uhcspark-config-override.mapreduce.reduce.memory.mbfact-distinct
false500


kylin.engine.mr.uhc-config-override.mapred.reduce.child.java.optsspark-uhc-dictionary
false-Xmx400M


kylin.engine.mr.yarn-check-interval-seconds
10
Time interval to check hadoop job status
spark-cardinality
false


kylin.engine.mrspark.output.max-reducer-numbersize
510485760


kylin.engine.mr.uhcspark-reducerdimension-count
3
dictionary
falsefor test


kylin.engine.mr.lib-dirspark-create-table-enabled
false







kylin.engine.sparkflink-conf.
getSparkConfigOverride




kylin.engine.sparkflink.additional-jars




kylin.engine.sparkflink.rdd-partition-cut-mb
10.0


kylin.engine.sparkflink.min-partition
1


kylin.engine.sparkflink.max-partition
5000


kylin.engine.flink.sanity-check-enabled
false







kylin.engine.livy-conf.livy-enabled
false


kylin.engine.livy.backtick.quote




kylin.engine.livy-conf.livy-url
5000




kylin.engine.sparklivy-conf.storage-level
MEMORY_AND_DISK_SER
livy-key.




kylin.engine.sparklivy-conf.sanity-check-enabled
false
livy-arr.




kylin.engine.sparklivy-fact-distinct
false
conf.livy-map.









kylin.engine.spark-uhc-dictionarydefault
2false







kylin.enginequery.sparkenable-dict-cardinalityenumerator
false


kylin.enginequery.spark.output.max-size
10485760
calcite.enumerable-rules-enabled
false


kylin.query.calcite.reduce-rules-enabled
true


kylin.query.convert
kylin.engine.spark-dimension-dictionary
false
kylin.engine.spark-create-table-enabledto-with
false


kylin.enginequery.sparkcalcite.build-class-name
org.apache.kylin.engine.spark.job.CubeBuildJob
extras-props.




kylin.query.calcite.add-rule




kylin.query.calcite.remove-rule




kylin.query.enable-dynamic-column
false


kylin.query.skip-empty-segmentskylin.engine.spark.task-impact-instance-enabled
true


kylin.engine.spark.task-core-factorquery.disable-cube-noagg-sql
false3


kylin.enginequery.spark.clusterstream-info-fetcher-class-nameorg.apache.kylin.cluster.YarnInfoFetcheraggregate-enabled
true


kylin.storage.project-isolation-enable
true


kylin.engine.spark.merge-class-name
org.apache.kylin.engine.spark.job.CubeMergeJob
kylin.engine.flink-conf.
kylin.engine.flink.additional-jars
kylin.engine.flink.partition-cut-mb
10.0
kylin.engine.flink.min-partition
1
kylin.engine.flink.max-partition
5000
kylin.engine.flink.sanity-check-enabled
false
kylin.engine.livy-conf.livy-enabled
false
kylin.engine.livy.backtick.quote
kylin.engine.livy-conf.livy-url
kylin.engine.livy-conf.livy-key.
kylin.engine.livy-conf.livy-arr.
kylin.engine.livy-conf.livy-map.
kylin.engine.driver-memory-base
1024
kylin.engine.driver-memory-strategy
{2, 20, 200}
kylin.engine.driver-memory-maximum
4096
kylin.engine.persist-flattable-threshold
1
kylin.engine.submit-hadoop-conf-dir
kylin.engine.max-retry-time
3
kylin.engine.retry-memory-gradient
1.5
kylin.engine.retry-overheadMemory-gradient
0.2
kylin.engine.max-allocation-proportion
0.9
kylin.engine.base-executor-instance
5
kylin.engine.executor-instance-strategy
100,2,500,3,1000,4
query.max-limit-pushdown
10000


kylin.query.force-limit
-1


kylin.query.scan-threshold
10000000



kylin.query.lazy-query-enabled
false


kylin.query.lazy-query-waiting-timeout-milliseconds
60000


kylin.query.project-concurrent-running-threshold
0


kylin.query.max-scan-bytes
0


kylin.query.max-return-rows
5000000



kylin.query.translated-in-clause-max-size
1024 * 1024


kylin.query.derived-filter-translation-threshold
20


kylin.query.badquery-stacktrace-depth
10


kylin.query.badquery-history-number
50


kylin.query.badquery-alerting-seconds
90


kylin.query.timeout-seconds-coefficient
0.5


kylin.query.badquery-persistent-enabled
true


kylin.query.transformers




kylin.query.transformers




kylin.query.cache-threshold-duration
2000


kylin.query.cache-threshold-scan-count
10240


kylin.query.cache-threshold-scan-bytes
1024 * 1024


kylin.query.security-enabled
true


kylin.query.cache-enabled
true


kylin.query.ignore-unknown-function
false


kylin.cache.memcached.hosts




kylin.query.segment-cache-enabled
false


kylin.query.segment-cache-timeout
2000


kylin.query.segment-cache-max-size
200


kylin.query.access-controller
null


kylin.query.statement-cache-max-num
50000


kylin.query.statement-cache-max-num-per-key
50


kylin.query.statement-cache-enabled
true


kylin.query.max-dimension-count-distinct
5000000



kylin.query.timeout-seconds
0


kylin.query.pushdown.enabled
false


kylin.query.pushdown.update-enabled
false


kylin.query.schema-factory
org.apache.kylin.query.schema.OLAPSchemaFactory



kylin.query.pushdown.runner-class-name




kylin.query.pushdown.runner.ids




kylin.query.pushdown.converter-class-names
{org.apache.kylin.source.adhocquery.HivePushDownConverter}


kylin.query.pushdown.cache-enabled
false


kylin.query.pushdown.jdbc.url




kylin.query.pushdown.jdbc.driver




kylin.query.pushdown.jdbc.username




kylin.query.pushdown.jdbc.password




kylin.query.pushdown.jdbc.pool-max-total
0


kylin.query.pushdown.jdbc.pool-max-idle
8


kylin.query.pushdown.jdbc.pool-min-idle
0


kylin.query.security.table-acl
kylin.engine.default
2
kylin.query.enable-dict-enumerator
false
kylin.query.calcite.enumerable-rules-enabled
false
kylin.query.calcite.reduce-rules-enabled
true
kylin.query.convert-create-table-to-with
false
kylin.query.calcite.extras-props.
kylin.query.calcite.add-rule
kylin.query.calcite.remove-rule
kylin.query.enable-dynamic-column
false
kylin.query.skip-empty-segments
true
kylin.query.disable-cube-noagg-sql
false
kylin.query.stream-aggregate-enabled
true
kylin.storage.project-isolation-enable
true
kylin.query.max-limit-pushdown
10000
kylin.query.force-limit
-1
kylin.query.scan-threshold
10000000
kylin.query.lazy-query-enabled
false
kylin.query.lazy-query-waiting-timeout-milliseconds
60000
kylin.query.project-concurrent-running-threshold
0
kylin.query.max-scan-bytes
0
kylin.query.max-return-rows
5000000
kylin.query.translated-in-clause-max-size
1024 * 1024
kylin.query.derived-filter-translation-threshold
20
kylin.query.badquery-stacktrace-depth
10
kylin.query.badquery-history-number
50
kylin.query.badquery-alerting-seconds
90
kylin.query.timeout-seconds-coefficient
0.5
kylin.query.badquery-persistent-enabled
true


kylin.query.transformersescape-default-keyword
false


kylin.query.transformersrealization-filter
null


kylin.query.cachesignature-threshold-durationclass
org.apache.kylin.rest.signature.FactTableRealizationSetCalculator
2000



kylin.query.cache-thresholdsignature-scan-countenabled
10240false


kylin.query.cacheflat-thresholdfilter-scan-bytes
1024 * 1024
kylin.query.security-enabled
true
kylin.query.cache-enabled
true
kylin.query.ignore-unknown-function
false
kylin.cache.memcached.hosts
kylin.query.segment-cache-enabled
false
kylin.query.segment-cache-timeout
2000
kylin.query.segment-cache-max-size
200
kylin.query.access-controller
null
kylin.query.statement-cache-max-num
50000
kylin.query.statement-cache-max-num-per-key
50
kylin.query.statement-cache-enabled
true
kylin.query.max-dimension-count-distinct
5000000
kylin.query.timeout-seconds
0
kylin.query.pushdown.enabled
false
kylin.query.pushdown.update-enabled
false
kylin.query.schema-factory
org.apache.kylin.query.schema.OLAPSchemaFactory
kylin.query.pushdown.runner-class-name
kylin.query.pushdown.runner.ids
kylin.query.pushdown.converter-class-names
{org.apache.kylin.source.adhocquery.HivePushDownConverter}
kylin.query.pushdown.cache-enabled
false
kylin.query.pushdown.jdbc.url
kylin.query.pushdown.jdbc.driver
kylin.query.pushdown.jdbc.username
kylin.query.pushdown.jdbc.password
kylin.query.pushdown.jdbc.pool-max-total
0
kylin.query.pushdown.jdbc.pool-max-idle
8
kylin.query.pushdown.jdbc.pool-min-idle
0
max-children
500000






Configurations new added 



ModulePropertyDefault ValueOptional ValueDescription

kylin.engine.spark.build-class-name
org.apache.kylin.engine.spark.job.CubeBuildJob



kylin.engine.spark.task-impact-instance-enabled
true


kylin.engine.spark.task-core-factor
3


kylin.engine.driver-memory-base
1024


kylin.engine.driver-memory-strategy
 {"2", "20", "100" }

Auto adjust the memory of driver

kylin.engine.driver-memory-maximum
4096


kylin.engine.persist-flattable-threshold
1


kylin.engine.spark.cluster-info-fetcher-class-name
org.apache.kylin.cluster.YarnInfoFetcher



kylin.engine.spark.merge-class-name
org.apache.kylin.engine.spark.job.CubeMergeJob



kylin.engine.max-retry-time
3


kylin.engine.retry-memory-gradient
1.5


kylin.engine.retry-overheadMemory-gradient
0.2


kylin.engine.max-allocation-proportion
0.9


kylin.engine.base-executor-instance
5


kylin.engine.executor-instance-strategy
100,2,500,3,1000,4



kylin.engine.submit-hadoop-conf-dir









kylin.snapshot.parallel-build-enabled
true


kylin.snapshot.parallel-build-timeout-seconds
3600


kylin.snapshot.shard-size-mb
128







kylin.storage.provider
org.apache.kylin.common.storage.DefaultStorageProvider



kylin.storage.columnar.shard-size-mb
128


kylin.storage.columnar.shard-rowcount
2500000



kylin.storage.columnar.shard-countdistinct-rowcount
1000000



kylin.storage.columnar.repartition-threshold-size-mb
128


kylin.storage.columnar.shard-min
1


kylin.storage.columnar.shard-max
1000


kylin.storage.columnar.hdfs-blocksize-bytes
5 * shard_size



kylin.storage.columnar.shard-expand-factor
10


kylin.storage.columnar.dfs-replication
3


kylin.spark-conf.auto.prior
true







kylin.job.log-print-enabled
true
kylin.query.security.table-acl-enabled
true
kylin.query.escape-default-keyword
false
kylin.query.realization-filter
null
kylin.query.signature-class
org.apache.kylin.rest.signature.FactTableRealizationSetCalculator
kylin.query.cache-signature-enabled
false
kylin.query.flat-filter-max-children
500000







kylin.query.spark-engine.join-memory-fraction
0.3
driver memory that can be used by join(mostly BHJ)

kylin.query.spark-engine.enabled
true



kylin.query.spark-engine.partition-split-size-mb
64


kylin.query.spark-engine.expose-sharding-trait
true


kylin.query.spark-engine.spark-sql-shuffle-partitions
-1


kylin.query.spark-conf.




kylin.query.engine.sparder-additional-files




kylin.query.engine.sparder-additional-jars




kylin.query.pushdown.auto-set-shuffle-partitions-enabled
true


kylin.query.pushdown.base-shuffle-partition-size
48


kylin.query.intersect.separator
|







kylin.kerberos.enabled
false


kylin.kerberos.keytab




kylin.kerberos.zookeeper.server.principal
zookeeeperzookeeper/hadoop


kylin.kerberos.ticket.refresh.interval.minutes
720


kylin.kerberos.monitor.interval.minutes
10


kylin.kerberos.platform




kylin.platform.zk.kerberos.enable




kylin.kerberos.krb5.conf
krb5.conf



kylin.kerberos.jaas.conf
jaas.conf



kylin.kerberos.principal
kylin.platform.zk.kerberos.enable
false

Configurations new added 

Description
ModulePropertyDefault ValueOptional Value








Configurations remained from Kylin3.x

...