Apache Impala is the open source, native analytic database for Apache Hadoop.
If you would like write access to this wiki, please send an e-mail to dev@impala.apache.org with your CWiki username. Take note that CWiki account is different than ASF JIRA account. Please sign up for the CWiki account if you have not done so. We'll grant you access ASAP.
User resources
To learn more about Impala as a business user, please visit the Impala homepage. Detailed documentation for administrators and users is available at Apache Impala documentation.
If you want to try out running queries and processing data sets in Impala on a single machine without the need to install dependencies, check out the Impala docker quickstart - https://github.com/apache/impala/blob/master/docker/README.md#docker-quickstart-with-docker-compose. It can automatically load test data sets into Apache Kudu and Apache Parquet formats and you can start playing around with Apache Impala via SQL within minutes.
An alternative quickstart docker container for Impala is also available via the Apache Kudu project: https://github.com/apache/kudu/blob/master/examples/quickstart/impala/README.adoc
Developer resources
Getting Started
Read first: Contributing to Impala
Shortcut: Bootstrapping an Impala Development Environment From Scratch
- Useful Tips for New Impala Developers
Impala Development Workflow and Processes
- Building Native-Toolchain from scratch and using with Impala
- How to switch to Apache-hosted git
- Impala Style Guide and Effective Coding Practices
- Committer Criteria
- Impala Docker Image
- Impala Development Environment inside Docker
- Build and Test for Daemon Docker Containers
- Eclipse Setup for Impala Development
- IntelliJ and CLion Setup for Impala Development
- Pushing Impala docs to impala.apache.org
- Impala Test Data
- How to start Hive beeline
- How to start a Ranger-enabled Impala minicluster
Debugging and Analyzing Impala
- Collecting Impala CPU and Heap Profiles
- Impala Debugging Tips
- Debugging Impala Core Dumps on Another System
- Debugging Impala Minidumps
- Using Address Sanitizer to Debug Impala
Additional Technical Documentation
- Runtime Code Generation with LLVM in Impala
- Impala Row Batches
- Resource Management Best Practices
- Understanding Impala Query States
- Kerberos in Impala - Explainer for Developers