This page is the Apache Arrow developer wiki. If you are involved in building or maintaining the project, this is a good page to have bookmarked. If you are a prospective user of the project, check out user-facing library and API documentation linked to from http://arrow.apache.org/.


Developer Resources

Language-specific Development Resources

Roadmap and Initiatives

Columnar Format

Columnar Format JIRA Dashboard

The "Arrow columnar format" is an open standard, language-independent binary in-memory format for columnar datasets. It can be used to create data frame libraries, build analytical query engines, and address many other use cases.  

Columnar Computational Libraries

C++ Libraries

C++ JIRA Dashboard

Projects:

C GLib Libraries

C GLib JIRA Dashboard

Feather File Format

GPU Support

GPU JIRA Dashboard

Go Libraries

Go JIRA Dashboard

Java Libraries

Java JIRA Dashboard

JavaScript Libraries

JavaScript JIRA Dashboard

Julia Libraries

We have been discussing involving the Julia community in Apache Arrow

Machine Learning Framework Integrations

Modern machine learning frameworks can leverage technologies we are developing in Apache Arrow, and vice versa. 

Parquet File Support

Plasma Shared Memory Store

Plasma JIRA Dashboard

Python Libraries

Python JIRA Dashboard

Projects:

RPC System (Arrow Flight)

R Libraries

R JIRA Dashboard

Ruby Libraries

Ruby JIRA Dashboard

Rust Libraries

Rust JIRA Dashboard


Release Planning

Top-level Releases

JavaScript Releases

Future Work and Ideas

Developing an open standard for in-memory records

  • No labels