Apache CarbonData community is pleased to announce the release of the Version 1.5.4 in The Apache Software Foundation (ASF).
CarbonData is a high-performance data solution that supports various data analytic scenarios, including BI analysis, ad-hoc SQL query, fast filter lookup on detail record, streaming analytics, and so on. CarbonData has been deployed in many enterprise production environments, in one of the largest scenarios, it supports queries on a single table with 3PB data (more than 5 trillion records) with response time less than 3 seconds!
We encourage you to use the release https://archive.apache.org/dist/carbondata/1.5.4/, and feedback through the CarbonData user mailing lists!
This release note provides information on the new features, improvements, and bug fixes of this release.
CarbonData 1.5.4 intention was to move closer to unified analytics. We have added new binary datatype to store binary objects like images. We have also allowed users to change sort columns of an existing table for better flexibility as per user needs. we are now compacting the segments which are loaded using range sort.
In this version of CarbonData, around 13 JIRA tickets related to new features, improvements, and bugs have been resolved. Following are the summary.
Previously, the user can configure the sort columns during table creation only, it restricts the user to load the data with same sort columns even though his query scenarios are changed.
From this version, we support altering the sort columns even after the table is created.
This version allows the user to configure the page size, it gives the control of memory utilization during reading and loading data especially for complex, varchar, and binary datatypes.
It is useful to store big objects and binary objects like images.
The segments which are loaded with range sort scope will now be compacted using the range compaction.
None
Please find the detailed JIRA list: https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12320220&version=12345388