Report 3

Project Name	Spark Backend Support for Gora (GORA-386)
Project URL	GORA-386 - Getting issue details... STATUS
Report #	3
Report compiled by	Furkan KAMACI
Report date	24 June 2015

Project Description

Apache Gora open source framework provides an in-memory data model and persistence for big data. Gora supports persisting to column stores, key value stores, document stores and RDBMSs, and analyzing the data with extensive Apache Hadoop MapReduce support.
Even Spark is so powerful compared to Map/Reduce which Gora currently supports; there is no Spark backend for Gora. This proposal aims to develop a solution for Gora to support Spark.

Checklist

GoraInputFormat is analyzed.

Spark's newAPIHadoopRDD method is analyzed.

Data is read from Hbase via Spark.

Next Steps

Read data over GoraInputFormat via Spark.

Develop and engine for Gora - Spark integration.

Implement LogAnalytics example with Gora - Spark style.

Supervisors Comments

Child pages

Report 3