Project Name | Spark Backend Support for Gora (GORA-386) |
Project URL | |
Report # | 5 |
Report compiled by | Furkan KAMACI |
Report date | 30 June 2015 |
Project Description
Apache Gora open source framework provides an in-memory data model and persistence for big data. Gora supports persisting to column stores, key value stores, document stores and RDBMSs, and analyzing the data with extensive Apache Hadoop MapReduce support.
Even Spark is so powerful compared to Map/Reduce which Gora currently supports; there is no Spark backend for Gora. This proposal aims to develop a solution for Gora to support Spark.
Checklist
Midterm report is prepared:
Next Steps
RDD output could be written into different data stores via Apache Gora (Hbase and Solr).
Supervisors Comments
Midterm report is attached: