...
The aim of the project is to integrate graph databases as inputs for Giraph, along with the current ones (e.g. HBase, Accumulo etc.). In particular, use Tinkerpop with Giraph as a way to inject data into Giraph for graph analytics. Precisely, to leverage Blueprints and Rexter, that build a vendor-agnostic API over the vendor-specific ones, and export it via a REST API.
Project Scope
The project scope is divided in three main milestones:
- Integrate Rexter into Giraph inputs, e.g. leveraging existing classes in Faunus for MapReduce
- Define a mapping between the flexible Property Graph data model to Giraph's data model
- Allow to specify the input graph from the graph database to Giraph via a query (not only support injecting the whole graph)
For very successful students, there are possibilities to extend the work with more deliverables:
- Support Titan as an input
- Support GraphSON as an InputFormat
References
Tinkerpop: http://www.tinkerpop.com/
Blueprints: https://github.com/tinkerpop/blueprints/wiki
Rexter: https://github.com/tinkerpop/rexster/wiki
Faunus: http://thinkaurelius.github.io/faunus/
Titan: http://thinkaurelius.github.io/titan/
GraphSON: https://github.com/thinkaurelius/faunus/wiki/GraphSON-Format
Neo4j: http://www.neo4j.org/
OrientDB: http://www.orientdb.org/
...