The Graph Package (Angrapa)
The graph package, called Angrapa, is an large-scale graph data management framework for analytical processing. It is still an ongoing project. It will employ massive parallelism on Hadoop. It aims to achieve the scalability for processing tera bytes or peta bytes graph data. Angrapa will be used in a variety of scientific and industrial areas, such as data mining, machine learning, information retrieval, bioinformatics, and social networks, required to process large-scale graph data.
The graph package is new programming framework for graph processing.
The Main Goal
- Easy APIs familar to graph features
- Store structure suited to graph data when it comes to considering the connectivity of graph data
- Applying data communication method (i.e., BSP) without deterioration of graph data locality