\ Wiki Markup
[b\] [https://github.com/AGMLab/giraph/tree/trunk/giraph-examples/src/main/java/org/apache/giraph/examples/LinkRank|https://github.com/AGMLab/giraph/tree/trunk/giraph-examples/src/main/java/org/apache/giraph/examples/LinkRank] \[c\] [
\[d\] https://issues.apache.org/jira/browse/GIRAPH-584 Wiki Markup
LinkRank Scoring mechanism in Apache Nutch 1.x currently works in pure map-reduce pattern. Moreover, Apache Nutch is not optimized for graph-processing operations. Due to this nature of Nutch, scoring calculation could have been more efficient if done by a graph-processing library that runs with Bulk Synchronous Parallel model. Moreover, Apache Nutch 2.x which has slightly different architecture than 1.x, lacks LinkRank scoring. So rather than porting it to the new architecture, a cross-version solution would be nice to have.
18.03.2013 - 27.03.2013
Practice crawling with Nutch 1.x and 2.x and Searching through SolrCloud.
28.03.2013 - 01.04.2013
Practice Hadoop and Mapreduce
02.04.2013 - 06.04.2013
Read on WebGraph, LinkRank and ScoringFilter mechanism
07.04.2013 - 15.04.2013
Write sample scoring plugins for Nutch 1.x and 2.x and debugging
16.04.2013 - 24.04.2013
Practice with Giraph. Write sample PageRank code from scratch and modify it
06.05.2013 - 01.06.2013
Run PageRank on sample graphs, practice more with Giraph
Discovering & Learning
03.06.2013 - 07.06.2013
Design Graph Metadata Design
10.06.2013 - 14.06.2013
Duplicate Link Removal
17.06.2013 - 21.06.2013
Design input/output pipeline and serialization
24.06.2013 - 28.06.2013
Write Tests to make sure it's working properly.
Generic LinkRank with Giraph
01.07.2013 - 05.07.2013
Read more on Nutch 1.x plugin mechanism
08.07.2013 - 11.07.2013
Write Nutch 1.x proxy plugin
15.07.2013 - 19.07.2013
Test Nutch 1.x - Giraph Integration
22.07.2013 - 26.07.2013
Make sure original LinkRank and mine produces the same results
Nutch 1.x Integration
28.07.2013 - 02.08.2013
Learn how to use Gora for accessing the scores in Nutch 2.x
05.08.2013 - 09.08.2013
Read more on Nutch 2.x plugin mechanism
12.08.2013 - 16.08.2013
Write Nutch 2.x proxy plugin
19.08.2013 - 30.08.2013
Test Nutch 2.x - Giraph Integration
Nutch 2.x Integration
02.09.2013 - 06.09.2013
09.09.2013 - 13.09.2013
Testing Loop Elimination
16.09.2013 - 20.09.2013
Community Testing & Review & Writing Report
Improvements on LinkRank: similar and better versions of LinkRank.
I'm Ahmet Emre Aladağ, a 4th semester PhD Student in Boğaziçi University, Istanbul, Turkey. My research interests are Complex Network Analysis (Ranking algorithms, Influence networks, Information Spread, Finding the most influential person/page), Information Retrieval (Crawling, search engines, ranking the web pages via graph-theoretic measures and pattern recognition methods given implicit feedback.). I have taken Complex Networks, Information Retriveal, Aritficial Intelligence, Machine Learning courses that could be related to this project.
In the Masters (GPA 4.00), I had 1 conference publication \ [1\] on Visualization of Protein Interaction Networks and 2 journal publications on highly reputable Oxford Bioinformatics Journal on the topics Clustering, Aligning and Visualizing Protein Interaction Networks \ [2\] \ [3\]. I have also taken Advanced Algorithms on graphs and Parallel Programming courses. Wiki Markup
I had my (non-GSoC) internship in the Pardus Linux project (which was also involved in GSoC) and developed a Package-Content Search Engine and Multi-System Installation system for Pardus Linux. I have been a Linux user and Free Software Contributor since 2006. I contributed several Django applications and developed open source projects on github/bitbucket. I used mostly Python, Java and some C for my projects.
Currently, I'm working for a R&D company where I'm given the position for developing an efficient and precise ranking algorithm. We will be using Nutch 2.x and it's to-be-implemented LinkRank scoring so they support me in contributing Nutch community. I will be working on this project in my working hours at the office and also at home. My company and our partners have been contributors to the Nutch project for some years. Moreover, my research area in my PhD studies is detecting the most important person/page on a network. So it will be very convenient and joyful for me to work on this project. Contributing to a project of Apache foundation is an honour for me.
\[1\] A.E. Aladag, C. Erten, M. Sozdinler, ”An integrated model for visualizing biclusters from gene expression data and PPI networks”, Proc. International Symposium on Biocomputing, no.24, 2010.
\ Wiki Markup
[2\] A.E. Aladag, C. Erten, M. Sozdinler, ”Reliability Oriented Bioinformatic Networks Visualization”, Bioinformatics, vol 27, pp. 1583-1584, 2011. \
[3\] A.E. Aladag, C. Erten, ”SPINAL: Scalable Protein Interaction Network Alignment”, Bioinformatics, vol 29, pp. 917-924, 2013.