October 2007 Board reports (see ReportingSchedule).
These reports are due to the Incubator PMC by 10 October 2007
Project name - Apache CXF
Description - SOA enabling framework, web services toolkit
Date of entry - August 2006
Items to resolve before graduation:
Project name - Apache FtpServer
Description - Java based FtpServer
Date of entry - March, 2003
Progress since last report
Voted in a new commiter, Clint Foster. He is currently in progress of setting up his account and get going with the practical details.
Top three items to resolve
2. Getting snapshot builds published automatically by the build server (minor details left to finish) 3. Getting s stable release out, need more active members to have good review
Getting a build server in place so that we can generate snapshots, they have been requested by several people.
Description - Ivy is a dependencies management tool mostly used in combination with Apache Ant
Date of entry - October 23rd, 2006
Remaining item to resolve:
RCF is a rich component set (ajax-style) for JSF. The RCF Wiki has been created and all committers are now set. We started discussion on the OpenSource efforts, how to manage it, since we need to provide a migration path for current users. We are waiting for an import of a current code drop into the repository.
Project name - Apache Lucene.Net
Description - Lucene.Net is a source code, class-per-class, API-per-API and algorithmtic port of the Java Lucene search engine to the C# and .NET platform utilizing Microsoft .NET Framework.
Date of entry - April, 2006
Progress since last report - We have seen good activities in the past three months on the mailing list (both the dev and user). Unlike in the past, where I was taking care of all fixes in the code in preparation to a release, I have seen considerable patches submitted by the community to resolve open issues in Lucene.Net 2.1 work. This is very encouraging and I would like to suggest to the community to vote on adding one of the active patch submitter as a committer.
Code aspects - Code is very stable with current "beta" version of 2.1 which we expect to release in the next few weeks.
Community aspects - We have good followers and the past three months have shown good commitments for submitting patches.
Top three items to resolve -
2. Release Apache Lucene.Net 2.1 and start working on 2.2 / 2.3.
3. Vote in a new committer.
The Apache Qpid Project provides an open and interoperable, multiple language implementations of the Advanced Messaged Queuing Protocol (AMQP) specification
Date of entry to the Incubator : 2006-09
Top three items to resolve before graduation
We aborted the M2 release vote mid way due to some key bugs. We are building new RC's and will re-vote. Once M2 is out we will seek graduation.
From last report:
The whole project has not gone through release review and the license files and notices need to be checked for all languages and components.
Done - Legal review was done prior to M2 vote for the full code base. M2 vote was aborted for key bug, and is being restarted Oct 8th.
During the next period, once we have M2 out I believe we will see if we can graduate.
iPMC questions / comments:
Sling is a framework to develop content centric web applications based on the idea of modularizing the rendering of HTTP resources.
Sling entered incubation on September 5th, 2007.
Issues before graduation
Licensing and other issues
Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser Libraries. Tika entered incubation on March 22nd, 2007.
There have been a number of positive items within Tika during the last few months. The traffic on the Tika mailing list has increased significantly (with typically 2, 3 questions, and 1 or 2 commits every day, or every other day), and there have been a lot of recent inquiries from external projects wanting to collaborate with Tika (including Aperture, PDFBox and a fellow developing a JSon library currently hosted at Google code). In addition, Tika's architecture has become a recent discussion of interest (as we'll see below).
We recently elected Keith Bennett as a new committer to Tika. Keith has been spearheading many of the new patches committed to Tika, as well as participating in discussions about the architecture, and future direction of the project.
Tika will be represented at the "Fast Feather" track at Apache Con US by Jukka Zitting. The rest of the community is helping to create the content for the presentation. The abstract is listed below:
Tika is a new content analysis framework borne from the desire to factor our commonality from the Apache Nutch search engine framework. Tika provides a mime detection framework, an extensible parsing framework and metadata environment for content analysis. Though in its nascent stages, progress on Tika has recently taken shape and the project is nearing a stable 0.1 release. In this talk, we'll describe the core APIs of Tika and discuss its use in several distinct domains including search engines, scientific data dissemination and an industrial setting.
There have been a flurry of JIRA issues and code activity \[1\] including 47 issues currently in JIRA, with 32 resolved issues, 14 closed issues, and 2 open major/minor issues in progress).
Tika's Parser interface (one of its key components) has just undergone a major overhaul led by Jukka Zitting, and Chris Mattmann has recently contributed a MimeType system (with help from fellow Apache Nutch committer Jerome Charron) to Tika. We also cleaned up and refactored large parts of the rest of the code (removing references to LuisLite and branding the project wherever possible with the Tika name), in preparation for an upcoming 0.1 release.
Chris Mattmann has led an effort to carve out the existing [MimeType] detection system in Apache Nutch \[2\] and replace it with Tika's improved [MimeType] detection system. There is a patch sitting in JIRA right now \[3\], and barring objections, Nutch will rely on Tika for its [MimeType] detection abilities.
Also active recently were committers Bertrand Delacretaz, Sami Siren and Rida Benjelloun, committing patches and improvements wherever needed.
No changes since our last report: the Tika project is still at an early stage of incubation. We need to continue bringing in the initial codebases and are targeting an initial incubating release (0.1) probably within the next month. We also need to work on growing the community and figuring out how to best interact with external parser projects.
TripleSoup is intended to provide an RDF store, tooling to work with that database, and a REST web interface to talk to that database using SPARQL, implemented as an apache webserver module.
TripleSoup has voted itself into dormant mode. The main reason the project did not quite materialize seems to be that we don't have enough people with sufficient interest, need, and time to make this really take off.
UIMA is a component framework for the analysis of unstructured content such as text, audio and video. UIMA entered incubation on October 3, 2006.
Some recent activity:
Items to complete before graduation: