This page has been superseded by the Solr Ref Guide section Uploading Data with Solr Cell using Apache Tika.
If you'd like to share information about how you use this feature, you can edit this page to add tips and tricks.
/* cwikimigrated */
a.k.a the SolrCell project!
Community Tips
Committer Notes
Upgrading Tika
- Run
ant clean clean-jars
from project root or/solr
- Update
solr/CHANGES.txt
andsolr/NOTICE.txt
as well as relevant LICENSE/NOTICE files insolr/licenses/
- Edit
solr/contrib/extraction/ivy.xml
according to instructions inline the file ant jar-checksums
- to generate new sha1 files for the jarsant test
- see that Ivy downloads the new dependencies- Commit
Additional Resources
What's in a Name
Grant was writing the javadocs for the code and needed an entry for the <title> tag and wrote out "Solr Content Extraction Library", since the contrib directory is named "extraction". This then lead to an "acronym": Solr CEL which then gets mashed to: Solr Cell. Hence, the project name is "Solr Cell". It's also appropriate because a Solar Cell's job is to convert the raw energy of the Sun to electricity, and this contrib's module is responsible for converting the "raw" content of a document to something usable by Solr. http://en.wikipedia.org/wiki/Solar_cell