This Confluence has been LDAP enabled, if you are an ASF Committer, please use your LDAP Credentials to login. Any problems file an INFRA jira ticket please.

Page tree
Skip to end of metadata
Go to start of metadata

General Information

Contributing to the wiki

To help avoid spam, in common with many other ASF wikis, the Tika wiki is only editable by known accounts.

If you would like to help out with the Tika wiki, add a new page, or work on an existing one, please first create a wiki account. With that done, drop an email to the user list or the dev list with your wiki username asking for access, and generally within a few hours you'll be able to edit away from then on!

Committer Info

  • UsingGit - Information on Tika's configuration management using Git.
  • Release Process - Info on releasing Tika
  • ThirdPartySonaType - A guide to staging and deploying third party jars on Sonatype OSSRH (OSS Repository Hosting) for subsequent use within Tika parser wrappers
  • VirtualMachine - a virtual machine hosted by Rackspace that allows an instance of Tika Server to run for public testing. Set up by Tim Allison et al.

User Notes

MIME identification design/implementation

Advanced Content Extraction with Tika - Integration

Entity Recognition Support

Named Entity Recognition (NER) support

Object Recognition (Computer Vision) support

Images

Video

Language Translation

Statistical Machine Translation

Design

Regression Testing On the Rackspace VM

How to run tika-eval on the VM


  • No labels