!!! This plugin is based on old nutch classes and not running with the current nutch version !!!
The plugin enables German-language stemming during indexing and searching. Unnecessary German stop words are removed from content and query.
The package contains:
- A German BasicIndexingFilter to replace the standard BasicIndexingFilter?.
- A German BasicQueryFilter to replace the standard BasicQueryFilter?.
- A stop-list "german-stopword.txt" used by both.
Download at http://nutch.eventax.com/
Config File Options
Default filename: german-stopword.txt
german-stopword.txt has to be placed into CLASSPATH/conf directory.
#List of stopwords:
The German Analyzer from the Lucene package is used.
The GermanBasicIndexingFilter works approximately 10
It is possible to use stop words in the query. They are ignored, but emphasized like normal hits.
– HammoudaBouyedda - 28 Sep 2004