...
You may not need to use any models other than those provided with Apache cTAKES, however they have been trained on a specific set of text (a corpus) which might not match the characteristics of your text. If you want to build or train your own models, please read the cTAKES 3.1 Component Use Guide, particularly:
- Training a sentence detector model
- Training a Part of Speech (POS) tagger model: Building a model - Obtaining training data
- Creating a Part of Speech (POS) tag dictionary: Building a tag dictionary
- Training a chunker model: Building a model - Prepare GENIA training data
- Training a dependency parser: Training a model - Training data or Training a model in Eclipse