The YTEX SentenceAnnotator is identical to the standard cTAKES Sentence Annotator with the exception of newline handling.  The cTAKES sentence annotator forces all newlines to be sentence endings.  However, in many documents, newlines are not sentence endings.  The YTEX SentenceAnnotator does not break sentences on newlines.

To use the YTEX SentenceAnnotator, use the desc\ctakes-ytex-uima\desc\analysis_engine\SentenceDetectorAnnotator.xml, or (for UIMAFit) org.apache.ctakes.ytex.uima.annotators.SentenceDetector.

  • No labels