org.carrot2.text.linguistic
Interface IStemmer

All Known Implementing Classes:
ArabicStemmerAdapter, IdentityStemmer, MorfologikStemmerAdapter

public interface IStemmer

Simple lemmatization engine transforming an inflected form of a word to its base form or some other unique token.


Method Summary
 CharSequence stem(CharSequence word)
          Returns the base form of the provided word or null if the base form cannot be determined.
 

Method Detail

stem

CharSequence stem(CharSequence word)
Returns the base form of the provided word or null if the base form cannot be determined. In the latter case, the base form will be assumed to be the same as the original word.



Copyright (c) Dawid Weiss, Stanislaw Osinski