org.carrot2.text.linguistic
Interface ILexicalData


public interface ILexicalData

Additional lexical information for a given language.


Method Summary
 boolean isCommonWord(MutableCharArray word)
           
 boolean isStopLabel(CharSequence formattedLabel)
           
 

Method Detail

isCommonWord

boolean isCommonWord(MutableCharArray word)
Returns:
Returns true if word is common (meaningless) in this language. Such words are referred to as "stop words" and are usually ignored in information retrieval tasks. Depending on the implementation, word may be lower-cased internally.

isStopLabel

boolean isStopLabel(CharSequence formattedLabel)
Returns:
Returns true if the formattedLabel has been declared a stop label (meaningless) in this language. This is a very low-level tuning method.


Copyright (c) Dawid Weiss, Stanislaw Osinski