Package org.carrot2.text.preprocessing.filter

Text feature filtering utilities.

See:
          Description

Interface Summary
ILabelFilter Defines the contract for label filtering components, which mark words and phrases that should not be considered as candidates for cluster labels
 

Class Summary
CompleteLabelFilter A filter that removes "incomplete" labels.
GenitiveLabelFilter Accepts labels that do not end in words in the Saxon Genitive form (e.g.
MinLengthLabelFilter Accepts labels whose length in characters is greater or equal to the provided value.
NumericLabelFilter Accepts labels that do not consist only of numeric tokens and start with a non-numeric token.
QueryLabelFilter Accepts labels that do not consist only of query words.
SingleLabelFilterBase A base for ILabelFilter implementations that handle each label independently.
StopLabelFilter Accepts labels that are not declared as stop labels in the stoplabels.<lang> files.
StopWordLabelFilter Accepts words that are not stop words and phrases that do not start nor end in a stop word.
 

Package org.carrot2.text.preprocessing.filter Description

Text feature filtering utilities.



Copyright (c) Dawid Weiss, Stanislaw Osinski