Package org.carrot2.text.preprocessing.filter

Text feature filtering utilities.

See:
          Description

Interface Summary
ILabelFilter Defines the contract for label filtering components, which mark words and phrases that should not be considered as candidates for cluster labels
 

Class Summary
CompleteLabelFilter A filter that removes "incomplete" labels.
CompleteLabelFilterDescriptor Metadata and attributes of the CompleteLabelFilter component.
CompleteLabelFilterDescriptor.AttributeBuilder Attribute map builder for the CompleteLabelFilter component.
CompleteLabelFilterDescriptor.Attributes All attributes of the CompleteLabelFilter component.
CompleteLabelFilterDescriptor.Keys Constants for all attribute keys of the CompleteLabelFilter component.
GenitiveLabelFilter Accepts labels that do not end in words in the Saxon Genitive form (e.g.
GenitiveLabelFilterDescriptor Metadata and attributes of the GenitiveLabelFilter component.
GenitiveLabelFilterDescriptor.AttributeBuilder Attribute map builder for the GenitiveLabelFilter component.
GenitiveLabelFilterDescriptor.Attributes All attributes of the GenitiveLabelFilter component.
GenitiveLabelFilterDescriptor.Keys Constants for all attribute keys of the GenitiveLabelFilter component.
MinLengthLabelFilter Accepts labels whose length in characters is greater or equal to the provided value.
MinLengthLabelFilterDescriptor Metadata and attributes of the MinLengthLabelFilter component.
MinLengthLabelFilterDescriptor.AttributeBuilder Attribute map builder for the MinLengthLabelFilter component.
MinLengthLabelFilterDescriptor.Attributes All attributes of the MinLengthLabelFilter component.
MinLengthLabelFilterDescriptor.Keys Constants for all attribute keys of the MinLengthLabelFilter component.
NumericLabelFilter Accepts labels that do not consist only of numeric tokens and start with a non-numeric token.
NumericLabelFilterDescriptor Metadata and attributes of the NumericLabelFilter component.
NumericLabelFilterDescriptor.AttributeBuilder Attribute map builder for the NumericLabelFilter component.
NumericLabelFilterDescriptor.Attributes All attributes of the NumericLabelFilter component.
NumericLabelFilterDescriptor.Keys Constants for all attribute keys of the NumericLabelFilter component.
QueryLabelFilter Accepts labels that do not consist only of query words.
QueryLabelFilterDescriptor Metadata and attributes of the QueryLabelFilter component.
QueryLabelFilterDescriptor.AttributeBuilder Attribute map builder for the QueryLabelFilter component.
QueryLabelFilterDescriptor.Attributes All attributes of the QueryLabelFilter component.
QueryLabelFilterDescriptor.Keys Constants for all attribute keys of the QueryLabelFilter component.
SingleLabelFilterBase A base for ILabelFilter implementations that handle each label independently.
StopLabelFilter Accepts labels that are not declared as stop labels in the stoplabels.<lang> files.
StopLabelFilterDescriptor Metadata and attributes of the StopLabelFilter component.
StopLabelFilterDescriptor.AttributeBuilder Attribute map builder for the StopLabelFilter component.
StopLabelFilterDescriptor.Attributes All attributes of the StopLabelFilter component.
StopLabelFilterDescriptor.Keys Constants for all attribute keys of the StopLabelFilter component.
StopWordLabelFilter Accepts words that are not stop words and phrases that do not start nor end in a stop word.
StopWordLabelFilterDescriptor Metadata and attributes of the StopWordLabelFilter component.
StopWordLabelFilterDescriptor.AttributeBuilder Attribute map builder for the StopWordLabelFilter component.
StopWordLabelFilterDescriptor.Attributes All attributes of the StopWordLabelFilter component.
StopWordLabelFilterDescriptor.Keys Constants for all attribute keys of the StopWordLabelFilter component.
 

Package org.carrot2.text.preprocessing.filter Description

Text feature filtering utilities.



Copyright (c) Dawid Weiss, Stanislaw Osinski