org.carrot2.text.preprocessing
Class LabelFilterProcessor

java.lang.Object
  extended by org.carrot2.text.preprocessing.LabelFilterProcessor

public class LabelFilterProcessor
extends Object

Applies basic filtering to words and phrases to produce candidates for cluster labels. Filtering is applied to PreprocessingContext.AllWords and PreprocessingContext.AllPhrases, the results are saved to PreprocessingContext.AllLabels. Currently, the following filters are applied:

  1. StopWordLabelFilter
  2. CompleteLabelFilter
This class saves the following results to the PreprocessingContext:

This class requires that Tokenizer, CaseNormalizer, StopListMarker and PhraseExtractor be invoked first.


Field Summary
 CompleteLabelFilter completeLabelFilter
          Truncated phrase filter for this processor.
 GenitiveLabelFilter genitiveLabelFilter
          Genitive length label filter.
 MinLengthLabelFilter minLengthLabelFilter
          Min length label filter.
 NumericLabelFilter numericLabelFilter
          Numeric label filter for this processor.
 QueryLabelFilter queryLabelFilter
          Query word label filter for this processor.
 StopLabelFilter stopLabelFilter
          Stop label filter.
 StopWordLabelFilter stopWordLabelFilter
          Stop word label filter for this processor.
 
Constructor Summary
LabelFilterProcessor()
           
 
Method Summary
 void process(PreprocessingContext context)
          Processes all filters declared as fields of this class.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

queryLabelFilter

public QueryLabelFilter queryLabelFilter
Query word label filter for this processor.


stopWordLabelFilter

public StopWordLabelFilter stopWordLabelFilter
Stop word label filter for this processor.


numericLabelFilter

public NumericLabelFilter numericLabelFilter
Numeric label filter for this processor.


completeLabelFilter

public CompleteLabelFilter completeLabelFilter
Truncated phrase filter for this processor.


minLengthLabelFilter

public MinLengthLabelFilter minLengthLabelFilter
Min length label filter.


genitiveLabelFilter

public GenitiveLabelFilter genitiveLabelFilter
Genitive length label filter.


stopLabelFilter

public StopLabelFilter stopLabelFilter
Stop label filter.

Constructor Detail

LabelFilterProcessor

public LabelFilterProcessor()
Method Detail

process

public void process(PreprocessingContext context)
Processes all filters declared as fields of this class.



Copyright (c) Dawid Weiss, Stanislaw Osinski