org.carrot2.text.preprocessing.pipeline
Class BasicPreprocessingPipelineDescriptor.AttributeBuilder

java.lang.Object
  extended by org.carrot2.text.preprocessing.pipeline.BasicPreprocessingPipelineDescriptor.AttributeBuilder
Direct Known Subclasses:
CompletePreprocessingPipelineDescriptor.AttributeBuilder
Enclosing class:
BasicPreprocessingPipelineDescriptor

public static class BasicPreprocessingPipelineDescriptor.AttributeBuilder
extends Object

Attribute map builder for the BasicPreprocessingPipeline component. You can use this builder as a type-safe alternative to populating the attribute map using attribute keys.


Field Summary
 Map<String,Object> map
          The attribute map populated by this builder.
 
Constructor Summary
protected BasicPreprocessingPipelineDescriptor.AttributeBuilder(Map<String,Object> map)
          Creates a builder backed by the provided map.
 
Method Summary
 CaseNormalizerDescriptor.AttributeBuilder caseNormalizer()
          Returns an attribute builder for the nested CaseNormalizer component, backed by the same attribute map as the current builder.
 LanguageModelStemmerDescriptor.AttributeBuilder languageModelStemmer()
          Returns an attribute builder for the nested LanguageModelStemmer component, backed by the same attribute map as the current builder.
 BasicPreprocessingPipelineDescriptor.AttributeBuilder lexicalDataFactory(Class<? extends ILexicalDataFactory> clazz)
          Lexical data factory.
 BasicPreprocessingPipelineDescriptor.AttributeBuilder lexicalDataFactory(ILexicalDataFactory value)
          Lexical data factory.
 BasicPreprocessingPipelineDescriptor.AttributeBuilder stemmerFactory(Class<? extends IStemmerFactory> clazz)
          Stemmer factory.
 BasicPreprocessingPipelineDescriptor.AttributeBuilder stemmerFactory(IStemmerFactory value)
          Stemmer factory.
 StopListMarkerDescriptor.AttributeBuilder stopListMarker()
          Returns an attribute builder for the nested StopListMarker component, backed by the same attribute map as the current builder.
 TokenizerDescriptor.AttributeBuilder tokenizer()
          Returns an attribute builder for the nested Tokenizer component, backed by the same attribute map as the current builder.
 BasicPreprocessingPipelineDescriptor.AttributeBuilder tokenizerFactory(Class<? extends ITokenizerFactory> clazz)
          Tokenizer factory.
 BasicPreprocessingPipelineDescriptor.AttributeBuilder tokenizerFactory(ITokenizerFactory value)
          Tokenizer factory.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

map

public final Map<String,Object> map
The attribute map populated by this builder.

Constructor Detail

BasicPreprocessingPipelineDescriptor.AttributeBuilder

protected BasicPreprocessingPipelineDescriptor.AttributeBuilder(Map<String,Object> map)
Creates a builder backed by the provided map.

Method Detail

tokenizerFactory

public BasicPreprocessingPipelineDescriptor.AttributeBuilder tokenizerFactory(ITokenizerFactory value)
Tokenizer factory. Creates the tokenizers to be used by the clustering algorithm.

See Also:
BasicPreprocessingPipeline.tokenizerFactory

tokenizerFactory

public BasicPreprocessingPipelineDescriptor.AttributeBuilder tokenizerFactory(Class<? extends ITokenizerFactory> clazz)
Tokenizer factory. Creates the tokenizers to be used by the clustering algorithm.

See Also:
BasicPreprocessingPipeline.tokenizerFactory

stemmerFactory

public BasicPreprocessingPipelineDescriptor.AttributeBuilder stemmerFactory(IStemmerFactory value)
Stemmer factory. Creates the stemmers to be used by the clustering algorithm.

See Also:
BasicPreprocessingPipeline.stemmerFactory

stemmerFactory

public BasicPreprocessingPipelineDescriptor.AttributeBuilder stemmerFactory(Class<? extends IStemmerFactory> clazz)
Stemmer factory. Creates the stemmers to be used by the clustering algorithm.

See Also:
BasicPreprocessingPipeline.stemmerFactory

lexicalDataFactory

public BasicPreprocessingPipelineDescriptor.AttributeBuilder lexicalDataFactory(ILexicalDataFactory value)
Lexical data factory. Creates the lexical data to be used by the clustering algorithm, including stop word and stop label dictionaries.

See Also:
BasicPreprocessingPipeline.lexicalDataFactory

lexicalDataFactory

public BasicPreprocessingPipelineDescriptor.AttributeBuilder lexicalDataFactory(Class<? extends ILexicalDataFactory> clazz)
Lexical data factory. Creates the lexical data to be used by the clustering algorithm, including stop word and stop label dictionaries.

See Also:
BasicPreprocessingPipeline.lexicalDataFactory

tokenizer

public TokenizerDescriptor.AttributeBuilder tokenizer()
Returns an attribute builder for the nested Tokenizer component, backed by the same attribute map as the current builder.


caseNormalizer

public CaseNormalizerDescriptor.AttributeBuilder caseNormalizer()
Returns an attribute builder for the nested CaseNormalizer component, backed by the same attribute map as the current builder.


languageModelStemmer

public LanguageModelStemmerDescriptor.AttributeBuilder languageModelStemmer()
Returns an attribute builder for the nested LanguageModelStemmer component, backed by the same attribute map as the current builder.


stopListMarker

public StopListMarkerDescriptor.AttributeBuilder stopListMarker()
Returns an attribute builder for the nested StopListMarker component, backed by the same attribute map as the current builder.



Copyright (c) Dawid Weiss, Stanislaw Osinski