Package org.carrot2.text.linguistic

Shallow linguistic processing utilities.

See:
          Description

Interface Summary
ILexicalData Additional lexical information for a given language.
ILexicalDataFactory Provides instances of ILexicalData for each language.
IStemmer Simple lemmatization engine transforming an inflected form of a word to its base form or some other unique token.
IStemmerFactory Provides instances of IStemmer for each language.
ITokenizerFactory Provides instances of ITokenizer for each language.
 

Class Summary
DefaultLexicalDataFactory The default management of lexical resources.
DefaultLexicalDataFactoryDescriptor Metadata and attributes of the DefaultLexicalDataFactory component.
DefaultLexicalDataFactoryDescriptor.AttributeBuilder Attribute map builder for the DefaultLexicalDataFactory component.
DefaultLexicalDataFactoryDescriptor.Attributes All attributes of the DefaultLexicalDataFactory component.
DefaultLexicalDataFactoryDescriptor.Keys Constants for all attribute keys of the DefaultLexicalDataFactory component.
DefaultStemmerFactory  
DefaultStemmerFactoryDescriptor Metadata and attributes of the DefaultStemmerFactory component.
DefaultStemmerFactoryDescriptor.AttributeBuilder Attribute map builder for the DefaultStemmerFactory component.
DefaultStemmerFactoryDescriptor.Attributes All attributes of the DefaultStemmerFactory component.
DefaultStemmerFactoryDescriptor.Keys Constants for all attribute keys of the DefaultStemmerFactory component.
DefaultTokenizerFactory  
DefaultTokenizerFactoryDescriptor Metadata and attributes of the DefaultTokenizerFactory component.
DefaultTokenizerFactoryDescriptor.AttributeBuilder Attribute map builder for the DefaultTokenizerFactory component.
DefaultTokenizerFactoryDescriptor.Attributes All attributes of the DefaultTokenizerFactory component.
DefaultTokenizerFactoryDescriptor.Keys Constants for all attribute keys of the DefaultTokenizerFactory component.
IdentityStemmer An implementation of IStemmer that always returns null from IdentityStemmer.stem(CharSequence).
IdentityStemmerFactory An IStemmerFactory implementation that returns IdentityStemmers for all supported languages.
LanguageModel A holder for all elements of a language model for a single language used internally by content preprocessing components.
LexicalDataLoader Common attributes related to loading and caching of lexical resources.
LexicalDataLoaderDescriptor Metadata and attributes of the LexicalDataLoader component.
LexicalDataLoaderDescriptor.AttributeBuilder Attribute map builder for the LexicalDataLoader component.
LexicalDataLoaderDescriptor.Attributes All attributes of the LexicalDataLoader component.
LexicalDataLoaderDescriptor.Keys Constants for all attribute keys of the LexicalDataLoader component.
 

Package org.carrot2.text.linguistic Description

Shallow linguistic processing utilities.



Copyright (c) Dawid Weiss, Stanislaw Osinski