Package org.carrot2.text.vsm

Vector Space Model utilities.

See:
          Description

Interface Summary
ITermWeighting Calculates term weights for the term-document matrix.
 

Class Summary
LinearTfIdfTermWeighting Calculates term-document matrix element values based on Linear Inverse Term Frequency.
LogTfIdfTermWeighting Calculates term-document matrix element values based on Term Frequency.
ReducedVectorSpaceModelContext Stores data related a Vector Space Model with reduced dimensionality.
TermDocumentMatrixBuilder Builds a term document matrix based on the provided PreprocessingContext.
TermDocumentMatrixBuilderDescriptor Metadata and attributes of the TermDocumentMatrixBuilder component.
TermDocumentMatrixBuilderDescriptor.AttributeBuilder Attribute map builder for the TermDocumentMatrixBuilder component.
TermDocumentMatrixBuilderDescriptor.Attributes All attributes of the TermDocumentMatrixBuilder component.
TermDocumentMatrixBuilderDescriptor.Keys Constants for all attribute keys of the TermDocumentMatrixBuilder component.
TermDocumentMatrixReducer Reduces the dimensionality of a term-document matrix using a matrix factorization algorithm.
TermDocumentMatrixReducerDescriptor Metadata and attributes of the TermDocumentMatrixReducer component.
TermDocumentMatrixReducerDescriptor.AttributeBuilder Attribute map builder for the TermDocumentMatrixReducer component.
TermDocumentMatrixReducerDescriptor.Attributes All attributes of the TermDocumentMatrixReducer component.
TermDocumentMatrixReducerDescriptor.Keys Constants for all attribute keys of the TermDocumentMatrixReducer component.
TfTermWeighting Calculates term-document matrix element values based on Log Inverse Term Frequency.
VectorSpaceModelContext Stores data related to the Vector Space Model of the processed documents.
 

Package org.carrot2.text.vsm Description

Vector Space Model utilities.



Copyright (c) Dawid Weiss, Stanislaw Osinski