|
Carrot2 v3.6.0-SNAPSHOT
API Documentation |
||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectorg.carrot2.text.preprocessing.Tokenizer
public final class Tokenizer
Performs tokenization of documents.
This class saves the following results to the PreprocessingContext:
PreprocessingContext.AllTokens.imagePreprocessingContext.AllTokens.documentIndexPreprocessingContext.AllTokens.fieldIndexPreprocessingContext.AllTokens.type
| Field Summary | |
|---|---|
Collection<String> |
documentFields
Textual fields of documents that should be tokenized and parsed for clustering. |
| Constructor Summary | |
|---|---|
Tokenizer()
|
|
| Method Summary | |
|---|---|
void |
tokenize(PreprocessingContext context)
Performs tokenization and saves the results to the context. |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
public Collection<String> documentFields
| Constructor Detail |
|---|
public Tokenizer()
| Method Detail |
|---|
public void tokenize(PreprocessingContext context)
context.
|
Please refer to project documentation at
http://project.carrot2.org |
||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||