|
Carrot2 v3.6.0-SNAPSHOT
API Documentation |
||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectorg.carrot2.text.preprocessing.PreprocessedDocumentScanner
public class PreprocessedDocumentScanner
Iterates over tokenized documents in PreprocessingContext.
| Field Summary | |
|---|---|
static com.carrotsearch.hppc.predicates.ShortPredicate |
ON_DOCUMENT_SEPARATOR
Predicate for splitting on document separator. |
static com.carrotsearch.hppc.predicates.ShortPredicate |
ON_FIELD_SEPARATOR
Predicate for splitting on field separator. |
static com.carrotsearch.hppc.predicates.ShortPredicate |
ON_SENTENCE_SEPARATOR
Predicate for splitting on sentence separator. |
| Constructor Summary | |
|---|---|
PreprocessedDocumentScanner()
|
|
| Method Summary | |
|---|---|
protected void |
document(PreprocessingContext context,
int start,
int length)
Invoked for each document. |
static com.carrotsearch.hppc.predicates.ShortPredicate |
equalTo(short t)
Return a new ShortPredicate returning true
if the argument equals a given value. |
protected void |
field(PreprocessingContext context,
int start,
int length)
Invoked for each document's field. |
void |
iterate(PreprocessingContext context)
Iterate over all documents, fields and sentences in PreprocessingContext.allTokens. |
protected void |
sentence(PreprocessingContext context,
int start,
int length)
Invoked for each document's sentence. |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
public static final com.carrotsearch.hppc.predicates.ShortPredicate ON_DOCUMENT_SEPARATOR
public static final com.carrotsearch.hppc.predicates.ShortPredicate ON_FIELD_SEPARATOR
public static final com.carrotsearch.hppc.predicates.ShortPredicate ON_SENTENCE_SEPARATOR
| Constructor Detail |
|---|
public PreprocessedDocumentScanner()
| Method Detail |
|---|
public static final com.carrotsearch.hppc.predicates.ShortPredicate equalTo(short t)
ShortPredicate returning true
if the argument equals a given value.
public final void iterate(PreprocessingContext context)
PreprocessingContext.allTokens.
protected void document(PreprocessingContext context,
int start,
int length)
protected void field(PreprocessingContext context,
int start,
int length)
protected void sentence(PreprocessingContext context,
int start,
int length)
|
Please refer to project documentation at
http://project.carrot2.org |
||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||