|
Carrot2 v3.6.0-SNAPSHOT
API Documentation |
||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectorg.carrot2.text.preprocessing.DocumentAssigner
public class DocumentAssigner
Assigns document to label candidates. For each label candidate from
PreprocessingContext.AllLabels.featureIndex an BitSet with the assigned documents is
constructed. The assignment algorithm is rather simple: in order to be assigned to a
label, a document must contain at least one occurrence of each non-stop word from the
label.
This class saves the following results to the PreprocessingContext :
This class requires that Tokenizer, CaseNormalizer,
StopListMarker, PhraseExtractor and LabelFilterProcessor be
invoked first.
| Field Summary | |
|---|---|
boolean |
exactPhraseAssignment
Only exact phrase assignments. |
int |
minClusterSize
Determines the minimum number of documents in each cluster. |
| Constructor Summary | |
|---|---|
DocumentAssigner()
|
|
| Method Summary | |
|---|---|
void |
assign(PreprocessingContext context)
Assigns document to label candidates. |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
public boolean exactPhraseAssignment
public int minClusterSize
| Constructor Detail |
|---|
public DocumentAssigner()
| Method Detail |
|---|
public void assign(PreprocessingContext context)
|
Please refer to project documentation at
http://project.carrot2.org |
||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||