org.carrot2.text.preprocessing
Class DocumentAssignerDescriptor.AttributeBuilder

java.lang.Object
  extended by org.carrot2.text.preprocessing.DocumentAssignerDescriptor.AttributeBuilder
Enclosing class:
DocumentAssignerDescriptor

public static class DocumentAssignerDescriptor.AttributeBuilder
extends Object

Attribute map builder for the DocumentAssigner component. You can use this builder as a type-safe alternative to populating the attribute map using attribute keys.


Field Summary
 Map<String,Object> map
          The attribute map populated by this builder.
 
Constructor Summary
protected DocumentAssignerDescriptor.AttributeBuilder(Map<String,Object> map)
          Creates a builder backed by the provided map.
 
Method Summary
 DocumentAssignerDescriptor.AttributeBuilder exactPhraseAssignment(boolean value)
          Only exact phrase assignments.
 DocumentAssignerDescriptor.AttributeBuilder minClusterSize(int value)
          Determines the minimum number of documents in each cluster.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

map

public final Map<String,Object> map
The attribute map populated by this builder.

Constructor Detail

DocumentAssignerDescriptor.AttributeBuilder

protected DocumentAssignerDescriptor.AttributeBuilder(Map<String,Object> map)
Creates a builder backed by the provided map.

Method Detail

exactPhraseAssignment

public DocumentAssignerDescriptor.AttributeBuilder exactPhraseAssignment(boolean value)
Only exact phrase assignments. Assign only documents that contain the label in its original form, including the order of words. Enabling this option will cause less documents to be put in clusters, which result in higher precision of assignment, but also a larger "Other Topics" group. Disabling this option will cause more documents to be put in clusters, which will make the "Other Topics" cluster smaller, but also lower the precision of cluster-document assignments.

See Also:
DocumentAssigner.exactPhraseAssignment

minClusterSize

public DocumentAssignerDescriptor.AttributeBuilder minClusterSize(int value)
Determines the minimum number of documents in each cluster.

See Also:
DocumentAssigner.minClusterSize


Copyright (c) Dawid Weiss, Stanislaw Osinski