|
Carrot2 v3.5.2
API Documentation |
||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectorg.carrot2.core.ProcessingComponentBase
org.carrot2.clustering.stc.STCClusteringAlgorithm
public final class STCClusteringAlgorithm
Suffix Tree Clustering (STC) algorithm. Pretty much as described in: Oren Zamir, Oren Etzioni, Grouper: A Dynamic Clustering Interface to Web Search Results, 1999. Some liberties were taken wherever STC's description was not clear enough or where we thought some improvements could be made.
| Field Summary | |
|---|---|
List<Cluster> |
clusters
Clusters created by the algorithm. |
double |
documentCountBoost
Document count boost. |
List<Document> |
documents
Documents to cluster. |
int |
ignoreWordIfInFewerDocs
Minimum word-document recurrences. |
double |
ignoreWordIfInHigherDocsPercent
Maximum word-document ratio. |
int |
maxBaseClusters
Maximum base clusters count. |
int |
maxClusters
Maximum final clusters. |
int |
maxDescPhraseLength
Maximum words per label. |
double |
maxPhraseOverlap
Maximum cluster phrase overlap. |
int |
maxPhrases
Maximum phrases per label. |
double |
mergeThreshold
Base cluster merge threshold. |
double |
minBaseClusterScore
Minimum base cluster score. |
int |
minBaseClusterSize
Minimum documents per base cluster. |
double |
mostGeneralPhraseCoverage
Minimum general phrase coverage. |
MultilingualClustering |
multilingualClustering
A helper for performing multilingual clustering. |
int |
optimalPhraseLength
Optimal label length. |
double |
optimalPhraseLengthDev
Phrase length tolerance. |
BasicPreprocessingPipeline |
preprocessingPipeline
Common preprocessing tasks handler. |
String |
query
Query that produced the documents. |
double |
singleTermBoost
Single term boost. |
| Constructor Summary | |
|---|---|
STCClusteringAlgorithm()
|
|
| Method Summary | |
|---|---|
void |
afterProcessing()
Memory cleanups. |
void |
process()
Performs STC clustering of documents. |
| Methods inherited from class org.carrot2.core.ProcessingComponentBase |
|---|
beforeProcessing, dispose, getContext, getSharedExecutor, init |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Methods inherited from interface org.carrot2.core.IProcessingComponent |
|---|
beforeProcessing, dispose, init |
| Field Detail |
|---|
public String query
public List<Document> documents
public List<Cluster> clusters
public int ignoreWordIfInFewerDocs
public double ignoreWordIfInHigherDocsPercent
public double minBaseClusterScore
public int maxBaseClusters
public int minBaseClusterSize
public int maxClusters
public double mergeThreshold
public double maxPhraseOverlap
public double mostGeneralPhraseCoverage
public int maxDescPhraseLength
public int maxPhrases
public double singleTermBoost
public int optimalPhraseLength
public double optimalPhraseLengthDev
public double documentCountBoost
public final BasicPreprocessingPipeline preprocessingPipeline
public final MultilingualClustering multilingualClustering
| Constructor Detail |
|---|
public STCClusteringAlgorithm()
| Method Detail |
|---|
public void process()
throws ProcessingException
documents.
process in interface IProcessingComponentprocess in class ProcessingComponentBaseProcessingException - when processing failed. If thrown, the
IProcessingComponent.afterProcessing() method will be called and the component will
be ready to accept further requests or to be disposed of. Finally, the
exception will be rethrown from the controller method that caused the
component to perform processing.public void afterProcessing()
afterProcessing in interface IProcessingComponentafterProcessing in class ProcessingComponentBase
|
Please refer to project documentation at
http://project.carrot2.org |
||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||