Text-Document Clustering Algorithm

From GM-RKB
(Redirected from Text Clustering Algorithm)
Jump to: navigation, search

A Text-Document Clustering Algorithm is a domain-specific clustering algorithm that can be applied by a text-document clustering system to solve the text-document clustering task.



References

2009

2008

2007

2006

2005

2004

2003

2002

2001

2000

  • (Steinbach, 2000) ⇒ Michael Steinbach, George Karypis, and Vipin Kumar. (2000). “A Comparison of Document Clustering Techniques.” In: Proceedings of Workshop at KDD-2000 on Text Mining.
    • We use two metrics for evaluating cluster quality: entropy, which provides a measure of “goodness” for un-nested clusters or for the clusters at one level of a hierarchical clustering, and the F-measure, which measures the effectiveness of a hierarchical clustering. (The F measure was recently extended to document hierarchies in [5].)

1999

1997

1992