Text Document Clustering Task

From GM-RKB
Jump to navigation Jump to search

A Text Document Clustering Task is a text-item clustering task for text documents.



References

2015

2011

2009

2006

  • (Yoo et al., 2006) ⇒ Illhoi Yoo, Xiaohua Hu, and Il-Yeol Song. (2006). “Integration of Semantic-based Bipartite Graph Representation and Mutual Refinement Strategy for Biomedical Literature Clustering.” In: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2006).
    • Document clustering was initially investigated for improving information retrieval (IR) performance because similar documents grouped by document clustering tend to be relevant to the same user queries [20] [21]. Document clustering, however, has not been widely used in IR systems [7] because document clustering algorithms were too slow or infeasible for very large document sets in the early days. As faster clustering algorithms have been introduced, they have been adopted in document clustering. Document clustering has been recently used to facilitate the nearest-neighbor search [3], to support an interactive document browsing paradigm [7] [10] [26] and to construct hierarchical topic structures [14]. Thus, as information grows exponentially, document clustering plays an important role for IR and text mining.

1992