- (Baker & McCallum, 1998) ⇒ L. Douglas Baker, and Andrew Kachites McCallum. (1998). “Distributional Clustering of Words for Text Classification.” In: Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval. ISBN:1-58113-015-5 doi:10.1145/290941.290970
This paper describes the application of Distributional Clustering  to document classification. This approach clusters words into groups based on the distribution of class labels associated with each word. Thus, unlike some other unsupervised dimensionality-reduction techniques, such as Latent Semantic Indexing, we are able to compress the feature space much more aggressively, while still maintaining high document classification accuracy.
Experimental results obtained on three real-world data sets show that we can reduce the feature dimensionality by three orders of magnitude and lose only 2% accuracy - significantly better than Latent Semantic Indexing , class-based clustering , feature selection by mutual information  or Markov-blanket-based feature selection . We also show that less aggressive clustering sometimes results in improved classification accuracy over classification without clustering.
|1998 DistributionalClusteringofWords||L. Douglas Baker|
|Distributional Clustering of Words for Text Classification||10.1145/290941.290970||1998|