2003 MiningTopicSpecificConceptsAndDefsOnTheWeb

Jump to: navigation, search

Subject Headings: Topic Analysis Task, Web Search Algorithm.


Cited By


  • (Chuang & Chien, 2004) ⇒ Shui-Lung Chuang, and Lee-Feng Chien. (2004). “A Practical Web-based Approach to Generating Topic Hierarchy for Text Segments.” In: Proceedings of the thirteenth ACM International Conference on Information and knowledge management (CIKM 2004). doi:10.1145/1031171.1031193
    • QUOTE: It is crucial in many information systems to organize short text segments, such as keywords in documents and queries from users, into a well-formed topic hierarchy. … Other such related works include topic finding, e.g., finding the definitions or subtopic terms for a given topic [15, 6]. Different from their work, the text segments we consider may not be associated with a set of documents. Instead, we uses the Web as a global corpus to discover the similarity relationships between text segments. ...


Author Keywords

Web content mining, domain concept mining, definition mining, knowledge compilation, information integration.


Traditionally, when one wants to learn about a particular topic, one reads a book or a survey paper. With the rapid expansion of the Web, learning in-depth knowledge about a topic from the Web is becoming increasingly important and popular. This is also due to the Web's convenience and its richness of information. In many cases, learning from the Web may even be essential because in our fast changing world, emerging topics appear constantly and rapidly. There is often not enough time for someone to write a book on such topics. To learn such emerging topics, one can resort to research papers. However, research papers are often hard to understand by non-researchers, and few research papers cover every aspect of the topic. In contrast, many Web pages often contain intuitive descriptions of the topic. To find such Web pages, one typically uses a search engine. However, current search techniques are not designed for in-depth learning. Top ranking pages from a search engine may not contain any description of the topic. Even if they do, the description is usually incomplete since it is unlikely that the owner of the page has good knowledge of every aspect of the topic. In this paper, we attempt a novel and challenging task, mining topic-specific knowledge on the Web. Our goal is to help people learn in-depth knowledge of a topic systematically on the Web. The proposed techniques first identify those sub-topics or salient concepts of the topic, and then find and organize those informative pages, containing definitions and descriptions of the topic and sub-topics, just like those in a book. Experimental results using 28 topics show that the proposed techniques are highly effective.



 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2003 MiningTopicSpecificConceptsAndDefsOnTheWebBing Liu
Chee Wee Chin
Hwee Tou Ng
Mining Topic-Specific Concepts and Definitions on the WebProceedings of the 12th International Conference on World Wide Webhttp://www.cs.uic.edu/~liub/publications/WWW-2003.pdf10.1145/775152.7751882003