2009 CuratingAndSearchTheAnnotWeb

From GM-RKB
Jump to navigation Jump to search

Subject Headings: Wikipedia.

Notes

Cited By

Quotes

Abstract

  • We demonstrate CSAW, a system for Curating and Searching the Annotated Web. CSAW annotates named entities and quantities in Web-scale text corpora, and, where confident, connects these annotations with entries in an entity and type catalog such as Wikipedia. The semi-structured catalog, together with the unstructured corpus, forms a composite database that CSAW can then search using powerful reachability, proximity and aggregation primitives. Specifically, we can look for snippets with mentions of specific entities, entities of a specified type, quantities with specified types or units, find unions and intersections of snippet sets, and then aggregate evidence from snippet sets into ranked responses. Responses are not page URLs as in standard Web search, but ranked tables where the cells can be entity references, quantities, or token snippets. We will show a subset of CSAW’s capabilities, and describe the beginnings of a next-generation Web search API that significantly extends the capabilities of APIs provided by popular search engines today.

,

 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2009 CuratingAndSearchTheAnnotWebSoumen Chakrabarti
Ganesh Ramakrishnan
Amit Singh
Sayali Kulkarni
Somnath Banerjee
Curating and Searching the Annotated Webhttp://www.cse.iitb.ac.in/~soumen/doc/sigkdd2009d/CSAWDemo.pdf