2014 OpenDomainQuantityQueriesonWebT

From GM-RKB

Jump to navigation Jump to search

(Sarawagi & Chakrabarti, 2014) ⇒ Sunita Sarawagi, and Soumen Chakrabarti. (2014). “Open-domain Quantity Queries on Web Tables: Annotation, Response, and Consensus Models.” In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2014) Journal. ISBN:978-1-4503-2956-9 doi:10.1145/2623330.2623749

Subject Headings:

Notes

Cited By

Quotes

Author Keywords

Collective inference; number and unit extraction; retrieval models; [[selection process; web tables

Abstract

Over 40% of columns in hundreds of millions of Web tables contain numeric quantities. Tables are a richer source of structured knowledge than free text. We harness Web tables to answer queries whose target is a quantity with natural variation, such as net worth of zuckerburg, battery life of ipad, half life of plutonium, and calories in pizza. Our goal is to respond to such queries with a ranked list of quantity distributions, suitably represented. Apart from the challenges of informal schema and noisy extractions, which have been known since tables were used for non-quantity information extraction, we face additional problems of noisy number formats, as well as unit specifications that are often contextual and ambiguous.

Early “hardening” of extraction decisions at a table level leads to poor accuracy. Instead, we use a probabilistic context free grammar (PCFG) based unit extractor on the tables, and retain several top-scoring extractions of quantity and numerals. Then we inject these into a new collective inference framework that makes global decisions about the relevance of candidate table snippets, the interpretation of the query's target quantity type, the value distributions to be ranked and presented, and the degree of consensus that can be built to support the proposed quantity distributions. Experiments with over 25 million Web tables and 350 diverse queries show robust, large benefits from our quantity catalog, unit extractor, and collective inference.

References

;

	Author	volume	Date Value	title	type	journal	titleUrl	doi	note	year
2014 OpenDomainQuantityQueriesonWebT	Soumen Chakrabarti Sunita Sarawagi			Open-domain Quantity Queries on Web Tables: Annotation, Response, and Consensus Models				10.1145/2623330.2623749		2014

Retrieved from "http://www.gabormelli.com/RKB/index.php?title=2014_OpenDomainQuantityQueriesonWebT&oldid=850454"

Facts

... more about "2014 OpenDomainQuantityQueriesonWebT"

Sunita Sarawagi + and Soumen Chakrabarti +

10.1145/2623330.2623749 +

Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining +

Open-domain Quantity Queries on Web Tables: Annotation, Response, and Consensus Models +

2014 +