WebTables System
Jump to navigation
Jump to search
A WebTables System is an Automated Information Extraction System for Structured Data developed at Google by Michael J. Cafarella and others.
- Context:
- It can crawl the Web to find small Relational Databases expressed using the HTML Table Tag.
- It can data mine the resulting extracted information
- It can introduce new data-centric applications (such as Schema Completion and Synonym Finding).
- See: Google Deep Web Crawler.
References
2009
- (Cafarella, 2009) ⇒ Michael J. Cafarella. (2009). “Extracting and Managing Structured Web Data." PhD Thesis, University of Washington.
2008
- (Cafarella et al., 2008) ⇒ Michael J. Cafarella, A. Halevy, D. Wang, E. Wu, and Y. Zhang. (2008). “Webtables: Exploring the power of tables on the web.” In: ProceedingsVery Large Data Base Endowment (VLDB 2008).