2006 OnDemandInformationExtraction

From GM-RKB
Jump to navigation Jump to search

Subject Headings: Open Information Extraction Task, TF-IDF, On-demand Information Extraction Task.

Notes

Cited By

2007

Quotes

Abstract

At present, adapting an Information Extraction system to new topics is an expensive and slow process, requiring some knowledge engineering for each new topic. We propose a new paradigm of Information Extraction which operates 'on demand' in response to a user's query. On-demand Information Extraction (ODIE) aims to completely eliminate the customization effort. Given a user’s query, the system will automatically create patterns to extract salient relations in the text of the topic, and build tables from the extracted information using paraphrase discovery technology. It relies on recent advances in pattern discovery, paraphrase discovery, and extended named entity tagging. We report on experimental results in which the system created useful tables for many topics, demonstrating the feasibility of this approach.


References

  • ACE Home Page: http://www.ldc.upenn
  • DUC Home Page: http://duc.nist.gov
  • WordNet Home Page: http://wordnet.princeton.edu/
  • Kenji Abe, Shinji Kawasone, Tatsuya Asai, Hiroki Arimura and Setsuo Arikawa. (2002). “Optimized Substructure Discovery for Semi-structured Data”. In: Proceedings of the 6th European Conference on Principles an (PKDD-02)
  • Chinatsu Aone; Mila Ramos-Santacruz. (2000). “REES: A Large-Scale Relation and Event Extraction System” In: Proceedings of the 6th Applied Natural Language Processing Conference (ANLP-00)
  • Eugene Agichtein and L. Gravano. (2000). “Snowball: Extracting Relations from Large Plaintext Collections”. In: Proceedings of the 5th ACM International Conference on Digital Libraries (DL-00)
  • Regina Barzilay, and Kathleen R. McKeown. (2001). “Extracting Paraphrases from a Parallel Corpus.” In: Proceedings of the Annual Meeting of Association of Computational Linguistics/ and European Chapter of Association of Computational Linguistics (ACL/EACL-01)
  • Ralph Grishman and Beth Sundheim. (1996). “Message Understanding Conference - 6: A Brief History”, in: Proceedings of the 16th International Conference on Computational Linguistics (COLING-96)
  • Takaaki Hasegawa, Satoshi Sekine and Ralph Grishman. (2004). “Discovering Relations among Named Entities from Large Corpora”, In: Proceedings of the Annual Meeting of the Association of Computational Linguistics (ACL-04)
  • Ellen Riloff. (1996). “Automatically Generating Extraction Patterns from Untagged Text”. In: Proceedings of Thirteen National Conference on Artificial Intelligence (AAAI-96)
  • Satoshi Sekine, Kiyoshi Sudo and Chikashi Nobata. 2002 “Extended Named Entity Hierarchy” In: Proceedingseefings of the third International Conference on Language Resources and Evaluation (LREC-02)
  • Satoshi Sekine and Chikashi Nobata. (2003). “A survey for Multi-Document Summarization” In the proceedings of Text Summarization Workshop.
  • Satoshi Sekine. (2005). “Automatic Paraphrase Discovery based on Context and Keywords between NE Pairs”. In: Proceedings of International Workshop on Paraphrase (IWP-05)
  • Yusuke Shinyama, Satoshi Sekine and Kiyoshi Sudo. (2002). “Automatic Paraphrase Acquisition from News Articles”. In: Proceedings of the Human Language Technology Conference (HLT-02)
  • Kiyoshi Sudo, Satoshi Sekine, and Ralph Grishman. (2003). “An Improved Extraction Pattern Representation Model for Automatic IE Pattern Acquisition”. In: Proceedings of the Annual Meeting of Association of Computational Linguistics (ACL-03)
  • Roman Yangarber, Ralph Grishman, Pasi Tapanainen and Silja Huttunen. (2000). “Unsupervised Discovery of Scenario-level Patterns for Information Extraction”. In: Proceedings of 18th International Conference on Computational Linguistics (COLING-00),


 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2006 OnDemandInformationExtractionSatoshi SekineOn-Demand Information ExtractionProceedings of the 44th Annual Meeting of the Association for Computational Linguisticshttp://acl.ldc.upenn.edu/P/P06/P06-2094.pdf2006