2005 InformationExtractionAutomatic

Jump to: navigation, search

Subject Headings: Information Extraction, Relation Recognition. Natural Language Analysis, Human Language Technology, Natural Language Engineering


Cited By



This article describes Information Extraction (IE), the process of deriving disambiguated quantifiable data from natural language texts in service of some pre-specified precise information need. The article: covers the origins of IE and the factors relevant to its deployment Preprint submitted to Elsevier Science 18th November 2004in applications contexts; presents scenarios in which the technology has been applied; breaks down the task into five subtasks and defines them; looks at recent developments in the field.


  • ACE, Feb (2004). Annotation Guidelines for Entity Detection and Tracking (EDT). Available at http://www.ldc.upenn.edu/Projects/ACE/.
  • Douglas E. Appelt, (1999). An Introduction to Information Extraction. Artificial Intelligence Communications 12 (3), 161–172.
  • ARPA, (1995). Proceedings of the Sixth Message Understanding Conference (MUC-6). Defense Advanced Research Projects Agency, Morgan Kaufmann, California.
  • Sean Bechhofer, van Harmelen, F., Hendler, J., Ian Horrocks, McGuinness, D. L., Patel-Schneider, P. F., Stein, L. A., (2003). OWL Web Ontology Language Reference. Tech. rep., W3C Proposed Recommendation 15 December 2003, http://www.w3.org/TR/2003/PR-owlref-20031215/.
  • Berners-Lee, T., (1999). Weaving the Web. Orion Business Books.
  • Boguraev, B., Garigliano, R., Tait, J., (1995). Editorial. Natural Language Engineering. 1, Part 1.
  • Bontcheva, K., (2004). Open-source Tools for Creation, Maintenance, and Storage of Lexical Resources for Language Generation from Ontologies. In: Proceedings of 4th Language Resources and Evaluation Conference (LREC’04).
  • Cardie, C., (1997). Empirical Methods in Information Extraction. AI Magazine 18 (4).
  • Chinchor, N. A., April (1998). Overview of Proceedings of the Seventh Message Understanding Conference (MUC-7)/MET-2. In: Proceedings of the Seventh Message Understanding Conference (MUC-7). Fairfax, VA, p. 5 pages, http://www.itl.nist.gov/iaui/894.02/related projects/muc/.
  • Cowie, J., Lehnert, W., (1996). Information Extraction. Communications of the ACM 39 (1), 80–91.
  • Hamish Cunningham, May (1999). Information Extraction: a User Guide (revised version). Research Memorandum CS–99–07, Department of Computer Science, University of Sheffield.
  • Hamish Cunningham, (2002). GATE, a General Architecture for Text Engineering. Computers and the Humanities 36, 223–254.
  • Hamish Cunningham, Scott, D. (Eds.), (2004). Special Issue of Natural Language Engineering on 21 Software Architecture for Language Engineering. Cambridge University Press.
  • Daelemans, W., Osborne, M. (Eds.), May (2003). CoNLL-2003, 7th Conference on Computational Natural Language Learning. Edmonton, Canada.
  • Davies, J., Fensel, D., van Harmelen, F. (Eds.), (2002). Towards the Semantic Web: Ontologydriven Knowledge Management. Wiley.
  • Day, D., Aberdeen, J., Lynette Hirschman, Kozierok, R., Robinson, P., Vilain, M., (1997). Mixed-Initiative Development of Language Processing Systems. In: Proceedings of the 5th Conference on Applied Natural Language Processing (ANLP-97).
  • Dill, S., Eiron, N., Gibson, D., Gruhl, D., Guha, R., Jhingran, A., Kanungo, T., Rajagopalan, S., Tomkins, A., Tomlin, J. A., Zien, J. Y., (2003). SemTag and Seeker: Bootstrapping the semantic web via automated semantic annotation. In: Proceedings of WWW’03.
  • Domingue, J., Dzbor, M., Motta, E., (2004). Magpie: Supporting Browsing and Navigation on the Semantic Web. In: Nunes, N., Rich, C. (Eds.), Proceedings ACM Conference on Intelligent User Interfaces (IUI). pp. 191–197.
  • Fensel, D., Hendler, J., Wahlster, W., Lieberman, H. (Eds.), (2002). Spinning the Semantic Web: Bringing the World Wide Web to Its Full Potential. MIT Press.
  • Gaizauskas, R., Wilks, Y., (1998). Information Extraction: Beyond Document Retrieval. Journal of Documentation 54 (1), 70–105.
  • Grishman, R., (2001). Adaptive Information Extraction and Sublanguage Analysis. In: Proceedings of Workshop on Adaptive Text Extraction and Mining at Seventeenth International Joint Conference on Artificial Intelligence. Seattle, USA.
  • Grishman, R., Sundheim, B., Jun. (1996). Message understanding conference - 6: A brief history. In: Proceedings of the 16th International Conference on Computational Linguistics. Copenhagen.
  • Diana Maynard, Bontcheva, K., Hamish Cunningham, (2003). Towards a semantic extraction of Named Entities. In: Recent Advances in Natural Language Processing. Bulgaria.
  • Pazienza, M. T. (Ed.), (2003). Information Extraction in the Web Era. Springer-Verlag.
  • Popov, B., Kiryakov, A., Kirilov, A., Manov, D., Ognyanoff, D., Goranov, M., (2004). KIM – Semantic Annotation Platform. Natural Language Engineering.
  • SAIC, (1998). Proceedings of the Seventh Message Understanding Conference (MUC-7), http://www.itl.nist.gov/iaui/894.02/related projects/muc/index.html.
  • Sundheim, B. (Ed.), (1995). Proceedings of the Sixth Message Understanding Conference (MUC-6). ARPA, Morgan Kaufmann, Columbia, MD.


 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2005 InformationExtractionAutomaticHamish CunninghamInformation Extraction, AutomaticEncyclopedia of Language and Linguisticshttp://gate.ac.uk/sale/ell2/ie/main.pdf2005