1999 LearningDictsForIEbyBootstrapping

From GM-RKB
Jump to navigation Jump to search

Subject Headings: Semi-Supervised Named Entity Recognition Algorithm.

Notes

  • Presentation at http://www.cs.cmu.edu/~wcohen/10-707/ppts/Yi-Chia.ppt
  • Presents an unsupervised learning algorithm for named entity (and nominal) detection & classification
  • Uses a wordlist for each category that contains Words and Phrases that belong to a given category.
  • Uses a list of Lexical Patterns to specify contexts typically associated with a given category. E.g. "operates in x."
  • AutoSlog is used to generate the patterns.
  • Commences with a "seed list" of words that are known to be in-category.
  • The algorithm then uses bootstrapping to improve their pattern identification.
    • Finds the pattern that best matches the current wordlist.
    • "Best match" is determined by a combination of precision and coverage.
    • Update the pattern list.
    • Update the worlist of the words and phrases extracted by the new pattern.

Cited By

~443 http://scholar.google.com/scholar?cites=11190526739252407918

2005

2000

Quotes

Abstract



References

  • Wordnet: An On-line Lexical Database (context) - Miller - 1990
  • Combining Labeled and Unlabeled Data with Co-Training - Blum, Mitchell - 1998
  • Learning to Extract Symbolic Knowledge from the World Wide W.. - Craven, DiPasquo et al. - 1998
  • Learning Information Extraction Rules for Semi-structured an.. - Soderland - 1999
  • CRYSTAL: Inducing a conceptual dictionary - Soderland, Fisher et al. - 1995
  • Automatically Constructing a Dictionary for Information Extr.. - Riloff - 1993
  • Automatically Generating Extraction Patterns from Untagged T.. - Riloff
  • Learning information extraction patterns from examples - Huffman - 1996
  • An Empirical Study of Automated Dictionary Construction for .. - Riloff
  • Relational Learning Techniques for Natural Language Informat.. - Califf - 1998
  • Toward General-Purpose Learning for Information Extraction - Freitag - 1998
  • Noun-phrase Cooccurrence Statistics for Semi-automatic Seman.. - Roark, Charniak - 1998
  • A Corpus-based Approach for Building Semantic Lexicons - Riloff, Shepherd - 1997
  • Acquisition of Semantic Patterns for Information Extraction .. (context) - Berlin, Kim et al. - 1993
  • MUC-4 Proceedings (context) - of - 1992,


 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
1999 LearningDictsForIEbyBootstrappingEllen Riloff
Rosie Jones
Learning Dictionaries for Information Extraction by Multi-level BootstrappingProceedings of AAAI Conferencehttp://www.cs.utah.edu/~riloff/pdfs/aaai99.pdf1999