2006 CombiningLinguisticExtractRelations

From GM-RKB
Jump to navigation Jump to search

Subject Headings: Relation Recognition from Text Algorithm, Dependency Grammar-based Relation Recognition Algorithm, LEILA System

Notes

Contributions

  1. Applies Syntactic Parsing information to Supervised Relation Recognition with high F-score.
  2. First known application of kNN and SVM to Pattern selection. Past approaches use Heuristics or [[.
  3. Proposes a Text Graph representation that is expressive and robust.
  4. The patterns reported in the empirical studies are interesting and reasonable. (see Comments)
  5. Achieves good empirical results on several datasets.
  6. Their system is available to test.

Input/Output

Input

  • positive examples
  • (relationship direction? one-to-one, one-to-many, many-to-one?)
  • annotated data (POS, Link, NER(e.g. person,date,...))

= Output

  • relation instances (& duplicates)
  • and their associated patterns

Patterns

Questions, Patterns

  • How are the boundaries of multi-word entities detected?

Algorithm

= Discovery Phase (Train Phase)

  • patterns are constructed from all sentences with both entities.

= Assessment Phase (Train Phase)

  • negative examples (counterexamples) are gathered
  • negative patterns are constructed
  • discriminative model induced

= Harvesting Phase (Test Phase)

= Questions, Algorithm

  • How does the counterexample selection know which entity is required to be fixed. E.g. in "Chopin"/"1957", would the algorithm know that the relationship is one-to-many and that the 'many' is associated with the year?

General Questions

  • What is the relationship of their approach with [Bunescu and Mooney, 2005] and with [Chiang and Yu, 2005]?
    • E.g. what is the difference in their definition of "shortest-path"?
  • It would be interesting to see all competitors evaluated.
  • The patterns reported in the empirical studies do not demonstrate the benefit of shortest path or of replacement nouns and adjectives. For example, the pattern used in figure 3 "<X> was <ADJECTIVE> among <Y>", is not one of the patterns reported in 4.2.1.
  • Why the discrepancy between the Snowball F-score performance of ~30% they report and the 80%+ figures reported in [Agichtein and Gravano, 2000]. This especially disconcerting given that the main pattern reported is the same one that Snowball (and DIPRE) have been shown to discover: "Y-based X".
  • The statements around the contribution of anaphora resolution are unconvincing. Clearly Snowball would also benefit directly from accurate tagging of anaphoras as entities.
  • There is no support for text that occurs before or after the entities, as in Snowball. This is a minor issue given that the value of prefiller and postfiller patterns.
  • The size of the test sets are very small; in the hundreds.
  • The size of the train sets is not reported

Cited By

Quotes

Abstract

  • The World Wide Web provides a nearly endless source of knowledge, which is mostly given in natural language. A first step towards exploiting this data automatically could be to extract pairs of a given semantic relation from text documents - for example all pairs of a person and her birthdate. One strategy for this task is to find text patterns that express the semantic relation, to generalize these patterns, and to apply them to a corpus to find new pairs. In this paper, we show that this approach profits significantly when deep linguistic structures are used instead of surface text patterns. We demonstrate how linguistic structures can be represented for machine learning, and we provide a theoretical analysis of the pattern matching approach. We show the benefits of our approach by extensive experiments with our prototype system LEILA.

References

  • [1] Eugene Agichtein, L. Gravano. Snowball: extracting relations from large plain-text collections. ACM 2000.
  • [2] J. Aslam, S. Decatur. On the sample complexity of noise-tolerant learning. Information Processing Letters 1996.
  • [3] S. Brin. Extracting patterns and relations from the world wide web. WWW 1999.
  • [4] Paul Buitelaar, D. Olejnik, M. Sintek. A protege plug-in for ontology extraction from text based on linguistic analysis. ESWS 2004.
  • [5] Paul Buitelaar, S. Ramaka. Unsupervised ontology-based semantic tagging for knowledge markup. Workshop on Learning in Web Search at ICML 2005.
  • [6] Razvan C. Bunescu, Raymond Mooney. A Shortest Path Dependency Kernel for Relation Extraction. EMNLP 2005.
  • [7] M. Cali®, Raymond Mooney. Relational learning of pattern-match rules for information extraction. ACL-97 Workshop in Natural Language Learning 1997.
  • [8] V. Cherkassky, M. Yunqian. Practical selection of SVM parameters and noise estimation for SVM regression. Neural Networks 2004
  • [9] Philipp Cimiano, G. Ladwig, Steffen Staab. Gimme the context: Contextdriven automatic semantic annotation with cpankow. WWW 2005.
  • [10] Philipp Cimiano, J. VÄolker. Text2onto - a framework for ontology learning and data-driven change discovery. NLDB 2005.
  • [11] Philipp Cimiano, J. VÄolker. Towards large-scale, open-domain and ontology-based named entity classi¯cation. RANLP 2005.
  • [12] Oren Etzioni, Michael J. Cafarella, D. Downey, S. Kok, A. Popescu, T. Shaked, S. Soderland, D. S. Weld, A. Yates. Web-scale information extraction in knowitall (preliminary results). WWW 2004.
  • [13] A. Finn, N. Kushmerick. Multi-level boundary classi¯cation for information extraction. ECML 2004.
  • [14] Dayne Freitag, N. Kushmerick. Boosted wrapper induction. American Nat. Conference on AI 2000.
  • [15] J. Graupmann. Concept-based search on semi-structured data exploiting mined semantic relations. EDBT Workshops 2004.
  • [16] A. Hearst. Automatic acquisition of hyponyms from large text corpora. ICCL 1992.
  • [17] F. C. J. Iria. Relation extraction for mining the semantic web, 2005.
  • [18] Thorsten Joachims. Learning to Classify Text Using Support Vector Machines. PhD thesis, Dortmund, 2002.
  • [19] M. J. Kearns, Robert E. Schapire. E±cient distribution-free learning of probabilistic concepts. Computational Learning Theory and Natural Learning Systems, 1994.
  • [20] D. Lin, Patrick Pantel. Dirt: Discovery of inference rules from text. KDD 2001.
  • [21] Alexander Maedche, Steffen Staab. Discovering conceptual relations from text. ECAI 2000.
  • [22] R. Montague. Universal grammar. Formal Philosophy. Selected Papers of Richard Montague. Yale University Press, 1974.
  • [23] D. Ravichandran, Eduard Hovy. Learning surface text patterns for a question answering system. ACL 2002.
  • [24] E. Rilo®. Automatically generating extraction patterns from untagged text. Annual Conference on AI 1996.
  • [25] H. U. Simon. General bounds on the number of examples needed for learning probabilistic concepts. COLT 1993.
  • [26] D. Sleator, D. Temperley. Parsing english with a link grammar. International Workshop on Parsing Technologies 1993.
  • [27] R. Snow, Daniel Jurafsky, A. Y. Ng. Learning syntactic patterns for automatic hypernym discovery. NIPS 2005.
  • [28] S. Soderland. Learning information extraction rules for semi-structured and free text. Machine Learning, pages 233{272, 1999.
  • [29] F. M. Suchanek, G. Ifrim, G. Weikum. Combining Linguistic and Statistical Analysis to Extract Relations from Web Documents. Technical Report MPI-I-2006-5-004 at the Max-Planck-Institute for Computer Science SaarbrÄucken/Germany.
  • [30] F. M. Suchanek, G. Ifrim, G. Weikum. LEILA: Learning to Extract Information by Linguistic Analysis. OLP Workshop at COLING/ACL 2006
  • [31] S. Soderland, D. Fisher, J. Aseltine, W. Lehnert. Crystal: Inducing a conceptual dictionary. IJCAI 1995.
  • [32] F. Xu, H. U. Krieger. Integrating shallow and deep NLP for information extraction. RANLP 2003.
  • [33] F. Xu, D. Kurz, J. Piskorski, S. Schmeier. Term extraction and mining term relations from free-text documents in the ¯nancial domain. BIS 2002.
  • [34] R. Yangarber, Ralph Grishman, P. Tapanainen, S. Huttunen. Automatic acquisition of domain knowledge for information extraction. ICCL 2000.
  • [35] R. Yangarber, W. Lin, Ralph Grishman. Unsupervised learning of generalized names. ICCL 2002.
  • [36] L. Zhang, Y. Yu. Learning to generate CGs from domain speci¯c sentences. LNCS, 2120:44{57, 2001.

BibTeX

@InProceedings{conf/kdd/SuchanekIW06,

 title =	"Combining linguistic and statistical analysis to

extract relations from web documents",

 author =	"Fabian M. Suchanek and Georgiana Ifrim and Gerhard

Weikum",

 bibdate =	"2006-10-05",
 bibsource =	"DBLP,

http://dblp.uni-trier.de/db/conf/kdd/kdd2006.html#SuchanekIW06",

 booktitle =	"KDD",
 booktitle =	"Proceedings of the Twelfth {ACM} {SIGKDD}

International Conference on Knowledge Discovery and Data Mining, Philadelphia, {PA}, {USA}, August 20-23, 2006",

 publisher =	"ACM",
 year = 	"2006",
 editor =	"Tina Eliassi-Rad and Lyle H. Ungar and Mark Craven and

Dimitrios Gunopulos",

 ISBN = 	"1-59593-339-5",
 pages =	"712--717",
 URL =  	"http://dx.doi.org/10.1145/1150402.1150492",

} ,


 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2006 CombiningLinguisticExtractRelationsFabian M. Suchanek
Gerhard Weikum
Georgiana Ifrim
Combining Linguistic and Statistical Analysis to Extract Relations from Web DocumentsProceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mininghttp://www.mpi-inf.mpg.de/~ifrim/publications/kdd2006.pdf10.1145/1150402.11504922006