2006 EspressoAutoHarvestingSemanticRelations

(Pantel & Pennacchiotti, 2006) ⇒ Patrick Pantel, and Marco Pennacchiotti. (2006). “Espresso: Leveraging Generic Patterns for Automatically Harvesting Semantic Relations.” In: Proceedings of the 44th Annual Meeting of the Association for Computational Linguistics (ACL 2006).

Subject Headings: Relation Mention Recognition Algorithm, Semi-Supervised Learning Algorithm, Espresso Algorithm.

Notes

It shows Performance Analysis on several types of Semantic Relations: IsA Relation, PartOf Relation, Succession Relation, Reaction Relation, and Production Relation.
It makes use of Surface Patterns generated from a Pattern Learning Algorithm (they test on patterns generated by (Ravichandran and Hovy, 2002)).
It does not reference Snowball. Unclear how their Patterns and approach differ.
It might be related to their other paper at ACL 2006: (Pennacchiotti & Pantel, 2006)

Cited By

~230 http://scholar.google.com/scholar?q=%22Espresso%3A+Leveraging+Generic+Patterns+for+Automatically+Harvesting+Semantic+Relations%22+2006

2007

(Suchanek et al., 2007) ⇒ Fabian M. Suchanek, Gjergji Kasneci, and Gerhard Weikum. (2007). “YAGO: A Core of Semantic Knowledge Unifying WordNet and Wikipedia.” In: Proceedings of the 16th International Conference on World Wide Web (WWW 2007) [doi:10.1145/1242572.1242667]

Quotes

Abstract

In this paper, we present Espresso, a weakly-supervised, general-purpose, and accurate algorithm for harvesting semantic relations. The main contributions are: i) a method for exploiting generic patterns by filtering incorrect instances using the Web; and ii) a principled measure of pattern and instance reliability enabling the filtering algorithm. We present an empirical comparison of Espresso with various state of the art systems, on different size and genre corpora, on extracting various general and specific relations. Experimental results show that our exploitation of generic patterns substantially increases system recall with small effect on overall precision.

References

1. Matthew Berland, Eugene Charniak, Finding parts in very large corpora, Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics on Computational Linguistics, p.57-64, June 20-26, 1999, College Park, Maryland doi:10.3115/1034678.1034697
2. Brown, T. L.; LeMay, H. E.; Bursten, B. E.; and Burdge, J. R. (2003). Chemistry: The Central Science, Ninth Edition. Prentice Hall.
3. Sharon A. Caraballo, Automatic construction of a hypernym-labeled noun hierarchy from text, Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics on Computational Linguistics, p.120-126, June 20-26, 1999, College Park, Maryland doi:10.3115/1034678.1034705
4. Thomas M. Cover, Joy A. Thomas, Elements of information theory, Wiley-Interscience, New York, NY, 1991
5. David Day, John Aberdeen, Lynette Hirschman, Robyn Kozierok, Patricia Robinson, Marc Vilain, Mixed-initiative development of language processing systems, Proceedings of the fifth Conference on Applied Natural Language Processing, p.348-355, March 31-April 03, 1997, Washington, DC doi:10.3115/974557.974608
6. Downey, D.; Oren Etzioni; and Soderland, S. (2005). A Probabilistic model of redundancy in information extraction. In: Proceedings of IJCAI-05. pp. 1034--1041. Edinburgh, Scotland.
7. Oren Etzioni, Michael J. Cafarella, Doug Downey, Ana-Maria Popescu, Tal Shaked, Stephen Soderland, Daniel S. Weld, Alexander Yates, Unsupervised named-entity extraction from the web: an experimental study, Artificial Intelligence, v.165 n.1, p.91-134, June 2005 doi:10.1016/j.artint.2005.03.001
8. C. Fellbaum. (1998). WordNet: An Electronic Lexical Database. MIT Press.
9. Maayan Geffet, Ido Dagan, The distributional inclusion hypotheses and lexical entailment, Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, p.107-114, June 25-30, 2005, Ann Arbor, Michigan doi:10.3115/1219840.1219854
10. Roxana Girju, Adriana Badulescu, Dan Moldovan, Automatic Discovery of Part-Whole Relations, Computational Linguistics, v.32 n.1, p.83-135, March 2006 doi:10.1162/coli.2006.32.1.83
11. Marti A. Hearst, Automatic acquisition of hyponyms from large text corpora, Proceedings of the 14th conference on Computational linguistics, August 23-28, 1992, Nantes, France doi:10.3115/992133.992154
12. Donald Hindle, Noun classification from predicate-argument structures, Proceedings of the 28th annual meeting on Association for Computational Linguistics, p.268-275, June 06-09, 1990, Pittsburgh, Pennsylvania doi:10.3115/981823.981857
13. Justeson J. S. and Katz S. M. (1995). Technical Terminology: some linguistic properties and algorithms for identification in text. In: Proceedings of ICCL-95. pp.539--545. Nantes, France.
14. Chin-Yew Lin, Eduard Hovy, The automated acquisition of topic signatures for text summarization, Proceedings of the 18th conference on Computational linguistics, p.495-501, July 31-August 04, 2000, Saarbrücken, Germany doi:10.3115/990820.990892
15. Dekang Lin, Patrick Pantel, Concept discovery from text, Proceedings of the 19th International Conference on Computational linguistics, p.1-7, August 24-September 01, 2002, Taipei, Taiwan doi:10.3115/1072228.1072372
16. Gideon S. Mann, Fine-grained proper noun ontologies for question answering, COLING-02 on SEMANET: building and using semantic networks, p.1-7, September 01, 2002 doi:10.3115/1118735.1118746
17. Patrick Pantel, and Ravichandran, D. (2004). Automatically labeling semantic classes. In: Proceedings of HLT/NAACL-04. pp. 321--328. Boston, MA.
18. Patrick Pantel, Deepak Ravichandran, Eduard Hovy, Towards terascale knowledge acquisition, Proceedings of the 20th International Conference on Computational Linguistics, p.771-es, August 23-27, 2004, Geneva, Switzerland doi:10.3115/1220355.1220466
19. Marius Paşca, and Sanda M. Harabagiu (2001). The informative role of WordNet in Open-Domain Question Answering. In: Proceedings of NAACL-01 Workshop on WordNet and Other Lexical Resources. pp. 138--143. Pittsburgh, PA.
20. * (Ravichandran and Hovy, 2002) ⇒ Deepak Ravichandran, and Eduard Hovy. (2002). “Learning surface text patterns for a Question Answering system./ In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, July 07-12, 2002, Philadelphia, Pennsylvania.
21. R. Riloff, and Shepherd, J. (1997). A corpus-based approach for building semantic lexicons. In: Proceedings of EMNLP-97.
22. Siegel, S. and Castellan Jr., N. J. (1988). Nonparametric Statistics for the Behavioral Sciences. McGraw-Hill.
23. Szpektor, I.; Tanev, H.; I. Dagan; and Coppola, B. (2004). Scaling web-based acquisition of entailment relations. In: Proceedings of EMNLP-04. Barcelona, Spain.

,

	Author	volume	Date Value	title	type	journal	titleUrl	doi	note	year
2006 EspressoAutoHarvestingSemanticRelations	Patrick Pantel Marco Pennacchiotti			Espresso: Leveraging Generic Patterns for Automatically Harvesting Semantic Relations		Proceedings of the 44th Annual Meeting of the Association for Computational Linguistics	http://acl.ldc.upenn.edu/P/P06/P06-1015.pdf			2006