2006 AutoDiscovOfPartWholeRelations

From GM-RKB
Jump to navigation Jump to search

Subject Headings: Semantic Relation Recognition Algorithm, PartOf Relation

Notes

Cited By

Quotes

Abstract

  • An important problem in knowledge discovery from text is the automatic extraction of semantic relations. This paper presents a supervised, semantically intensive, domain independent approach for the automatic detection of part-whole relations in text. First an algorithm is described that identifies lexico-syntactic patterns that encode part-whole relations. A difficulty is that these patterns also encode other semantic relations, and a learning method is necessary to discriminate whether or not a pattern contains a part-whole relation. A large set of training examples have been annotated and fed into a specialized learning system that learns classification rules. The rules are learned through an iterative semantic specialization (ISS) method applied to noun phrase constituents. Classification rules have been generated this way for different patterns such as genitives, noun compounds, and noun phrases containing prepositional phrases to extract part-whole relations from them. The applicability of these rules has been tested on a test corpus obtaining an overall average precision of 80.95% and recall of 75.91%. The results demonstrate the importance of word sense disambiguation for this task. They also demonstrate that different lexico-syntactic patterns encode different semantic information and should be treated separately in the sense that different clarification rules apply to different patterns."

References

  • Berland, Matthew and Eugene Charniak. (1999). Finding parts in very large corpora. In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics (ACL 1999), pages 57–64, University of Maryland.
  • Brill, Eric. (1995). Transformation-based error-driven learning and natural language processing: A case study in part-of-speech tagging. Computational Linguistics, 21(4):543–566.
  • Eugene Charniak. (2000). A maximum-entropy-inspired parser. In: Proceedings of the 1st Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2000), pages 132–139, Seattle, WA.
  • Downing, Pamela. 1977. On the creation and use of English compound nouns. Language, 53(4):810–842.
  • Dunning, Ted. (1993). Accurate methods for the statistics of surprise and coincidence. Computational Linguistics, 19:61–74.
  • Evens, Martha W., Bonnie C. Litowitz, Judith A. Markowitz, Raoul N. Smith, and Oswald Werner. (1980). Lexical-semantic relations: A comparative survey. Linguistic Research, pages 187–219.
  • Fellbaum, Christiane. (1998). WordNet — An Electronic Lexical Database. MIT Press, Cambridge, MA.
  • Finin, TimothyW. (1980). The Semantic Interpretation of Compound Nominals. Ph.D. thesis, University of Illinois at Urbana-Champaign.
  • Freeze, Ray. (1992). Existentials and other locatives. Language, 68:553–595.
  • Gildea, Daniel and Daniel Jurafsky. (2002). Automatic labeling of semantic roles. Computational Linguistics, 28(3): 245–288.
  • (Girju et al., 2003) ⇒ Roxana Girju, and Dan Moldovan. (2003). “Learning semantic constraints for the automatic discovery of part-whole relations.” In: Proceedings of the 3rd Human Language Technology Conference/ 4th Meeting of the North American Chapter of the Association for Computational Linguistics Conference (HLT-NAACL 2003).
  • (Girju et al., 2001) ⇒ Roxana Girju. (2001). “Answer Fusion with On-line Ontology Development.” In: Proceedings of the 2nd Meeting of the North American Chapter of the Association for Computational Linguistics (NAACL 2001) - Student Research Workshop.
  • (Girju et al., 2005) ⇒ Roxana Girju, Dan Moldovan, Marta Tatu, and Daniel Antohe. (2005). “On the Semantics of Noun Compounds.” In: Computer Speech and Language — Special Issue on Multiword Expressions (in press).
  • Hearst,Marti. (1992). Acquisition of hyponyms from large text corpora. In: Proceedings of the 14th International Conference on Computational Linguistics (COLING-92), pages 539–545, Nantes, France.
  • Hearst, Marti. (1998). Automated discovery

of WordNet relations. In Christiane Fellbaum, editor, An Electronic Lexical Database and Some of Its Applications. MIT Press, Cambridge, MA, pages 131–151.

  • Iris, Madelyn, Bonnie Litowitz, and Martha

Evens. (1988). Problems with part-whole relation. In M. W. Evens, editor, Relational Models of the Lexicon: Representing Knowledge in Semantic Networks. Cambridge University Press, Cambridge, pages 261–288.

  • Jensen, Per Anker and Carl Vikner. (1996). The

double nature of the verb have. LAMBDA, 21:25–37.

Marcus. (2002). Adding semantic annotation to the Penn Treebank. In: Proceedings of the 2nd Human Language Technology Conference (HLT 2002), pages 252–256, San Diego, CA.

  • Lapata, Mirella. (2002). The disambiguation of

nominalisations. Computational Linguistics, 28(3):357–388.

Pragmatics and word meaning. Journal of Linguistics, 34(2):387–414. Lauer, Mark and Mark Dras. (1994). A probabilistic model of compound nouns. In: Proceedings of the 7th Australian Joint Conference on Artificial Intelligence, pages 474–481, Armidale, Australia. Levi, Judith. 1978. The Syntax and Semantics of Complex Nominals. Academic Press, New York. 133 Computational Linguistics Volume 32, Number 1 Marcus, Mitchell P., Beatrice Santorini, and Mary Ann Marcinkiewicz. (1993). Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics, 19(2):313–330. Moldovan, Dan and Adriana Badulescu. 2005. A semantic scattering model for the automatic interpretation of genitives. In: Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Proceesing (HLT/ EMNLP 2005), pages 891–898, Vancouver, BC, Canada. Moldovan, Dan, Adriana Badulescu, Marta Tatu, Daniel Antohe, and Roxana Girju. (2004). Models for the semantic classification of noun phrases. In Proceedings of the Human Language Technology Conference (HLT-NAACL) 2004, Computational Lexical Semantics Workshop, Boston, MA. Moldovan, Dan and Roxana Girju. (2001). An interactive tool for the rapid development of knowledge bases. International Journal on Artificial Intelligence Tools, 10(1–2):65–86. Moldovan, Dan, Sanda M. Harabagiu, Roxana Girju, Paul Morarescu, Finley Lacatusu, Adrian Novischi, Adriana Badulescu, and Orest Bolohan. (2002). LCC tools for question answering. In: Proceedings of the 11th Meeting of the Text Retrieval Conference (TREC 2002), pages 388–397, Gaithersburg, MD. Morris, Jane and Graeme Hirst. 2004. Non-classical lexical semantic relations. In: Proceedings of the 4th Human Language Technology Conference / of the 5th Meeting of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2004) - Workshop on Computational Lexical Semantics, pages 46–51, Boston, MA. Novischi, Adrian, Dan Moldovan, Paul Parker, Adriana Badulescu, and Bob Hauser. (2004). LCC’s WSD systems for Senseval 3. In: Proceedings of Senseval 3 (ACL 2004), Barcelona, Spain. Pustejovsky, James, Sabine Bergler, and Peter Anick. (1993). Lexical semantic techniques for corpus analysis. Computational Linguistics, 19(2): 331–358. Quinlan, Ross. J. (1993). C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco, CA. Philip Resnik. (1996). Selectional constraints: An information-theoretic model and its computational realization. Cognition, 61:127–159. Philip Resnik and Marti Hearst. 1993. Structural ambiguity and conceptual relations. In: Proceedings of the 31st Meeting of the Association for Computational Linguistics (ACL 1993)- 1st Workshop on Very Large Corpora: Academic and Industrial Perspectives, pages 58–64, Ohio State University, Columbus, OH. Rosario, Barbara and Marti Hearst. 2001. Classifying the semantic relations in noun compounds via a domain-specific lexical hierarchy. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2001), pages 82–90, Pittsburgh, PA. Rosario, Barbara, Marti Hearst, and Charles Fillmore. (2002). The descent of hierarchy, and selection in relational semantics. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pages 247–254, University of Pennsylvania. Schafer, Robin. (1995). The SLP/ILP distinction in have-predication. In M. Simons and T. Galloway, editors, Proceedings from Semantics and Linguistic Theory V. Cornell University Department of Linguistics, pages 292–309, Ithaca. Siegel, Sidney and John Castellan. 1988. Nonparametric Statistics for the Behavioral Science. McGraw-Hill, New York. Simons, Peter. (1987). Parts. A Study in Ontology. Clarendon Press, Oxford. Simons, Peter. (1991). Part/whole II: Mereology since 1900. In H. Burkhardt and B. Smith, editors, Handbook of Metaphysics and Ontology. Philosophia, Munich, pages 672–675. Sp¨arck Jones, K. (1983). Compound noun interpretation problems. In F. Fallside and W. A.Woods, editors, Computer Speech Processing. Prentice-Hall, Englewood Cliffs, NJ, pages 363–380. Tatu, Marta and Dan Moldovan. 2005. A semantic approach to recognizing textual entailmant. In: Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP 2005), pages 371–378, Vancouver, BC, Canada. Thompson, Cynthia A., Roger Levy, and Christopher D. Manning. (2003). A generative model for Framenet semantic role labeling. In: Proceedings of the 14th

BibTeX

@article{DBLP:journals/coling/GirjuBM06,

 author    = {Roxana Girju and
              Adriana Badulescu and
              Dan I. Moldovan},
 title     = {Automatic Discovery of Part-Whole Relations.},
 journal   = {Computational Linguistics},
 volume    = {32},
 number    = {1},
 year      = {2006},
 pages     = {83-135},
 ee        = {http://dx.doi.org/10.1162/coli.2006.32.1.83},
 bibsource = {DBLP, http://dblp.uni-trier.de}

} ,


 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2006 AutoDiscovOfPartWholeRelationsAdriana Badulescu
Roxana Girju
Dan I. Moldovan
Automatic Discovery of Whole-Part Relationshttps://netfiles.uiuc.edu/girju/publications/papers/cl-2006.pdf10.1162/coli.2006.32.1.83