2008 NounPhraseBracketingGuidelinesV1

From GM-RKB
Jump to navigation Jump to search

Subject Headings: NP Bracketing Task.

Notes

Quotes

1. Introduction

  • This document describes guidelines for bracketing noun phrase (NP) structure in the Penn Treebank. These guidelines are in addition to the Treebank II Guidelines (Bies et al., 1995). They are also based on, and overlap with the Addendum for BioMedical Annotation (Warner et al., 2004).

An earlier version (0.9) of these guidelines were used in the annotation described in Vadas and Curran (2007), while this version was used in a subsequent pass over the data.

2 Bracketing NPs

  • The goal of our annotation is to identify and bracket multi-token premodifiers in NPs. Quirk et al. (1985, p. 1321) describes such premodifiers, which include adjectives, participles, nouns, genitives and adverbs. All of these items are modifiable themselves, and this is precisely the behaviour that we have annotated.


  • dark red car
    • current ⇒ (NP (JJ dark) (JJ red) (NN car) )
    • proposed ⇒ (NP (JJP (JJ dark) (JJ red) ) (NN car) )


  • the fastest developing trends
    • current ⇒ (NP (DT the) (JJS fastest) (VBG developing) (NNS trends))
    • proposed ⇒ (NP (DT the) (JJP (JJS fastest) (VBG developing) ) (NNS trends))


  • relatively higher rates
    • current ⇒ (NP (RB relatively) (JJR higher) (NNS rates))
    • proposed ⇒ (NP (JJP (RB relatively) (JJR higher) ) (NNS rates))
  • Coordinated Adjectival

When two or three adjectives of the same class are used, they are called "coordinated adjectives".

   (EX) Her [simple but expensive] long silk dress
   (EX) The [red, white and blue] French flag.

bright and incredibly sharp

References

  • Ann Bies, Mark Ferguson, Karen Katz, and Robert MacIntyre. (1995). Bracketing guidelines for Treebank II style: Penn Treebank project. Technical report.
  • Michael Collins. (1999). Head-Driven Statistical Models for Natural Language Parsing. Ph.D. thesis, University of Pennsylvania.
  • Randolph Quirk, Sidney Greenbaum, Geoffrey Leech, and Jan Svartvik. (1985). A Comprehensive Grammar of the English Language. Longman, London.
  • David Vadas and James R. Curran. (2007). Adding noun phrase structure to the Penn Treebank. In: Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL-07). (to appear).
  • Colin Warner, Ann Bies, Christine Brisson, and Justin Mott. (2004). Addendum to the Penn Tree-bank II style bracketing guidelines: BioMedical Treebank annotation. Technical report.

,

 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2008 NounPhraseBracketingGuidelinesV1David VadasNoun Phrase Bracketing Guidelines, Version 1.0http://www.cs.usyd.edu.au/~dvadas1/?download=guidelines1.0.pdf2008