ACE-2003

From GM-RKB
Jump to navigation Jump to search

See: ACE Benchmark Task, ACE Program, ACE-2004.



References


  • (Zhang et al., 2006)
    • "In the ACE 2003 data, the training set consists of 674 documents and 9683 relation instances while the test set consists of 97 documents and 1386 relation instances. The ACE 2003 data defines 5 entity types, 5 major relation types and 24 relation subtypes.


  • (Kambhatla, 2004) ⇒ Nanda Kambhatla. (2004). Combining lexical, syntactic, and semantic features with maximum entropy models for extracting relations. Poster. In: Proceedings of [[ACL 2004]
    • "Automatic Content Extraction (ACE, 2004) is an evaluation conducted by NIST to measure Entity Detection and Tracking (EDT) and relation detection and characterization (RDC). The EDT task entails the detection of mentions of entities and chaining them together by identifying their coreference. In ACE vocabulary, entities are objects, mentions are references to them, and relations are explicitly or implicitly stated relationships among entities. Entities can be of five types: persons, organizations, locations, facilities, and geo-political entities (geographically defined regions that define a political boundary, e.g. countries, cities, etc.). Mentions have levels: they can be names, nominal expressions or pronouns.
    • "The RDC task detects implicit and explicit relations between entities identified by the EDT task. Explict relations occur in text with explicit evidence suggesting the relationship. Implicit relations need not have explicit supporting evidence in text, though they should be evident from a reading of the document.
    • "Here is an example:
      • "The American Medical Association voted yesterday to install the heir apparent as its president-elect, rejecting a strong, upstart challenge by a District doctor who argued that the nation’s largest physiciansgroup needs stronger ethics and new leadership.
      • "In electing Thomas R. Reardon, an Oregon general practitioner who had been the chairman of its board, ...
    • "In this fragment, all the underlined phrases are mentions referring to the American Medical Association, or to Thomas R. Reardon or the board (an organization) of the American Medical Association. Moreover, there is an explicit management relation between chairman and board, which are references to Thomas R. Reardon and the board of the American Medical Association respectively. Relation extraction is hard, since successful extraction implies correctly detecting both the argument mentions, correctly chaining these mentions to their respective entities, and correctly determining the type of relation that holds between them.


Reported Results

PaperMethod5Mt-P5Mt-R5Mt-F24St-P24St-R24St-F
ZZJZ07Composite kernel (cntxt. sens.) 80.8 68.4 74.1 65.254.959.6
ZZJZ07Conv. tree kernel (cntxt.sense.) 80.1 63.8 71.0 63.451.957.1
ZZSZ06Composite Kernel 2 (poly exp) 77.365.6 70.9 64.951.257.2
ZZSZ06Composite Kernel 2 (linear comb) 76.363.069.0
ZZS06ConvTreeKernel(PT+EI+Sem feat.) 76.363.069.064.650.7656.83
ZZS06ConvTreeKernel(PT+Entity information(EI)) 76.162.968.9
ZSZZ05Feature-based SVM 77.260.768.063.149.555.5
HBM05 contig+bag-o-words kernel plus all features 72.244.555.1
ZZS06ConvTreeKernel only (Parse tree) 72.853.861.9
N04Feature-based MaxEnt 63.545.252.8
ZZSZ06Entity kernel only (Parse tree) 79.534.648.2
CS04Tree kernel 67.135.045.8
HBM05 contig+bag-o-words kernel non SRL features 60.520.330.4