1998 ExploitingDiverseKnowSourcesViaMEinNER

From GM-RKB
Jump to navigation Jump to search

Subject Headings: Supervised NER, Sequence Tagging, Maximum Entropy Models.

Notes

Cited by

2006

2000

Quotes

Abstract

This paper describes a novel statistical named-entity (i.e. "proper name") recognition system built around a maximum entity framework. By working within the framework of maximum entropy theory and utilizing a flexible object-based architecture, the system is able to make use of an extraordinarily diverse range of knowledge sources in making its tagging decisions. These knowledge sources include capitalization features, lexical features, features indicating the current section of text (i.e. headline or main body), and dictionaries of single or multi-word terms. The purely statistical system contains no hand-generated patterns and achieves a result comparable with the best statistical systems. However, when combined with other hand-coded systems, the system achieves scores that exceed the highest comparable scores thus-far published.,


 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
1998 ExploitingDiverseKnowSourcesViaMEinNERRalph Grishman
Eugene Agichtein
Andrew Borthwick
John Sterling
Exploiting Diverse Knowledge Sources via Maximum Entropy in Named Entity RecognitionProceedings of the Sixth Workshop on Very Large Corporahttp://acl.ldc.upenn.edu/W/W98/W98-1118.pdf1998