1998 ExploitingDiverseKnowSourcesViaMEinNER

Jump to: navigation, search

Subject Headings: Supervised NER, Sequence Tagging, Maximum Entropy Models.


Cited by





This paper describes a novel statistical named-entity (i.e. "proper name") recognition system built around a maximum entity framework. By working within the framework of maximum entropy theory and utilizing a flexible object-based architecture, the system is able to make use of an extraordinarily diverse range of knowledge sources in making its tagging decisions. These knowledge sources include capitalization features, lexical features, features indicating the current section of text (i.e. headline or main body), and dictionaries of single or multi-word terms. The purely statistical system contains no hand-generated patterns and achieves a result comparable with the best statistical systems. However, when combined with other hand-coded systems, the system achieves scores that exceed the highest comparable scores thus-far published.,

 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
1998 ExploitingDiverseKnowSourcesViaMEinNERAndrew Borthwick
John Sterling
Eugene Agichtein
Ralph Grishman
Exploiting Diverse Knowledge Sources via Maximum Entropy in Named Entity RecognitionProceedings of the Sixth Workshop on Very Large Corporahttp://acl.ldc.upenn.edu/W/W98/W98-1118.pdf1998