1998 ExploitingDiverseKnowSourcesViaMEinNER

Subject Headings: Supervised NER, Sequence Tagging, Maximum Entropy Models.

Notes

It is one of the Seminal Papers on Supervised Sequence Segmentation.
It used a Sequence Tegger.
It uses Eric Sven Ristad. (1998). Maximum entropy modeling toolkit, release 1.6 beta. http://www.mnemonic.com/software/memt,
- http://www.cs.cmu.edu/~aberger/maxent.html
It can be contrasted to
- HMM-based named-entity recognizer (Bikel et al., 1999)
- MEMM-based recognizer (McCallum et al., 2000a).

(McCallum et al., 2000a) ⇒ Andrew McCallum, Dayne Freitag, and Fernando Pereira. (2000). “Maximum Entropy Markov Models for Information Extraction and Segmentation.” In: Proceedings of ICML-2000.
- However, we know of no previous general method that combines the rich state representation of Markov models with the flexible feature combination of exponential models. The MENE named-entity recognizer (Borthwick, Sterling, Agichtein, & Grishman, 1998) uses an exponential model to label each word with a label indicating the position of the word in a labeled-entity class (start, inside, end or singleton), but the conditioning information does not include the previous label, unlike our model. Therefore, it is closer to our ME-Stateless model. It is possible that its inferior performance compared to an HMM-based named-entity recognizer (Bikel et al., 1999) may have similar causes to the corresponding weakness of ME-Stateless relative to FeatureHMM in our experiments — the lack of representation of sequential dependencies.

	Author	volume	Date Value	title	type	journal	titleUrl	doi	note	year
1998 ExploitingDiverseKnowSourcesViaMEinNER	Ralph Grishman Eugene Agichtein Andrew Borthwick John Sterling			Exploiting Diverse Knowledge Sources via Maximum Entropy in Named Entity Recognition		Proceedings of the Sixth Workshop on Very Large Corpora	http://acl.ldc.upenn.edu/W/W98/W98-1118.pdf			1998