1996 AMaximumEntropyModelForPOS

Subject Headings: Part-of-Speech Tagging Algorithm, Maximum-Entropy Model.

Notes

This paper presents a statistical model which trains from a corpus annotated with Part-Of-Speech tags and assigns them to previously unseen text with state-of-the-art accuracy (96.6%). The model can be classified as a Maximum Entropy model and simultaneously uses many contextual "features" to predict the POS tag. Furthermore, this paper demonstrates the use of specialized features to model difficult tagging decisions, discusses the corpus consistency problems discovered during the implementation of these features, and proposes a training strategy that mitigates these problems.

…
(Darroch & Ratcliff, 1972) ⇒ John N. Darroch, and Douglas Ratcliff. (1972). “Generalized Iterative Scaling for Log-Linear Models.” In: The Annals of Mathematical Statistics, 43(5).
…,

	Author	volume	Date Value	title	type	journal	titleUrl	doi	note	year
1996 AMaximumEntropyModelForPOS	Adwait Ratnaparkhi			A Maximum Entropy Model for Part-of-Speech Tagging		Proceedings of the Conference on Empirical Methods in Natural Language Processing	http://acl.ldc.upenn.edu/W/W96/W96-0213.pdf			1996