- (Brill, 1995) ⇒ Eric D. Brill. (1995). “Transformation-based Error-Driven Learning and Natural Language Processing: A Case Study in Part of Speech Tagging.” In: Computational Linguistics, 21(4).
Subject Headings: Part-of-Speech Tagging Algorithm, Transformation-based Learning Algorithm.
~1288 papers http://scholar.google.com/scholar?cites=6669039697562179777
- (Sha & Pereira, 2003a) ⇒ Fei Sha, and Fernando Pereira. (2003). “Shallow Parsing with Conditional Random Fields.” In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology (HLT-NAACL 2003). doi:10.3115/1073445.1073473
- (Collins, 2003) ⇒ Michael Collins. (2003). “Head-Driven Statistical Models for Natural Language Parsing.” In: Computational Linguistics, 29(4). doi:10.1162/089120103322753356.
- (Lafferty et al., 2001) ⇒ John D. Lafferty, Andrew McCallum, and Fernando Pereira. (2001). “Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data.” In: Proceedings of ICML 2001.
- Andrew McCallum, Dayne Freitag, Fernando Pereira. (2000). “Maximum entropy Markov models for information extraction and segmentation.” In: Proceedings17th International Conference on Machine Learning (ICML 2000).
- Recently, there has been a rebirth of empiricism in the field of natural language processing. Manual encoding of linguistic information is being challenged by automated corpus-based learning as a method of providing a natural language processing system with linguistic knowledge. Although corpus-based approaches have been successful in many different areas of natural language processing, it is often the case that these methods capture the linguistic information they are modelling indirectly in large opaque tables of statistics. This can make it difficult to analyze, understand and improve the ability of these approaches to model underlying linguistic behavior. In this paper, we will describe a simple rule-based approach to automated learning of linguistic knowledge. This approach has been shown for a number of tasks to capture information in a clearer and more direct fashion without a compromise in performance. We present a detailed case study of this learning method applied to part-of-speech tagging.
- Ezra Black, Fred Jelinek, John D. Lafferty, David M. Magerman, Robert Mercer, Salim Roukos, Towards history-based grammars: using richer models for probabilistic parsing, Proceedings of the 31st annual meeting on Association for Computational Linguistics, p.31-37, June 22-26, 1993, Columbus, Ohio doi:10.3115/981574.981579
- Ezra Black, Fred Jelinek, John D. Lafferty, Robert Mercer, Salim Roukos, Decision tree models applied to the labeling of text with parts-of-speech, Proceedings of the workshop on Speech and Natural Language, February 23-26, 1992, Harriman, New York doi:10.3115/1075527.1075554
- Leo Breiman; Jerome H. Friedman; Olshen, Richard; and Stone, Charles (1984). Classification and regression trees. Wadsworth and Brooks.
- Eric D. Brill, A simple rule-based part of speech tagger, Proceedings of the third Conference on Applied Natural Language Processing, March 31-April 03, 1992, Trento, Italy doi:10.3115/974499.974526
- Eric D. Brill, Automatic grammar induction and parsing free text: a transformation-based approach, Proceedings of the 31st annual meeting on Association for Computational Linguistics, p.259-265, June 22-26, 1993, Columbus, Ohio doi:10.3115/981574.981609
- Eric D. Brill, A corpus-based approach to language learning, University of Pennsylvania, Philadelphia, PA, 1993
- Brill, Eric (1993c). “Transformation-based error-driven parsing.” In: Proceedings, Third International Workshop on Parsing Technologies, Tilburg, The Netherlands.
- Eric D. Brill, Some advances in transformation-based part of speech tagging, Proceedings of the twelfth national conference on Artificial intelligence (vol. 1), p.722-727, October 1994, Seattle, Washington, United States
- Eric D. Brill, Philip Resnik, A rule-based approach to prepositional phrase attachment disambiguation, Proceedings of the 15th conference on Computational linguistics, August 05-09, 1994, Kyoto, Japan doi:10.3115/991250.991346
- Peter F. Brown, John Cocke, Stephen A. Della Pietra, Vincent J. Della Pietra, Fredrick Jelinek, John D. Lafferty, Robert L. Mercer, Paul S. Roossin, A statistical approach to machine translation, Computational Linguistics, v.16 n.2, p.79-85, June 1990
- Peter F. Brown, Stephen A. Della Pietra, Vincent J. Della Pietra, Robert L. Mercer, Word-sense disambiguation using statistical methods, Proceedings of the 29th annual meeting on Association for Computational Linguistics, p.264-270, June 18-21, 1991, Berkeley, California doi:10.3115/981344.981378
- Rebecca Bruce, Janyce M. Wiebe, Word-sense disambiguation using decomposable models, Proceedings of the 32nd annual meeting on Association for Computational Linguistics, p.139-146, June 27-30, 1994, Las Cruces, New Mexico doi:10.3115/981732.981752
- Eugene Charniak; Hendrickson, Curtis; Jacobson, Neil; and Perkowitz, Michael (1993). “Equations for part of speech tagging.” In: Proceedings, Conference of the American Association for Artificial Intelligence (AAAI-93), Washington, DC.
- Kenneth W. Church, A stochastic parts program and noun phrase parser for unrestricted text, Proceedings of the second Conference on Applied Natural Language Processing, February 09-12, 1988, Austin, Texas doi:10.3115/974235.974260
- Doug Cutting, Julian Kupiec, Jan Pedersen, Penelope Sibun, A practical part-of-speech tagger, Proceedings of the third Conference on Applied Natural Language Processing, March 31-April 03, 1992, Trento, Italy doi:10.3115/974499.974523
- Carl G. de Marcken, Parsing the LOB corpus, Proceedings of the 28th annual meeting on Association for Computational Linguistics, p.243-251, June 06-09, 1990, Pittsburgh, Pennsylvania doi:10.3115/981823.981854
- Steven J. DeRose, Grammatical category disambiguation by statistical optimization, Computational Linguistics, v.14 n.1, p.31-39, Winter 1988
- Francis, Winthrop Nelson and Kucera, Henry (1982). Frequency analysis of English usage: Lexicon and grammar. Houghton Mifflin, Boston.
- Fujisaki, Tetsu; Jelinek, Fred; Cocke, John; and Black, Ezra (1989). “Probabilistic parsing method for sentence disambiguation.” In: Proceedings, International Workshop on Parsing Technologies, Carnegie Mellon University, Pittsburgh, PA.
- William A. Gale, Kenneth W. Church, A program for aligning sentences in bilingual corpora, Proceedings of the 29th annual meeting on Association for Computational Linguistics, p.177-184, June 18-21, 1991, Berkeley, California doi:10.3115/981344.981367
- Gale, William; Kenneth W. Church, and David Yarowsky (1992). “A method for disambiguating word senses in a large corpus." Computers and the Humanities.
- Geoffrey Leech, Roger Garside, Michael Bryant, CLAWS4: the tagging of the British National Corpus, Proceedings of the 15th conference on Computational linguistics, August 05-09, 1994, Kyoto, Japan doi:10.3115/991886.991996
- Harris, Zellig (1962). String Analysis of Language Structure. Mouton and Co., The Hague.
- Donald Hindle, Acquiring disambiguation rules from text, Proceedings of the 27th annual meeting on Association for Computational Linguistics, p.118-125, June 26-29, 1989, Vancouver, British Columbia, Canada doi:10.3115/981623.981638
- Donald Hindle, Mats Rooth, Structural ambiguity and lexical relations, Computational Linguistics, v.19 n.1, March 1993
- Huang, Caroline; Son-Bell, Mark; and Baggett, David (1994). “Generation of pronunciations from orthographies using transformation-based error-driven learning.” In: Proceedings of the International Conference on Speech and Language Processing (ICSLP), Yokohama, Japan.
- F. Jelinek, Self-organized language modeling for speech recognition, Readings in speech recognition, Morgan Kaufmann Publishers Inc., San Francisco, CA, 1990
- Aravind K. Joshi, B. Srinivas, Disambiguation of super parts of speech (or supertags): almost parsing, Proceedings of the 15th conference on Computational linguistics, August 05-09, 1994, Kyoto, Japan doi:10.3115/991886.991912. 29. Sheldon Klein, Robert F. Simmons, A Computational Approach to Grammatical Coding of English Words, Journal of the ACM (JACM), v.10 n.3, p.334-347, July 1963 doi:10.1145/321172.321180
- Kupiec, Julian (1992). “Robust part-of-speech tagging using a hidden Markov model." Computer Speech and Language, 6.
- Mitchell P. Marcus, Mary Ann Marcinkiewicz, Beatrice Santorini, Building a large annotated corpus of English: the penn treebank, Computational Linguistics, v.19 n.2, June 1993
- Bernard Merialdo, Tagging English text with a probabilistic model, Computational Linguistics, v.20 n.2, p.155-171, June 1994
- Miller, George (1990). “Wordnet: an on-line lexical database." International Journal of Lexicography. 3(4).
- J. R. Quinlan, Induction of Decision Trees, Machine Learning, v.1 n.1, p.81-106 doi:10.1023/A:1022643204877
- J. R. Quinlan, R. L. Rivest, Inferring decision trees using the minimum description length principle, Information and Computation, v.80 n.3, p.227-248, Mar. 1989 doi:10.1016/0890-5401(89)90010-2
- Ramshaw, Lance and Marcus, Mitchell (1994). “Exploring the statistical derivation of transformational rule sequences for part-of-speech tagging.” In: The Balancing Act: Proceedings of the ACL Workshop on Combining Symbolic and Statistical Approaches to Language, New Mexico State University, July.
- Emmanuel Roche, Yves Schabes, Deterministic part-of-speech tagging with finite-state transducers, Computational Linguistics, v.21 n.2, p.227-253, June 1995
- Hinrich Schütze, Yoram Singer, Part-of-speech tagging using a Variable Memory Markov model, Proceedings of the 32nd annual meeting on Association for Computational Linguistics, p.181-187, June 27-30, 1994, Las Cruces, New Mexico doi:10.3115/981732.981757
- R. A. Sharman, F. Jelinek, R. Mercer, Generating a grammar for statistical training, Proceedings of the workshop on Speech and Natural Language, p.267-274, June 24-27, 1990, Hidden Valley, Pennsylvania doi:10.3115/116580.116667
- Ralph Weischedel, Richard Schwartz, Jeff Palmucci, Marie Meteer, Lance Ramshaw, Coping with ambiguity and unknown words through probabilistic models, Computational Linguistics, v.19 n.2, June 1993
- David Yarowsky, Word-sense disambiguation using statistical models of Roget's categories trained on large corpora, Proceedings of the 14th conference on Computational linguistics, August 23-28, 1992, Nantes, France doi:10.3115/992133.992140,
|1995 TrasformationBasedErrorDrivenPOSTagging||Eric D. Brill||Transformation-based Error-Driven Learning and Natural Language Processing: A Case Study in Part of Speech Tagging||Computational Linguistics (CL) Research Area||http://www.cs.mu.oz.au/acl/J/J95/J95-4004.pdf||1995|