- (Jacquemin & Bourigault, 2003) ⇒ Christian Jacquemin, Didier Bourigault. (2003). “Term Extraction and Automatic Indexing.” In: (Mitkov, 2003) ⇒ Ruslan Mitkov, editor. (2003). “The Oxford Handbook of Computational Linguistics.” In: Oxford University Press.
- An alternative document location: http://perso.limsi.fr/jacquemi/FTP/JacBourHandbookCL.pdf
- Terms are pervasive in scientific and technical documents; their identification is a crucial issue for any application dealing with the analysis, understanding, generation, or translation of such documents. In particular, the ever-growing mass of specialized documentation available on-line, in industrial and governmental archives or in digital libraries, calls for advances in terminology processing for such purposes as information retrieval, cross-language querying, indexing of multimedia documents, translation aids, document routing and summarization, etc.
- This chapter introduces the basic linguistic characteristics of terms. It presents the main methods in NLP for recognizing or discovering terms and their interrelationships in large corpora.
- In a definition of term that is better suited to corpus-based terminology, a term must be stated as the output of a procedure of terminological analysis. A single word, such as cell, or a multi-word unit, such as blood cell is a term because it has been decided that it would be so. The decision process can involve a community of researchers or practitioners, a normalization institution, or even a single engineer or terminologist in charge of building a terminological resource for a specific purpose.
|2003 TermExtractionAndAutomaticIndexing||Christian Jacquemin|
|Term Extraction and Automatic Indexing||http://books.google.com/books?id=OaClhre-vW4C&oi=fnd&pg=PA599||2003|