2003 TermExtractionAndAutomaticIndexing

Subject Headings: Computational Terminology, Term Extraction Task, Term Indexing Task, TERMINO System, LEXTER System, CLARIT System, FASTR System.

Notes

Terms are pervasive in scientific and technical documents; their identification is a crucial issue for any application dealing with the analysis, understanding, generation, or translation of such documents. In particular, the ever-growing mass of specialized documentation available on-line, in industrial and governmental archives or in digital libraries, calls for advances in terminology processing for such purposes as information retrieval, cross-language querying, indexing of multimedia documents, translation aids, document routing and summarization, etc.
This chapter introduces the basic linguistic characteristics of terms. It presents the main methods in NLP for recognizing or discovering terms and their interrelationships in large corpora.

In a definition of term that is better suited to corpus-based terminology, a term must be stated as the output of a procedure of terminological analysis. A single word, such as cell, or a multi-word unit, such as blood cell is a term because it has been decided that it would be so. The decision process can involve a community of researchers or practitioners, a normalization institution, or even a single engineer or terminologist in charge of building a terminological resource for a specific purpose.

,

	Author	volume	Date Value	title	type	journal	titleUrl	doi	note	year
2003 TermExtractionAndAutomaticIndexing	Didier Bourigault Christian Jacquemin			Term Extraction and Automatic Indexing			http://books.google.com/books?id=OaClhre-vW4C&oi=fnd&pg=PA599			2003