2002 TheGENIAcorpus

From GM-RKB
Jump to navigation Jump to search

Subject Headings: GENIA Corpus.

Notes

Cited By

Quotes

Abstract

With the information overload in genome-related field, there is an increasing need for natural language processing technology to extract information from literature and various attempts of information extraction using NLP has been being made. We are developing the necessary resources including domain ontology and annotated corpus from research abstracts in MEDLINE database (GENIA corpus). We are building the ontology and the corpus simultaneously, using each other. In this paper we report on our new corpus, its ontological basis, annotation scheme, and statistics of annotated objects. We also describe the tools used for corpus annotation and management.,


 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2002 TheGENIAcorpusTomoko Ohta
Jin-Dong Kim
Yuka Tateisi
The GENIA corpus: an annotated research abstract corpus in molecular biology domainhttp://www-tsujii.is.s.u-tokyo.ac.jp/~genia/paper/hlt2002GENIA.pdf