AQUAINT Corpus

From GM-RKB
Jump to navigation Jump to search

The AQUAINT Corpus is a Corpus of News Articles.



References

2008

2002

  • http://www.ldc.upenn.edu/Catalog/docs/LDC2002T31/
    • This file contains documentation on the AQUAINT Corpus, Linguistic Data Consortium (LDC) catalog number LDC2002T31 and isbn 1-58563-240-6.
    • This corpus consists of newswire text data in English, drawn from three sources: the Xinhua News Service (People's Republic of China), the New York Times News Service, and the Associated Press Worldstream News Service. It was prepared by the LDC for the AQUAINT Project, and will be used in official benchmark evaluations conducted by National Institute of Standards and Technology (NIST).