GATE System

Jump to: navigation, search

See: ANNIE System, Text Mining System, UIMA Architecture, General Architecture for Text Engineering.




    • the Eclipse of Natural Language Engineering, the Lucene of Information Extraction, a leading toolkit for Text Mining
    • used worldwide by thousands of scientists, companies, teachers and students
    • comprised of an architecture, a free open source framework (or SDK) and graphical development environment
    • used for all sorts of language processing tasks, including Information Extraction in many languages
    • funded by the EPSRC, BBSRC, AHRC, the EU and commercial users
    • 100% Java reference implementation of ISO TC37/SC4 and used with XCES in the ANC
    • 10 years old in 2005, used in many research projects and compatible with IBM's UIMA
    • based on MVC, mobile code, continuous integration, and test-driven development, with code hosted on SourceForge
    • Which paper to cite if I used GATE/ANNIE:
    • Which paper to cite if I used the ontology features in GATE:
      • K. Bontcheva, V. Tablan, D. Maynard, H. Cunningham. Evolving GATE to Meet New Challenges in Language Engineering. Natural Language Engineering. 10 (3/4), pp. 349-373. (2004). Pre-print. BibTex entry.
    • Which paper to cite on GATE and multimedia:
      • M. Dowman, V. Tablan, H. Cunningham and Borislav Popov. Web-Assisted Annotation, Semantic Indexing and Search of Television and Radio News. 14th International World Wide Web Conference. Chiba, Japan, (2005). PDF BibTex entry
    • Which paper to cite if I used the machine learning facilities in GATE:
      • Y. Li, K. Bontcheva, and H. Cunningham. SVM Based Learning System For Information Extraction. J. Winkler, M. Niranjan and N. Lawerence (Eds.): Deterministic and Statistical Methods in Machine Learning, LNAI 3635, Springer Verlag, pp. 319-339. (2005). PDF BibTex entry