657,294
edits
m (Text replacement - ", sentence splitter" to ", sentence splitter") |
m (Text replacement - " tokeniser," to " tokeniser,") |
||
Line 22: | Line 22: | ||
* ([[2002_GATEaFramework|Cunningham & al, 2002]]) ⇒ Hamish Cunningham, [[Diana Maynard]], Kalina Bontcheva, and [[Valentin Tablan]]. (2001). "[http://gate.ac.uk/sale/acl02/acl-main.pdf GATE: A Framework and Graphical Development Environment for Robust NLP Tools and Applications]." In: Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics (ACL 2002). | * ([[2002_GATEaFramework|Cunningham & al, 2002]]) ⇒ Hamish Cunningham, [[Diana Maynard]], Kalina Bontcheva, and [[Valentin Tablan]]. (2001). "[http://gate.ac.uk/sale/acl02/acl-main.pdf GATE: A Framework and Graphical Development Environment for Robust NLP Tools and Applications]." In: Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics (ACL 2002). | ||
** QUOTE: Provided with GATE is a set of reusable processing resources for common NLP tasks. (None of them are definitive, and the user can replace and/or extend them as necessary.) These are packaged together to form '''ANNIE''', A Nearly-New IE system, but can also be used individually or coupled together with new modules in order to create new applications. For example, many other NLP tasks might require a sentence splitter and POS tagger, but would not necessarily require resources more specific to IE tasks such as a named entity transducer. The system is in use for a variety of IE and other tasks, sometimes in combination with other sets of application-specific modules. | ** QUOTE: Provided with GATE is a set of reusable processing resources for common NLP tasks. (None of them are definitive, and the user can replace and/or extend them as necessary.) These are packaged together to form '''ANNIE''', A Nearly-New IE system, but can also be used individually or coupled together with new modules in order to create new applications. For example, many other NLP tasks might require a sentence splitter and POS tagger, but would not necessarily require resources more specific to IE tasks such as a named entity transducer. The system is in use for a variety of IE and other tasks, sometimes in combination with other sets of application-specific modules. | ||
* '''ANNIE consists of''' the following main processing resources: tokeniser, [[sentence splitter]], [[POS tagger]], [[gazetteer]], finite state transducer (based on GATE’s built-in regular expressions over annotations language (Cunningham et al., 2002)), orthomatcher and coreference resolver. The resources communicate via GATE’s annotation API, which is a directed graph of arcs bearing arbitrary feature/value data, and nodes rooting this data into document content (in this case text). | * '''ANNIE consists of''' the following main processing resources: [[tokeniser]], [[sentence splitter]], [[POS tagger]], [[gazetteer]], finite state transducer (based on GATE’s built-in regular expressions over annotations language (Cunningham et al., 2002)), orthomatcher and coreference resolver. The resources communicate via GATE’s annotation API, which is a directed graph of arcs bearing arbitrary feature/value data, and nodes rooting this data into document content (in this case text). | ||
---- | ---- |