Text Segmentation Task

From GM-RKB
(Redirected from Text Chunking Task)
Jump to: navigation, search

A Text Segmentation Task is a sequence segmentation task that requires the text annotation of coherent text segments.



References

2011

  • (Wikipedia, 2011) ⇒ http://en.wikipedia.org/wiki/Text_segmentation
    • QUOTE: Text segmentation is the process of dividing written text into meaningful units, such as words, sentences, or topics. The term applies both to mental processes used by humans when reading text, and to artificial processes implemented in computers, which are the subject of natural language processing. The problem is non-trivial, because while some written languages have explicit word boundary markers, such as the word spaces of written English and the distinctive initial, medial and final letter shapes of Arabic, such signals are sometimes ambiguous and not present in all written languages.

      Compare speech segmentation, the process of dividing speech into linguistically meaningful portions.

2005

2000

1999

  • (Beeferman et al, 1999) ⇒ Doug Beeferman, Adam Berger, and John D. Lafferty. (1999). "Statistical Models for Text Segmentation." In: Machine Learning, 34(1–3).
    • QUOTE: This paper introduces a new statistical approach to automatically partitioning text into coherent segments. ... Assessment of our approach on quantitative and qualitative grounds demonstrates its effectiveness in two very different domains, Wall Street Journal news articles and television broadcast news story transcripts. Quantitative results on these domains are presented using a new probabilistically motivated error metric, which combines precision and recall in a natural and flexible way. This metric is used to make a quantitative assessment of the relative contributions of the different feature types, as well as a comparison with decision trees and previously proposed text segmentation algorithms.

1988


Personal tools
Namespaces

Variants
Views
Actions
Navigation
Toolbox