Children's Book Test (CBT) Dataset

From GM-RKB
Jump to navigation Jump to search

A Children's Book Test (CBT) Dataset is a reading comprehension dataset that contains text data from books freely available through Project Gutenberg.



References

2016

2016 TheGoldilocksPrincipleReadingCh Fig1.png
Figure 1: A Named Entity question from the CBT (right), created from a book passage (left, in blue). In this case, the candidate answers $C$ are both entities and common nouns, since fewer than ten named entities are found in the context.