Sennrich-Haddow-Birch Rare Words Neural Machine Translation Task

From GM-RKB
Jump to navigation Jump to search

A Sennrich-Haddow-Birch Rare Words Neural Machine Translation Task is a Neural Machine Translation Task that translates rare and unseen words using BPE-based Word Segmentations.

vocabulary BLEU CHRF3 unigram F1 (%)
name segmentation shortlist source target single ens-8 single ens-8 all rare OOV
syntax-based (Sennrich and Haddow, 2015) 24.4 - 55.3 - 59.1 46.0 37.7
WUnk - - 300,000 500,000 20.6 22.8 47.2 48.9 56.7 20.4 0.0
WDict - - 300,000 500,000 22.0 24.2 50.5 52.4 58.1 36.8 36.8
C2-50k char-bigram 50,000 60,000 60,000 22.8 25.3 51.9 53.5 58.4 40.5 30.9
BPE-60k BPE - 60,000 60,000 21.5 24.5 52.0 53.9 58.4 40.9 29.3
BPE-J90k BPE (joint) - 90,000 90,000 22.8 24.7 51.7 54.1 58.5 41.8 33.6
vocabulary BLEU CHRF3 unigram F1 (%)
name segmentation shortlist source target single ens-8 single ens-8 all rare OOV
phrase-based (Haddow etal.,2015) 24.3 - 53.8 - 56.0 31.3 16.5
WUnk - - 300,000 500,000 18.8 22.4 46.5 49.9 54.2 25.2 0.0
WDict - - 300,000 500,000 19.1 22.8 47.5 51.0 54.8 26.5 6.6
C2-50k char-bigram 50,000 60,000 60,000 20.9 24.1 49.0 51.6 55.2 27.8 17.4
BPE-60k BPE - 60,000 60,000 20.5 23.6 49.8 52.7 55.3 29.7 15.6
BPE-J90k BPE (joint) - 90,000 100,000 20.4 24.1 49.7 53.0 55.8 29.7 18.3


References

2016

2015a

2015b

1994