2003 BenchmarkTestforSpeechRecogniti

From GM-RKB
Jump to navigation Jump to search

Subject Headings: Japanese ASR.

Notes

Cited By

Quotes

Abstract

We present benchmark results of automatic speech recognition using the Corpus of Spontaneous Japanese (CSJ), which has been developed in the five-year national project and will be the largest spontaneous speech databases. New test-sets are designed for both academic presentation speech and extemporaneous public speech, which are the two major categories in the corpus. The test-sets are selected to cover the variation of acoustic and linguistic factors in spontaneous speech: word perplexity, degree of disfluency, and the speaking rate. Baseline acoustic and language models are set up using an almost complete set (500 hours and 6.67M words) of the CSJ. Statistical modeling of pronunciation variation is also incorporated into the language model based on the alignment of large-scale transcriptions. The benchmark results verified the effects of the factors considered in the test-set design.

References

;

 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2003 BenchmarkTestforSpeechRecognitiTatsuya Kawahara
Hiroaki Nanjo
Takahiro Shinozaki
Sadaoki Furui
Benchmark Test for Speech Recognition Using the Corpus of Spontaneous Japanese2003