Large Vocabulary Continuous Speech Recognition (LVCSR) System

From GM-RKB
Jump to: navigation, search

A Large Vocabulary Continuous Speech Recognition (LVCSR) System is an Automatic Speech Recognition (ASR) System in which the audio file is split into phonemes that matched with words and phrases in a dictionary to produce a full text transcript.



References

2022

2021

  • (Wikipedia, 2021) ⇒ https://en.wikipedia.org/wiki/Audio_mining#Large_Vocabulary_Continuous_Speech_Recognizers Retrieved:2021-6-20.
    • In text-based indexing or large vocabulary continuous speech recognition (LVCSR), the audio file is first broken down into recognizable phonemes. It is then run through a dictionary that can contain several hundred thousand entries and matched with words and phrases to produce a full text transcript. A user can then simply search a desired word term and the relevant portion of the audio content will be returned.

      If the text or word could not be found in the dictionary, the system will choose the next most similar entry it can find. The system uses a language understanding model to create a confidence level for its matches. If the confidence level be below 100 percent, the system will provide options of all the found matches.

2011