We are interested in the problem of learning stochastic language models on-line (without speech transcriptions) for adaptive speech recognition and understanding. In this paper we propose an algorithm to adapt to variations in the language model distributions based on the speech input only and without its true transcription. The on-line probability estimate is defined as a function of the prior and word error distributions. We show the effectiveness of word-lattice based error probability distributions in terms of Receiver operating Characteristics (ROC) curves and word accuracy. We apply the new estimates Padapt (w) to the task of adapting on-line and initial large vocabulary trigram language model and show improvement in word accuracy with respect to the baseline speech recognizer
On-Line Learning of Language Models with Word Error Probability Distribution
Gretter, Roberto;
2001-01-01
Abstract
We are interested in the problem of learning stochastic language models on-line (without speech transcriptions) for adaptive speech recognition and understanding. In this paper we propose an algorithm to adapt to variations in the language model distributions based on the speech input only and without its true transcription. The on-line probability estimate is defined as a function of the prior and word error distributions. We show the effectiveness of word-lattice based error probability distributions in terms of Receiver operating Characteristics (ROC) curves and word accuracy. We apply the new estimates Padapt (w) to the task of adapting on-line and initial large vocabulary trigram language model and show improvement in word accuracy with respect to the baseline speech recognizerI documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.