Automatic Speech Recognition (ASR) is a challenging classification task over sequences of acoustic features, extracted from the vocal signal, according to some linguistic models. Hidden Markov Models (HMM) are the most common and successful tool for ASR, allowing for high recognition performance in a variety of difficult tasks (speaker independent, large vocabulary, continuous speech). In spite of that, standard HMMs suffer from relevant limitations. Artificial neural networks (ANN) have been proposed as an alternative, nonparametric paradigm for ASR since the late Eighties. ANNs were successfully applied in reduced size tasks (e.g., phoneme or isolated word recognition) but they did not succeed in solving the general ASR problem, due to their lack of capability to model long-term dependencies. Combining ANN and HMM within a unifying framework is a suitable approach to ASR, which takes benefit from both techniques and overcomes the corresponding limitations. This paper reviews the main topics concerning hybrid HMM/ANN systems for ASR, summarizing some major architectures of this kind

Combining Neural Networks and Hidden Markov Models for Speech Recognition

Trentin, Edmondo;Gori, Marco
1999

Abstract

Automatic Speech Recognition (ASR) is a challenging classification task over sequences of acoustic features, extracted from the vocal signal, according to some linguistic models. Hidden Markov Models (HMM) are the most common and successful tool for ASR, allowing for high recognition performance in a variety of difficult tasks (speaker independent, large vocabulary, continuous speech). In spite of that, standard HMMs suffer from relevant limitations. Artificial neural networks (ANN) have been proposed as an alternative, nonparametric paradigm for ASR since the late Eighties. ANNs were successfully applied in reduced size tasks (e.g., phoneme or isolated word recognition) but they did not succeed in solving the general ASR problem, due to their lack of capability to model long-term dependencies. Combining ANN and HMM within a unifying framework is a suitable approach to ASR, which takes benefit from both techniques and overcomes the corresponding limitations. This paper reviews the main topics concerning hybrid HMM/ANN systems for ASR, summarizing some major architectures of this kind
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11582/1527
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact