This paper describes the system used to process the data of the CHiME Pascal 2011 competition, whose goal is to separate the desired speech and recognize the commands being spoken. The binaural recorded mixtures are processed by an on-line Semi- Blind Source Extraction algorithm. The algorithm is based on a multi-stage architecture combining the advantages of con- strained Independent Component Analysis and Wiener-based processing, allowing the estimation of the target signal with lim- ited distortion. The recovered target signal is then fed to the rec- ognizer which uses noise robust features based on Gammatone Frequency Cepstral Coefficients. Moreover, model adaptation to actual processing is applied as a further stage to reduce the acoustic mismatch. Performance comparison between differ- ent model/algorithmic settings is reported for both development and test data sets.

Robust Automatic Speech Recognition through On-line Semi Blind Signal Extraction

Nesta, Francesco;Matassoni, Marco
2011-01-01

Abstract

This paper describes the system used to process the data of the CHiME Pascal 2011 competition, whose goal is to separate the desired speech and recognize the commands being spoken. The binaural recorded mixtures are processed by an on-line Semi- Blind Source Extraction algorithm. The algorithm is based on a multi-stage architecture combining the advantages of con- strained Independent Component Analysis and Wiener-based processing, allowing the estimation of the target signal with lim- ited distortion. The recovered target signal is then fed to the rec- ognizer which uses noise robust features based on Gammatone Frequency Cepstral Coefficients. Moreover, model adaptation to actual processing is applied as a further stage to reduce the acoustic mismatch. Performance comparison between differ- ent model/algorithmic settings is reported for both development and test data sets.
File in questo prodotto:
File Dimensione Formato  
pS22_nesta.pdf

accesso aperto

Licenza: Dominio pubblico
Dimensione 321.14 kB
Formato Adobe PDF
321.14 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11582/34202
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact