IRIS Institutional Research Information System

This paper describes the system used to process the data of the CHiME Pascal 2011 competition, whose goal is to separate the desired speech and recognize the commands being spoken. The binaural recorded mixtures are processed by an on-line Semi- Blind Source Extraction algorithm. The algorithm is based on a multi-stage architecture combining the advantages of con- strained Independent Component Analysis and Wiener-based processing, allowing the estimation of the target signal with lim- ited distortion. The recovered target signal is then fed to the rec- ognizer which uses noise robust features based on Gammatone Frequency Cepstral Coefficients. Moreover, model adaptation to actual processing is applied as a further stage to reduce the acoustic mismatch. Performance comparison between differ- ent model/algorithmic settings is reported for both development and test data sets.

Robust Automatic Speech Recognition through On-line Semi Blind Signal Extraction

Nesta, Francesco;Matassoni, Marco

2011-01-01

Abstract

This paper describes the system used to process the data of the CHiME Pascal 2011 competition, whose goal is to separate the desired speech and recognize the commands being spoken. The binaural recorded mixtures are processed by an on-line Semi- Blind Source Extraction algorithm. The algorithm is based on a multi-stage architecture combining the advantages of con- strained Independent Component Analysis and Wiener-based processing, allowing the estimation of the target signal with lim- ited distortion. The recovered target signal is then fed to the rec- ognizer which uses noise robust features based on Gammatone Frequency Cepstral Coefficients. Moreover, model adaptation to actual processing is applied as a further stage to reduce the acoustic mismatch. Performance comparison between differ- ent model/algorithmic settings is reported for both development and test data sets.

Scheda breve

Scheda completa

Scheda completa (DC)

Anno

2011

Appare nelle tipologie:

4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
pS22_nesta.pdf accesso aperto Licenza: Dominio pubblico Dimensione 321.14 kB Formato Adobe PDF Visualizza/Apri	321.14 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11582/34202

Citazioni

ND

social impact