We address the problem of assigning binary quality labels to automatically transcribed utterances when neither reference transcripts nor information about the decoding process are accessible. Our quality estimation models are evaluated in a large vocabulary continuous speech recognition setting (the transcription of English TED talks). In this setting, we apply different learning algorithms and strategies and measure performance in two testing conditions characterized by different distributions of “good” and “bad” instances. The positive results of our experiments pave the way towards the use of binary estimators of ASR output quality in a number of application scenarios.
Reference-free and Confidence-independent Binary Quality Estimation for Automatic Speech Recognition
Negri, Matteo;Turchi, Marco;Falavigna, Giuseppe Daniele
2015-01-01
Abstract
We address the problem of assigning binary quality labels to automatically transcribed utterances when neither reference transcripts nor information about the decoding process are accessible. Our quality estimation models are evaluated in a large vocabulary continuous speech recognition setting (the transcription of English TED talks). In this setting, we apply different learning algorithms and strategies and measure performance in two testing conditions characterized by different distributions of “good” and “bad” instances. The positive results of our experiments pave the way towards the use of binary estimators of ASR output quality in a number of application scenarios.File | Dimensione | Formato | |
---|---|---|---|
clic2015.pdf
solo utenti autorizzati
Descrizione: Conferenza Clic-it 2015
Tipologia:
Documento in Post-print
Licenza:
Creative commons
Dimensione
152.06 kB
Formato
Adobe PDF
|
152.06 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.