Evaluation of hardness of a given automatic speech recognition task - once a specific language has been chosen - is usually accomplished in terms of such factors as the vocabulary size, the complexity of the grammar, isolated vs. continuous speech, and the acoustic conditions. In practice, little attention has been paid so far to the fact that, once a vocabulary has been fixed, along with a given grammar and setup of environmental conditions, the difficulty of the recognition task may vry according to the amount of 'confusable' words in the vocabylary, as well as to the specific 'degree of confusability' among them. This report informally introduces the concept of confusability, briefly surveying technical literature where the problem is faced. It then introduces two formal definitions of the problem, the former having a statistical root in Bayes' decision theory, and the latter relying on a geometric approach, involving the concept of 'distance' between pairs of words. Practical hints on possible directions to follow in order to apply the definitions are finally outlined

Word Confusability as an Evaluation Criterion of the Hardness of a Speech Recognition Task

Trentin, Edmondo;Matassoni, Marco
2000

Abstract

Evaluation of hardness of a given automatic speech recognition task - once a specific language has been chosen - is usually accomplished in terms of such factors as the vocabulary size, the complexity of the grammar, isolated vs. continuous speech, and the acoustic conditions. In practice, little attention has been paid so far to the fact that, once a vocabulary has been fixed, along with a given grammar and setup of environmental conditions, the difficulty of the recognition task may vry according to the amount of 'confusable' words in the vocabylary, as well as to the specific 'degree of confusability' among them. This report informally introduces the concept of confusability, briefly surveying technical literature where the problem is faced. It then introduces two formal definitions of the problem, the former having a statistical root in Bayes' decision theory, and the latter relying on a geometric approach, involving the concept of 'distance' between pairs of words. Practical hints on possible directions to follow in order to apply the definitions are finally outlined
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11582/291
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact