This paper proposes a new speaker localization method that is based on a preliminary estimation of the head orientation. The basic information on which the estimation is accomplished is called Oriented Global Coherence Field (OGCF).The new algorithm is shown to be significantly more robust than the traditional ones so far explored. Its robustness is also due to an effective speech activity detection, implicitly performed by a thresholding technique applied to OGCF information.To show the performance of the proposed system, experiments were conducted on the NIST RT-05 Spring Evaluation source localization task, which is based on real recordings of lectures in noisy and reverberant environments.

Speaker Localiztion Based on Oriented Global Coherence Field

Brutti, Alessio;Omologo, Maurizio;Svaizer, Piergiorgio
2006-01-01

Abstract

This paper proposes a new speaker localization method that is based on a preliminary estimation of the head orientation. The basic information on which the estimation is accomplished is called Oriented Global Coherence Field (OGCF).The new algorithm is shown to be significantly more robust than the traditional ones so far explored. Its robustness is also due to an effective speech activity detection, implicitly performed by a thresholding technique applied to OGCF information.To show the performance of the proposed system, experiments were conducted on the NIST RT-05 Spring Evaluation source localization task, which is based on real recordings of lectures in noisy and reverberant environments.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11582/2982
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact