According to the physical meaning of the frequencydomain blind source separation (FD-BSS), each mixing matrix estimated by Independent Component Analysis (ICA) contains information on the physical acoustic propagation related to each source and then can be used for localization purposes. In this paper we analyze the Generalized State Coherence Transform (GSCT) which is a non-linear transform of the space represented by the whole demixing matrices. The transform enables an accurate estimation of the propagation time-delay of multiple sources in multiple dimensions. Furthermore, it is shown that with appropriate non-linearities and a statistical model for the reverberation, GSCT can be considered an approximated kernel density estimator of the acoustic propagation time-delay. Experimental results confirm the good properties of the transform and its effectiveness in addressing multiple source TDOA detection (e.g. 2D TDOA estimation of several sources with only three microphones).

Generalized State Coherence Transform for multidimensional TDOA estimation of multiple sources

Nesta, Francesco;Omologo, Maurizio
2012

Abstract

According to the physical meaning of the frequencydomain blind source separation (FD-BSS), each mixing matrix estimated by Independent Component Analysis (ICA) contains information on the physical acoustic propagation related to each source and then can be used for localization purposes. In this paper we analyze the Generalized State Coherence Transform (GSCT) which is a non-linear transform of the space represented by the whole demixing matrices. The transform enables an accurate estimation of the propagation time-delay of multiple sources in multiple dimensions. Furthermore, it is shown that with appropriate non-linearities and a statistical model for the reverberation, GSCT can be considered an approximated kernel density estimator of the acoustic propagation time-delay. Experimental results confirm the good properties of the transform and its effectiveness in addressing multiple source TDOA detection (e.g. 2D TDOA estimation of several sources with only three microphones).
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11582/34200
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact