In Gaussian model based audio source separation, source spatial images are modelled by Gaussian distributions. The covariance matrices of the distributions are represented by source variances and spatial covariance matrices. Accordingly, the likelihood of an observed mixture of independent source signals is parametrized by the variances and the covariance matrices. The separation is performed by estimating the parameters and applying multichannel Wiener filtering. Assuming that spectral basis matrices trained on source power spectra are available, this work proposes a method to estimate the parameters by maximizing the likelihood using Expectation-Maximization. In terms of normalization, the variances are estimated applying singular value decomposition. Furthermore, by building weighted matrices from vectors of the trained matrices, semi-supervised non-negative matrix factorization is applied to estimate the spatial covariance matrices. The experimental results prove the efficiency of the proposed algorithm in reverberant environments.

Estimation of the spatial information in Gaussian model based audio source separation using weighted spectral bases

Abdelraheem, Mahmoud Fakhry Mahmoud;Svaizer, Piergiorgio;Omologo, Maurizio
2016

Abstract

In Gaussian model based audio source separation, source spatial images are modelled by Gaussian distributions. The covariance matrices of the distributions are represented by source variances and spatial covariance matrices. Accordingly, the likelihood of an observed mixture of independent source signals is parametrized by the variances and the covariance matrices. The separation is performed by estimating the parameters and applying multichannel Wiener filtering. Assuming that spectral basis matrices trained on source power spectra are available, this work proposes a method to estimate the parameters by maximizing the likelihood using Expectation-Maximization. In terms of normalization, the variances are estimated applying singular value decomposition. Furthermore, by building weighted matrices from vectors of the trained matrices, semi-supervised non-negative matrix factorization is applied to estimate the spatial covariance matrices. The experimental results prove the efficiency of the proposed algorithm in reverberant environments.
978-0-9928-6266-4
File in questo prodotto:
File Dimensione Formato  
Paper_Eusipco_2016.pdf

non disponibili

Descrizione: Articolo principale
Tipologia: Documento in Pre-print
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 361.34 kB
Formato Adobe PDF
361.34 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: http://hdl.handle.net/11582/307031
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact