Spatial location priors for Gaussian model-based reverberant audio source separation - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Rapport (Rapport De Recherche) Année : 2012

Spatial location priors for Gaussian model-based reverberant audio source separation

Résumé

This article addresses the under-determined reverberant audio source separation when prior knowledge about the spatial source locations and the room characteristics is available. We consider the Gaussian modeling framework whereby the contribution of each source to all mixture channels in the time-frequency domain is modeled as a zero-mean Gaussian random variable whose covariance represents the spatial characteristics of the source. We advocate the use of rigorous Bayesian estimation by defining three different priors over the spatial parameters, whose means are given by the theory of statistical room acoustics and whose variances are learned from training data. We then derive corresponding Expectation-Maximization (EM) algorithms to estimate the model parameters in the Maximum A Posteriori (MAP) sense. These algorithms provide a principled solution to the well-known permutation problem and two of them achieve better separation performance, as shown in our experiment, than Maximum Likelihood (ML) based EM algorithms exploiting the same prior knowledge.
Fichier principal
Vignette du fichier
RR-8057.pdf (285.95 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-00727781 , version 1 (04-09-2012)
hal-00727781 , version 2 (02-04-2013)

Identifiants

  • HAL Id : hal-00727781 , version 1

Citer

Ngoc Q. K. Duong, Emmanuel Vincent, Rémi Gribonval. Spatial location priors for Gaussian model-based reverberant audio source separation. [Research Report] RR-8057, 2012. ⟨hal-00727781v1⟩

Collections

INRIA-RRRT
461 Consultations
441 Téléchargements

Partager

Gmail Facebook X LinkedIn More