Under-determined reverberant audio source separation using a full-rank spatial covariance model - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Article Dans Une Revue IEEE Transactions on Audio, Speech and Language Processing Année : 2010

Under-determined reverberant audio source separation using a full-rank spatial covariance model

Résumé

This article addresses the modeling of reverberant recording environments in the context of under-determined convolutive blind source separation. We model the contribution of each source to all mixture channels in the time-frequency domain as a zero-mean Gaussian random variable whose covari- ance encodes the spatial characteristics of the source. We then consider four specific covariance models, including a full-rank unconstrained model. We derive a family of iterative expectation- maximization (EM) algorithms to estimate the parameters of each model and propose suitable procedures adapted from the state- of-the-art to initialize the parameters and to align the order of the estimated sources across all frequency bins. Experimental results over reverberant synthetic mixtures and live recordings of speech data show the effectiveness of the proposed approach.
Fichier principal
Vignette du fichier
duong_TASLP10.pdf (806.02 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00541865 , version 1 (27-01-2011)

Identifiants

Citer

Ngoc Q. K. Duong, Emmanuel Vincent, Rémi Gribonval. Under-determined reverberant audio source separation using a full-rank spatial covariance model. IEEE Transactions on Audio, Speech and Language Processing, 2010, 18 (7), pp.1830--1840. ⟨10.1109/TASL.2010.2050716⟩. ⟨inria-00541865⟩
437 Consultations
1006 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More