Audio Source Separation With a Single Sensor - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Article Dans Une Revue IEEE Transactions on Audio, Speech and Language Processing Année : 2006

Audio Source Separation With a Single Sensor

Résumé

In this work we present a method to perform a complete audiovisual source separation without need of previous information. This method is based on the assumption that sounds are caused by moving structures. Thus, an efficient representation of audio and video sequences allows to build relationships between synchronous structures on both modalities. A robust clustering algorithm groups video structures exhibiting strong correlations with the audio so that sources are counted and located in the image. Using such information and exploiting audio-video correlation, the audio sources activity is determined. Next, spectral Gaussian Mixture Models (GMMs) are learnt in time slots with only one source active so that it is possible to separate them in case of an audio mixture. Audio source separation performances are rigorously evaluated, clearly showing that the proposed algorithm performs efficiently and robustly.
Fichier principal
Vignette du fichier
2006_IEEE_TSALP_BenaroyaBimbotGribonval_AudioSepOneSensor.pdf (412.37 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

inria-00544949 , version 1 (11-12-2010)

Identifiants

Citer

Laurent Benaroya, Frédéric Bimbot, Rémi Gribonval. Audio Source Separation With a Single Sensor. IEEE Transactions on Audio, Speech and Language Processing, 2006, 14 (1), pp.191--199. ⟨10.1109/TSA.2005.854110⟩. ⟨inria-00544949⟩
304 Consultations
1123 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More