Multichannel nonnegative tensor factorization with structured constraints for user-guided audio source separation - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 2011

Multichannel nonnegative tensor factorization with structured constraints for user-guided audio source separation

Résumé

Separating multiple tracks from professionally produced music recordings (PPMRs) is still a challenging problem. We address this task with a user-guided approach in which the separation system is provided segmental information indicating the time activations of the particular instruments to separate. This information may typically be retrieved from manual annotation. We use a so-called multichannel nonnegative tensor factorization (NTF) model, in which the original sources are observed through a multichannel convolutive mixture and in which the source power spectrograms are jointly modeled by a 3-valence (time/frequency/source) tensor. Our user-guided separation method produced competitive results at the 2010 Signal Separation Evaluation Campaign, with sufficient quality for real-world music editing applications.
Fichier principal
Vignette du fichier
Ozerov_et_al_icassp11.pdf (237.36 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00564851 , version 1 (10-02-2011)

Identifiants

  • HAL Id : inria-00564851 , version 1

Citer

Alexey Ozerov, Cédric Févotte, Raphaël Blouet, Jean-Louis Durrieu. Multichannel nonnegative tensor factorization with structured constraints for user-guided audio source separation. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'11), May 2011, Prague, Czech Republic. ⟨inria-00564851⟩
378 Consultations
692 Téléchargements

Partager

Gmail Facebook X LinkedIn More