Kernel Spectrogram models for source separation

Antoine Liutkus; Zafar Rafii; Bryan Pardo; Derry Fitzgerald; Laurent Daudet

Communication Dans Un Congrès Année : 2014

Kernel Spectrogram models for source separation

(1) , (2) , (2) , (3) , (4)

1
2
3
4

Antoine Liutkus

Fonction : Auteur
PersonId : 2740
IdHAL : antoine-liutkus
ORCID : 0000-0002-3458-6498
IdRef : 167600419

Analysis, perception and recognition of speech

Zafar Rafii

Fonction : Auteur

Northwestern University [Evanston]

Bryan Pardo

Fonction : Auteur

Northwestern University [Evanston]

Derry Fitzgerald

Fonction : Auteur

NIMBUS Centre [Cork]

Laurent Daudet

Fonction : Auteur

Institut Langevin - Ondes et Images (UMR7587)

Résumé

In this study, we introduce a new framework called Kernel Additive Modelling for audio spectrograms that can be used for multichannel source separation. It assumes that the spectrogram of a source at any time-frequency bin is close to its value in a neighbourhood indicated by a source-specific proximity kernel. The rationale for this model is to easily account for features like periodicity, stability over time or frequency, self-similarity, etc. In many cases, such local dynamics are indeed much more natural to assess than any global model such as a tensor factorization. This framework permits one to use different proximity kernels for different sources and to estimate them blindly using their mixtures only. Estimation is performed using a variant of the kernel backfitting algorithm that allows for multichannel mixtures and permits parallelization. Experimental results on the separation of vocals from musical backgrounds demonstrate the efficiency of the approach.

Mots clés

audio source separation spatial filtering spectrogram models

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Fichier principal

KAM_HSCMA.pdf (113.61 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Antoine Liutkus : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00959384

Soumis le : samedi 15 mars 2014-12:11:01

Dernière modification le : vendredi 19 avril 2024-16:18:59

Archivage à long terme le : dimanche 15 juin 2014-10:37:04

Dates et versions

hal-00959384 , version 1 (14-03-2014)

hal-00959384 , version 2 (15-03-2014)

hal-00959384 , version 3 (21-03-2014)

hal-00959384 , version 4 (16-02-2015)

Identifiants

HAL Id : hal-00959384 , version 2

Citer

Antoine Liutkus, Zafar Rafii, Bryan Pardo, Derry Fitzgerald, Laurent Daudet. Kernel Spectrogram models for source separation. HSCMA, May 2014, Nancy, France. ⟨hal-00959384v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

681 Consultations

1155 Téléchargements

Kernel Spectrogram models for source separation

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Partager