A mixture model-based real-time audio sources classification method

Maxime Baelde; Christophe Biernacki; Raphaël Greff

Communication Dans Un Congrès Année : 2017

A mixture model-based real-time audio sources classification method

(1, 2, 3) , (1, 2) , (3)

1
2
3

Maxime Baelde

Fonction : Auteur

Laboratoire Paul Painlevé - UMR 8524

MOdel for Data Analysis and Learning

A-Volute [Roubaix]

Christophe Biernacki

Fonction : Auteur

Laboratoire Paul Painlevé - UMR 8524

MOdel for Data Analysis and Learning

Raphaël Greff

Fonction : Auteur

A-Volute [Roubaix]

Résumé

Recent research on machine learning focuses on audio source identification in complex environments. They rely on extracting features from audio signals and use machine learning techniques to model the sound classes. However, such techniques are often not optimized for a real-time implementation and in multi-source conditions. We propose a new real-time audio single-source classification method based on a dictionary of sound models (that can be extended to a multi-source setting). The sound spectrums are modeled with mixture models and form a dictionary. The classification is based on a comparison with all the elements of the dictionary by computing likelihoods and the best match is used as a result. We found that this technique outperforms classic methods within a temporal horizon of 0.5s per decision (achieved 6% of errors on a database composed of 50 classes). Future works will focus on the multi-sources classification and reduce the computational load.

Mots clés

real-time audio identification statistical learning mixture models sound classification machine learn-

Domaines

Méthodologie [stat.ME]

Fichier principal

baelde.pdf (268.48 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Christophe Biernacki : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01420677

Soumis le : mercredi 28 décembre 2016-10:25:31

Dernière modification le : vendredi 19 avril 2024-14:04:05

Archivage à long terme le : mardi 28 mars 2017-01:45:10

Dates et versions

hal-01420677 , version 1 (22-12-2016)

hal-01420677 , version 2 (28-12-2016)

Identifiants

HAL Id : hal-01420677 , version 2

Citer

Maxime Baelde, Christophe Biernacki, Raphaël Greff. A mixture model-based real-time audio sources classification method. The 42nd IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP2017, Mar 2017, New Orleans, United States. ⟨hal-01420677v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA INRIA2 UNIV-LILLE LPP-MATH

447 Consultations

1713 Téléchargements

A mixture model-based real-time audio sources classification method

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager