A mixture model-based real-time audio sources classification method - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 2017

A mixture model-based real-time audio sources classification method

Résumé

Recent research on machine learning focuses on audio source identification in complex environments. They rely on extracting features from audio signals and use machine learning techniques to model the sound classes. However, such techniques are often not optimized for a real-time implementation and in multi-source conditions. We propose a new real-time audio single-source classification method based on a dictionary of sound models (that can be extended to a multi-source setting). The sound spectrums are modeled with mixture models and form a dictionary. The classification is based on a comparison with all the elements of the dictionary by computing likelihoods and the best match is used as a result. We found that this technique outperforms classic methods within a temporal horizon of 0.5s per decision (achieved 6% of errors on a database composed of 50 classes). Future works will focus on the multi-sources classification and reduce the computational load.
Fichier principal
Vignette du fichier
baelde.pdf (268.48 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-01420677 , version 1 (22-12-2016)
hal-01420677 , version 2 (28-12-2016)

Identifiants

  • HAL Id : hal-01420677 , version 2

Citer

Maxime Baelde, Christophe Biernacki, Raphaël Greff. A mixture model-based real-time audio sources classification method. The 42nd IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP2017, Mar 2017, New Orleans, United States. ⟨hal-01420677v2⟩
447 Consultations
1713 Téléchargements

Partager

Gmail Facebook X LinkedIn More