Music feature maps with convolutional neural networks for music genre classification - Université Toulouse III - Paul Sabatier - Toulouse INP Accéder directement au contenu
Communication Dans Un Congrès Année : 2017

Music feature maps with convolutional neural networks for music genre classification

Résumé

Nowadays, deep learning is more and more used for Music Genre Classification: particularly Convolutional Neural Networks (CNN) taking as entry a spectrogram considered as an image on which are sought different types of structure. But, facing the criticism relating to the difficulty in understanding the underlying relationships that neural networks learn in presence of a spectrogram, we propose to use, as entries of a CNN, a small set of eight music features chosen along three main music dimensions: dynamics, timbre and tonality. With CNNs trained in such a way that filter dimensions are interpretable in time and frequency, results show that only eight music features are more efficient than 513 frequency bins of a spectrogram and that late score fusion between systems based on both feature types reaches 91% accuracy on the GTZAN database.
Fichier principal
Vignette du fichier
Senac_22107.pdf (599.58 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-02874986 , version 1 (19-06-2020)

Identifiants

Citer

Christine Sénac, Thomas Pellegrini, Florian Mouret, Julien Pinquier. Music feature maps with convolutional neural networks for music genre classification. International Workshop on Content-Based Multimedia Indexing (CBMI), Jun 2017, Florence, Italy. pp.1-5, ⟨10.1145/3095713.3095733⟩. ⟨hal-02874986⟩
61 Consultations
693 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More