A Wavelet-Based Parameterization for Speech/Music Segmentation - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 2006

A Wavelet-Based Parameterization for Speech/Music Segmentation

Dominique Fohr
Irina Illina
Odile Mella

Résumé

The problem of speech/music discrimination is a challenging research problem which significantly impacts Automatic Speech Recognition (ASR) performance. This paper proposes new features for the Speech/Music discrimination task. We propose to use a decomposition of the audio signal based on wavelets, which allows a good analysis of non stationary signal like speech or music. We compute different energy types in each frequency band obtained from wavelet decomposition. Two class/non-class classifiers are used : one for speech/non-speech, one for music/non-music. On the different test corpora, the proposed wavelet approach gives better results than the MFCC one. For instance, we have a significant relative improvements of the error rate of 71.6% on the ``Scheirer'' corpus for the speech/music discrimination task.
Fichier non déposé

Dates et versions

hal-00103569 , version 1 (04-10-2006)

Identifiants

  • HAL Id : hal-00103569 , version 1

Citer

Emmanuel Didiot, Dominique Fohr, Jean-Paul Haton, Irina Illina, Odile Mella. A Wavelet-Based Parameterization for Speech/Music Segmentation. 2006, pp.653. ⟨hal-00103569⟩
107 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More