Audio keyword extraction by unsupervised word discovery

Armando Muscariello; Guillaume Gravier; Frédéric Bimbot

Communication Dans Un Congrès Année : 2009

Audio keyword extraction by unsupervised word discovery

(1) , (1) , (1)

Armando Muscariello

Fonction : Auteur
PersonId : 885855

Speech and sound data modeling and processing

Guillaume Gravier

Fonction : Auteur
PersonId : 1046
IdHAL : guig
ORCID : 0000-0002-2266-5682
IdRef : 110355415

Speech and sound data modeling and processing

Frédéric Bimbot

Fonction : Auteur
PersonId : 830967

Speech and sound data modeling and processing

Résumé

In real audio data, frequently occurring patterns often convey relevant information on the overall content of the data. The possibility to extract meaningful portions of the main content by identifying such key patterns, can be exploited for providing audio summaries and speeding up the access to relevant parts of the data. We refer to these patterns as audio motifs in analogy with the nomenclature in its counterpart task in biology. We describe a framework for the discovery of audio motifs in streams in an unsupervised fashion, as no acoustic or linguistic models are used. We define the fundamental problem by decomposing the overall task into elementary subtasks; then we propose a solution that combines a one-pass strategy that exploits the local repetitiveness of motifs and a dynamic programming technique to detect repetitions in audio streams. Results of an experiment on a radio broadcast show are shown to illustrate the effectiveness of the technique in providing audio summaries of real data.

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Fichier principal

is_09_motif.pdf (73.62 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Armando Muscariello : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00551769

Soumis le : dimanche 20 février 2011-19:13:21

Dernière modification le : vendredi 24 mars 2023-14:52:54

Archivage à long terme le : samedi 21 mai 2011-02:31:15

Dates et versions

inria-00551769 , version 1 (20-02-2011)

Identifiants

HAL Id : inria-00551769 , version 1

Citer

Armando Muscariello, Guillaume Gravier, Frédéric Bimbot. Audio keyword extraction by unsupervised word discovery. INTERSPEECH 2009: 10th Annual Conference of the International Speech Communication Association, Sep 2009, Brighton, United Kingdom. ⟨inria-00551769⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA IRISA-D5 INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES INSA-GROUPE UR1-MATH-NUM

537 Consultations

486 Téléchargements

Audio keyword extraction by unsupervised word discovery

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager