About vocabulary adaptation for automatic speech recognition of video data

Denis Jouvet; David Langlois; Mohamed Amine Menacer; Dominique Fohr; Odile Mella; Kamel Smaïli

Communication Dans Un Congrès Année : 2017

About vocabulary adaptation for automatic speech recognition of video data

(1) , (2) , (2) , (1) , (1) , (2)

1
2

Denis Jouvet

Fonction : Auteur
PersonId : 15904
IdHAL : denis-jouvet
IdRef : 029418666

Speech Modeling for Facilitating Oral-Based Communication

David Langlois

Fonction : Auteur
PersonId : 298
IdHAL : david-langlois
IdRef : 070239509

Statistical Machine Translation and Speech Modelization and Text

Mohamed Amine Menacer

Fonction : Auteur
PersonId : 14275
IdHAL : mohamed-amine-menacer
IdRef : 25240937X

Statistical Machine Translation and Speech Modelization and Text

Dominique Fohr

Fonction : Auteur
PersonId : 15652
IdHAL : dominique-fohr
IdRef : 031092942

Speech Modeling for Facilitating Oral-Based Communication

Odile Mella

Fonction : Auteur
PersonId : 15902
IdHAL : odile-mella
IdRef : 12011903X

Speech Modeling for Facilitating Oral-Based Communication

Kamel Smaïli

Fonction : Auteur
PersonId : 2521
IdHAL : kamel-smaili
IdRef : 034429700

Statistical Machine Translation and Speech Modelization and Text

Résumé

This paper discusses the adaptation of vocabularies for automatic speech recognition. The context is the transcriptions of videos in French, English and Arabic. Baseline automatic speech recognition systems have been developed using available data. However, the available text data, including the GigaWord corpora from LDC, are getting quite old with respect to recent videos that are to be transcribed. The paper presents the collection of recent textual data from internet for updating the speech recognition vocabularies and training the language models, as well as the elaboration of development data sets necessary for the vocabulary selection process. The paper also compares the coverage of the training data collected from internet, and of the GigaWord data, with finite size vocabularies made of the most frequent words. Finally, the paper presents and discusses the amount of out-of-vocabulary word occurrences, before and after update of the vocabularies, for the three languages.

Mots clés

Speech recognition vocabulary vocabulary adaptation vocabulary selection

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

AboutTaskAdaptation-v1.2-upload.01November2017.pdf (1.14 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Denis Jouvet : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01649057

Soumis le : lundi 27 novembre 2017-10:45:02

Dernière modification le : lundi 11 septembre 2023-17:41:19

Dates et versions

hal-01649057 , version 1 (27-11-2017)

Identifiants

HAL Id : hal-01649057 , version 1

Citer

Denis Jouvet, David Langlois, Mohamed Amine Menacer, Dominique Fohr, Odile Mella, et al.. About vocabulary adaptation for automatic speech recognition of video data. ICNLSSP'2017 - International Conference on Natural Language, Signal and Speech Processing, Dec 2017, Casablanca, Morocco. pp.1-5. ⟨hal-01649057⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA GRID5000 UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD SILECS

492 Consultations

379 Téléchargements

About vocabulary adaptation for automatic speech recognition of video data

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager