Retrieving phrases by selecting the history: application to Automatic Speech Recognition - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 2002

Retrieving phrases by selecting the history: application to Automatic Speech Recognition

Résumé

This paper focuses on statistical language modeling for automatic speech recognition. We present a method which aims at finding linguistic units in corpus. This method, called the Selected History Principle, consists in finding strong distant relationships between words. The new units are phrases made up of basic units of our vocabulary linked by these distant relationships. We adapt the multigram principle to large vocabularies in order to introduce an optimal subset of these sequences into a bigram model. The bigram model using these sequences outperforms the baseline bigram model by 21% in terms of Perplexity, and increases the recognition rate of the large vocabulary system Sirocco by 8.7%. The word error rate is decreased by 12.7%.
Fichier non déposé

Dates et versions

inria-00100805 , version 1 (26-09-2006)

Identifiants

  • HAL Id : inria-00100805 , version 1

Citer

David Langlois, Kamel Smaïli, Jean-Paul Haton. Retrieving phrases by selecting the history: application to Automatic Speech Recognition. 7th International Conference on Spoken Language Processing - ICSLP'2002, Sep 2002, Denver, USA, pp.721. ⟨inria-00100805⟩
71 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More