A New Based Distance Language Model for a Dictation Machine: application to MAUD - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 1999

A New Based Distance Language Model for a Dictation Machine: application to MAUD

Résumé

This paper deals with the use of a stochastic language model based on the split of the words history into d words where d is the length of the history. One of our aims is to modelise the semantic and syntactic relationships between words. This model can be considered as a first step for this goal. We experimented our model through the Shannon game (on 10 000 truncated sentences) and implemented it in MAUD, our dictation machine. Tests on MAUD have been done on 300 sentences pronounced by several women and men. This model predicts more words (in the Shannon game) than any other methods we developed before in our team. However, these models are sophisticated in contrast to the one we describe. Moreover, when including unknown words, the results are better than the model ones we presented in a recent work in terms of mean rank, ranks from 1 to 5 and perplexity. This work has needed to use two interpolation methods inspired from Markov model. Also, we discuss the problem of the unknown word modelling.

Domaines

Autre [cs.OH]
Fichier non déposé

Dates et versions

inria-00098984 , version 1 (26-09-2006)

Identifiants

  • HAL Id : inria-00098984 , version 1

Citer

David Langlois, Kamel Smaïli. A New Based Distance Language Model for a Dictation Machine: application to MAUD. 6th European Conference on Speech Communication & Technology - EUROSPEECH'99, 1999, Budapest, Hungary, pp.1779-1782. ⟨inria-00098984⟩
97 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More