Integrating Speech Recognition and Natural Language LTAG Techniques with Weighted Synchonized Automata

Jean-Luc Husson; Patrice Lopez

Communication Dans Un Congrès Année : 1999

Integrating Speech Recognition and Natural Language LTAG Techniques with Weighted Synchonized Automata

(1) , (1)

Jean-Luc Husson

Fonction : Auteur
PersonId : 835344

Human-machine dialogue with a significant language component

Patrice Lopez

Fonction : Auteur
PersonId : 2984
IdHAL : patricelopez
ORCID : 0000-0002-9959-9441
IdRef : 157929930

Human-machine dialogue with a significant language component

Résumé

This paper will focus on the conceptual and technical design of a system architecture devoted to the integration of the various natural spoken language processing levels. Our goal is to avoid the well known constraints and limitations imposed by the usual combination of stochastic recognition models and statistical n-gram language models. To do so, we investigate the intensive use of weighted finite-state automata at each processing level to gather the static information. These automata are obtained directly or by linearizing tree structures and are then minimalized to get a optimal sharing of common substructures. Our hypothesis is that this property of sharing directly leads to a high processing factorization and consequently to an improved computational efficiency. Experiments are currently carried out to integrate an analytical segmentation system, a stochastic phonetic recognition module and a parser based on synchronous LTAG (Lexicalized Tree Adjoining Grammars) [Shieber 90] [Lopez 98] for syntactic and semantic descriptions to analyse a spoken task-oriented man-machine dialog corpus. The segmentation stage is based on hierarchical multi-level lattices and produces a weighted automaton of predicted broad phonetic classes [Husson 96]. The phoneme recognition stage is led by usual phonetic hidden Markov models. Minimilized phonetic automata are computed from the canonical phonetic transcription of all words of the corpus [Laporte 93]. The syntactic and semantic lexicalized tree grammar are semi-automatically built from a corpus collected by an wizard of Oz experiment. The trees are linearized and the resulting automata are minimalized with similar techniques as described in [Evans 98]. We provide some results which exhibit the high syntactical substructures sharing rate which is obtained. These different automata are synchronized with links between states. The decoding and parsing modules produce temporary concurrent analysis structures attached to at least one state of the corresponding automaton. Distributed control modules are needed to apply and exploit local synchronization constraints on the activated synchronized states. These synchonization techniques allow us to implement all kind of hypothesis manipulation for both deductive and abductive reasoning.

Mots clés

grammaire lexicalisée d'arbres adjoints traitement du langage naturel techniques d'intégration integration techniques reconnaissance de parole speech recognition finite-state automata lexicalized tree grammar natural language processing automates à états finis

Domaines

Autre [cs.OH]

Publications Loria : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00098903

Soumis le : mardi 26 septembre 2006-08:40:06

Dernière modification le : vendredi 24 mars 2023-14:52:48

Dates et versions

inria-00098903 , version 1 (26-09-2006)

Identifiants

HAL Id : inria-00098903 , version 1

Citer

Jean-Luc Husson, Patrice Lopez. Integrating Speech Recognition and Natural Language LTAG Techniques with Weighted Synchonized Automata. International Workshop Speech & Computer - SPECOM'99, 1999, Moscow, Russia, 6 p. ⟨inria-00098903⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA

92 Consultations

0 Téléchargements

Integrating Speech Recognition and Natural Language LTAG Techniques with Weighted Synchonized Automata

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager