Weighted Automata Computation of Edit Distances with Consolidations and Fragmentations - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Article Dans Une Revue Information and Computation Année : 2022

Weighted Automata Computation of Edit Distances with Consolidations and Fragmentations

Résumé

We study edit distances between strings, based on operations of character substitutions, insertions, deletions and additionally consolidations and fragmentations. The two latter operations transform a sequence of characters into one character and vice-versa. They correspond to the compression and expansion in Dynamic Time-Warping algorithms for speech recognition and are also used for the formal analysis of written music. Such edit distances are not computable in general for arbitrary rulesets. We propose weighted automaton constructions to compute an edit distance taking into account both consolidations and deletions, or both fragmentations and insertions. Assuming that the operation ruleset has a constant size, these constructions are polynomial into the lengths of the involved strings. We finally show that the optimal weight of sequences made of consolidations chained with fragmentations, in that order, is computable for arbitrary rulesets, and not computable if we reverse the order of fragmentations and consolidations.
Fichier principal
Vignette du fichier
ms-automata.pdf (236.4 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01857267 , version 1 (14-08-2018)
hal-01857267 , version 2 (17-08-2018)
hal-01857267 , version 3 (31-10-2018)
hal-01857267 , version 4 (16-10-2019)

Identifiants

Citer

Mathieu Giraud, Florent Jacquemard. Weighted Automata Computation of Edit Distances with Consolidations and Fragmentations. Information and Computation, 2022, 282, pp.104652. ⟨10.1016/j.ic.2020.104652⟩. ⟨hal-01857267v4⟩
549 Consultations
769 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More