Few-shot learning through contextual data augmentation

Farid Arthaud; Rachel Bawden; Alexandra Birch

Communication Dans Un Congrès Année : 2021

Few-shot learning through contextual data augmentation

(1) , (2) , (3)

1
2
3

Farid Arthaud

Fonction : Auteur

Département d'informatique - ENS Paris

Rachel Bawden

Fonction : Auteur
PersonId : 9441
IdHAL : rachel-bawden
ORCID : 0000-0001-9553-1768
IdRef : 233174591

Automatic Language Modelling and ANAlysis & Computational Humanities

Alexandra Birch

Fonction : Auteur
PersonId : 1080201

School of Informatics [Edimbourg]

Résumé

Machine translation (MT) models used in industries with constantly changing topics, such as translation or news agencies, need to adapt to new data to maintain their performance over time. Our aim is to teach a pre-trained MT model to translate previously unseen words accurately, based on very few examples. We propose (i) an experimental setup allowing us to simulate novel vocabulary appearing in human-submitted translations, and (ii) corresponding evaluation metrics to compare our approaches. We extend a data augmentation approach using a pre-trained language model to create training examples with similar contexts for novel words. We compare different fine-tuning and data augmentation approaches and show that adaptation on the scale of one to five examples is possible. Combining data augmentation with randomly selected training sentences leads to the highest BLEU score and accuracy improvements. Impressively, with only 1 to 5 examples, our model reports better accuracy scores than a reference system trained with on average 313 parallel examples.

Mots clés

Journalistic postedition Fine-tuning Adaptation Machine translation Few-shot learning Postedition

Domaines

Informatique et langage [cs.CL]

Fichier principal

Lifelong_learning_EACL_2021_submission-3.pdf (593.23 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Rachel Bawden : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03121971

Soumis le : mardi 26 janvier 2021-17:21:27

Dernière modification le : vendredi 19 avril 2024-16:18:56

Archivage à long terme le : mardi 27 avril 2021-19:28:09

Dates et versions

hal-03121971 , version 1 (26-01-2021)

Identifiants

HAL Id : hal-03121971 , version 1

Citer

Farid Arthaud, Rachel Bawden, Alexandra Birch. Few-shot learning through contextual data augmentation. EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Apr 2021, Kiev / Virtual, Ukraine. ⟨hal-03121971⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS CNRS INRIA INRIA2 PSL

137 Consultations

180 Téléchargements

Few-shot learning through contextual data augmentation

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager