The Zero Resource Speech Challenge 2021: Spoken language modelling

We present the Zero Resource Speech Challenge 2021, which asks participants to learn a language model directly from audio, without any text or labels. The challenge is based on the Libri-light dataset, which provides up to 60k hours of audio from English audio books without any associated text. We provide a pipeline baseline system consisting on an encoder based on contrastive predictive coding (CPC), a quantizer ($k$-means) and a standard language model (BERT or LSTM). The metrics evaluate the learned representations at the acoustic (ABX discrimination), lexical (spot-the-word), syntactic (acceptability judgment) and semantic levels (similarity judgment). We present an overview of the eight submitted systems from four groups and discuss the main results.

Mots clés

Unsupervised speech Lowresource Language modelling Zero-resource Cognitive benchmarks

Domaines

Informatique et langage [cs.CL] Intelligence artificielle [cs.AI]

Fichier principal

2104.14700 (1).pdf (129.19 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Emmanuel Dupoux : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03329301

Soumis le : lundi 11 octobre 2021-15:39:36

Dernière modification le : lundi 18 mars 2024-10:24:06

Dates et versions

hal-03329301 , version 1 (30-08-2021)

hal-03329301 , version 2 (11-10-2021)

Identifiants

HAL Id : hal-03329301 , version 2
ARXIV : 2104.14700
DOI : 10.1109/TPAMI.2021.3083839

Citer

Ewan Dunbar, Mathieu Bernard, Nicolas Hamilakis, Tu Anh Nguyen, Maureen de Seyssel, et al.. The Zero Resource Speech Challenge 2021: Spoken language modelling. Interspeech 2021 - Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic. ⟨10.1109/TPAMI.2021.3083839⟩. ⟨hal-03329301v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS CNRS INRIA EHESS LSCP DEC INRIA2 PSL ANR PRAIRIE-IA

127 Consultations

170 Téléchargements