The PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 2017

The PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions

Résumé

Multiword expressions (MWEs) are known as a "pain in the neck" for NLP due to their idiosyncratic behaviour. While some categories of MWEs have been addressed by many studies, verbal MWEs (VMWEs), such as to take a decision, to break one's heart or to turn off, have been rarely modelled. This is notably due to their syntactic variability, which hinders treating them as " words with spaces ". We describe an initiative meant to bring about substantial progress in understanding, modelling and processing VMWEs. It is a joint effort, carried out within a European research network, to elaborate universal terminologies and annotation guidelines for 18 languages. Its main outcome is a multilingual 5-million-word annotated corpus which underlies a shared task on automatic identification of VMWEs. This paper presents the corpus annotation methodology and outcome, the shared task organisation and the results of the participating systems.
Fichier principal
Vignette du fichier
W17-1704.pdf (278.76 Ko) Télécharger le fichier
W17-7610.pdf (441.44 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-01504624 , version 1 (10-04-2017)

Identifiants

  • HAL Id : hal-01504624 , version 1

Citer

Agata Savary, Carlos Ramisch, Silvio Ricardo Cordeiro, Federico Sangati, Veronika Vincze, et al.. The PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions. MWE 2017 - Proceedings of the 13th Workshop on Multiword Expressions, Apr 2017, Valencia, Spain. pp.31 - 47. ⟨hal-01504624⟩
774 Consultations
311 Téléchargements

Partager

Gmail Facebook X LinkedIn More