Tweet Data Mining: the Cultural Microblog Contextualization Data Set - Université Toulouse III - Paul Sabatier - Toulouse INP Accéder directement au contenu
Communication Dans Un Congrès Année : 2016

Tweet Data Mining: the Cultural Microblog Contextualization Data Set

Résumé

This paper presents an overview of the data set that was used for the Cultural Microblog Contextualization Workshop at CLEF 2016 and more specifically for the task 1: tweet contextualization. In this paper we first present a descriptive analysis of the data: we consider the variables or features associated with the tweets and analyse them. Then we also analyse the tweet textual content. The results of this work correspond to a first step toward data quality checking. It can also useful in order to understand better the data and its usefulness for some tasks or case studies.
Fichier principal
Vignette du fichier
chaham_18770.pdf (901.96 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01671365 , version 1 (22-12-2017)

Identifiants

  • HAL Id : hal-01671365 , version 1
  • OATAO : 18770

Citer

Yassine Rkha Chaham, Clémentine Scohy, Sébastien Dejean, Josiane Mothe. Tweet Data Mining: the Cultural Microblog Contextualization Data Set. Conference and Labs of the Evaluation forum (CLEF 2016), Sep 2016, Evora, Portugal. pp. 1246-1259. ⟨hal-01671365⟩
113 Consultations
63 Téléchargements

Partager

Gmail Facebook X LinkedIn More