Navigating the Maze of Wikidata Query Logs - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 2019

Navigating the Maze of Wikidata Query Logs

Résumé

This paper provides an in-depth and diversified analysis of the Wikidata query logs, recently made publicly available. Although the usage of Wikidata queries has been the object of recent studies, our analysis of the query traffic reveals interesting and unforeseen findings concerning the usage, types of recursion, and the shape classification of complex recursive queries. Wikidata specific features combined with recursion let us identify a significant subset of the entire corpus that can be used by the community for further assessment. We considered and analyzed the queries across many different dimensions, such as the robotic and organic queries, the presence/absence of constants along with the correctly executed and timed out queries. A further investigation that we pursue in this paper is to find, given a query, a number of queries structurally similar to the given query. We provide a thorough characterization of the queries in terms of their expressive power, their topological structure and shape, along with a deeper understanding of the usage of recursion in these logs. We make the code for the analysis available as open source.

Domaines

Web
Fichier principal
Vignette du fichier
3308558.3313472.pdf (988.34 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02096714 , version 1 (25-09-2020)

Identifiants

Citer

Angela Bonifati, Wim Martens, Thomas Timm. Navigating the Maze of Wikidata Query Logs. WWW 2019 - The World Wide Web Conference, May 2019, San Francisco, United States. pp.127-138, ⟨10.1145/3308558.3313472⟩. ⟨hal-02096714⟩
492 Consultations
518 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More