Mining redescriptions with Siren - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Article Dans Une Revue ACM Transactions on Knowledge Discovery from Data (TKDD) Année : 2018

Mining redescriptions with Siren

Esther Galbrun

Résumé

In many areas of science, scientists need to find distinct common characterizations of the same objects and, vice versa, to identify sets of objects that admit multiple shared descriptions. For example, in biology, an important task is to identify the bioclimatic constraints that allow some species to survive, that is, to describe geographical regions both in terms of the fauna that inhabits them and of their bioclimatic conditions. In data analysis, the task of automatically generating such alternative characterizations is called redescription mining. If a domain expert wants to use redescription mining in his research, merely being able to find redescriptions is not enough. He must also be able to understand the redescriptions found, adjust them to better match his domain knowledge, test alternative hypotheses with them, and guide the mining process towards results he considers interesting. To facilitate these goals, we introduce Siren, an interactive tool for mining and visualizing redescriptions. Siren allows to obtain redescriptions in an anytime fashion through efficient, distributed mining, to examine the results in various linked visualizations, to interact with the results either directly or via the visualizations, and to guide the mining algorithm toward specific redescriptions. In this paper, we explain the features of Siren and why they are useful for redescription mining. We also propose two novel redescription mining algorithms that improve the generalizability of the results compared to the existing ones.
Fichier principal
Vignette du fichier
GM18_mining.pdf (2.96 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01399213 , version 1 (25-05-2018)

Identifiants

Citer

Esther Galbrun, Pauli Miettinen. Mining redescriptions with Siren. ACM Transactions on Knowledge Discovery from Data (TKDD), 2018, 12 (1), pp.6:1--6:30. ⟨10.1145/3007212⟩. ⟨hal-01399213⟩
187 Consultations
210 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More