Correlation Clustering with Adaptive Similarity Queries

Marco Bressan; Nicolo Cesa-Bianchi; Andrea Paudice; Fabio Vitale

Communication Dans Un Congrès Année : 2019

Correlation Clustering with Adaptive Similarity Queries

(1) , (2) , (3) , (4)

1
2
3
4

Marco Bressan

Fonction : Auteur
PersonId : 945551

Università degli Studi di Roma "La Sapienza" = Sapienza University [Rome]

Nicolo Cesa-Bianchi

Fonction : Auteur

Dipartimento di Scienze dell'Informazione [Milano]

Andrea Paudice

Fonction : Auteur

Università degli Studi di Milano = University of Milan

Fabio Vitale

Fonction : Auteur

Machine Learning in Information Networks

Résumé

In correlation clustering, we are given $n$ objects together with a binary similarity score between each pair of them. The goal is to partition the objects into clusters so to minimise the disagreements with the scores. In this work we investigate correlation clustering as an active learning problem: each similarity score can be learned by making a query, and the goal is to minimise both the disagreements and the total number of queries. On the one hand, we describe simple active learning algorithms, which provably achieve an almost optimal trade-off while giving cluster recovery guarantees, and we test them on different datasets. On the other hand, we prove information-theoretical bounds on the number of queries necessary to guarantee a prescribed disagreement bound. These results give a rich characterization of the trade-off between queries and clustering error.

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

1905.11902.pdf (651.95 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Team Magnet : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02376961

Soumis le : vendredi 22 novembre 2019-19:16:27

Dernière modification le : mercredi 24 janvier 2024-09:54:24

Dates et versions

hal-02376961 , version 1 (22-11-2019)

Identifiants

HAL Id : hal-02376961 , version 1
ARXIV : 1905.11902

Citer

Marco Bressan, Nicolo Cesa-Bianchi, Andrea Paudice, Fabio Vitale. Correlation Clustering with Adaptive Similarity Queries. Conference on Neural Information Processing Systems, Dec 2019, Vancouver, Canada. ⟨hal-02376961⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA CRISTAL INRIA2 CRISTAL-MAGNET UNIV-LILLE

59 Consultations

70 Téléchargements

Correlation Clustering with Adaptive Similarity Queries

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager