Bicocca Open Archive

The Correlation Clustering problem has been introduced recently [N. Bansal, A. Blum, S. Chawla, Correlation Clustering, in: Proc. 43rd Symp. Foundations of Computer Science, FOCS, 2002, pp. 238-247] as a model for clustering data when a binary relationship between data points is known. More precisely, for each pair of points we have two scores measuring the similarity and dissimilarity respectively, of the two points, and we would like to compute an optimal partition where the value of a partition is obtained by summing up the similarity scores of pairs involving points from the same cluster and the dissimilarity scores of pairs involving points from different clusters. A closely related problem is Consensus Clustering, where we are given a set of partitions and we would like to obtain a partition that best summarizes the input partitions. The latter problem is a restricted case of Correlation Clustering. In this paper we prove that Minimum Consensus Clustering is APX-hard even for three input partitions, answering an open question in the literature, while Maximum Consensus Clustering admits a PTAS. We exhibit a combinatorial and practical frac(4, 5)-approximation algorithm based on a greedy technique for Maximum Consensus Clustering on three partitions. Moreover, we prove that a PTAS exists for Maximum Correlation Clustering when the maximum ratio between two scores is at most a constant. © 2007 Elsevier Inc. All rights reserved.

Bonizzoni, P., DELLA VEDOVA, G., Dondi, R., Jiang, T. (2008). On the Approximation of Correlation Clustering and Consensus Clustering. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 74(5), 671-696 [10.1016/j.jcss.2007.06.024].

On the Approximation of Correlation Clustering and Consensus Clustering

BONIZZONI, PAOLA;DELLA VEDOVA, GIANLUCA;Dondi, R;Jiang, T.

2008

Abstract

The Correlation Clustering problem has been introduced recently [N. Bansal, A. Blum, S. Chawla, Correlation Clustering, in: Proc. 43rd Symp. Foundations of Computer Science, FOCS, 2002, pp. 238-247] as a model for clustering data when a binary relationship between data points is known. More precisely, for each pair of points we have two scores measuring the similarity and dissimilarity respectively, of the two points, and we would like to compute an optimal partition where the value of a partition is obtained by summing up the similarity scores of pairs involving points from the same cluster and the dissimilarity scores of pairs involving points from different clusters. A closely related problem is Consensus Clustering, where we are given a set of partitions and we would like to obtain a partition that best summarizes the input partitions. The latter problem is a restricted case of Correlation Clustering. In this paper we prove that Minimum Consensus Clustering is APX-hard even for three input partitions, answering an open question in the literature, while Maximum Consensus Clustering admits a PTAS. We exhibit a combinatorial and practical frac(4, 5)-approximation algorithm based on a greedy technique for Maximum Consensus Clustering on three partitions. Moreover, we prove that a PTAS exists for Maximum Correlation Clustering when the maximum ratio between two scores is at most a constant. © 2007 Elsevier Inc. All rights reserved.

Scheda breve

Scheda completa

Scheda completa (DC)

	Sottotipologia
	
			Articolo in rivista - Articolo scientifico
		
	Parole chiave
	
			correlation clustering; consensus clustering; ptas
		
	Lingua del contenuto
	
			English
		
	Data di pubblicazione
	
			2008
		
	Rivista
	
			JOURNAL OF COMPUTER AND SYSTEM SCIENCES
		
	Numero del volume
	
			74
		
	Fascicolo
	
			5
		
	Pagina iniziale
	
			671
		
	Pagina finale
	
			696
		
	DOI dell'articolo
	
			https://dx.doi.org/10.1016/j.jcss.2007.06.024
		
	Fulltext
	
			none
		
	Citazione
	
			Bonizzoni, P., DELLA VEDOVA, G., Dondi, R., Jiang, T. (2008). On the Approximation of Correlation Clustering and Consensus Clustering. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 74(5), 671-696 [10.1016/j.jcss.2007.06.024].
		
	Appare nelle tipologie:
	
			01 - Articolo su rivista

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/1889

Citazioni

29

25

Social impact