Bicocca Open Archive

Publishing personal data without giving up privacy is becoming an increasingly important problem in different fields. In the last years, different interesting approaches have been proposed, i.e. k-Anonymity and l-Diversity. Given an input table, these approaches partition its rows so that the computed partition satisfies some constraint, in order to prevent the inference of the individuals the data belong to. Then, the rows in a same set of the partition are related to the same rows by suppressing some of their entries. Here we focus on the l-Diversity problem, where the attributes of the input table are distinguished in sensitive attributes and quasi-identifier attributes. The goal is to partition the rows of the input table, so that each set C of the partition contains at most 1l |C| rows having a specific value in the sensitive attribute, and the number of suppressions is minimized. In this paper we investigate the approximation and parameterized complexity of l-Diversity. First, we prove that the problem is not approximable within factor c ln l, for some constant c > 0, even if the input table consists of only two columns, and that the problem is APX-hard, even if l = 4 and the input table contains exactly three columns. Then we give an approximation algorithm of factor m (where m+1 is the number of columns in the input table), when the sensitive attribute ranges over an alphabet of constant size. Concerning the parameterized complexity, we prove that the problem is W[1]-hard when parameterized by the cost-bound, by l, and by the size of the alphabet. Then we prove that the problemadmits a fixed-parameter algorithm when both the maximum number of different values in a column and the number of columns are parameters.

Dondi, R., Mauri, G., Zoppis, I. (2011). On the complexity of the l-diversity problem. In Proceedings of the 36th International Conference on Mathematical Foundations of Computer Science (pp.266-277). Berlin : Springer [10.1007/978-3-642-22993-0_26].

On the complexity of the l-diversity problem

Dondi, R;MAURI, GIANCARLO;ZOPPIS, ITALO FRANCESCO

2011

Abstract

Publishing personal data without giving up privacy is becoming an increasingly important problem in different fields. In the last years, different interesting approaches have been proposed, i.e. k-Anonymity and l-Diversity. Given an input table, these approaches partition its rows so that the computed partition satisfies some constraint, in order to prevent the inference of the individuals the data belong to. Then, the rows in a same set of the partition are related to the same rows by suppressing some of their entries. Here we focus on the l-Diversity problem, where the attributes of the input table are distinguished in sensitive attributes and quasi-identifier attributes. The goal is to partition the rows of the input table, so that each set C of the partition contains at most 1l |C| rows having a specific value in the sensitive attribute, and the number of suppressions is minimized. In this paper we investigate the approximation and parameterized complexity of l-Diversity. First, we prove that the problem is not approximable within factor c ln l, for some constant c > 0, even if the input table consists of only two columns, and that the problem is APX-hard, even if l = 4 and the input table contains exactly three columns. Then we give an approximation algorithm of factor m (where m+1 is the number of columns in the input table), when the sensitive attribute ranges over an alphabet of constant size. Concerning the parameterized complexity, we prove that the problem is W[1]-hard when parameterized by the cost-bound, by l, and by the size of the alphabet. Then we prove that the problemadmits a fixed-parameter algorithm when both the maximum number of different values in a column and the number of columns are parameters.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
				slide + paper
			
	Parole chiave
	
				Data privacy, k-anonymity, ℓ-diversity, privacy-preserving, data publishing, approximation, parameterized complexity
			
	Lingua del contenuto
	
				English
			
	Nome del convegno
	
				MFCS'11 36th International Conference on Mathematical Foundations of Computer Science
			
	Anno del convegno
	
				2011
			
	Curatori della monografia
	
				Murlak, Filip; Sankowski, Piotr
			
	Titolo degli atti
	
				Proceedings of the 36th International Conference on Mathematical Foundations of Computer Science
			
	ISBN del volume degli atti
	
				978-3-642-22992-3
			
	Collana o serie
	
				Lecture Notes in Computer Science
			
	Data di pubblicazione
	
				2011
			
	Numero del volume
	
				6907
			
	Pagina iniziale
	
				266
			
	Pagina finale
	
				277
			
	DOI dell'intervento
	
				https://dx.doi.org/10.1007/978-3-642-22993-0_26
			
	URL alternativo
	
				http://dl.acm.org/citation.cfm?id=2034006.2034034
			
	Fulltext
	
				none
			
	Citazione
	
				Dondi, R., Mauri, G., Zoppis, I. (2011). On the complexity of the l-diversity problem. In Proceedings of the 36th International Conference on Mathematical Foundations of Computer Science (pp.266-277). Berlin : Springer [10.1007/978-3-642-22993-0_26].
			
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/27999

Citazioni

7

5

Social impact