Estimating the probability density of a population while preserving the privacy of individuals in that population is an important and challenging problem that has received considerable attention in recent years. While the previous literature focused on frequentist approaches, in this article, we propose a Bayesian nonparametric mixture model under differential privacy (DP) and present two Markov chain Monte Carlo (MCMC) algorithms for posterior inference. One is a marginal approach, resembling Neal’s algorithm 5 with a pseudo-marginal Metropolis-Hastings move, and the other is a conditional approach. Although our focus is primarily on local DP, we show that our MCMC algorithms can be easily extended to deal with global differential privacy mechanisms. Moreover, for some carefully chosen mechanisms and mixture kernels, we show how auxiliary parameters can be analytically marginalized, allowing standard MCMC algorithms (i.e., non-privatized, such as Neal’s Algorithm 2) to be efficiently employed. Our approach is general and applicable to any mixture model and privacy mechanism. In several simulations and a real case study, we discuss the performance of our algorithms and evaluate different privacy mechanisms proposed in the frequentist literature. Supplementary materials for this article are available online.

Beraha, M., Favaro, S., Rao, V. (2024). MCMC for Bayesian Nonparametric Mixture Modeling Under Differential Privacy. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 1-19 [10.1080/10618600.2024.2410911].

MCMC for Bayesian Nonparametric Mixture Modeling Under Differential Privacy

Beraha, M;
2024

Abstract

Estimating the probability density of a population while preserving the privacy of individuals in that population is an important and challenging problem that has received considerable attention in recent years. While the previous literature focused on frequentist approaches, in this article, we propose a Bayesian nonparametric mixture model under differential privacy (DP) and present two Markov chain Monte Carlo (MCMC) algorithms for posterior inference. One is a marginal approach, resembling Neal’s algorithm 5 with a pseudo-marginal Metropolis-Hastings move, and the other is a conditional approach. Although our focus is primarily on local DP, we show that our MCMC algorithms can be easily extended to deal with global differential privacy mechanisms. Moreover, for some carefully chosen mechanisms and mixture kernels, we show how auxiliary parameters can be analytically marginalized, allowing standard MCMC algorithms (i.e., non-privatized, such as Neal’s Algorithm 2) to be efficiently employed. Our approach is general and applicable to any mixture model and privacy mechanism. In several simulations and a real case study, we discuss the performance of our algorithms and evaluate different privacy mechanisms proposed in the frequentist literature. Supplementary materials for this article are available online.
Articolo in rivista - Articolo scientifico
Data augmentation; Dirichlet process; Pseudo-marginal MCMC;
English
19-nov-2024
2024
1
19
open
Beraha, M., Favaro, S., Rao, V. (2024). MCMC for Bayesian Nonparametric Mixture Modeling Under Differential Privacy. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 1-19 [10.1080/10618600.2024.2410911].
File in questo prodotto:
File Dimensione Formato  
Beraha-2024-Journal of Computational and Graphical Statistics-Preprint.pdf

accesso aperto

Descrizione: Depositato in arXiv
Tipologia di allegato: Submitted Version (Pre-print)
Licenza: Altro
Dimensione 876.95 kB
Formato Adobe PDF
876.95 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/545386
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
Social impact