There is a very rich literature proposing Bayesian approaches for clustering starting with a prior probability distribution on partitions. Most approaches assume exchangeability, leading to simple representations in terms of Exchangeable Partition Probability Functions (EPPF). Gibbs-type priors encompass a broad class of such cases, including Dirichlet and Pitman-Yor processes. Even though there have been some proposals to relax the exchangeability assumption, allowing covariate-dependence and partial exchangeability, limited consideration has been given on how to include concrete prior knowledge on the partition. For example, we are motivated by an epidemiological application, in which we wish to cluster birth defects into groups and we have prior knowledge of an initial clustering provided by experts. As a general approach for including such prior knowledge, we propose a Centered Partition (CP) process that modifies the EPPF to favor partitions close to an initial one. Some properties of the CP prior are described, a general algorithm for posterior computation is developed, and we illustrate the methodology through simulation examples and an application to the motivating epidemiology study of birth defects.

Rigon, T., Aliverti, E., Russo, M., Scarpa, B. (2021). Contributed discussion on: "Centered Partition Processes: informative priors for clustering". BAYESIAN ANALYSIS, 16(1 (March 2021)), 348-350 [10.1214/20-BA1197].

Contributed discussion on: "Centered Partition Processes: informative priors for clustering"

Rigon, Tommaso;Scarpa, Bruno
2021

Abstract

There is a very rich literature proposing Bayesian approaches for clustering starting with a prior probability distribution on partitions. Most approaches assume exchangeability, leading to simple representations in terms of Exchangeable Partition Probability Functions (EPPF). Gibbs-type priors encompass a broad class of such cases, including Dirichlet and Pitman-Yor processes. Even though there have been some proposals to relax the exchangeability assumption, allowing covariate-dependence and partial exchangeability, limited consideration has been given on how to include concrete prior knowledge on the partition. For example, we are motivated by an epidemiological application, in which we wish to cluster birth defects into groups and we have prior knowledge of an initial clustering provided by experts. As a general approach for including such prior knowledge, we propose a Centered Partition (CP) process that modifies the EPPF to favor partitions close to an initial one. Some properties of the CP prior are described, a general algorithm for posterior computation is developed, and we illustrate the methodology through simulation examples and an application to the motivating epidemiology study of birth defects.
Articolo in rivista - Contributo a Forum/Dibattito, Introduzione
Bayesian clustering; Bayesian nonparametrics; centered process; Dirichlet Process; exchangeable probability partition function; mixture model; product partition model;
English
13-feb-2020
2021
16
1 (March 2021)
348
350
none
Rigon, T., Aliverti, E., Russo, M., Scarpa, B. (2021). Contributed discussion on: "Centered Partition Processes: informative priors for clustering". BAYESIAN ANALYSIS, 16(1 (March 2021)), 348-350 [10.1214/20-BA1197].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/314161
Citazioni
  • Scopus 7
  • ???jsp.display-item.citation.isi??? 9
Social impact