Assessing homogeneity of distributions is an old problem that has received considerable attention, especially in the nonparametric Bayesian literature. To this effect, we propose the semi-hierarchical Dirichlet process, a novel hierarchical prior that extends the hierarchical Dirichlet process of Teh et al. (2006) and that avoids the degeneracy issues of nested processes recently described by Camerlenghi et al. (2019a). We go beyond the simple yes/no answer to the homogeneity question and embed the proposed prior in a random partition model; this procedure allows us to give a more comprehensive response to the above question and in fact find groups of populations that are internally homogeneous when I ≥ 2 such populations are considered. We study theoretical properties of the semihierarchical Dirichlet process and of the Bayes factor for the homogeneity test when I = 2. Extensive simulation studies and applications to educational data are also discussed.

Beraha, M., Guglielmi, A., Quintana, F. (2021). The Semi-Hierarchical Dirichlet Process and Its Application to Clustering Homogeneous Distributions*. BAYESIAN ANALYSIS, 16(4), 1187-1219 [10.1214/21-BA1278].

The Semi-Hierarchical Dirichlet Process and Its Application to Clustering Homogeneous Distributions*

Beraha, M;
2021

Abstract

Assessing homogeneity of distributions is an old problem that has received considerable attention, especially in the nonparametric Bayesian literature. To this effect, we propose the semi-hierarchical Dirichlet process, a novel hierarchical prior that extends the hierarchical Dirichlet process of Teh et al. (2006) and that avoids the degeneracy issues of nested processes recently described by Camerlenghi et al. (2019a). We go beyond the simple yes/no answer to the homogeneity question and embed the proposed prior in a random partition model; this procedure allows us to give a more comprehensive response to the above question and in fact find groups of populations that are internally homogeneous when I ≥ 2 such populations are considered. We study theoretical properties of the semihierarchical Dirichlet process and of the Bayes factor for the homogeneity test when I = 2. Extensive simulation studies and applications to educational data are also discussed.
Articolo in rivista - Articolo scientifico
Bayes factors; Bayesian nonparametrics; Homogeneity test; Partial exchangeability; Posterior consistency;
English
2021
16
4
1187
1219
reserved
Beraha, M., Guglielmi, A., Quintana, F. (2021). The Semi-Hierarchical Dirichlet Process and Its Application to Clustering Homogeneous Distributions*. BAYESIAN ANALYSIS, 16(4), 1187-1219 [10.1214/21-BA1278].
File in questo prodotto:
File Dimensione Formato  
Beraha-2021-Bayesian Analysis-VoR.pdf

Solo gestori archivio

Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Tutti i diritti riservati
Dimensione 727.57 kB
Formato Adobe PDF
727.57 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/545391
Citazioni
  • Scopus 12
  • ???jsp.display-item.citation.isi??? 11
Social impact