Bicocca Open Archive

Motivated by the problem of accurately predicting gap times between successive blood donations, we present here a general class of Bayesian nonparametric models for clustering. These models allow for the prediction of new recurrences, accommodating covariate information that describes the personal characteristics of the sample individuals. We introduce a prior for the random partition of the sample individuals, which encourages two individuals to be co-clustered if they have similar covariate values. Our prior generalizes product partition models with covariates (PPMx) models in the literature, which are defined in terms of cohesion and similarity functions. We assume cohesion functions that yield mixtures of PPMx models, while our similarity functions represent the denseness of a cluster. We show that including covariate information in the prior specification improves the posterior predictive performance and helps interpret the estimated clusters in terms of covariates in the blood donation application.

Argiento, R., Corradin, R., Guglielmi, A., Lanzarone, E. (2024). Clustering blood donors via mixtures of product partition models with covariates. BIOMETRICS, 80(1) [10.1093/biomtc/ujad021].

Clustering blood donors via mixtures of product partition models with covariates

Argiento R.;Corradin R.;Guglielmi A.;Lanzarone E.

2024

Abstract

Motivated by the problem of accurately predicting gap times between successive blood donations, we present here a general class of Bayesian nonparametric models for clustering. These models allow for the prediction of new recurrences, accommodating covariate information that describes the personal characteristics of the sample individuals. We introduce a prior for the random partition of the sample individuals, which encourages two individuals to be co-clustered if they have similar covariate values. Our prior generalizes product partition models with covariates (PPMx) models in the literature, which are defined in terms of cohesion and similarity functions. We assume cohesion functions that yield mixtures of PPMx models, while our similarity functions represent the denseness of a cluster. We show that including covariate information in the prior specification improves the posterior predictive performance and helps interpret the estimated clusters in terms of covariates in the blood donation application.

Scheda breve

Scheda completa

Scheda completa (DC)

	Sottotipologia
	
				Articolo in rivista - Articolo scientifico
			
	Parole chiave
	
				Bayesian cluster models; blood donations; non-exchangeable prior; prediction; random partition; recurrent events;
			
	Lingua del contenuto
	
				English
			
	Data ahead of print o Data prima pubblicazione Online
	
				16-feb-2024
			
	Data di pubblicazione
	
				2024
			
	Rivista
	
				BIOMETRICS
			
	Numero del volume
	
				80
			
	Fascicolo
	
				1
			
	Article number
	
				ujad021
			
	DOI dell'articolo
	
				https://dx.doi.org/10.1093/biomtc/ujad021
			
	Fulltext
	
				open
			
	Citazione
	
				Argiento, R., Corradin, R., Guglielmi, A., Lanzarone, E. (2024). Clustering blood donors via mixtures of product partition models with covariates. BIOMETRICS, 80(1) [10.1093/biomtc/ujad021].
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
Argiento-2024-Biometrics-VoR.pdf accesso aperto Descrizione: This is an Open Access article distributed under the terms of the CreativeCommons Attribution License (https://creativecommons.org/licenses/by/4.0/) Tipologia di allegato: Publisher’s Version (Version of Record, VoR) Licenza: Creative Commons Dimensione 1.01 MB Formato Adobe PDF Visualizza/Apri	1.01 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/516822

Citazioni

1

1

Social impact