The increasing availability of multiple network data has highlighted the need for statistical models for heterogeneous populations of networks. A convenient framework makes use of metrics to measure similarity between networks. In this context, we propose a novel Bayesian nonparametric model that identifies clusters of networks characterized by similar connectivity patterns. Our approach relies on a location-scale Dirichlet process mixture of centered Erdős--Rényi kernels, with components parametrized by a unique network representative, or mode, and a univariate measure of dispersion around the mode. We demonstrate that this model has full support in the Kullback--Leibler sense and is strongly consistent. An efficient Markov chain Monte Carlo scheme is proposed for posterior inference and clustering of multiple network data. The performance of the model is validated through extensive simulation studies, showing improvements over state-of-the-art methods. Additionally, we present an heuristic strategy to extend the application of the proposed model to datasets with a large number of nodes. We illustrate our approach with the analysis of human brain network data.

Barile, F., Lunagómez, S., Nipoti, B. (2025). Bayesian Nonparametric Modeling of Heterogeneous Populations of Networks. BAYESIAN ANALYSIS [10.1214/26-BA1588].

Bayesian Nonparametric Modeling of Heterogeneous Populations of Networks

Barile, F
Primo
;
Nipoti, B
2025

Abstract

The increasing availability of multiple network data has highlighted the need for statistical models for heterogeneous populations of networks. A convenient framework makes use of metrics to measure similarity between networks. In this context, we propose a novel Bayesian nonparametric model that identifies clusters of networks characterized by similar connectivity patterns. Our approach relies on a location-scale Dirichlet process mixture of centered Erdős--Rényi kernels, with components parametrized by a unique network representative, or mode, and a univariate measure of dispersion around the mode. We demonstrate that this model has full support in the Kullback--Leibler sense and is strongly consistent. An efficient Markov chain Monte Carlo scheme is proposed for posterior inference and clustering of multiple network data. The performance of the model is validated through extensive simulation studies, showing improvements over state-of-the-art methods. Additionally, we present an heuristic strategy to extend the application of the proposed model to datasets with a large number of nodes. We illustrate our approach with the analysis of human brain network data.
Articolo in rivista - Articolo scientifico
Centered Erdős–Rényi distribution; Consensus subgraph clustering; Dirichlet process; Multiple network data
English
11-mar-2026
2025
open
Barile, F., Lunagómez, S., Nipoti, B. (2025). Bayesian Nonparametric Modeling of Heterogeneous Populations of Networks. BAYESIAN ANALYSIS [10.1214/26-BA1588].
File in questo prodotto:
File Dimensione Formato  
Barile-2025-Bayesian Anal-VoR.pdf

accesso aperto

Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Creative Commons
Dimensione 922.33 kB
Formato Adobe PDF
922.33 kB Adobe PDF Visualizza/Apri
Barile-2025-Bayesian Anal.pdf

accesso aperto

Descrizione: Supplementary material
Tipologia di allegato: Other attachments
Licenza: Non specificato
Dimensione 730.4 kB
Formato Adobe PDF
730.4 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/596848
Citazioni
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
Social impact