We propose a model to address the overlooked problem of node clustering in simple hypergraphs. Simple hypergraphs are suitable when a node may not appear multiple times in the same hyperedge, such as in co-authorship datasets. Our model generalizes the stochastic blockmodel for graphs and assumes the existence of latent node groups and hyperedges are conditionally independent given these groups. We first establish the generic identifiability of the model parameters. We then develop a variational approximation Expectation-Maximization algorithm for parameter inference and node clustering, and derive a statistical criterion for model selection. To illustrate the performance of our R package HyperSBM, we compare it with other node clustering methods using synthetic data generated from the model, as well as from a line clustering experiment and a co-authorship dataset.

Brusa, L., Matias, C. (2024). Model‐based clustering in simple hypergraphs through a stochastic blockmodel. SCANDINAVIAN JOURNAL OF STATISTICS [10.1111/sjos.12754].

Model‐based clustering in simple hypergraphs through a stochastic blockmodel

Brusa, Luca
;
2024

Abstract

We propose a model to address the overlooked problem of node clustering in simple hypergraphs. Simple hypergraphs are suitable when a node may not appear multiple times in the same hyperedge, such as in co-authorship datasets. Our model generalizes the stochastic blockmodel for graphs and assumes the existence of latent node groups and hyperedges are conditionally independent given these groups. We first establish the generic identifiability of the model parameters. We then develop a variational approximation Expectation-Maximization algorithm for parameter inference and node clustering, and derive a statistical criterion for model selection. To illustrate the performance of our R package HyperSBM, we compare it with other node clustering methods using synthetic data generated from the model, as well as from a line clustering experiment and a co-authorship dataset.
Articolo in rivista - Articolo scientifico
co-authorship network; high-order interactions; latent variable model; line clustering; non-uniform hypergraph; variational expectation-maximization;
English
17-set-2024
2024
open
Brusa, L., Matias, C. (2024). Model‐based clustering in simple hypergraphs through a stochastic blockmodel. SCANDINAVIAN JOURNAL OF STATISTICS [10.1111/sjos.12754].
File in questo prodotto:
File Dimensione Formato  
Brusa-2024-ScandJStat-VoR.pdf

accesso aperto

Descrizione: This is an open access article under the terms of the Creative Commons Attribution License
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Creative Commons
Dimensione 944.94 kB
Formato Adobe PDF
944.94 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/514979
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
Social impact