Human microbiome data are becoming extremely common in biomedical research due to the relevant connections with different types of diseases. A widespread discrete distribution to analyze this kind of data is the Dirichletmultinomial. Despite its popularity, this distribution often fails in modeling microbiome data due to the strict parameterization imposed on its covariance matrix. The aim of this work is to propose a new distribution for analyzing microbiome data and to define a regression model based on it. The new distribution can be expressed as a structured finite mixture model with Dirichlet-multinomial components. We illustrate how this mixture structure can improve a microbiome data analysis to cluster patients into “enterotypes”, which are a classification based on the bacteriological composition of gut microbiota. The comparison between the two models is performed through an application to a real gut microbiome dataset.

Ascari, R., Migliorati, S. (2023). A New Regression Model for the Analysis of Microbiome Data. In P. Brito, J.G. Dias, B. Lausen, A. Montanari, R. Nugent (a cura di), Classification and Data Science in the Digital Age (pp. 35-42). Springer Science and Business Media Deutschland GmbH [10.1007/978-3-031-09034-9_5].

A New Regression Model for the Analysis of Microbiome Data

Ascari, R;Migliorati, S
2023

Abstract

Human microbiome data are becoming extremely common in biomedical research due to the relevant connections with different types of diseases. A widespread discrete distribution to analyze this kind of data is the Dirichletmultinomial. Despite its popularity, this distribution often fails in modeling microbiome data due to the strict parameterization imposed on its covariance matrix. The aim of this work is to propose a new distribution for analyzing microbiome data and to define a regression model based on it. The new distribution can be expressed as a structured finite mixture model with Dirichlet-multinomial components. We illustrate how this mixture structure can improve a microbiome data analysis to cluster patients into “enterotypes”, which are a classification based on the bacteriological composition of gut microbiota. The comparison between the two models is performed through an application to a real gut microbiome dataset.
Capitolo o saggio
Bayesian inference; Count data; Mixture model; Multivariate regression;
English
Classification and Data Science in the Digital Age
Brito, P; Dias, JG; Lausen, B; Montanari, A; Nugent, R
8-dic-2023
2023
9783031090332
Springer Science and Business Media Deutschland GmbH
35
42
Ascari, R., Migliorati, S. (2023). A New Regression Model for the Analysis of Microbiome Data. In P. Brito, J.G. Dias, B. Lausen, A. Montanari, R. Nugent (a cura di), Classification and Data Science in the Digital Age (pp. 35-42). Springer Science and Business Media Deutschland GmbH [10.1007/978-3-031-09034-9_5].
none
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/453018
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
Social impact