Motivation: Understanding how bacterial species relate to clinical health indicators can reveal microbiome signatures of disease, offering insights into conditions such as obesity or liver disease. However, analyzing such data requires methods that address compositionality, high dimensionality, sparsity, and outliers. Results: We tackle the challenge of identifying microbiome components linked to health indicators through a robust multivariate compositional regression model. Our method addresses the high dimensionality, sparsity, and compositional nature of microbiome data while maintaining control of the false discovery rate (FDR). By incorporating outlier robustness and a derandomization step, we enhance the stability and reproducibility of results, surpassing current techniques like the Multi-Response Knockoff Filter (MRKF). In simulation studies, our method outperforms MRKF in terms of FDR control, power, and robustness. In real data applications, it leads to valuable biological insights, such as identifying microbial species associated with specific clinical parameters.

Monti, G., Pujolassos, M., Calle Rosingana, M., Filzmoser, P. (2025). Robust multivariate regression controlling false discoveries for microbiome data. BIOINFORMATICS, 41(9) [10.1093/bioinformatics/btaf506].

Robust multivariate regression controlling false discoveries for microbiome data

Monti, G S
;
2025

Abstract

Motivation: Understanding how bacterial species relate to clinical health indicators can reveal microbiome signatures of disease, offering insights into conditions such as obesity or liver disease. However, analyzing such data requires methods that address compositionality, high dimensionality, sparsity, and outliers. Results: We tackle the challenge of identifying microbiome components linked to health indicators through a robust multivariate compositional regression model. Our method addresses the high dimensionality, sparsity, and compositional nature of microbiome data while maintaining control of the false discovery rate (FDR). By incorporating outlier robustness and a derandomization step, we enhance the stability and reproducibility of results, surpassing current techniques like the Multi-Response Knockoff Filter (MRKF). In simulation studies, our method outperforms MRKF in terms of FDR control, power, and robustness. In real data applications, it leads to valuable biological insights, such as identifying microbial species associated with specific clinical parameters.
Articolo in rivista - Articolo scientifico
e-values; false discovery rate (FDR); high-dimensional multivariate regression; knockoffs; log-ratio transformation; microbial feature selection; robustness; variable selection
English
17-set-2025
2025
41
9
btaf506
none
Monti, G., Pujolassos, M., Calle Rosingana, M., Filzmoser, P. (2025). Robust multivariate regression controlling false discoveries for microbiome data. BIOINFORMATICS, 41(9) [10.1093/bioinformatics/btaf506].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/568083
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
Social impact