Single-cell biology has revolutionized the way we understand biological processes. In this paper, we provide a more tailored approach to clustering and analyzing spatial single-cell data coming from immunofluorescence imaging techniques. We propose Bayesian Reduction for Amplified Quantization in UMAP Embedding (BRAQUE) as an integrative novel approach, from data preprocessing to phenotype classification. BRAQUE starts with an innovative preprocessing, named Lognormal Shrinkage, which is able to enhance input fragmentation by fitting a lognormal mixture model and shrink each component towards its median, in order to help further the clustering step in finding more separated and clear clusters. Then, BRAQUE’s pipeline consists of a dimensionality reduction step performed using UMAP, and a clustering performed using HDBSCAN on UMAP embedding. In the end, clusters are assigned to a cell type by experts, using effects size measures to rank markers and identify characterizing markers (Tier 1), and possibly characterize markers (Tier 2). The number of total cell types in one lymph node detectable with these technologies is unknown and difficult to predict or estimate. Therefore, with BRAQUE, we achieved a higher granularity than other similar algorithms such as PhenoGraph, following the idea that merging similar clusters is easier than splitting unclear ones into clear subclusters.

Dall'Olio, L., Bolognesi, M., Borghesi, S., Cattoretti, G., Castellani, G. (2023). BRAQUE: Bayesian Reduction for Amplified Quantization in UMAP Embedding. ENTROPY, 25(2) [10.3390/e25020354].

BRAQUE: Bayesian Reduction for Amplified Quantization in UMAP Embedding

Bolognesi, Maddalena
Co-primo
;
Borghesi, Simone
Penultimo
;
Cattoretti, Giorgio
Co-ultimo
;
2023

Abstract

Single-cell biology has revolutionized the way we understand biological processes. In this paper, we provide a more tailored approach to clustering and analyzing spatial single-cell data coming from immunofluorescence imaging techniques. We propose Bayesian Reduction for Amplified Quantization in UMAP Embedding (BRAQUE) as an integrative novel approach, from data preprocessing to phenotype classification. BRAQUE starts with an innovative preprocessing, named Lognormal Shrinkage, which is able to enhance input fragmentation by fitting a lognormal mixture model and shrink each component towards its median, in order to help further the clustering step in finding more separated and clear clusters. Then, BRAQUE’s pipeline consists of a dimensionality reduction step performed using UMAP, and a clustering performed using HDBSCAN on UMAP embedding. In the end, clusters are assigned to a cell type by experts, using effects size measures to rank markers and identify characterizing markers (Tier 1), and possibly characterize markers (Tier 2). The number of total cell types in one lymph node detectable with these technologies is unknown and difficult to predict or estimate. Therefore, with BRAQUE, we achieved a higher granularity than other similar algorithms such as PhenoGraph, following the idea that merging similar clusters is easier than splitting unclear ones into clear subclusters.
Articolo in rivista - Articolo scientifico
Bayesian; cell type; clustering; dimensionality reduction; effect size; Gaussian mixture; lognormal; lymphoid tissue; multiplex immunostaining; single-cell;
English
14-feb-2023
2023
25
2
354
none
Dall'Olio, L., Bolognesi, M., Borghesi, S., Cattoretti, G., Castellani, G. (2023). BRAQUE: Bayesian Reduction for Amplified Quantization in UMAP Embedding. ENTROPY, 25(2) [10.3390/e25020354].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/404556
Citazioni
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
Social impact