Single cell profiling has become a common practice to investigate the complexity of tissues, organs and organisms. Recent technological advances are expanding our capabilities to profile various molecular layers beyond the transcriptome such as, but not limited to, the genome, the epigenome and the proteome. Depending on the experimental procedure, these data can be obtained from separate assays or from the very same cells. Despite development of computational methods for data integration is an active research field, most of the available strategies have been devised for the joint analysis of two modalities and cannot accommodate a high number of them. To solve this problem, we here propose a multiomic data integration framework based on Wasserstein Generative Adversarial Networks (MOWGAN) suitable for the analysis of paired or unpaired data with high number of modalities (>2). At the core of our strategy is a single network trained on all modalities together, limiting the computational burden when many molecular layers are evaluated. Source code of our framework is available at https://github.com/vgiansanti/MOWGAN.

Giansanti, V., Giannese, F., Botrugno, O., Gandolfi, G., Balestrieri, C., Antoniotti, M., et al. (2023). Scalable Integration of Multiomic Single Cell Data Using Generative Adversarial Networks [Altro] [10.1101/2023.06.26.546547].

Scalable Integration of Multiomic Single Cell Data Using Generative Adversarial Networks

Giansanti, Valentina
Primo
;
Balestrieri, Chiara;Antoniotti, Marco;
2023

Abstract

Single cell profiling has become a common practice to investigate the complexity of tissues, organs and organisms. Recent technological advances are expanding our capabilities to profile various molecular layers beyond the transcriptome such as, but not limited to, the genome, the epigenome and the proteome. Depending on the experimental procedure, these data can be obtained from separate assays or from the very same cells. Despite development of computational methods for data integration is an active research field, most of the available strategies have been devised for the joint analysis of two modalities and cannot accommodate a high number of them. To solve this problem, we here propose a multiomic data integration framework based on Wasserstein Generative Adversarial Networks (MOWGAN) suitable for the analysis of paired or unpaired data with high number of modalities (>2). At the core of our strategy is a single network trained on all modalities together, limiting the computational burden when many molecular layers are evaluated. Source code of our framework is available at https://github.com/vgiansanti/MOWGAN.
Altro
Bioinformatics, scRNA, scATAC, AI, Generative Adversarial Networks
English
29-giu-2023
2023
biorXiv preprint.
Giansanti, V., Giannese, F., Botrugno, O., Gandolfi, G., Balestrieri, C., Antoniotti, M., et al. (2023). Scalable Integration of Multiomic Single Cell Data Using Generative Adversarial Networks [Altro] [10.1101/2023.06.26.546547].
none
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/459518
Citazioni
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
Social impact