Cancer is the result of mutagenic processes that can be inferred from tumor genomes by analyzing rate spectra of point mutations, or "mutational signatures". Here we present SparseSignatures, a novel framework to extract signatures from somatic point mutation data. Our approach incorporates a user-specified background signature, employs regularization to reduce noise in non-background signatures, uses cross-validation to identify the number of signatures, and is scalable to large datasets. We show that SparseSignatures outperforms current state-of-the-art methods on simulated data using a variety of standard metrics. We then apply SparseSignatures to whole genome sequences of pancreatic and breast tumors, discovering well-differentiated signatures that are linked to known mutagenic mechanisms and are strongly associated with patient clinical features.

Lal, A., Liu, K., Tibshirani, R., Sidow, A., Ramazzotti, D. (2021). De novo mutational signature discovery in tumor genomes using SparseSignatures. PLOS COMPUTATIONAL BIOLOGY, 17(6) [10.1371/journal.pcbi.1009119].

De novo mutational signature discovery in tumor genomes using SparseSignatures

Ramazzotti, Daniele
Ultimo
2021

Abstract

Cancer is the result of mutagenic processes that can be inferred from tumor genomes by analyzing rate spectra of point mutations, or "mutational signatures". Here we present SparseSignatures, a novel framework to extract signatures from somatic point mutation data. Our approach incorporates a user-specified background signature, employs regularization to reduce noise in non-background signatures, uses cross-validation to identify the number of signatures, and is scalable to large datasets. We show that SparseSignatures outperforms current state-of-the-art methods on simulated data using a variety of standard metrics. We then apply SparseSignatures to whole genome sequences of pancreatic and breast tumors, discovering well-differentiated signatures that are linked to known mutagenic mechanisms and are strongly associated with patient clinical features.
Articolo in rivista - Articolo scientifico
Cancer; Mutational Signatures; Non-negative matrix factorization;
English
28-giu-2021
2021
17
6
e1009119
reserved
Lal, A., Liu, K., Tibshirani, R., Sidow, A., Ramazzotti, D. (2021). De novo mutational signature discovery in tumor genomes using SparseSignatures. PLOS COMPUTATIONAL BIOLOGY, 17(6) [10.1371/journal.pcbi.1009119].
File in questo prodotto:
File Dimensione Formato  
journal.pcbi.1009119.pdf

Solo gestori archivio

Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Dimensione 3.99 MB
Formato Adobe PDF
3.99 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/318909
Citazioni
  • Scopus 13
  • ???jsp.display-item.citation.isi??? 13
Social impact