Motivation: Recent single-cell DNA sequencing technologies enable whole-genome sequencing of hundreds to thousands of individual cells. However, these technologies have ultra-low sequencing coverage (<0.5x per cell) which has limited their use to the analysis of large copy-number aberrations (CNAs) in individual cells. While CNAs are useful markers in cancer studies, single-nucleotide mutations are equally important, both in cancer studies and in other applications. However, ultra-low coverage sequencing yields single-nucleotide mutation data that are too sparse for current single-cell analysis methods. Results: We introduce SBMClone, a method to infer clusters of cells, or clones, that share groups of somatic single-nucleotide mutations. SBMClone uses a stochastic block model to overcome sparsity in ultra-low coverage single-cell sequencing data, and we show that SBMClone accurately infers the true clonal composition on simulated datasets with coverage at low as 0:2x. We applied SBMClone to single-cell whole-genome sequencing data from two breast cancer patients obtained using two different sequencing technologies. On the first patient, sequenced using the 10X Genomics CNV solution with sequencing coverage ≈0:03x, SBMClone recovers the major clonal composition when incorporating a small amount of additional information. On the second patient, where pre- and post-treatment tumor samples were sequenced using DOP-PCR with sequencing coverage ≈0:5x, SBMClone shows that tumor cells are present in the post-treatment sample, contrary to published analysis of this dataset.

Myers, M., Zaccaria, S., Raphael, B. (2020). Identifying tumor clones in sparse single-cell mutation data. BIOINFORMATICS, 36(supplement 1), 186-193 [10.1093/BIOINFORMATICS/BTAA449].

Identifying tumor clones in sparse single-cell mutation data

Zaccaria S.;
2020

Abstract

Motivation: Recent single-cell DNA sequencing technologies enable whole-genome sequencing of hundreds to thousands of individual cells. However, these technologies have ultra-low sequencing coverage (<0.5x per cell) which has limited their use to the analysis of large copy-number aberrations (CNAs) in individual cells. While CNAs are useful markers in cancer studies, single-nucleotide mutations are equally important, both in cancer studies and in other applications. However, ultra-low coverage sequencing yields single-nucleotide mutation data that are too sparse for current single-cell analysis methods. Results: We introduce SBMClone, a method to infer clusters of cells, or clones, that share groups of somatic single-nucleotide mutations. SBMClone uses a stochastic block model to overcome sparsity in ultra-low coverage single-cell sequencing data, and we show that SBMClone accurately infers the true clonal composition on simulated datasets with coverage at low as 0:2x. We applied SBMClone to single-cell whole-genome sequencing data from two breast cancer patients obtained using two different sequencing technologies. On the first patient, sequenced using the 10X Genomics CNV solution with sequencing coverage ≈0:03x, SBMClone recovers the major clonal composition when incorporating a small amount of additional information. On the second patient, where pre- and post-treatment tumor samples were sequenced using DOP-PCR with sequencing coverage ≈0:5x, SBMClone shows that tumor cells are present in the post-treatment sample, contrary to published analysis of this dataset.
Articolo in rivista - Articolo scientifico
Algorithms; Clone Cells; Genomics; High-Throughput Nucleotide Sequencing; Humans; Mutation; Software; Whole Genome Sequencing
English
2020
36
supplement 1
186
193
open
Myers, M., Zaccaria, S., Raphael, B. (2020). Identifying tumor clones in sparse single-cell mutation data. BIOINFORMATICS, 36(supplement 1), 186-193 [10.1093/BIOINFORMATICS/BTAA449].
File in questo prodotto:
File Dimensione Formato  
Myers-2020-Bioinformatics-VoR.pdf

accesso aperto

Descrizione: CC BY-NC 4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/)
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Creative Commons
Dimensione 683.24 kB
Formato Adobe PDF
683.24 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/508699
Citazioni
  • Scopus 14
  • ???jsp.display-item.citation.isi??? 12
Social impact