The reconstruction of the haplotype pair for each chromosome is a hot topic in Bioinformatics and Genome Analysis. In Haplotype Assembly (HA), all heterozygous Single Nucleotide Polymorphisms (SNPs) have to be assigned to exactly one of the two chromosomes. In this work, we outline the state-of-the-art on HA approaches and present an in-depth analysis of the computational performance of GenHap, a recent method based on Genetic Algorithms. GenHap was designed to tackle the computational complexity of the HA problem by means of a divide-et-impera strategy that effectively leverages multi-core architectures. In order to evaluate GenHap’s performance, we generated different instances of synthetic (yet realistic) data exploiting empirical error models of four different sequencing platforms (namely, Illumina NovaSeq, Roche/454, PacBio RS II and Oxford Nanopore Technologies MinION). Our results show that the processing time generally decreases along with the read length, involving a lower number of sub-problems to be distributed on multiple cores.

Tangherloni, A., Rundo, L., Spolaor, S., Nobile, M., Merelli, I., Besozzi, D., et al. (2019). High performance computing for haplotyping: Models and platforms. In Euro-Par 2018: Parallel Processing Workshops. Euro-Par 2018 International Workshop. Turin, Italy, August 27-28 2018. Revised Selected Papers (pp.650-661). Cham : Springer Verlag [10.1007/978-3-030-10549-5_51].

High performance computing for haplotyping: Models and platforms

Tangherloni, Andrea;Rundo, Leonardo;Spolaor, Simone;Nobile, Marco S.;Merelli, Ivan;Besozzi, Daniela;Mauri, Giancarlo;Cazzaniga, Paolo;
2019

Abstract

The reconstruction of the haplotype pair for each chromosome is a hot topic in Bioinformatics and Genome Analysis. In Haplotype Assembly (HA), all heterozygous Single Nucleotide Polymorphisms (SNPs) have to be assigned to exactly one of the two chromosomes. In this work, we outline the state-of-the-art on HA approaches and present an in-depth analysis of the computational performance of GenHap, a recent method based on Genetic Algorithms. GenHap was designed to tackle the computational complexity of the HA problem by means of a divide-et-impera strategy that effectively leverages multi-core architectures. In order to evaluate GenHap’s performance, we generated different instances of synthetic (yet realistic) data exploiting empirical error models of four different sequencing platforms (namely, Illumina NovaSeq, Roche/454, PacBio RS II and Oxford Nanopore Technologies MinION). Our results show that the processing time generally decreases along with the read length, involving a lower number of sub-problems to be distributed on multiple cores.
slide + paper
Future-generation sequencing; Genome Analysis; Haplotype Assembly; High Performance Computing; Master-Slave paradigm
English
Euro-Par 2018 - Parallel Processing Workshops
2018
Mencagli, G; Heras, DB; Cardellini, V; Casalicchio, E; Jeannot, E; Wolf, F; Salis, A; Schifanella, C; Manumachu, RR; Ricci, L; Beccuti, M; Antonelli, L; Garcia Sanchez, JD; Scott, SL
Euro-Par 2018: Parallel Processing Workshops. Euro-Par 2018 International Workshop. Turin, Italy, August 27-28 2018. Revised Selected Papers
9783030105488
2019
11339
650
661
none
Tangherloni, A., Rundo, L., Spolaor, S., Nobile, M., Merelli, I., Besozzi, D., et al. (2019). High performance computing for haplotyping: Models and platforms. In Euro-Par 2018: Parallel Processing Workshops. Euro-Par 2018 International Workshop. Turin, Italy, August 27-28 2018. Revised Selected Papers (pp.650-661). Cham : Springer Verlag [10.1007/978-3-030-10549-5_51].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/214344
Citazioni
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 1
Social impact