Bicocca Open Archive

Efficiently finding maximal exact matches (MEMs) between a sequence read and a database of genomes is a key first step in read alignment. But until recently, it was unknown how to build a data structure in space that supports efficient MEM finding, where r is the number of runs in the Burrows-Wheeler Transform. In 2021, Rossi et al. showed how to build a small auxiliary data structure called thresholds in addition to the r-index in space. This addition enables efficient MEM finding using the r-index. In this article, we present the tool that implements this solution, which we call MONI. Namely, we give a high-level view of the main components of the data structure and show how the source code can be downloaded, compiled, and used to find MEMs between a set of sequence reads and a set of genomes.

Rossi, M., Oliva, M., Bonizzoni, P., Langmead, B., Gagie, T., Boucher, C. (2022). Finding Maximal Exact Matches Using the r-Index. JOURNAL OF COMPUTATIONAL BIOLOGY, 29(2), 188-194 [10.1089/cmb.2021.0445].

Finding Maximal Exact Matches Using the r-Index

Rossi M.;Oliva M.;Bonizzoni P.;Langmead B.;Gagie T.;Boucher C.

2022

Abstract

Efficiently finding maximal exact matches (MEMs) between a sequence read and a database of genomes is a key first step in read alignment. But until recently, it was unknown how to build a data structure in space that supports efficient MEM finding, where r is the number of runs in the Burrows-Wheeler Transform. In 2021, Rossi et al. showed how to build a small auxiliary data structure called thresholds in addition to the r-index in space. This addition enables efficient MEM finding using the r-index. In this article, we present the tool that implements this solution, which we call MONI. Namely, we give a high-level view of the main components of the data structure and show how the source code can be downloaded, compiled, and used to find MEMs between a set of sequence reads and a set of genomes.

Scheda breve

Scheda completa

Scheda completa (DC)

	Sottotipologia
	
				Articolo in rivista - Articolo scientifico
			
	Parole chiave
	
				MEM finding; r-index; run-length-encoded Burrows-Wheeler transform; thresholds;
			
	Lingua del contenuto
	
				English
			
	Data ahead of print o Data prima pubblicazione Online
	
				17-gen-2022
			
	Data di pubblicazione
	
				2022
			
	Rivista
	
				JOURNAL OF COMPUTATIONAL BIOLOGY
			
	Numero del volume
	
				29
			
	Fascicolo
	
				2
			
	Pagina iniziale
	
				188
			
	Pagina finale
	
				194
			
	DOI dell'articolo
	
				https://dx.doi.org/10.1089/cmb.2021.0445
			
	Fulltext
	
				none
			
	Citazione
	
				Rossi, M., Oliva, M., Bonizzoni, P., Langmead, B., Gagie, T., Boucher, C. (2022). Finding Maximal Exact Matches Using the r-Index. JOURNAL OF COMPUTATIONAL BIOLOGY, 29(2), 188-194 [10.1089/cmb.2021.0445].
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/362759

Citazioni

8

6

Social impact