Text-indexing structures provide significant advantages in the solution of many problems related to string analysis and comparison, and are nowadays widely used in the analysis of biological sequences. In this paper, we present some applications of affix trees to problems of exact and approximate pattern matching and discovery in RNA sequences. By allowing bidirectional search for symmetric patterns in the sequences, affix trees permit to discover and locate in the sequences patterns describing not only sequence regions, but also containing information about the secondary structure that a given region could form, with improvements in terms of theoretical and practical efficiency over the existing methods. The search can be either exact or approximate, where the approximation can be defined simultaneously both for the sequence and the structure of patterns. The approach presented in this paper could provide significant help in the analysis of RNA sequences, where the functional motifs often involve not only sequence, but also the structural constraints. © 2005 Elsevier B.V. All rights reserved.

Pavesi, G., Mauri, G. (2005). Algorithms for Pattern Matching and Discovery In Rna Secondary Structure. THEORETICAL COMPUTER SCIENCE, 335(1), 29-51 [10.1016/j.tcs.2004.12.015].

Algorithms for Pattern Matching and Discovery In Rna Secondary Structure

MAURI, GIANCARLO
2005

Abstract

Text-indexing structures provide significant advantages in the solution of many problems related to string analysis and comparison, and are nowadays widely used in the analysis of biological sequences. In this paper, we present some applications of affix trees to problems of exact and approximate pattern matching and discovery in RNA sequences. By allowing bidirectional search for symmetric patterns in the sequences, affix trees permit to discover and locate in the sequences patterns describing not only sequence regions, but also containing information about the secondary structure that a given region could form, with improvements in terms of theoretical and practical efficiency over the existing methods. The search can be either exact or approximate, where the approximation can be defined simultaneously both for the sequence and the structure of patterns. The approach presented in this paper could provide significant help in the analysis of RNA sequences, where the functional motifs often involve not only sequence, but also the structural constraints. © 2005 Elsevier B.V. All rights reserved.
Articolo in rivista - Articolo scientifico
Text-indexing; affix trees; pattern discovery
English
2005
335
1
29
51
none
Pavesi, G., Mauri, G. (2005). Algorithms for Pattern Matching and Discovery In Rna Secondary Structure. THEORETICAL COMPUTER SCIENCE, 335(1), 29-51 [10.1016/j.tcs.2004.12.015].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/2552
Citazioni
  • Scopus 25
  • ???jsp.display-item.citation.isi??? 19
Social impact