Pattern discovery in biological sequences is the problem of finding patterns that are overrepresented in a set of unaligned DNA or protein sequences of related biological function. Such patterns could correspond to regions of the sequences responsible for the function itself, and could be used later for the functional annotation of newly determined sequences. Despite many studies, this problem can be considered far from being solved. The main dif®culty lies in the fact that significant patterns can appear within each sequence with mutations, insertions or deletions of nucleotides or amino acids, without losing their biological function. This paper provides a survey of a number of existing pattern discovery algorithms, focusing both on the methods underlying them and their availability for the scientific community.

Pavesi, G., Mauri, G., Pesole, G. (2001). Methods for pattern discovery in unaligned biological sequences. BRIEFINGS IN BIOINFORMATICS, 2(4), 417-430.

Methods for pattern discovery in unaligned biological sequences

MAURI, GIANCARLO;
2001

Abstract

Pattern discovery in biological sequences is the problem of finding patterns that are overrepresented in a set of unaligned DNA or protein sequences of related biological function. Such patterns could correspond to regions of the sequences responsible for the function itself, and could be used later for the functional annotation of newly determined sequences. Despite many studies, this problem can be considered far from being solved. The main dif®culty lies in the fact that significant patterns can appear within each sequence with mutations, insertions or deletions of nucleotides or amino acids, without losing their biological function. This paper provides a survey of a number of existing pattern discovery algorithms, focusing both on the methods underlying them and their availability for the scientific community.
Articolo in rivista - Articolo scientifico
pattern discovery, sequence alignment, bioinformatics
English
2001
2
4
417
430
none
Pavesi, G., Mauri, G., Pesole, G. (2001). Methods for pattern discovery in unaligned biological sequences. BRIEFINGS IN BIOINFORMATICS, 2(4), 417-430.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/2563
Citazioni
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
Social impact