Background: Population levels of microbial phylotypes can be examined using a hybridization-based method that utilizes a small set of computationally-designed DNA probes targeted to a gene common to all. Our previous algorithm attempts to select a set of probes such that each training sequence manifests a unique theoretical hybridization pattern (a binary fingerprint) to a probe set. It does so without taking into account similarity between training gene sequences or their putative taxonomic classifications, however. We present an improved algorithm for probe set selection that utilizes the available taxonomic information of training gene sequences and attempts to choose probes such that the resultant binary fingerprints cluster into real taxonomic groups.Results: Gene sequences manifesting identical fingerprints with probes chosen by the new algorithm are more likely to be from the same taxonomic group than probes chosen by the previous algorithm. In cases where they are from different taxonomic groups, underlying DNA sequences of identical fingerprints are more similar to each other in probe sets made with the new versus the previous algorithm. Complete removal of large taxonomic groups from training data does not greatly decrease the ability of probe sets to distinguish those groups.Conclusions: Probe sets made from the new algorithm create fingerprints that more reliably cluster into biologically meaningful groups. The method can readily distinguish microbial phylotypes that were excluded from the training sequences, suggesting novel microbes can also be detected. © 2011 Ruegger et al; licensee BioMed Central Ltd.

Ruegger, P., DELLA VEDOVA, G., Jiang, T., Borneman, J. (2011). Improving probe set selection for microbial community analysis by leveraging taxonomic information of training sequences. BMC BIOINFORMATICS, 12(1), 394 [10.1186/1471-2105-12-394].

Improving probe set selection for microbial community analysis by leveraging taxonomic information of training sequences

DELLA VEDOVA, GIANLUCA;
2011

Abstract

Background: Population levels of microbial phylotypes can be examined using a hybridization-based method that utilizes a small set of computationally-designed DNA probes targeted to a gene common to all. Our previous algorithm attempts to select a set of probes such that each training sequence manifests a unique theoretical hybridization pattern (a binary fingerprint) to a probe set. It does so without taking into account similarity between training gene sequences or their putative taxonomic classifications, however. We present an improved algorithm for probe set selection that utilizes the available taxonomic information of training gene sequences and attempts to choose probes such that the resultant binary fingerprints cluster into real taxonomic groups.Results: Gene sequences manifesting identical fingerprints with probes chosen by the new algorithm are more likely to be from the same taxonomic group than probes chosen by the previous algorithm. In cases where they are from different taxonomic groups, underlying DNA sequences of identical fingerprints are more similar to each other in probe sets made with the new versus the previous algorithm. Complete removal of large taxonomic groups from training data does not greatly decrease the ability of probe sets to distinguish those groups.Conclusions: Probe sets made from the new algorithm create fingerprints that more reliably cluster into biologically meaningful groups. The method can readily distinguish microbial phylotypes that were excluded from the training sequences, suggesting novel microbes can also be detected. © 2011 Ruegger et al; licensee BioMed Central Ltd.
Articolo in rivista - Articolo scientifico
probe selection; microbial communities
English
2011
12
1
394
394
none
Ruegger, P., DELLA VEDOVA, G., Jiang, T., Borneman, J. (2011). Improving probe set selection for microbial community analysis by leveraging taxonomic information of training sequences. BMC BIOINFORMATICS, 12(1), 394 [10.1186/1471-2105-12-394].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/26491
Citazioni
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 2
Social impact