Summary: Having multiple datasets is a key aspect of robust bioinformatics analyses, because it allows researchers to find possible confirmation of the discoveries made on multiple cohorts. For this purpose, Gene Expression Omnibus (GEO) can be a useful database, since it provides hundreds of thousands of microarray gene expression datasets freely available for download and usage. Despite this large availability, collecting prognostic datasets of a specific cancer type from GEO can be a long, time-consuming and energy-consuming activity for any bioinformatician, who needs to execute it manually by first performing a search on the GEO website and then by checking all the datasets found one by one. To solve this problem, we present here geoCancerPrognosticDatasetsRetriever, a Perl 5 application which reads a cancer type and a list of microarray platforms, searches for prognostic gene expression datasets of that cancer type and based on those platforms available on GEO, and returns the GEO accession codes of those datasets, if found. Our bioinformatics tool can easily generate in a few minutes a list of cancer prognostic datasets that otherwise would require numerous hours of manual work to any bioinformatician. geoCancerPrognosticDatasetsRetriever can handily retrieve multiple prognostic datasets of gene expression of any cancer type, laying the foundations for numerous bioinformatics studies and meta-analyses that can have a strong impact on oncology research.

Alameer, A., Chicco, D. (2022). geoCancerPrognosticDatasetsRetriever: A bioinformatics tool to easily identify cancer prognostic datasets on Gene Expression Omnibus (GEO). BIOINFORMATICS, 38(6), 1761-1763 [10.1093/bioinformatics/btab852].

geoCancerPrognosticDatasetsRetriever: A bioinformatics tool to easily identify cancer prognostic datasets on Gene Expression Omnibus (GEO)

Chicco, D
2022

Abstract

Summary: Having multiple datasets is a key aspect of robust bioinformatics analyses, because it allows researchers to find possible confirmation of the discoveries made on multiple cohorts. For this purpose, Gene Expression Omnibus (GEO) can be a useful database, since it provides hundreds of thousands of microarray gene expression datasets freely available for download and usage. Despite this large availability, collecting prognostic datasets of a specific cancer type from GEO can be a long, time-consuming and energy-consuming activity for any bioinformatician, who needs to execute it manually by first performing a search on the GEO website and then by checking all the datasets found one by one. To solve this problem, we present here geoCancerPrognosticDatasetsRetriever, a Perl 5 application which reads a cancer type and a list of microarray platforms, searches for prognostic gene expression datasets of that cancer type and based on those platforms available on GEO, and returns the GEO accession codes of those datasets, if found. Our bioinformatics tool can easily generate in a few minutes a list of cancer prognostic datasets that otherwise would require numerous hours of manual work to any bioinformatician. geoCancerPrognosticDatasetsRetriever can handily retrieve multiple prognostic datasets of gene expression of any cancer type, laying the foundations for numerous bioinformatics studies and meta-analyses that can have a strong impact on oncology research.
Articolo in rivista - Articolo scientifico
Computational Biology; Gene Expression; Gene Expression Profiling; Humans; Neoplasms; Prognosis
English
22-dic-2021
2022
38
6
1761
1763
reserved
Alameer, A., Chicco, D. (2022). geoCancerPrognosticDatasetsRetriever: A bioinformatics tool to easily identify cancer prognostic datasets on Gene Expression Omnibus (GEO). BIOINFORMATICS, 38(6), 1761-1763 [10.1093/bioinformatics/btab852].
File in questo prodotto:
File Dimensione Formato  
Alameer-2022-Bioinformatics-VoR.pdf

Solo gestori archivio

Descrizione: Article
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Tutti i diritti riservati
Dimensione 230.57 kB
Formato Adobe PDF
230.57 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/430241
Citazioni
  • Scopus 11
  • ???jsp.display-item.citation.isi??? 13
Social impact