The annotation of genomic information is a major challenge in biology and bioinformatics. Existing databases of known gene functions are incomplete and prone to errors, and the bimolecular experiments needed to improve these databases are slow and costly. While computational methods are not a substitute for experimental verification, they can help in two ways: Algorithms can aid in the curation of gene anno- Tations by automatically suggesting inaccuracies, and they can predict previously-unidentified gene functions, acceler- Ating the rate of gene function discovery. In this work, we develop an algorithm that achieves both goals using deep autoencoder neural networks. With experiments on gene annotation data from the Gene Ontology project, we show that deep autoencoder networks achieve better performance than other standard machine learning methods, including the popular truncated singular value decomposition.

Chicco, D., Sadowski, P., Baldi, P. (2014). Deep Autoencoder Neural Networks for Gene Ontology Annotation Predictions. In ACM BCB '14 Proceedings of the 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics (pp.533-540). Association for Computing Machinery [10.1145/2649387.2649442].

Deep Autoencoder Neural Networks for Gene Ontology Annotation Predictions

Chicco, D
;
2014

Abstract

The annotation of genomic information is a major challenge in biology and bioinformatics. Existing databases of known gene functions are incomplete and prone to errors, and the bimolecular experiments needed to improve these databases are slow and costly. While computational methods are not a substitute for experimental verification, they can help in two ways: Algorithms can aid in the curation of gene anno- Tations by automatically suggesting inaccuracies, and they can predict previously-unidentified gene functions, acceler- Ating the rate of gene function discovery. In this work, we develop an algorithm that achieves both goals using deep autoencoder neural networks. With experiments on gene annotation data from the Gene Ontology project, we show that deep autoencoder networks achieve better performance than other standard machine learning methods, including the popular truncated singular value decomposition.
paper
Autoencoders; Biomolecular annotations; Gene ontology; Matrix-completion; Neural networks; Principal component analysis; Truncated singular value decomposition;
English
5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, ACM BCB 2014 - 20 September 2014 through 23 September 2014
2014
ACM BCB '14 Proceedings of the 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics
9781450328944
2014
533
540
http://dl.acm.org/citation.cfm?id=2649442
reserved
Chicco, D., Sadowski, P., Baldi, P. (2014). Deep Autoencoder Neural Networks for Gene Ontology Annotation Predictions. In ACM BCB '14 Proceedings of the 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics (pp.533-540). Association for Computing Machinery [10.1145/2649387.2649442].
File in questo prodotto:
File Dimensione Formato  
Chicco-2014-ACM BCB'14-VoR.pdf

Solo gestori archivio

Descrizione: Intervento a convegno
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Tutti i diritti riservati
Dimensione 514.38 kB
Formato Adobe PDF
514.38 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/435462
Citazioni
  • Scopus 147
  • ???jsp.display-item.citation.isi??? ND
Social impact