Discovering the models explaining the hidden relationship between genetic material and tumor pathologies is one of the most important open challenges in biology and medicine. Given the large amount of data made available by the DNA Microarray technique, Machine Learning is becoming a popular tool for this kind of investigations. In the last few years, we have been particularly involved in the study of Genetic Programming for mining large sets of biomedical data. In this paper, we present a comparison between four variants of Genetic Programming for the classification of two different oncologic datasets: the first one contains data from healthy colon tissues and colon tissues affected by cancer; the second one contains data from patients affected by two kinds of leukemia (acute myeloid leukemia and acute lymphoblastic leukemia). We report experimental results obtained using two different fitness criteria: the receiver operating characteristic and the percentage of correctly classified instances. These results, and their comparison with the ones obtained by three nonevolutionary Machine Learning methods (Support Vector Machines, MultiBoosting, and Random Forests) on the same data, seem to hint that Genetic Programming is a promising technique for this kind of classification.

Vanneschi, L., Archetti, F., Castelli, M., Giordani, I. (2009). Classification of Oncologic Data with Genetic Programming. JOURNAL OF ARTIFICIAL EVOLUTION AND APPLICATIONS, 2009 [10.1155/2009/848532].

Classification of Oncologic Data with Genetic Programming

Vanneschi, L;Archetti, FA;Castelli, M;Giordani, I
2009

Abstract

Discovering the models explaining the hidden relationship between genetic material and tumor pathologies is one of the most important open challenges in biology and medicine. Given the large amount of data made available by the DNA Microarray technique, Machine Learning is becoming a popular tool for this kind of investigations. In the last few years, we have been particularly involved in the study of Genetic Programming for mining large sets of biomedical data. In this paper, we present a comparison between four variants of Genetic Programming for the classification of two different oncologic datasets: the first one contains data from healthy colon tissues and colon tissues affected by cancer; the second one contains data from patients affected by two kinds of leukemia (acute myeloid leukemia and acute lymphoblastic leukemia). We report experimental results obtained using two different fitness criteria: the receiver operating characteristic and the percentage of correctly classified instances. These results, and their comparison with the ones obtained by three nonevolutionary Machine Learning methods (Support Vector Machines, MultiBoosting, and Random Forests) on the same data, seem to hint that Genetic Programming is a promising technique for this kind of classification.
Articolo in rivista - Articolo scientifico
Classification, Genetic Programming, Oncological Data, Feature Selection
English
13
Vanneschi, L., Archetti, F., Castelli, M., Giordani, I. (2009). Classification of Oncologic Data with Genetic Programming. JOURNAL OF ARTIFICIAL EVOLUTION AND APPLICATIONS, 2009 [10.1155/2009/848532].
Vanneschi, L; Archetti, F; Castelli, M; Giordani, I
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/7725
Citazioni
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
Social impact