A Genetic Programming (GP) framework for classification is presented in this paper and applied to a publicly available biomedical microarray dataset representing a collection of expression measurements from colon biopsy experiments [3]. We report experimental results obtained using two different well known fitness criteria: the area under the receiving operating curve (ROC) and the percentage of correctly classified instances (CCI). These results, and their comparison with the ones obtained by three non-evolutionary Machine Learning methods (Support Vector Machines, Voted Perceptron and Random Forests) on the same data, seem to hint that GP is a promising technique for this kind of classification both from the viewpoint of the accuracy of the proposed solutions and of the generalization ability. These results are encouraging and should pave the way to a deeper study of GP for classification applied to biomedical microarray datasets.
Archetti, F., Giordani, I., Vanneschi, L., Castelli, M. (2008). Classification of colon tumor tissues using genetic programming. In Artificial Life and Evolutionary Computation (pp.49-58). Singapore : World Scientific Publ Co. [10.1142/9789814287456_0004].
Classification of colon tumor tissues using genetic programming
ARCHETTI, FRANCESCO ANTONIO;GIORDANI, ILARIA;VANNESCHI, LEONARDO;CASTELLI, MAURO
2008
Abstract
A Genetic Programming (GP) framework for classification is presented in this paper and applied to a publicly available biomedical microarray dataset representing a collection of expression measurements from colon biopsy experiments [3]. We report experimental results obtained using two different well known fitness criteria: the area under the receiving operating curve (ROC) and the percentage of correctly classified instances (CCI). These results, and their comparison with the ones obtained by three non-evolutionary Machine Learning methods (Support Vector Machines, Voted Perceptron and Random Forests) on the same data, seem to hint that GP is a promising technique for this kind of classification both from the viewpoint of the accuracy of the proposed solutions and of the generalization ability. These results are encouraging and should pave the way to a deeper study of GP for classification applied to biomedical microarray datasets.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.