In this work, we study the influence of the population size on the learning ability of Geometric Semantic Genetic Programming for the task of symbolic regression. A large set of experiments, considering different population size values on different regression problems, has been performed. Results show that, on real-life problems, having small populations results in a better training fitness with respect to the use of large populations after the same number of fitness evaluations. However, performance on the test instances varies among the different problems: in datasets with a high number of features, models obtained with large populations present a better performance on unseen data, while in datasets characterized by a relative small number of variables a better generalization ability is achieved by using small population size values. When synthetic problems are taken into account, large population size values represent the best option for achieving good quality solutions on both training and test instances

Castelli, M., Manzoni, L., Silva, S., Vanneschi, L., Popovič, A. (2017). The influence of population size in geometric semantic GP. SWARM AND EVOLUTIONARY COMPUTATION, 32, 110-120 [10.1016/j.swevo.2016.05.004].

The influence of population size in geometric semantic GP

Castelli, Mauro
;
Manzoni, Luca;Silva, Sara;Vanneschi, Leonardo;
2017

Abstract

In this work, we study the influence of the population size on the learning ability of Geometric Semantic Genetic Programming for the task of symbolic regression. A large set of experiments, considering different population size values on different regression problems, has been performed. Results show that, on real-life problems, having small populations results in a better training fitness with respect to the use of large populations after the same number of fitness evaluations. However, performance on the test instances varies among the different problems: in datasets with a high number of features, models obtained with large populations present a better performance on unseen data, while in datasets characterized by a relative small number of variables a better generalization ability is achieved by using small population size values. When synthetic problems are taken into account, large population size values represent the best option for achieving good quality solutions on both training and test instances
Articolo in rivista - Articolo scientifico
Genetic programming; Population size; Semantics; Computer Science (all); Mathematics (all)
English
2017
32
110
120
none
Castelli, M., Manzoni, L., Silva, S., Vanneschi, L., Popovič, A. (2017). The influence of population size in geometric semantic GP. SWARM AND EVOLUTIONARY COMPUTATION, 32, 110-120 [10.1016/j.swevo.2016.05.004].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/210498
Citazioni
  • Scopus 9
  • ???jsp.display-item.citation.isi??? 8
Social impact