Operator equalisation was recently proposed as a new bloat control technique for genetic programming. By controlling the distribution of program lengths inside the population, it can bias the search towards smaller or larger programs. In this paper we propose a new implementation of operator equalisation and compare it to a previous version, using a hard real-world regression problem where bloat and overfitting are major issues. The results show that both implementations of operator equalisation are completely bloat-free, producing smaller individuals than standard genetic programming, without compromising the generalization ability. We also show that the new implementation of operator equalisation is more efficient and exhibits a more predictable and reliable behavior than the previous version. We advance some arguable ideas regarding the relationship between bloat and overfitting, and support them with our results. Copyright 2009 ACM.

Vanneschi, L., Silva, S. (2009). Operator equalisation, bloat and overfitting: A study on human oral bioavailability prediction. In Proceedings of the 11th Annual Genetic and Evolutionary Computation Conference, GECCO-2009 (pp.1115-1122). New York : ACM [10.1145/1569901.1570051].

Operator equalisation, bloat and overfitting: A study on human oral bioavailability prediction

VANNESCHI, LEONARDO;
2009

Abstract

Operator equalisation was recently proposed as a new bloat control technique for genetic programming. By controlling the distribution of program lengths inside the population, it can bias the search towards smaller or larger programs. In this paper we propose a new implementation of operator equalisation and compare it to a previous version, using a hard real-world regression problem where bloat and overfitting are major issues. The results show that both implementations of operator equalisation are completely bloat-free, producing smaller individuals than standard genetic programming, without compromising the generalization ability. We also show that the new implementation of operator equalisation is more efficient and exhibits a more predictable and reliable behavior than the previous version. We advance some arguable ideas regarding the relationship between bloat and overfitting, and support them with our results. Copyright 2009 ACM.
paper
operator, equalisation, bloat, overfitting, study, human, oral, bioavailability, prediction
English
11th Annual Genetic and Evolutionary Computation Conference, GECCO-2009
2009
Proceedings of the 11th Annual Genetic and Evolutionary Computation Conference, GECCO-2009
9781605583259
2009
1115
1122
none
Vanneschi, L., Silva, S. (2009). Operator equalisation, bloat and overfitting: A study on human oral bioavailability prediction. In Proceedings of the 11th Annual Genetic and Evolutionary Computation Conference, GECCO-2009 (pp.1115-1122). New York : ACM [10.1145/1569901.1570051].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/16081
Citazioni
  • Scopus 31
  • ???jsp.display-item.citation.isi??? ND
Social impact