The nuclear androgen receptor (AR) is one of the most relevant biological targets of Endocrine Disrupting Chemicals (EDCs), which produce adverse effects by interfering with hormonal regulation and endocrine system functioning. This paper describes novel in silico models to identify organic AR modulators in the context of the Collaborative Modeling Project of Androgen Receptor Activity (CoMPARA), coordinated by the National Center of Computational Toxicology (U.S. Environmental Protection Agency). The collaborative project involved 35 international research groups to prioritize the experimental tests of approximatively 40k compounds, based on the predictions provided by each participant. In this paper, we describe our machine learning approach to predict the binding to AR, which is based on a consensus of a multivariate Bernoulli Naive Bayes, a Random Forest, and N-Nearest Neighbor classification models. The approach was developed in compliance with the Organization of Economic Cooperation and Development (OECD) principles, trained on 1687 ToxCast molecules classified according to 11 in vitro assays, and further validated on a set of 3,882 external compounds. The models provided robust and reliable predictions and were used to gather novel data-driven insights on the structural features related to AR binding, agonism, and antagonism

Grisoni, F., Consonni, V., Ballabio, D. (2019). Machine Learning Consensus To Predict the Binding to the Androgen Receptor within the CoMPARA Project. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 59(5), 1839-1848 [10.1021/acs.jcim.8b00794].

Machine Learning Consensus To Predict the Binding to the Androgen Receptor within the CoMPARA Project

Grisoni, Francesca
;
Consonni, Viviana;Ballabio, Davide
2019

Abstract

The nuclear androgen receptor (AR) is one of the most relevant biological targets of Endocrine Disrupting Chemicals (EDCs), which produce adverse effects by interfering with hormonal regulation and endocrine system functioning. This paper describes novel in silico models to identify organic AR modulators in the context of the Collaborative Modeling Project of Androgen Receptor Activity (CoMPARA), coordinated by the National Center of Computational Toxicology (U.S. Environmental Protection Agency). The collaborative project involved 35 international research groups to prioritize the experimental tests of approximatively 40k compounds, based on the predictions provided by each participant. In this paper, we describe our machine learning approach to predict the binding to AR, which is based on a consensus of a multivariate Bernoulli Naive Bayes, a Random Forest, and N-Nearest Neighbor classification models. The approach was developed in compliance with the Organization of Economic Cooperation and Development (OECD) principles, trained on 1687 ToxCast molecules classified according to 11 in vitro assays, and further validated on a set of 3,882 external compounds. The models provided robust and reliable predictions and were used to gather novel data-driven insights on the structural features related to AR binding, agonism, and antagonism
Articolo in rivista - Articolo scientifico
QSAR; chemometrics; machine learning; Androgen Receptor
English
22-gen-2019
2019
59
5
1839
1848
reserved
Grisoni, F., Consonni, V., Ballabio, D. (2019). Machine Learning Consensus To Predict the Binding to the Androgen Receptor within the CoMPARA Project. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 59(5), 1839-1848 [10.1021/acs.jcim.8b00794].
File in questo prodotto:
File Dimensione Formato  
Grisoni-2019.pdf

Solo gestori archivio

Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Dimensione 2.17 MB
Formato Adobe PDF
2.17 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/231149
Citazioni
  • Scopus 39
  • ???jsp.display-item.citation.isi??? 34
Social impact