Bicocca Open Archive

The sense of taste plays a critical role in food science, since it directly impacts food consumption, human nutrition, and overall health. Computational models that predict the taste of molecular tastants based on their chemical structure and machine learning classifiers serve as powerful tools in the advancing field of foodinformatics. This study describes the development of ChemTastesPredictor designed to predict the taste of 4075 molecular tastants included in the extended version of ChemTastesDB (https://zenodo.org/records/14963136). To the best of our knowledge, this represents the largest dataset with a broad-based chemical space used to calibrate machine learning (ML) models for taste prediction based on molecular descriptors and fingerprints. For validation, datasets were randomly split into training and test sets in a 75:25 ratio, ensuring balanced class distributions. In binary classification tasks, the Random Forest classifier demonstrated the highest predictive performance for sweet/bitter (NER = 0.928 and F-score = 0.927) and bitter/non-bitter (NER = 0.902 and F-score = 0.903) classification. Adaptive Boosting excelled in the prediction of sweet/non-sweet (NER = 0.861 and F-score = 0.862). The N-Nearest Neighbors classifier emerged as the optimal classifier for umami/non-umami (NER = 0.957 and F-score = 0.860) and sweet/bitter/umami (NER = 0.870 and F-score = 0.843). These models may be useful in the development and analysis of new chemical tastants.

Rojas, C., Abril-González, M., Ballabio, D., García, F. (2025). ChemTastesPredictor: An ensemble of machine learning classifiers to predict the taste of molecular tastants. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 261(15 June 2025) [10.1016/j.chemolab.2025.105380].

ChemTastesPredictor: An ensemble of machine learning classifiers to predict the taste of molecular tastants

Rojas, Cristian;Abril-González, Mónica;Ballabio, Davide;García, Fernando

2025

Abstract

The sense of taste plays a critical role in food science, since it directly impacts food consumption, human nutrition, and overall health. Computational models that predict the taste of molecular tastants based on their chemical structure and machine learning classifiers serve as powerful tools in the advancing field of foodinformatics. This study describes the development of ChemTastesPredictor designed to predict the taste of 4075 molecular tastants included in the extended version of ChemTastesDB (https://zenodo.org/records/14963136). To the best of our knowledge, this represents the largest dataset with a broad-based chemical space used to calibrate machine learning (ML) models for taste prediction based on molecular descriptors and fingerprints. For validation, datasets were randomly split into training and test sets in a 75:25 ratio, ensuring balanced class distributions. In binary classification tasks, the Random Forest classifier demonstrated the highest predictive performance for sweet/bitter (NER = 0.928 and F-score = 0.927) and bitter/non-bitter (NER = 0.902 and F-score = 0.903) classification. Adaptive Boosting excelled in the prediction of sweet/non-sweet (NER = 0.861 and F-score = 0.862). The N-Nearest Neighbors classifier emerged as the optimal classifier for umami/non-umami (NER = 0.957 and F-score = 0.860) and sweet/bitter/umami (NER = 0.870 and F-score = 0.843). These models may be useful in the development and analysis of new chemical tastants.

Scheda breve

Scheda completa

Scheda completa (DC)

	Sottotipologia
	
				Articolo in rivista - Articolo scientifico
			
	Parole chiave
	
				chemometrics; QSPR; molecular tastants; taste
			
	Lingua del contenuto
	
				English
			
	Data ahead of print o Data prima pubblicazione Online
	
				12-mar-2025
			
	Data di pubblicazione
	
				2025
			
	Rivista
	
				CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS
			
	Numero del volume
	
				261
			
	Fascicolo
	
				15 June 2025
			
	Article number
	
				105380
			
	DOI dell'articolo
	
				https://dx.doi.org/10.1016/j.chemolab.2025.105380
			
	Fulltext
	
				reserved
			
	Citazione
	
				Rojas, C., Abril-González, M., Ballabio, D., García, F. (2025). ChemTastesPredictor: An ensemble of machine learning classifiers to predict the taste of molecular tastants. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 261(15 June 2025) [10.1016/j.chemolab.2025.105380].
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
Rojas-2025-Chemometrics and Intelligent Laboratory Systems-VoR.pdf Solo gestori archivio Tipologia di allegato: Publisher’s Version (Version of Record, VoR) Licenza: Tutti i diritti riservati Dimensione 4.88 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	4.88 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/546882

Citazioni

ND

ND

Social impact