The sense of taste plays a critical role in food science, since it directly impacts food consumption, human nutrition, and overall health. Computational models that predict the taste of molecular tastants based on their chemical structure and machine learning classifiers serve as powerful tools in the advancing field of foodinformatics. This study describes the development of ChemTastesPredictor designed to predict the taste of 4075 molecular tastants included in the extended version of ChemTastesDB (https://zenodo.org/records/14963136). To the best of our knowledge, this represents the largest dataset with a broad-based chemical space used to calibrate machine learning (ML) models for taste prediction based on molecular descriptors and fingerprints. For validation, datasets were randomly split into training and test sets in a 75:25 ratio, ensuring balanced class distributions. In binary classification tasks, the Random Forest classifier demonstrated the highest predictive performance for sweet/bitter (NER = 0.928 and F-score = 0.927) and bitter/non-bitter (NER = 0.902 and F-score = 0.903) classification. Adaptive Boosting excelled in the prediction of sweet/non-sweet (NER = 0.861 and F-score = 0.862). The N-Nearest Neighbors classifier emerged as the optimal classifier for umami/non-umami (NER = 0.957 and F-score = 0.860) and sweet/bitter/umami (NER = 0.870 and F-score = 0.843). These models may be useful in the development and analysis of new chemical tastants.
Rojas, C., Abril-González, M., Ballabio, D., García, F. (2025). ChemTastesPredictor: An ensemble of machine learning classifiers to predict the taste of molecular tastants. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 261(15 June 2025) [10.1016/j.chemolab.2025.105380].
ChemTastesPredictor: An ensemble of machine learning classifiers to predict the taste of molecular tastants
Ballabio, Davide;
2025
Abstract
The sense of taste plays a critical role in food science, since it directly impacts food consumption, human nutrition, and overall health. Computational models that predict the taste of molecular tastants based on their chemical structure and machine learning classifiers serve as powerful tools in the advancing field of foodinformatics. This study describes the development of ChemTastesPredictor designed to predict the taste of 4075 molecular tastants included in the extended version of ChemTastesDB (https://zenodo.org/records/14963136). To the best of our knowledge, this represents the largest dataset with a broad-based chemical space used to calibrate machine learning (ML) models for taste prediction based on molecular descriptors and fingerprints. For validation, datasets were randomly split into training and test sets in a 75:25 ratio, ensuring balanced class distributions. In binary classification tasks, the Random Forest classifier demonstrated the highest predictive performance for sweet/bitter (NER = 0.928 and F-score = 0.927) and bitter/non-bitter (NER = 0.902 and F-score = 0.903) classification. Adaptive Boosting excelled in the prediction of sweet/non-sweet (NER = 0.861 and F-score = 0.862). The N-Nearest Neighbors classifier emerged as the optimal classifier for umami/non-umami (NER = 0.957 and F-score = 0.860) and sweet/bitter/umami (NER = 0.870 and F-score = 0.843). These models may be useful in the development and analysis of new chemical tastants.File | Dimensione | Formato | |
---|---|---|---|
Rojas-2025-Chemometrics and Intelligent Laboratory Systems-VoR.pdf
Solo gestori archivio
Tipologia di allegato:
Publisher’s Version (Version of Record, VoR)
Licenza:
Tutti i diritti riservati
Dimensione
4.88 MB
Formato
Adobe PDF
|
4.88 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.