Some plants contain useful molecules for human health but identifying them requires time and resources. To optimize the discovery of new useful compounds, a strategic approach is needed to first select plants with high potential. One strategy is the phylogenetic approach, as closely related plant species tend to share biochemistry and medicinal properties. The aim is to propose a pipeline of phylogenetic methods to identify plants most likely to contain beneficial molecules. Thus, a phylogenetic tree was downloaded from the literature, and five monophyletic subtrees were extracted from it, containing a total of 30,388 plant species. Then, lists of medicinal plants correlated with 12 diseases were obtained from the CMAUP medicinal plants database, and lists of medicinal plants correlated with 12 biological activities were obtained from Dr. Duke's Phytochemical and Ethnobotanical Databases. The data from each list were combined with the data from each monophyletic subtree, and for all of them, the phylogenetic signal was measured using different methods. In the presence of phylogenetic signals, subsequent analyses were done to determine the exact position of phylogenetic clumping by identifying the “hot nodes”, nodes related to the plants with the most potential. The trees containing the species descending from the hot nodes were extracted, and those with more than 14 tips were plotted. Then, for all trees with more than 100 tips, another method called “hidden state prediction” was applied to calculate the probability of a plant on the tree having medicinal properties. From the analysis of the 12 lists, a total of 198 trees were plotted. On each tree, the plants descending from the most hot nodes and with the highest probability are the plants with the highest potential to have medicinal properties. This work was able to apply a pipeline of different phylogenetic methods that can be used for the selection of potential plants for drug discovery; the functions used in the pipeline will soon be available in the R package “pm4mp”.

Toini, E., Zecca, G., Labra, M., Grassi, F. (2025). Which plants would you choose to study for new drug discovery? A pipeline for a phylogenetic approach. Intervento presentato a: BtbsDay 2025, Milano, Italy.

Which plants would you choose to study for new drug discovery? A pipeline for a phylogenetic approach

Elisa Toini;Giovanni Zecca;Massimo Labra;Fabrizio Grassi
2025

Abstract

Some plants contain useful molecules for human health but identifying them requires time and resources. To optimize the discovery of new useful compounds, a strategic approach is needed to first select plants with high potential. One strategy is the phylogenetic approach, as closely related plant species tend to share biochemistry and medicinal properties. The aim is to propose a pipeline of phylogenetic methods to identify plants most likely to contain beneficial molecules. Thus, a phylogenetic tree was downloaded from the literature, and five monophyletic subtrees were extracted from it, containing a total of 30,388 plant species. Then, lists of medicinal plants correlated with 12 diseases were obtained from the CMAUP medicinal plants database, and lists of medicinal plants correlated with 12 biological activities were obtained from Dr. Duke's Phytochemical and Ethnobotanical Databases. The data from each list were combined with the data from each monophyletic subtree, and for all of them, the phylogenetic signal was measured using different methods. In the presence of phylogenetic signals, subsequent analyses were done to determine the exact position of phylogenetic clumping by identifying the “hot nodes”, nodes related to the plants with the most potential. The trees containing the species descending from the hot nodes were extracted, and those with more than 14 tips were plotted. Then, for all trees with more than 100 tips, another method called “hidden state prediction” was applied to calculate the probability of a plant on the tree having medicinal properties. From the analysis of the 12 lists, a total of 198 trees were plotted. On each tree, the plants descending from the most hot nodes and with the highest probability are the plants with the highest potential to have medicinal properties. This work was able to apply a pipeline of different phylogenetic methods that can be used for the selection of potential plants for drug discovery; the functions used in the pipeline will soon be available in the R package “pm4mp”.
poster
phylogenetic methods, medicinal plant, drug discovery
English
BtbsDay 2025
2025
2025
https://www.btbs.unimib.it/it/dipartimento/btbsday/btbsday2025/poster-session-p61-p70
none
Toini, E., Zecca, G., Labra, M., Grassi, F. (2025). Which plants would you choose to study for new drug discovery? A pipeline for a phylogenetic approach. Intervento presentato a: BtbsDay 2025, Milano, Italy.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/566061
Citazioni
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
Social impact