Antibiotic resistance, particularly among Gram-negative bacteria, poses a significant healthcare challenge due to their ability to evade antibiotic action through various mechanisms. In this study, we explore the prediction of small molecule accumulation in Gram-negative bacteria by using machine learning techniques enhanced with statistical descriptors derived from molecular dynamics simulations. We begin by identifying a minimal set of molecular descriptors that maximize the model's predictive power while preserving human interpretability. We optimize model accuracy, precision, and the area under the receiver operating characteristic curve through an iterative process. We demonstrate that the inclusion of statistical descriptors significantly improves model performance across various prediction metrics. Particularly, the addition of statistical descriptors related to dipole moment and minimum projection radius enhances the model's predictive capabilities, shedding light on the physicochemical properties crucial for small molecule accumulation. Our findings highlight the importance of considering statistical moments beyond mean values in predictive modeling and suggest avenues for future research. Overall, our study provides insights into the complex dynamics of antibiotic accumulation in Escherichia coli bacterial cells, generalizable to other Gram-negative species, offering a promising approach for the discovery of effective antibacterial agents, identifying new hits, and improving them to define effective lead agents.
Machine Learning Prediction of Small Molecule Accumulation in Escherichia Coli Enhanced with Descriptor Statistics
Milenkovic S.
Primo
;Boi S.Secondo
;Scorciapino M. A.Penultimo
;Ceccarelli M.Ultimo
2024-01-01
Abstract
Antibiotic resistance, particularly among Gram-negative bacteria, poses a significant healthcare challenge due to their ability to evade antibiotic action through various mechanisms. In this study, we explore the prediction of small molecule accumulation in Gram-negative bacteria by using machine learning techniques enhanced with statistical descriptors derived from molecular dynamics simulations. We begin by identifying a minimal set of molecular descriptors that maximize the model's predictive power while preserving human interpretability. We optimize model accuracy, precision, and the area under the receiver operating characteristic curve through an iterative process. We demonstrate that the inclusion of statistical descriptors significantly improves model performance across various prediction metrics. Particularly, the addition of statistical descriptors related to dipole moment and minimum projection radius enhances the model's predictive capabilities, shedding light on the physicochemical properties crucial for small molecule accumulation. Our findings highlight the importance of considering statistical moments beyond mean values in predictive modeling and suggest avenues for future research. Overall, our study provides insights into the complex dynamics of antibiotic accumulation in Escherichia coli bacterial cells, generalizable to other Gram-negative species, offering a promising approach for the discovery of effective antibacterial agents, identifying new hits, and improving them to define effective lead agents.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.