AI-assisted Aqueous Solubility Prediction

Aqueous solubility is crucial to the processes of drug discovery and development. It is an important factor affecting oral absorption and bioavailability of drugs and is considered a relevant parameter in ADMET (absorption, distribution, metabolism, excretion, and toxicity) studies. Many drug development failures have been linked to poor solubility, and increasing the water solubility of bioactive compounds is a significant challenge in medicinal chemistry. In addition, aqueous solubility is also a key determinant of the environmental impact of pollutants and agricultural chemicals. To determine the water solubility compounds, a variety of experimental techniques have been used, such as variations of the shake-flask method. However, experimental methods are difficult, expensive, and time-consuming to determine the aqueous solubility. It is also unrealistic to test thousands or millions of compounds in high throughput screening (HTS). Therefore, the prediction of solubility by in silico approaches is highly valuable. Based on the advanced AI-assisted platform, Creative Biolabs can accurately predict the ADMET properties and help customers accelerate the drug screening process and reduce R&D costs.

The effectiveness of popular machine learning modeling methods and molecular featurization techniques in predicting aqueous solubility.Fig.1 Machine learning approaches in predicting aqueous solubility.1

In Silico Solubility Prediction Tool

Water solubility is an important physicochemical property of compounds in anticancer drug discovery and development, which affects pharmacokinetic properties and formulations. Given the limited predictive performance of many published solubility models, some groups have developed innovative QSPR models using new recursive algorithms in machine learning methods for data and variable selection. During the early phases of drug discovery and development, the automatic workflow showed high predictive performance and can offer superior predictions of aqueous solubility. For the prediction of aqueous solubility, some machine learning (ML) methods have been used, including the convolutional and recurrent networks, random forests (RF), support vector machines (SVM), and k-nearest neighbors (k-NN).

A variety of artificial intelligence solubility prediction tools have been developed by utilizing deep learning, regression, and modeling machine learning to facilitate solubility assessment. These tools have achieved outstanding results with high R2 and low RMSE values. However, because different data sets are used, the reported performance can vary considerably even with the same tools. It is necessary to enhance solubility prediction for novel compounds, which can be further achieved through deep learning. Solubility prediction may improve as deep learning progresses. The deeper net model also outperformed other models in predicting the solubility values of a series of newly synthesized compounds for anticancer drug discovery.

As a reliable partner to the world's leading pharmaceutical companies and research institutions, Creative Biolabs brings together the market-leading in silico expertise to build our AI-assisted aqueous solubility prediction service team that can provide the custom professional aqueous solubility prediction.

Reference

  1. Zheng, Tianyuan, John BO Mitchell, and Simon Dobson. "Revisiting the application of machine learning approaches in predicting aqueous solubility." ACS omega 9.32 (2024): 35209-35222. Distributed under Open Access license CC BY 4.0, without modification.
For Research Use Only
Let's Get Started
Contact Us

USA
UK
Germany
Follow us on
ISO 9001 Certified - Creative Biolabs Quality Management System.
Copyright © 2025 Creative Biolabs. All Rights Reserved.