Summary
ADME properties of molecules are important for the features of compounds as a drug. If molecules will have poor solubility and chemical instability it will lead to low bioavailability. It means molecules will be less effective. Also, patients will need to take them many times per day.
Drugs that absorb poorly when taken orally must be administered in some less desirable way, like intravenously or by inhalation. Another important feature of new compounds is toxicity.
Scientists need to make sure molecules that they suggest as a drug candidate that will be safe and not toxic.
Machine learning methods are useful tools in prediction of ADME and toxicity properties. In this project we implement and investigate the performance of the machine learning model using the Random Forest algorithm.
This model was able to predict solubility and toxicity properties of chemical compounds. We implemented this model using Python language and modern Python libraries such as TensorFlow and PyTorch. We trained the model with experimentally identified solubility of 1128 compounds and toxicity data of 1491 drug compounds. After training we validated and tested the performance of the machine learning model on the test set of experimental data. Machine learning that we developed demonstrated sufficient accuracy.
References Cited
L. Tao, P. Zhang, C. Qin, S.Y. Chen, C. Zhang, Z. Chen, F. Zhu, S.Y. Yang, Y.Q. Wei, Y.Z. Chen,
Recent progresses in the exploration of machine learning methods as in-silico ADME prediction tools, Advanced Drug Delivery Reviews,Volume 86,2015,Pages 83-100,ISSN 0169-409X,
Jeonghee Jo, Bumju Kwak, Hyun-Soo Choi, Sungroh Yoon, The message passing neural networks for chemical property prediction on SMILES,Methods,Volume 179,2020,Pages 65-72,ISSN 1046-2023
https://doi.org/10.1016/j.ymeth.2020.05.009. Wu, Z., Ramsundar, B., Evan, Gomes, J., Geniesse, C., Pappu, A.S., Leswing, K., Pande, V., 2018. MoleculeNet: a benchmark for molecular machine learning. Chemical Science 9, 513–530.. doi:10.1039/c7sc02664a