Rapid traversal of vast chemical space using machine learning-guided docking screens

Luttens, Andreas; Cabeza de Vaca, Israel; Sparring, Leonard; Brea Floriani, José Manuel; Martínez Rodríguez, Antón Leandro; Kahlous, Nour Aldin; Radchenko, Dmytro S.; Moroz, Yurii S.; Loza García, María Isabel; Norinder, Ulf; Carlsson, Jens

doi:10.1038/s43588-025-00777-x

Rapid traversal of vast chemical space using machine learning-guided docking screens

Files

2025_NatCompSci_Luttens_Rapid.pdf (3.33 MB)

Identifiers

URI: https://hdl.handle.net/10347/45594

E-ISSN: 2662-8457

DOI: 10.1038/s43588-025-00777-x

Publication date

2025-03-13

Publisher

Springer Nature

Metrics

Export

Abstract

The accelerating growth of make-on-demand chemical libraries provides unprecedented opportunities to identify starting points for drug discovery with virtual screening. However, these multi-billion-scale libraries are challenging to screen, even for the fastest structure-based docking methods. Here we explore a strategy that combines machine learning and molecular docking to enable rapid virtual screening of databases containing billions of compounds. In our workflow, a classification algorithm is trained to identify top-scoring compounds based on molecular docking of 1 million compounds to the target protein. The conformal prediction framework is then used to make selections from the multi-billion-scale library, reducing the number of compounds to be scored by docking. The CatBoost classifier showed an optimal balance between speed and accuracy and was used to adapt the workflow for screens of ultralarge libraries. Application to a library of 3.5 billion compounds demonstrated that our protocol can reduce the computational cost of structure-based virtual screening by more than 1,000-fold. Experimental testing of predictions identified ligands of G protein-coupled receptors and demonstrated that our approach enables discovery of compounds with multi-target activity tailored for therapeutic effect

Keywords

Cheminformatics| Computational chemistry| Machine learning| Structure-based drug design| Virtual drug screening

Bibliographic citation

Luttens, A., Cabeza de Vaca, I., Sparring, L. et al. Rapid traversal of vast chemical space using machine learning-guided docking screens. Nat Comput Sci 5, 301–312 (2025). https://doi.org/10.1038/s43588-025-00777-x

Publisher version

https://doi.org/10.1038/s43588-025-00777-x

Sponsors

A.L. was supported by a postdoctoral scholarship from the Knut and Alice Wallenberg Foundation (KAW2022.0347). J.C. received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (grant agreement 715052), the Swedish Cancer Society, the Swedish Research Council and the Olle Engkvist Foundation. This research was partially supported by the project AI4Research at Uppsala University. I.C.d.V. was funded by a postdoctoral fellowship provided by the Sven och Lilly Lawski foundation. The computations were enabled using resources provided by the National Academic Infrastructure for Supercomputing in Sweden (NAISS) (partially funded by the Swedish Research Council through grant agreement number 2022-06725) and the supercomputing resource Berzelius provided by the National Supercomputer Centre at Linköping University and the Knut and Alice Wallenberg Foundation. J.B., A.L.M. and M.I.L. were funded by Agencia Estatal de Investigación (PID2020-119428RB-I00), Xunta de Galicia (ED431C 2022/20) and European Regional Development Fund (ERDF). A.L., I.C.d.V. and J.C. thank OpenEye Scientific Software for the use of OEToolkits at no cost. We thank J. Zhang for providing the initial deep neural network code

Rights

© The Author(s) 2025. This article is licensed under a Creative Commons Attribution 4.0 International License
Attribution 4.0 International

Collections

Farmacoloxía, Farmacia e Tecnoloxía Farmacéutica
Centro de Investigación en Medicina Molecular e Enfermidades Crónicas (CiMUS)

Full item page

Rapid traversal of vast chemical space using machine learning-guided docking screens

Files

Identifiers

Publication date

Authors

Advisors

Tutors

Editors

Journal Title

Journal ISSN

Volume Title

Publisher

Metrics

Export

Research Projects

Organizational Units

Journal Issue

Abstract

Description

Keywords

Bibliographic citation

Relation

Has part

Has version

Is based on

Is part of

Is referenced by

Is version of

Requires

Publisher version

Sponsors

Rights

Collections