Logo: to the web site of Uppsala University

uu.sePublications from Uppsala University
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Discovery of Chemical Probes through Structure-based Virtual Screening of Vast Compound Databases
Uppsala University, Disciplinary Domain of Science and Technology, Biology, Department of Cell and Molecular Biology, Computational Biology and Bioinformatics. Uppsala University, Science for Life Laboratory, SciLifeLab.
2023 (English)Doctoral thesis, comprehensive summary (Other academic)
Description
Abstract [en]

Bioactive molecules have traditionally been discovered through labor-intensive screening methods in which individual compounds are tested against specific protein targets or cells to identify those that produce the desired biological effect. However, these approaches have significant limitations. Firstly, the number of molecules that can be tested in a standard laboratory is restricted, and the acquisition and curation of these compounds come at a high cost. Secondly, these methods are time-consuming because each compound must be tested individually, and they are confined to small libraries with very limited chemical space coverage. In contrast, structure-based virtual screening can rapidly predict a molecule's interaction with a target protein, allowing for the evaluation of enormous libraries of chemical substances. Furthermore, this approach is not restricted to physically available molecules and can be extended to virtual compounds. Commercial chemical space has recently grown exponentially and currently contains several billion molecules that can be readily synthesized and delivered for experimental testing within weeks. Despite the enormous potential of these databases for drug discovery, they also pose new challenges, and development of effective strategies is required to explore ultralarge libraries. The goal of this thesis was to develop and apply novel strategies focused on exploring the potential of ultralarge chemical libraries using structure-based virtual screening. Publication I summarizes best practices on large-scale virtual screening and benchmarking protocols for molecular docking calculations. Publication II describes a docking screen of several hundred million lead-like molecules against the SARS-CoV-2 main protease, leading to promising starting points for development of coronavirus inhibitors. The binding modes predicted by docking were confirmed experimentally by X-ray crystallography. After several rounds of optimization, nanomolar broad-spectrum inhibitors with antiviral effects against coronaviruses in cell models were discovered. Manuscript III demonstrates how machine learning can be used to accelerate virtual screening campaigns. Classification models were trained on docking scores to identify promising molecules in ultralarge libraries relevant to the protein target of interest. The classification algorithms were able to reduce a multi-billion-scale library to a subset of high-confidence candidates with improved docking scores. Manuscript IV focuses on large-scale fragment docking to identify compounds binding to 8-oxoguanine glycosylase 1 and how to efficiently optimize them to potent inhibitors. The docking scoring function was able to correctly predict binding modes of the experimental hits and optimization led to submicromolar inhibitors with anti-inflammatory and anti-cancer effects in cell models. Publication V presents how docking of tailored virtual libraries of nature-inspired macrocycles led to potent disruptors of the KEAP1-Nrf2 complex. The results of this thesis highlight that large-scale virtual screening is a resourceful tool to discover ligands of a wide variety of drug targets.

Place, publisher, year, edition, pages
Uppsala: Acta Universitatis Upsaliensis, 2023. , p. 68
Series
Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology, ISSN 1651-6214 ; 2261
National Category
Bioinformatics (Computational Biology)
Identifiers
URN: urn:nbn:se:uu:diva-500083ISBN: 978-91-513-1792-2 (print)OAI: oai:DiVA.org:uu-500083DiVA, id: diva2:1749936
Public defence
2023-06-02, A1:111a, BMC, Husargatan 3, Uppsala, 13:15 (English)
Opponent
Supervisors
Available from: 2023-05-09 Created: 2023-04-12 Last updated: 2023-05-09
List of papers
1. A practical guide to large-scale docking
Open this publication in new window or tab >>A practical guide to large-scale docking
Show others...
2021 (English)In: Nature Protocols, ISSN 1754-2189, E-ISSN 1750-2799, Vol. 16, no 10, p. 4799-4832Article in journal (Refereed) Published
Abstract [en]

Structure-based docking screens of large compound libraries have become common in early drug and probe discovery. As computer efficiency has improved and compound libraries have grown, the ability to screen hundreds of millions, and even billions, of compounds has become feasible for modest-sized computer clusters. This allows the rapid and cost-effective exploration and categorization of vast chemical space into a subset enriched with potential hits for a given target. To accomplish this goal at speed, approximations are used that result in undersampling of possible configurations and inaccurate predictions of absolute binding energies. Accordingly, it is important to establish controls, as are common in other fields, to enhance the likelihood of success in spite of these challenges. Here we outline best practices and control docking calculations that help evaluate docking parameters for a given target prior to undertaking a large-scale prospective screen, with exemplification in one particular target, the melatonin receptor, where following this procedure led to direct docking hits with activities in the subnanomolar range. Additional controls are suggested to ensure specific activity for experimentally validated hit compounds. These guidelines should be useful regardless of the docking software used. Docking software described in the outlined protocol (DOCK3.7) is made freely available for academic research to explore new hits for a range of targets. Structure-based docking screens of compound libraries are common in early drug and probe discovery. This protocol outlines best practices and control calculations to evaluate docking parameters prior to undertaking a large-scale prospective screen.

Place, publisher, year, edition, pages
Springer NatureSpringer Nature, 2021
National Category
Biochemistry Molecular Biology
Identifiers
urn:nbn:se:uu:diva-469391 (URN)10.1038/s41596-021-00597-z (DOI)000698885200001 ()34561691 (PubMedID)
Funder
Swedish Research Council, 2017-04676EU, Horizon 2020, 715052
Note

De två första författarna delar förstaförfattarskapet.

Available from: 2022-03-11 Created: 2022-03-11 Last updated: 2025-02-20Bibliographically approved
2. Ultralarge Virtual Screening Identifies SARS-CoV-2 Main Protease Inhibitors with Broad-Spectrum Activity against Coronaviruses
Open this publication in new window or tab >>Ultralarge Virtual Screening Identifies SARS-CoV-2 Main Protease Inhibitors with Broad-Spectrum Activity against Coronaviruses
Show others...
2022 (English)In: Journal of the American Chemical Society, ISSN 0002-7863, E-ISSN 1520-5126, Vol. 144, no 7, p. 2905-2920Article in journal (Refereed) Published
Abstract [en]

Drugs targeting SARS-CoV-2 could have saved millions of lives during the COVID-19 pandemic, and it is now crucial to develop inhibitors of coronavirus replication in preparation for future outbreaks. We explored two virtual screening strategies to find inhibitors of the SARS-CoV-2 main protease in ultralarge chemical libraries. First, structure-based docking was used to screen a diverse library of 235 million virtual compounds against the active site. One hundred top-ranked compounds were tested in binding and enzymatic assays. Second, a fragment discovered by crystallographic screening was optimized guided by docking of millions of elaborated molecules and experimental testing of 93 compounds. Three inhibitors were identified in the first library screen, and five of the selected fragment elaborations showed inhibitory effects. Crystal structures of target-inhibitor complexes confirmed docking predictions and guided hit-to-lead optimization, resulting in a noncovalent main protease inhibitor with nanomolar affinity, a promising in vitro pharmacokinetic profile, and broad-spectrum antiviral effect in infected cells.

Place, publisher, year, edition, pages
American Chemical Society (ACS)American Chemical Society (ACS), 2022
National Category
Infectious Medicine
Identifiers
urn:nbn:se:uu:diva-470953 (URN)10.1021/jacs.1c08402 (DOI)000765779100012 ()35142215 (PubMedID)
Funder
Knut and Alice Wallenberg Foundation, 2020.0182Knut and Alice Wallenberg Foundation, 2020.0182EU, European Research Council, 715052Swedish Research Council, 2018-07152Swedish Research Council, 2018-06454Vinnova, 2018-04969Swedish Research Council Formas, 2019-02496Swedish Research Council Formas, ZW13-02
Available from: 2022-04-01 Created: 2022-04-01 Last updated: 2024-01-15Bibliographically approved
3. Rapid Traversal of Ultralarge Chemical Space using Machine Learning Guided Docking Screens
Open this publication in new window or tab >>Rapid Traversal of Ultralarge Chemical Space using Machine Learning Guided Docking Screens
Show others...
2025 (English)In: nature computational scienceArticle in journal (Refereed) Published
Abstract [en]

The accelerating growth of make-on-demand chemical libraries provides unprecedented opportunities to identify starting points for drug discovery with virtual screening. However, these multi-billion-scale libraries are challenging to screen, even for the fastest structure-based docking methods. Here we explore a strategy that combines machine learning and molecular docking to enable rapid virtual screening of databases containing billions of compounds. In our workflow, a classification algorithm is trained to identify top-scoring compounds based on molecular docking of 1 million compounds to the target protein. The conformal prediction framework is then used to make selections from the multi-billion-scale library, reducing the number of compounds to be scored by docking. The CatBoost classifier showed an optimal balance between speed and accuracy and was used to adapt the workflow for screens of ultralarge libraries. Application to a library of 3.5 billion compounds demonstrated that our protocol can reduce the computational cost of structure-based virtual screening by more than 1,000-fold. Experimental testing of predictions identified ligands of G protein-coupled receptors and demonstrated that our approach enables discovery of compounds with multi-target activity tailored for therapeutic effect.

Place, publisher, year, edition, pages
Springer Nature, 2025
National Category
Bioinformatics and Computational Biology
Identifiers
urn:nbn:se:uu:diva-500081 (URN)10.1038/s43588-025-00777-x (DOI)001443841300001 ()40082701 (PubMedID)
Available from: 2023-04-11 Created: 2023-04-11 Last updated: 2025-05-22Bibliographically approved
4. Virtual Fragment Screening for DNA Repair Inhibitors in Vast Chemical Space
Open this publication in new window or tab >>Virtual Fragment Screening for DNA Repair Inhibitors in Vast Chemical Space
Show others...
2025 (English)In: Nature Communications, E-ISSN 2041-1723, Vol. 16, no 1, article id 1741Article in journal (Refereed) Published
Abstract [en]

Fragment-based screening can catalyze drug discovery by identifying novel scaffolds, but this approach is limited by the small chemical libraries studied by biophysical experiments and the challenging hit optimization step. In efforts to identify DNA repair inhibitors, we explored the use of structure-based virtual screening to access ultralarge fragment libraries that cover four orders of magnitude larger fractions of chemical space than traditional techniques. A set of 14 million fragments were docked to 8-oxoguanine DNA glycosylase (OGG1), a challenging drug target involved in cancer and inflammation. Of the 29 top-ranked fragments that were experimentally evaluated, four compounds were shown to bind to OGG1 and X-ray crystallography confirmed the predicted binding modes. Docking of readily synthesizable elaborations guided fragment optimization, leading to the discovery of submicromolar OGG1 inhibitors with anti-inflammatory and anti-cancer effects in cell models. Our results demonstrate that fragment-based virtual screening enables efficient exploration of vast chemical libraries.

Place, publisher, year, edition, pages
Springer Nature, 2025
National Category
Medical Life Sciences
Identifiers
urn:nbn:se:uu:diva-500079 (URN)10.1038/s41467-025-56893-9 (DOI)001425285400024 ()39966348 (PubMedID)2-s2.0-85218501018 (Scopus ID)
Available from: 2023-04-11 Created: 2023-04-11 Last updated: 2025-10-15Bibliographically approved
5. Importance of Binding Site Hydration and Flexibility Revealed When Optimizing a Macrocyclic Inhibitor of the Keap1-Nrf2 Protein-Protein Interaction
Open this publication in new window or tab >>Importance of Binding Site Hydration and Flexibility Revealed When Optimizing a Macrocyclic Inhibitor of the Keap1-Nrf2 Protein-Protein Interaction
Show others...
2022 (English)In: Journal of Medicinal Chemistry, ISSN 0022-2623, E-ISSN 1520-4804, Vol. 65, no 4, p. 3473-3517Article in journal (Refereed) Published
Abstract [en]

Upregulation of the transcription factor Nrf2 by inhibition of the interaction with its negative regulator Keap1 constitutes an opportunity for the treatment of disease caused by oxidative stress. We report a structurally unique series of nanomolar Keap1 inhibitors obtained from a natural product-derived macrocyclic lead. Initial exploration of the structure-derived macrocyclic lead. Initial exploration of the structure-activity relationship of the lead, followed by structure-guided optimization, resulted in a 100-fold improvement in inhibitory potency. The macrocyclic core of the nanomolar inhibitors positions three pharmacophore units for productive interactions with key residues of Keap1, including R415, R483, and Y572. Ligand optimization resulted in the displacement of a coordinated water molecule from the Keap1 binding site and a significantly altered thermodynamic profile. In addition, minor reorganizations of R415 and R483 were accompanied by major differences in affinity between ligands. This study therefore indicates the importance of accounting both for the hydration and flexibility of the Keap1 binding site when designing high-affinity ligands.

Place, publisher, year, edition, pages
American Chemical Society (ACS)American Chemical Society (ACS), 2022
National Category
Biochemistry Molecular Biology Medicinal Chemistry
Identifiers
urn:nbn:se:uu:diva-477522 (URN)10.1021/acs.jmedchem.1c01975 (DOI)000797940600046 ()35108001 (PubMedID)
Funder
Swedish Research Council, 2016-05160AstraZenecaSwedish National Infrastructure for Computing (SNIC), 2020-5-435Swedish National Infrastructure for Computing (SNIC), 2020-3-21
Available from: 2022-06-20 Created: 2022-06-20 Last updated: 2025-02-20Bibliographically approved

Open Access in DiVA

UUThesis_Luttens,A-2023(2036 kB)517 downloads
File information
File name FULLTEXT01.pdfFile size 2036 kBChecksum SHA-512
b3b9623f9976aaed9b5eb9071d4f8f085073430591d426ee6a74c57ae10cb009867c83f2fbf05470c329ac6f2d56639b0aec8ec530608aa82ef65373e26bfccb
Type fulltextMimetype application/pdf

Authority records

Luttens, Andreas

Search in DiVA

By author/editor
Luttens, Andreas
By organisation
Computational Biology and BioinformaticsScience for Life Laboratory, SciLifeLab
Bioinformatics (Computational Biology)

Search outside of DiVA

GoogleGoogle Scholar
Total: 517 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 1310 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf