Advancing the Understanding of the Starch Structure-function Relationship with Molecular Simulation

Posted on March 29, 2022January 23, 2025 by Anonymous

MAR 29, 2022

Advancing the understanding of the starch structure-function relationship with molecular simulation

Speaker:
Jeffrey Sanders, Principal Scientist

Abstract:
Starch is one of the most common biopolymers in food products and it’s interaction with other ingredients influences the overall physical and chemical characteristics, including nutrition and texture quality. Given the relative size of and complexity of starch polymers, traditional biophysical experiments lack the necessary resolution to study key nanostructure properties. Use of molecular dynamics simulations provides critical insight into the structural and dynamic properties of amorphous starch structures. Combining structural analyses with grand canonical Monte Carlo methods, the moisture uptake behavior is also explored along with it’s effects on thermophysical properties.

Benchmark study of DeepAutoQSAR, ChemProp, and DeepPurpose on the ADMET subset of the Therapeutic Data Commons

Posted on March 28, 2022July 15, 2024 by Anonymous

Benchmark study of DeepAutoQSAR, ChemProp, and DeepPurpose on the ADMET subset of the Therapeutic Data Commons

Comparing performance metrics for Schrödinger’s automated ML model building engine to ChemProp and DeepPurpose.

Abstract

With the advent of more powerful hardware and methods, the use of machine learning (ML) methods has seen a significant upsurge in chemistry-related applications recently. Specifically in drug discovery, the prediction of ADMET (absorption, distribution, metabolism, excretion and toxicity) properties is a main target for ML applications. Herein, we present performance metrics for Schrödingers automated ML model building engine, DeepAutoQSAR, on the ADMET subset of the Therapeutic Data Commons (TDC) — a large collection of public data for ML model building and benchmarking. We also compare the performance of DeepAutoQSAR to the performance of two open source projects, namely ChemProp and DeepPurpose.

DeepAutoQSAR is among the top-performing methods in 20 of the 22 investigated cases, clearly outperforming the other methods in 9 of those. For the other 11 cases, at least one of the other tested methods performs similarly. We believe that continuous development and further improvement of DeepAutoQSAR, in accuracy, robustness to chemical data shift and label efficiency will enable faster and more cost-effective means of drug discovery, ultimately leading to the introduction of novel therapeutics.

Introduction

It is widely recognized that the ADMET (absorption, distribution, metabolism, excretion and toxicity) profile of novel molecules plays a key role in the successful development of new drugs. This is reinforced by the amount of time and effort spent both in academia and the pharmaceutical industry to develop reliable models to measure and predict numerous related endpoints1. Due to the potentially catastrophic impact of an unfavorable ADMET profile in the later stages of drug development, a common goal is to identify potential issues as early as possible.

With the rise of ultra-large on-demand libraries and DNA encoded libraries (for example Enamine REAL Space or WuXi LabNetwork), early identification of liabilities requires methods that are computationally fast, cheap, and accurate enough to evaluate hundreds of millions of compounds without discarding potentially good candidates. This obviously precludes the use of experimental in vivo or even in vitro methods. Modern machine learning (ML) approaches, often coined artificial intelligence (AI), can easily process millions of molecules on short timescales and low computational costs with acceptable accuracy.

In contrast to physics-based in silico methods, ML/AI methods require high fidelity data to be trained to predict a given endpoint. High-quality training data is often unavailable; data need to be clean and well-curated, and datasets in chemistry applications are often smaller than those used in other domains like ML on images or text. These strict data requirements can limit the application of more complex ML/AI approaches since there is often insufficient amounts of training data to fit complex and accurate models.

However, recognizing the importance of profiling ADMET properties over the past decades, large pharmaceutical companies have generated a wealth of data which is often unfortunately non-public and exclusively applied for internal programs. Public data is rarer, but there are efforts to collect and aggregate public data 2 and also to share non-public data in smart ways to improve existing models while retaining data confidentiality 3.

The successes of deep learning (DL) approaches have led to a renaissance of ML/AI in chemistry applications, with a large number of both open-source and commercial software to pick from when targeting ADMET endpoints. While open-source software oftentimes can profit from faster development cycles and thus implements new scientific insights more quickly, application is often limited to domain experts. On the other hand, commercial software has the benefits of structured quality assurance (QA), documentation and support, and comes coupled with comprehensive user interfaces which significantly lower the barrier to entry for non-experts.

In this paper, we will take a closer look at the performance of two of the more popular open-source packages, ChemProp and DeepPurpose, and Schrödinger’s ML/AI package DeepAutoQSAR, demonstrating their comparative performance on a recently published set of benchmarks.

Download to read the full white paper

Software and services to meet your organizational needs

Industry-Leading Software Platform

Deploy digital drug discovery workflows using a comprehensive and user-friendly platform for molecular modeling, design, and collaboration.

Learn More

Research Enablement Services

Leverage Schrödinger’s team of expert computational scientists to advance your projects through key stages in the drug discovery process.

Learn More

Scientific and Technical Support

Access expert support, educational materials, and training resources designed for both novice and experienced users.

Learn More

Computational Design and Biological Evaluation of Analogs of Lupin Peptide P5 Endowed with Dual PCSK9/HMG-CoAR Inhibiting Activity

Posted on March 18, 2022September 16, 2024 by Anonymous

Hit to lead design of novel d-amino-acid oxidase inhibitors using a comprehensive digital chemistry strategy

Posted on February 11, 2022April 7, 2026 by Anonymous

Hit to lead design of novel d-amino-acid oxidase inhibitors using a comprehensive digital chemistry strategy

Computational platform grounded in highly accurate predictive models enables team-based discovery of a novel chemical series engaging a complex CNS target.

Overview

Inhibition of D-amino-acid oxidase (DAO) has been hypothesized as a potential therapeutic strategy for schizophrenia. Schrödinger’s Drug Discovery Team engaged in a discovery effort with a collaborator to identify novel DAO inhibitors with potential best-in-class properties.

Program Challenges

Identify novel chemical matter while striving for best-in-class molecules that cross the blood-brain-barrier
Simultaneously optimize drug-like properties, improve CNS exposure, and affinity

Approach

The Drug Discovery Team deployed a large-scale digital chemistry strategy leveraging:

A centralized project data platform to facilitate knowledge-based medicinal chemistry design collaboration (LiveDesign, AutoQSAR)
Physics-based methods to predict affinity and prioritize design ideas for synthesis (FEP+)
Computationally-driven ideation and scoring workflow to amplify common enumeration strategies and screen hundreds of millions of compounds using machine learning coupled with physics-based free energy methods (FEP+, AutoDesigner)

Results

The team discovered a novel class of DAO inhibitors with desirable drug-like properties by confidently exploring synthetically-challenging chemistry. The team also identified a previously unexplored subpocket for further evaluation. The novelty of the compounds, coupled with well-balanced properties, demonstrates the extraordinary power of the approach to unleash project team creativity. By leveraging a digital platform, the team explored vast chemical space while simultaneously optimizing for drug-like properties in a challenging disease area.

Why use a digital chemistry approach?

A digital chemistry approach uses physics-based modelling, machine learning, and a team-based collective intelligence platform to design better molecules on accelerated timelines.

How to achieve optimal drug-like properties?

The development of CNS drugs poses several unique challenges. Fine-tuning physicochemical properties for optimal brain exposure is an essential element of CNS drug development. Many companies have discontinued neuroscience discovery because these challenges lead to longer development timelines and a lower probability of success.

Schrödinger’s Drug Discovery Group developed property prediction models, using AutoQSAR deployed via LiveDesign, a web-based collaborative design platform. LiveDesign enabled teams of medicinal chemists to crowdsource designs and simultaneously optimize CNS properties with push-button workflows in a single interface (see figure 1). Compounds predicted in the desired property space were triaged using free energy perturbation (FEP+), a physics-based method for accurately predicting compound binding affinity. This workflow empowers teams to confidently pursue synthetically challenging compounds.

Figure 1. Schrödinger’s digital collaboration platform, LiveDesign, facilitates design optimization through custom multiple parameter optimization models by centralizing program data and improving team communication and collaboration.

How did a digital chemistry strategy enable improved hypothesis testing?

The team delivered high-quality molecules by working in an ecosystem that facilitated exploration of vast, novel chemical space and simultaneous optimization for desired properties through accurate physics-based modeling and machine learning.

The digital chemistry approach allowed the team to discover and quickly overcome many medicinal chemistry challenges in the pursuit of best-in-class molecules (see figure 1).²

Figure 2. SAR progression to achieve key milestones through late lead optimization with key compounds series represented. Crucial discovery and medicinal chemistry outcomes are highlighted. DHP represents dihydropyrazine and NHP, N-hydroxyl pyrimidine.

The team interrogated the atypical polarity of the DAO binding site, shown in panel A of figure 3. Chemists pursued challenging chemistry to reduce conformational flexibility and displace a high-energy water molecule by cyclizing and methylating cmpd 4 and 5 (see figure 2, panels B and C).

Finally, while literature and crystallographic structures suggested limited pocket volume for SAR exploration,³FEP+ revealed the opportunity to interrogate this vector with larger chemical groups such as cmpd 6, as shown in panel D of figure 3.

Figure 3. A) Polarity of DAO binding site required a polar warhead. B) Cyclization of ligand linker reduced entropy improving affinity C) Displacement of high energy water near the binding site improved affinity (compare water present in panel B with panel C). D) FEP+ suggested exploring a novel subpocket predicted to improve potency (compare gray surface in panel A with the green surface in panel D).

How to interrogate vast chemical space and deliver novel chemical matter?

Pursuing best-in-class molecules required exploring vast chemical space outside of previously characterized drug-like molecules. The team utilized AutoDesigner, a multifaceted large-scale enumeration workflow (figure 4), to generate ideas exploring the novel DAO binding subpocket suggested by FEP+ (see figure 3, panel D).⁴

Figure 4. AutoDesigner enumeration and triage workflow explores SAR from the lead molecule while optimizing CNS drug-like properties to discover best-in-class molecules by covering vast chemical space.

To explore SAR and tune physicochemical properties, the team performed iterative cycles of AutoDesigner in the newly discovered subpocket. After triaging with appropriate filters, all molecular ideas were prioritized using free energy methods. The team trained active learning models using physics-based affinity predictions (FEP+) to prioritize compounds for synthesis. In total, more than 350 million ideas were generated and triaged.

What was the project impact?

Typically as drug discovery programs progress, teams struggle to balance desired properties, which leads to deficits in desired drug properties as novel scaffolds are explored and optimized.

Through a computational platform rooted in creative team collaboration, highly accurate predictive modeling, and enhanced by machine learning, a promising CNS DAO inhibitor series transitioned from hit discovery to lead optimization with approximately 11,000 compounds scored by FEP+ and only 208 synthesized. Of the 208 compounds synthesized, only 20 were inactive (>10μM) against DAO. By discovering novel compounds and concurrently performing multi-parameter optimization of critical CNS properties, the team delivered high project impact for this challenging disease area.

References

Bromet E.J., Fenning S.; Epidemiology and natural history of schizophrenia. Biol Psychiatry. 1999, 46 (7), 871–881.
Tang et al. Discovery of a Novel Class of D-amino Acid Oxidase (DAO) Inhibitors with the Schrödinger Computational Platform. ChemRxiv. Preprint. https://doi.org/10.33774/chemrxiv-2021-dkf1k.
Hondo et. al. 4-Hydroxypyridazin-3(2H)-one Derivatives as Novel d-Amino Acid Oxidase Inhibitors. J Med Chem. 2013, 56 (9): 3582-3592.
Bos et al. AutoDesigner, a De Novo Design Algorithm for Rapidly Exploring Large Chemical Space for Lead Optimization: Application to the Design and Synthesis of D-Amino Acid Oxidase Inhibitors. ChemRxiv. Preprint.

Download case study

CovDock

Posted on January 5, 2022May 12, 2025 by Anonymous

CovDock

The Advantages of Covalent Docking

With the recent resurgence in covalent drug research, computational insight into covalent docking is becoming key to understanding how covalent inhibitors can be used to address selectivity and potency challenges.

Covalent inhibitors derive their activity not only from the formation of a covalent bond between the target and the ligand but also from stabilizing non-covalent forces in the binding pocket. CovDock selects the top covalent complexes using the extensively validated Prime energy model, and calculates an apparent affinity score that captures these essential elements of a successful covalent docking process:

The pre-reactive ligand form occupies the binding pocket with enough residency time to facilitate the reaction of the ligand warhead with the reactive protein residue; and
unfavorable steric clashes and poor electrostatic contacts are prevented as the reaction proceeds.

CovDock begins with Glide docking to a receptor with the reactive residue trimmed to alanine. The receptor reactive residue is then added and sampled to form a covalent bond with the ligand in different poses. Covalent complexes are minimized using the Prime VSGB2.0 energy model to score the top covalent complexes. An apparent affinity score, based on the Glide score of pre-reactive and post-reactive poses, is also calculated to estimate binding energies for use in virtual screening.

Features

Accurate binding mode prediction:

CovDock is built upon a foundation of the time-tested Glide docking algorithm and Prime structure refinement methodology for accurate prediction of non-covalently docked poses. Glide quickly samples a large pool of initial poses for the pre-reactive species and Prime simultaneously optimizes the ligand pose and attachment residue to produce a sound physical chemistry. The resultant accuracy outperforms other docking programs in achieving lower RMS deviations from native co-crystallized structures.

Complete workflow:

CovDock performs a series of automated steps based on a simple setup from the Maestro graphical interface or from the command line. First, CovDock docks the pre-reactive ligand to determine viable poses that bring the reactive group into close proximity with the reactive receptor residue. Then the covalent bond is formed for the top scoring complex structures, the covalently attached ligand is sampled, and the complexes are scored using all-atom molecular mechanics with the OPLS force field and VSGB2.0 implicit solvent model.

Intuitive graphical interface:

Schrödinger’s intuitive graphical user interface, Maestro, provides easy-to-use panels for straightforward set-up of experiments, easy visualization, and efficient analysis of CovDock results.

Covalent Reactions Repository

Schrödinger has made available several custom reactions that can be used in CovDock studies, which can be found on the Covalent Reactions Repository documentation page.

Publications

“Docking covalent inhibitors: A parameter free approach to pose prediction and scoring”
Zhu, K.; Borrelli, K.W.; Greenwood, J.R.; Day, T.; Abel, R.; Farid, R.S.; Harder, E., J. Chem. Inf. Model., 2014, 54, 1932−1940.
“A Structure-Based Virtual Screening Approach for Discovery of Covalently Bound Ligands”
Toledo Warshaviak, D.; Golan, G.; Borrelli, K.W.; Zhu, K.; Kalid, O., J. Chem. Inf. Model, 2014, 54(7), 1941–1950.

Citations

Zhu, K.; Borrelli, K.W.; Greenwood, J.R.; Day, T.; Abel, R.; Farid, R.S.; Harder, E., “Docking covalent inhibitors: A parameter free approach to pose prediction and scoring,”
J. Chem. Inf. Model., 2014, 54, 1932−1940.