Hit to development candidate in 10 months: Rapid discovery of a novel, potent MALT1 inhibitor

Posted on July 11, 2023October 8, 2025 by Anonymous

Hit to development candidate in 10 months: Rapid discovery of SGR- 1505, a novel, potent MALT1 inhibitor

Digital chemistry platform provides scale and accuracy to drive high precision molecular design

8.2billion

compounds computationally evaluated

total compounds synthesized in lead series

10months

To discovery of development candidate

Target

MALT1, protease

Program Type

Schrödinger proprietary program, small molecule

Indication

Relapsed or refractory B-cell lymphoma, chronic lymphocytic leukemia

Stage

Phase 1 clinical trial

“The ability to leverage the computational platform to rapidly identify not just one, but several novel, highly potent series with well-balanced properties is unique in my many years experience in industry.”

Zhe Nie
Project Lead, Executive Director, Medicinal Chemistry,
Schrödinger Therapeutics Group

Design challenge

Mucosa-associated lymphoid tissue lymphoma translocation protein 1 (MALT1) is a genetically validated target for the treatment of diseases associated with lymphocyte regulation. MALT1 consists of three domains: a paracaspase protease domain, an Ig3 domain, and a linking helix. First generation MALT1 inhibitors consisted of large peptidomimetics targeting the protease domain; due to their poor drug-like properties, none made it into the clinic. Second generation MALT1 inhibitors targeting an allosteric region at the interface of the caspase-like and Ig3 domains have been more successful, resulting in a clinical stage compound.

Significant challenges exist in optimizing the properties of second generation MALT1 inhibitors, specifically permeability, efflux, and solubility, while maintaining on-target potency. The aim of this program was to discover a potent inhibitor with good overall drug-like properties to support combinations with standard of care agents for treatment of relapsed or refractory B-cell malignancies.

Scale and accuracy of digital assays drives efficient DMTA cycles

Finding a novel molecule with the right balance of on-target affinity and desired physicochemical properties is the essential challenge of every drug discovery program. In principle, increasing the number of rationally designed compounds assessed across these various properties increases the odds of success. Designing molecules in silico — with the speed and accuracy to traverse billions of molecules — is the guiding ethos of Schrödinger’s digital chemistry strategy. Specifically, this project combines rigorous physics-based modeling with machine learning (ML), predictive ADMET models, and data analytics to search and triage a chemical space consisting of more than 8B compounds. Ultimately, execution of this strategy enabled the identification of multiple novel series.

First, the team performed structure-activity relationship (SAR) analysis of existing chemical matter, followed by computational assessment of the allosteric binding site using WaterMap. As a result, the team identified a number of displaceable highenergy water molecules in regions of the binding site that provided an opportunity to gain potency while exploring different chemotypes.¹Schrödinger’s drug discovery team used this information to drive the evaluation of billions of compounds via a De Novo Design strategy for iterative large-scale design and scoring. This strategy included synthetically-aware, reaction-based enumeration, crowdsourced medicinal chemistry ideation, and FEP+ for free energy perturbation modeling. The accuracy and utility of FEP+ as a computational assay for the prediction of relative binding energies of molecules has been validated extensively, generating predictions within one kcal/mol of experimental values on average.² By combining FEP+ with high performance cloud computing and machine learning (Active Learning FEP+), over 1,700 molecules were evaluated in the first three months of the project. All ideas and corresponding modeled data crowdsourced by the team were captured and analyzed with LiveDesign, a best-in-class, modeling-enabled collaborative enterprise platform for real-time project ideation (Figure 1). In less than three months, with fewer than 50 total compounds synthesized, the team was able to identify two novel and distinct series of highly potent MALT1 inhibitors, affording progression to in vivo testing.

Figure 1: Modeling strategy and design-predict-make-test-analyze (DPMTA) cycle employed for MALT1 inhibitor program, in which development candidate SGR-1505 was discovered in 10 months.

Overcoming the MPO challenge by tuning potency, solubility, and permeability simultaneously

Once potent chemical series were identified, the team focused on tuning physicochemical properties to meet the target product profile (TPP). They employed a multiparameter optimization (MPO) scoring system to triage molecules rapidly based on their predicted ability to satisfy the TPP. Calculation of the MPO score was based on values derived from predictive models for solubility, permeability, and potency. Using this strategy the design team assessed over 5,000 ideas and identified 43 compounds that met the program’s criteria. A handful progressed to synthesis and experimental testing, reducing cost and time significantly.

Within 10 months and a total of 78 compounds synthesized in the lead series (and 129 compounds program wide), the project team identified a potential best-in-class MALT1 inhibitor with balanced properties and on-target activity, SGR-1505 (Figure 2). In June 2025, SGR-1505 was observed in its ongoing Phase 1, open-label, dose-escalation study to have a favorable safety profile and was well tolerated, with encouraging preliminary efficacy in patients with relapsed/refractory B-cell malignancies.4 Responses were observed across a broad range of B-cell malignancies, including monotherapy responses in patients with chronic lymphocytic leukemia (CLL) and Waldenström macroglobulinemia (Figure 3).⁵

**Figure 2:** Comparison of SGR-1505 with competitor’s MALT1 inhibitor. *Structure of JNJ-6633 first disclosed by Tianbao Lu at 2021 Spring ACS. All competitor data is internally generated by contract research organizations. Yin et al., ASH 2023.

**Figure 3:** Initial results of the SGR-1505 Phase 1 study showing encouraging preliminary efficacy across a range of B-cell malignancies including chronic lymphocytic leukemia/small lymphocytic leukemia (CLL/SLL), marginal zone lymphoma (MZL), and Waldenström macroglobulinemia (WM).

Enabling digital technologies to drive discovery programs

FEP+

Digital assay for predicting protein-ligand binding across broad chemical space at an accuracy matching experimental methods.

Learn More

De Novo Design Workflow

Ultra-large scale chemical space exploration combining multiple compound enumeration strategies with an advanced filtering cascade.

Learn More

WaterMap

Calculation of the positions and energies of water sites in a protein binding pocket.

Learn More

LiveDesign

Collaborative enterprise informatics platform for centralizing access to virtual and wet lab project data and powerful computational predictions.

Learn More

References

Calculating water thermodynamics in the binding site of proteins – Applications of WaterMap to drug discovery.
Cappel et al. Curr. Top. Med. Chem. 2017, 17(23), 2586-2598.
Advancing drug discovery through enhanced free energy calculations.
Abel et al. Acc. Chem. Res. 2017, 50(7), 1625–1632.
Characterization of potent paracaspase MALT1 inhibitors for hematological malignancies.
Yin et al. ASH Presentation 2021.
Schrödinger reports encouraging initial Phase 1 clinical data for SGR-1505 at EHA Annual Congress.
Schrödinger. 2025.
A Phase 1 study of SGR-1505, an oral, potent, MALT1 inhibitor for relapsed/refractory (R/R) B-cell malignancies, including chronic lymphocytic leukemia/small lymphocytic leukemia (CLL/SLL).
Spurgeon, et al. European Hematological Association Annual Congress. 2025.

Software and services to meet your organizational needs

Industry-Leading Software Platform

Deploy digital drug discovery workflows using a comprehensive and user-friendly platform for molecular modeling, design, and collaboration.

Learn More

Modeling Services

Leverage Schrödinger’s team of expert computational scientists to advance your projects through key stages in the drug discovery process.

Learn More

Scientific and Technical Support

Access expert support, educational materials, and training resources designed for both novice and experienced users.

Learn More

Schrödinger solutions for small molecule protonation state enumeration and pKa prediction

Posted on May 17, 2023December 11, 2024 by Anonymous

Schrödinger solutions for small molecule protonation state enumeration and pKa prediction

Executive Summary

The pKa of a drug is a key physicochemical property to consider in the drug discovery process given its importance in determining the ionization state of a molecule at physiological pH. Schrödinger provides several solutions for predicting pKa values, protonation state distribution and derived properties that can be applied across a range of drug discovery stages, from screening through lead optimization. Here we provide an overview of each technology solution and use case examples of how they can be applied in drug discovery.

Background

Small molecules can undergo ionization in solution where they either lose or gain protons (H⁺) at different ionizing sites. The measure of the propensity of a site or molecule to ionize by the association/dissociation of one or more protons is quantified by a pK_a value. If the pK_a value refers to a particular site ionizable site the value is a microscopic pK_a (micro-pK_a), and it is a macroscopic pK_a (macro-pK_a) if the value refers to the entire molecule. The specific arrangement of protons around the ionizing sites constitutes a protonation state, and different protonation states of the same charge level are called tautomers. Each protonation state is in thermodynamic equilibrium with the others and therefore has a free energy associated with its population within this collection of protonation states, which may be derived either from micro-pK_a values through thermodynamic equations or obtained directly by comparing the free energies of the states. In drug design, understanding the different protonation states of a molecule is critical, since they will drive properties including solubility, membrane permeability, and activity.

Challenges of pK_a Prediction

Determining which states predominate at a given pH and by how much is a challenging task both experimentally and computationally because the number of states that are all in thermodynamic equilibrium grows ~2ⁿ with the number, n, of singly protonatable sites. Thus, molecules with many titratable sites can potentially have a large number of different protonation states, all of which need to be enumerated and energetically scored.

Computationally, Schrödinger uses two main approaches to score states: 1) through evaluating thermodynamic equilibrium equations with micro-pK_a values, and 2) directly predicting the states’ relative free energies. Predicting pK_a values is an important step to calculating state distributions, which in turn enables prediction of important related quantities that would otherwise be inaccessible.

**Figure 1:** Relationships between macro-pKa, micro-pKa, protonation states, and tautomers and the corresponding speciation diagram.

Overview of Schrödinger Solutions

Epik Classic

Epik Classic, previously known simply as Epik¹, is an expert system for rapidly and accurately predicting the micro-pK_a values and the most populated protonation states for a ligand at a given pH. The underlying pK_a prediction technology is the empirical Hammett-Taft linear free energy relationship (LFER), which identifies an ionizing group, takes its root pK_a value, perturbs it by the bonded chemical fragments, and applies charge spreading to arrive at its effective micro-pK_a value. Epik Classic then uses the predicted pK_a values to enumerate a ligand’s protonation states, rank them by energy, and then return the most populated states. Because Epik Classic uses SMARTS patterns-based rules, it is fast enough for high-throughput, although at the expense of being unaware of both conformational and stereochemical effects.

Epik 7

Epik 7 is a complete redesign of Epik that leverages Schrödinger’s powerful machine learning (ML) technology for more accurate results across broader chemical space. Ionizing groups are initially identified by SMARTS patterns and are then used to enumerate the protonation states for a range of ionizations.² The micro-pK_a values of each site in each state are predicted with 3-layer atomic graph convolutional neural networks (GCNNs) extending out radially six bonds from the ionizing atom. The predicted pK_a values for the states are then used to predict the relative energies of the states to both allow determination of the most populated states at a pH and calculation of macro-pK_a values. The topological nature of the ML approach means that Epik 7, like previous versions, is rapid but agnostic to 3D geometry and stereochemistry.

Jaguar pKa

Jaguar pK_a takes a third, more physics-based approach to predicting micro-pK_a values for a ligand. This workflow calculates the pK_a values at the user-defined ionizing sites in a query ligand by first generating the conjugate pair, on which are then executed conformational searches to locate the lowest energy structures,³ followed by density functional theory (DFT) based geometry optimizations and single-point energy evaluations. These resulting conformationally-averaged, “raw” micro-pK_a values are then corrected using empirically-parametrized relationships to give accurate predictions. Jaguar pK_a performs best on non-tautomerizable structures. Being physics-based, it does take into account geometric and stereochemical effects, but at the expense of speed.

Macro-pKa

Macro-pK_a follows the same philosophy as Jaguar pK_a by combining physics-based DFT calculations with empirical corrections, but extends its applicability to enable calculation of tautomerizable ligands. Macro-pK_a automatically identifies ionizing sites, enumerates the protonation states, and calculates the micro-pK_a values following a similar workflow to Jaguar pK_a, but with an enhanced scheme for generating empirical corrections. Finally, the calculated micro-pK_a values are used to rank the protonation states by energy, return the most populated states for a user-supplied pH, and determine the macro-pK_a values for the ligand. The exhaustiveness of this approach comes at a larger time and resource cost than Jaguar pK_a.

Use Cases

Here we outline several use cases for pK_a prediction in the drug discovery workflow.
Note: Each use case example outlines below could be approached with any of the listed solutions within that section. The dataset presented highlights the applicability of just one of the possible solutions.

I. Querying microscopic pK_a values

Applicable Solutions

Epik Classic
Epik 7
Jaguar pK_a

When investigating the binding modes of a ligand, the micro-pK_a value of an ionizing site is an indicator of the propensity for it to become ionized at a given pH. The ionization state of the ligand directly influences how it interacts with another molecule such as a protein, e.g., whether or not it can participate in a salt bridge.

**Figure 2:** Jaguar pKa micro-pKa predictions for a dataset of small molecules.

II. Querying apparent or macroscopic pK_a values

Applicable Solutions

Epik 7
Macro-pK_a

For monoprotic or polyprotic compounds with a single dominant tautomer at each charge level, micro-pK_as may very closely match the apparent or macro-pK_a value that is most commonly obtained through titration experiments. However, for compounds or ionization states with multiple competitive tautomers, the micro-pK_a value of a single tautomer may not fully reproduce the experimentally observed macroscopic value. To obtain this apparent value, all states’ must first be enumerated and evaluated so that all their micro-pK_a values are considered in the macro-pK_a calculation.

**Figure 3:** Macro-pKa macro-pKa predictions for a dataset of tautomeric molecules.

III. Ligand preparation and high-throughput screening

Applicable Solutions

Epik Classic
Epik 7

Physics-based simulations typically require specification of all atoms in the simulation system, including all hydrogen atoms. Thus, structure-based simulations including Glide docking, molecular dynamics, and free energy perturbation with FEP+ should be performed using an ensemble of the highly-populated protonation states of a ligand. Therefore, a crucial first step in any structure-based screen of a small molecule ligand library is to prepare the ligands by obtaining the most populated protonated states. Epik Classic and Epik 7 are integrated with our automated ligand preparation workflow, LigPrep, to allow preparation of large ligand libraries for high-throughput screening. Additionally, both Epik Classic and Epik 7 and their LigPrep implementations allow for the generation and scoring of additional states that may potentially bind to metal ions in the pocket.

**Figure 4:** Epik Classic micro-pKa predictions for a dataset of 152 drug molecules

IV. Hit-to-lead optimization

Applicable Solutions

Epik Classic
Epik 7

Once hits are identified, a series of analogs are synthesized to explore the relevant chemical space in greater detail to arrive at improved behavior. It is important to be able to screen potential candidates rapidly and accurately to assess which to optimize further. The < 0.5 log unit accuracy and sub-second calculation speed of Epik Classic and Epik 7 make them excellent tools for rapid idea generation and testing. In addition to pK_a value and protonation state distribution prediction, they have been implemented in other ADMET or property predictors, such as for membrane permeability and solvation energy.

**Figure 5** Epik 7 macro-pKa predictions for a dataset of congeneric tricyclic thrombin inhibitors.

V. Early-stage lead optimization

Applicable Solutions

Epik 7
Jaguar pK_a
Macro-pK_a

Optimizing the many physical characteristics required can be laborious and costly, from ideation, through synthesis and assay. In this environment, where high quality property predictions are required and time permits, Schrödinger’s physics-based predictors, Jaguar pK_a and Macro-pK_a, take into account more molecular characteristics, including conformational and stereochemical effects to improve pK_a prediction accuracy. Additionally, Macro-pK_a and Epik 7 both offer detailed speciation reports for a queried ligand. These are especially helpful for understanding the distribution of tautomeric states across the pH spectrum.

**Figure 6:** A Macro-pKa report detailing the macro-pKa value and the distribution of protonation states across a pH range.