CLOSE
Schrödinger
  • Home
  • Product Suites
    • S m a l l - M o l e c u l e  D r u g  D i s c o v e r y
    • B i o l o g i c s
    • M a t e r i a l s  S c i e n c e
    • D i s c o v e r y  I n f o r m a t i c s
    • P y M O L
    • L i s t  o f  A l l  P r o d u c t s
    • I m a g e  G a l l e r i e s
  • Support
    • C o n t a c t  S u p p o r t
    • R e q u e s t  L i c e n s e
    • D o c s  a n d  K n o w n  I s s u e s
    • K n o w l e d g e  B a s e
    • V i d e o s
  • Resources & Downloads
    • D o w n l o a d s
    • S c r i p t s
    • K N I M E  W o r k f l o w s
    • P y t h o n  A P I
    • C i t a t i o n s
    • T r i a l s / S a l e s  Q u o t e
    • P a y m e n t  P o l i c i e s
    • E U L A
  • News & Events
    • E v e n t s
    • N e w s
    • N e w s l e t t e r s
    • S e m i n a r s
  • About
    • O v e r v i e w
    • P a r t n e r s h i p s
    • L e a d e r s h i p
    • S c i e n t i f i c  A d v i s o r s
    • C a r e e r s
    • C o n t a c t  U s
    • S c h r ö d i n g e r  K . K .
  • Home
  • News & Events
  • Newsletters

– October 2011 Newsletters

  • Events
  • News
  • Newsletters
  • Seminars
Phase Shape: A Fast and Versatile Tool for Shape-Based Screening
Dr. Steve Dixon, Phase Product Manager

As Phase Product Manager, Dr. Steve Dixon oversees and personally contributes to the development of Schrödinger's program for shape-based screening. In this article, Dr. Dixon describes recently published work on the Phase Shape methodology. 

Background

Under increasing pressure to identify novel leads in a crowded patent space, modelers are continually in search of fast, lead-finding techniques that promise an expansion into regions of active chemical space not covered by traditional docking and pharmacophore‑based screens. In particular, there is a marked demand for methods that attempt to match the overall shape of one or more known actives.1 These shape‑based screens are typically orders of magnitude faster than docking, and, unlike pharmacophore matching, there is no need to develop a model that encodes key ligand‑receptor interactions.

In developing an alternative to existing shape­‑based technologies,2 it is important to focus on the speed at which structures can be processed and the quality of the superpositions provided, while still retaining the ability to selectively identify actives within a drug‑like database. Hence, it is worthwhile to assess whether a given level of rigor is really necessary to achieve a given result. For example, if a more approximate model of overlap allows more rapid computation of the shape similarity between two structures, it is possible to explore a greater number and variety of structural poses per unit time, thereby increasing the chances of finding more satisfactory superpositions. These sorts of considerations have led to the development of Phase Shape, a fast and versatile tool for shape‑based screening, which provides highly intuitive overlays of chemical structures, and virtual screening enrichments that, on average, surpass those of competing methodologies across broad classes of targets.

Shape Similarity Models

The basic concept of shape similarity is illustrated in Figure 1. Given a superposition of two structures A and B, the shared or jointly occupied volume VA∩B is normalized by the total volume VAUB  to arrive at a shape similarity SimAB that ranges between 0 and 1:

SimAB = VA∩B  / VAUB

A basic goal of shape screening is to determine the alignment of A and B that maximizes SimAB. The complexity of this task depends upon the mathematical representation of shape, and the way in which volumes are calculated.

Figure 1: A basic representation of chemical shape illustrating the shared volume VA∩B and total volume VAUB of two overlapping structures A and B.

Phase Shape represents a structure as a set of hard atomic van der Waals spheres, with one sphere for each heavy atom and polar hydrogen. The overlap OAB between structures A and B is computed as the sum of pairwise atomic overlaps, and it is normalized by the largest self-overlap to obtain the following measure of shape similarity:

SimAB = OAB/max(OAA, OBB)

This differs from the previous definition in that an alternate normalization scheme is employed, and rigorously computed volumes are replaced by overlaps that ignore the effects of intersections among three or more atoms. Although ignoring higher order overlaps results in an overestimation of the true volumes, normalization by the largest self-overlap computed in the same manner tends to cancel errors, as shown in Figure 2.  These approximations allow exceedingly fast shape similarity calculations compared to Gaussian-based methods,3,4 and the use of hard spheres eliminates the need to consider overlap between pairs of atoms separated by more than the sum of their van der Waals radii.

Figure 2: The relationship between shape similarities derived from rigorously computed volumes (y axis) and sums of pairwise atomic overlaps (x axis).

When computing overlaps, Phase Shape has the ability to treat all atoms equivalently, a so-called “pure shape” approach, or to distinguish atoms by type and consider overlap only between atoms of the same type. In the latter case, Phase Shape provides progressively more specific schemes that differentiate by Phase QSAR atom type,5 by element, and by MacroModel atom type.

As an alternative to the atom-based approach, Phase Shape can represent a structure as a set of pharmacophore sites that encode the locations of hydrogen bond acceptors and donors, hydrophobic regions, positive and negative ionizable functions, and aromatic rings. No particular pharmacophore model is implied by this approach, since all sites in a given structure are encoded into the shape, not just those that are hypothesized to be required for binding to a particular target. Pharmacophore sites are mapped to a structure using Phase feature definitions,5 and each site is represented by a 2 Ǻ hard sphere. Figure 3 illustrates the various models of shape that are supported.

Figure 3: The three models of chemical shape that are supported in Phase Shape.

Whether an atom-based or pharmacophore-based approach is used, Phase Shape identifies numerous pairs of triplets with similar geometries and similar local environments in structures A and B and superimposes the two structures based on a least-squares alignment of each pair of triplets (Fig. 4). The superposition with the highest shape similarity is then refined by realigning on additional pairs of atoms/sites that lie within 0.5 Ǻ of each other in the triplet-based alignment.

Figure 4: A triplet-based alignment of structure B onto structure A.

For each pair of structures, hundreds of alignments may ultimately be considered in a tiny fraction of a second. This is possible thanks to an optimized triplet alignment algorithm, ultra-fast hard sphere overlap calculations, and a shape similarity estimation technique which allows poorer overlays to be rejected after computing only a fraction of the total overlap. These time-saving measures allow Phase Shape to screen a multi-conformer Phase database at a rate of about 600 conformers per second on a 2 GHz processor. Phase Shape calculations are trivially parallelizable, and any desired speedup is achievable by dividing the screen over multiple processors.

Phase Shape Applications

Figure 5 illustrates the quality of overlays that can be achieved with Phase Shape using elemental atom types. Here, the CDK2 X-ray ligand structure 2G9X was used as a rigid template onto which nine other CDK2 ligands were aligned. Results are reported for the highest scoring X‑ray to template alignment, and for the highest scoring conformer to template alignment, where conformational ensembles were generated using both MacroModel and Phase Shape on‑the‑fly ConfGen sampling. In all cases, Phase Shape yields a multi-ligand alignment with low average RMSD values, and clean superposition of common structural elements.

Figure 5: The results of various Phase Shape alignments for CDK2 ligands onto the crystallographically determined bound conformation of the ligand from PDB structure 2G9X. RMSDs are reported for the alignment of experimentally determined ligand geometries, and also for alignments performed using conformer sets created with either MacroModel or ConfGen.

In addition to producing intuitive, high quality overlays, Phase Shape has been shown to be quite effective at selectively identifying known actives within a database of drug‑like decoys.6 Table 1 summarizes results of Phase Shape virtual screening exercises performed according to the protocols described by McGaughey et al.7 Briefly, multi‑conformer actives for 11 diverse targets were seeded within a multi‑conformer database of 25,000 MDDR decoys. A single active for each target was used as a rigid template for shape-based screening, and database structures were ranked in order of decreasing similarity to that template.

As evidenced by the average enrichment factors in the top 1% of the screened database, results consistently improve with the use of more specific atom typing schemes. Analogous behavior was observed when 2D fingerprint screens were performed on the same data,8 so the relationship between atom type specificity and enrichment is not surprising. Although this trend is promising, improvements for most targets are only incremental, and it is unlikely that devising ever-more discriminating atom typing schemes will lead to a true breakthrough in performance. This threshold for performance improvement is not crossed until the atom-based model of shape is replaced with a pharmacophoric representation. Doing so boosts enrichments for eight of 11 targets, including a two‑fold or greater increase in four cases, and a 66% improvement over MacroModel atom types on average.

Table 1: The enrichment factors at 1% screened for various Phase Shape approaches performed according to protocols described by McGaughey et al.7 Increasingly specific atom typing schemes are shown from left to right.

The pharmacophore-based approach also competes very well with other 3D virtual screening methods which have been applied to the McGaughey data set. Table 2 compares Phase Shape pharmacophore-based enrichments to those obtained using the ROCS-color technique9 and the SQW superposition method developed at Merck.7,10 Phase Shape surpasses both of these methods by 30-40% in terms of average and median enrichments, and outperforms each of them head-to-head in eight of 11 cases. Since publication of the McGaughey paper in 2007, ROCS‑color has been viewed by many as the gold standard for shape-based screening, so these latest results are of particular significance. 

Table 2: A comparison of the Phase Shape pharmacophore-based approach to other 3D virtual screening methods.

Other versatile features of Phase Shape include the ability to score poses in place, force the alignment of specific atoms by way of SMARTS matching, compute similarities to multiple shape queries in a single run, apply alternate similarity normalization schemes that facilitate the identification of embedded shapes, and filter hits using excluded volumes. The Phase Shape technology has also been employed to develop a fast, multi-ligand superposition method, where the template and the structures being aligned to it are all treated in a flexible manner, and the template conformer that yields the best overall alignment of all ligands is utilized.

 

References

1. Kirchmair, J.; Distinto, S.; Markt, P.; Schuster, D.; Spitzer, G. M.; Liedl, K. R.; Wolber, G., How To Optimize Shape-Based Virtual Screening: Choosing the Right Query and Including Chemical Information. J. Chem. Inf. Model. 2009, 49, 678-692.
2. Putta, S.; Beroza, P. Shapes of Things: Computer Modeling of Molecular Shape in Drug Discovery. Curr. Top. Med. Chem. 2007, 7, 1514-1524.
3. Grant, J.; Pickup, B. A. Gaussian Description of Molecular Shape. J. Phys. Chem. 1995, 99, 3503-3510.
4. Rush, T. S., III; Grant, J. A.; Mosyak, L.; Nicholls, A. A Shape-Based 3-D Scaffold Hopping Method and its Application to a Bacterial Protein-Protein Interaction. J. Med. Chem. 2005, 48, 1489-1495.
5. Dixon, S.; Smondyrev, A.; Knoll, E.; Rao, S.; Shaw, D.; Friesner, R., PHASE: A New Engine for Pharmacophore Perception, 3D QSAR Model Development, and 3D Database Screening: 1. Methodology and Preliminary Results. J. Comput.-Aided Mol. Des. 2006, 20, 647-671.
6. Sastry, M.; Dixon, S. L.; Sherman, W. Rapid Shape-Based Ligand Alignment and Virtual Screening Method Based on Atom/Feature-Pair Similarities and Volume. J. Chem. Inf. Model. In press.
7. McGaughey, G. B.; Sheridan, R. P.; Bayly, C. I.; Culberson, J. C.; Kreatsoulas, C.; Lindsley, S.; Maiorov, V.; Truchon, J.-F.; Cornell, W. D., Comparison of Topological, Shape, and Docking Methods in Virtual Screening. J. Chem. Inf. Model. 2007, 47, 1504-1519.
8. Sastry, M.; Lowrie, J. F.; Dixon, S. L.; Sherman, W. Large-Scale Systematic Analysis of 2D Fingerprint Methods and Parameters to Improve Virtual Screening Enrichments. J. Chem. Inf. Model. 2010, 50, 771-784.
9. Hawkins, P. C. D. A Comparison of Structure-Based and Shape-Based Tools for Virtual Screening. Abstracts of Papers, 231st ACS National Meeting, Atlanta, GA, United States, March 26-30, 2006.
10. Miller, M. D.; Sheridan, R. P.; Kearsley, S. L. SQ: A Program for Rapidly Producing Pharmacophorically Relevant Molecular Superpositions. J. Med. Chem. 1999, 42, 1505-1514.

 

Table of Contents

Phase Shape: A Fast and Versatile Tool for Shape-Based Screening

Dr. Steve Dixon, Phase Product Manager

Ask the Scripts Expert: The New Pose Explorer, Per-Residue RMSD Calculations, a Simplified Phase GUI, and more

Dr. Woody Sherman, Vice President of Applications Science

KNIME Questions and Answers: Editing Tabular Data, Interactively Editing 3D Structures, and more

Dr. Jean-Christophe Mozziconacci, Schrödinger Applications Scientist

Schrödinger's New Maestro Image Gallery
Fall 2011 Seminar Series Starting Soon
Upcoming Events
Recent Publications

View Issue

  • November 2012
  • May 2012
  • October 2011
  • May 2011
  • January 2011
  • August 2010
  • May 2010
  • January 2010
  • August 2009
  • February 2009
  • Fall 2008
  • Summer 2008
  • Spring 2008
  • Winter 2007
  • Fall 2007
  • Summer 2007
  • Spring 2007
  • Winter 2006
  • Fall 2006
  • Home
  • Product Suites
  • S m a l l - M o l e c u l e  D r u g  D i s c o v e r y
  • B i o l o g i c s
  • M a t e r i a l s  S c i e n c e
  • D i s c o v e r y  I n f o r m a t i c s
  • P y M O L
  • L i s t  o f  A l l  P r o d u c t s
  • I m a g e  G a l l e r i e s
  • Support
  • C o n t a c t  S u p p o r t
  • R e q u e s t  L i c e n s e
  • D o c s  a n d  K n o w n  I s s u e s
  • K n o w l e d g e  B a s e
  • V i d e o s
  • Resources & Downloads
  • D o w n l o a d s
  • S c r i p t s
  • K N I M E  W o r k f l o w s
  • P y t h o n  A P I
  • C i t a t i o n s
  • T r i a l s / S a l e s  Q u o t e
  • P a y m e n t  P o l i c i e s
  • E U L A
  • News & Events
  • E v e n t s
  • N e w s
  • N e w s l e t t e r s
  • S e m i n a r s
  • About
  • O v e r v i e w
  • P a r t n e r s h i p s
  • L e a d e r s h i p
  • S c i e n t i f i c  A d v i s o r s
  • C a r e e r s
  • C o n t a c t  U s
  • S c h r ö d i n g e r  K . K .
RSS RSS
Copyright © 2005-2013 Schrödinger, LLC
  • Privacy Policy
  • Terms of Use
  • FCOI Policy
  • Log On
  • My Account
Schrödinger