Force Field
Improvements in the OPLS3 Force Field
Background
Realistic atomistic simulation of complex molecular systems is contingent upon the availability of an accurate and reliable molecular mechanics force field. The treatment of systems of interest in biology and materials science typically requires models on the order of thousands to millions of atoms, and timescales from nanoseconds to seconds; even at the low end of both of these ranges, purely quantum mechanical dynamics is extremely expensive (and not necessarily more accurate, as quantum chemical methods that are tractable for systems of this size have deficiencies of their own). Hence, molecular mechanics force field development has been a central goal of computational chemistry for the past four decades.
OPLS (optimized potentials for liquid simulations) first developed by Jorgensen and coworkers,1,2 was one of the first models in which parameters were extensively optimized to reproduce liquid state thermodynamic properties for a variety of small organic molecules. This effort yielded a core set of nonbonded (van der Waals) parameters, which still remain at the heart of OPLS development. OPLS33 represents a next generation model that builds upon this OPLS framework with the following key improvements:
A. Valence and Torsion Parameters
Extensive use of high-quality quantum chemical calculations to generate more accurate valence and torsional parameterizations. The OPLS3 model contains nearly two orders of magnitude more explicitly fit torsional parameters than alternative small molecule force fields leading to greater accuracy in modeling small molecule conformational barriers and propensities:
Table 1. Distribution of RMS error with respect to QM over the 11845 torsion profiles.
Model | %<1.0 kcal/mol | %>1.0 and <2.0 kcal/mol | %>2.0 kcal/mol |
MMFF | 33 | 32 | 35 |
OPLS_2005 | 33 | 33 | 32 |
OPLS2.1 | 89 | 8 | 3 |
OPLS3 | 88 | 9 | 3 |
B. Parameterization of Bond Charge Correction Terms (BCC)
The OPLS3 force field employs a CM1A-BCC based charge model based on a combination of Cramer-Truhlar CM1A charges1 with an extensive parameterization of bond charge correction terms (BCC). In OPLS3, we also introduce virtual (off-atom centered) sites to better represent lone pair and sigma hole charge distributions. Overall, these improvements improve the modeling of key interactions (Figure 1) and are reflected in the observed high level of accuracy seen across an expansive set of aqueous solvation free energies.2,3
Figure 1. Interaction energy between aryl chloride and carbonyl oxygen comparing QM to OPLS2.1 and OPLS3. The halogen bond gives ride to an attractive interaction at close distances which is captured with the addition of a virtual charge site for the aryl chloride in OPLS3.
Table 2. Error summary for hydration free energy results for 239 small molecules.
Force Field | MUE (kcal/mol) | RMSE (kcal/mol) | % > 2 kcal/mol |
ChelpG/CHARMM | 1.93 | 2.28 | 44.7 |
AM1-BCC/GAFF | 1.17 | 1.39 | 15.0
|
OPLS_2005 | 1.10 | 1.33 | 8.5 |
OPLS2.0/OPLS2.1 | 0.73 | 0.88 | 2.1 |
OPLS3 | 0.72 | 0.87 | 2.1 |
aValues for ChelpG/CHARMM and AM1-BCC/GAFF, OPLS_2005 and OPLS2.0/2.1 are those reported in Shivakumar et al.
C. Quantum Chemical and Training Simulation Data
For predictive biomolecular simulation, the protein force field must be refined to a very high degree of accuracy and stability. Small errors in, for example, the torsional potential of the protein dihedral angles, can translate into significant disagreements with experiment, ranging from discrepancies in the prediction of NMR spectra to incorrect prediction of the stability of proteins and peptides. OPLS3 incorporates a greatly increased amount of quantum chemical data, and training to simulation data as compared with experiment, leading to significant improvements in the representation of secondary structure elements in simulated peptides and native structure stability over a number of proteins.
Table 3. Average RMS difference with respect to X-ray or NMR structure.
Protein | OPLS_2005 | OPLS2.1 | OPLS3 | C22* | AMBER99sb-ILDN | |
Name | PDB | |||||
trpcage | 1L2Y | 5.3 | 2.7 | 1.2 | 1.7 | 1.7 |
GB3 | 1P7E | 1.7 | 1.8 | 0.8 | 0.7 | 0.8 |
Ubiquitin | 1UBQ | 1.1 | 1.1 | 0.9 | 0.8 | 0.9 |
Sumo2 | 1WM3 | 1.9 | 1.8 | 1.3 | 1.3 | 1.4 |
BPTI | 1BPI | 1.7 | 1.1 | 1.2 | 1.1 | 1.1 |
Crambin | 1CRN | 1.7 | 0.9 | 0.8 | 0.7 | 0.8 |
Lysozyme | 6LYT | 1.6 | 1.6 | 1.3 | 1.1 | 1.1 |
Average | 2.1 | 1.6 | 1.1 | 1.1 | 1.1 |
D. Protein-Ligand Binding
Together, the improvements made to both the small molecule and protein force field lead to a high level of accuracy in predicting protein-ligand binding measured over a wide range of targets and ligands (less than 1 kcal/mol RMS error) representing a 30% improvement over earlier variants of the OPLS force field.
Table 4. Relative binding free energy results.a
System | No.cmpds | Series ref | No. Pert. | OPLS_2005 | OPLS2.1 | OPLS3 | |||
R2 | RMSE | R2 | RMSE | R2 | RMSE | ||||
BACE | 36 | 54 | 58 | 0.01 | 1.35 | 0.56 | 1.03 | 0.64 | 0.89 |
CDK2 | 16 | 55 | 25 | 0.48 | 0.98 | 0.07 | 1.27 | 0.51 | 0.86 |
JNK1 | 21 | 56 | 34 | 0.75 | 0.87 | 0.74 | 0.87 | 0.76 | 0.62 |
MCL1 | 42 | 57 | 71 | 0.46 | 1.77 | 0.62 | 1.44 | 0.37 | 1.40 |
P38 | 34 | 58 | 56 | 0.32 | 0.95 | 0.54 | 0.97 | 0.57 | 1.05 |
PTP1B | 23 | 59 | 49 | 0.55 | 1.55 | 0.50 | 1.05 | 0.79 | 0.57 |
Thrombin | 11 | 60 | 16 | 0.21 | 1.35 | 0.40 | 0.97 | 0.38 | 0.83 |
Tyk2 | 16 | 61,62 | 24 | 0.86 | 0.75 | 0.80 | 0.98 | 0.84 | 0.98 |
Weighted Average | 1.28 +/- 0.06 | 1.11 +/- 0.05 | 0.95 +/- 0.04 |
aIncludes root mean square error (RMSE) reported in kcal/mol and square of the correlation coefficient (R2). Uncertainty in weighted average RMSE calculated based on bootstrapping method.
Case Study 1: Impact of an Improved Protein Force Field on Protein-Ligand Binding
Experimental versus FEP-predicted binding free energies over 6 BACE ligands distinguished by their R1 substituent. Left panel shows the binding mode of the methoxy derivative indicating the position of the functional group varied across the series. Binding affinities as a function of the R1 functional group shown in the middle panel. At right, representative structures, taken from simulations of OPLS2.1 and OPLS3, of the Ala396 residue and its associated helix, which enclose the S3 pocket of the BACE1 binding site.
Case Study 2: Impact of an Improved Charge Model on Protein-Ligand Binding
Sub-set of FXA series ligands illustrating electron rich receptor environment from the carboxylate of Asp189 and the backbone carbonyl of Gly219 coordinating aryl nitrogen lone pairs of 17i and 17h. Binding free energies relative to 17d are tabulated in Table.
Table 5. Experimental and predicted relative binding free energies for FXA ligands.a
Ligand | ΔΔG(Exp.) | ΔΔG(OPLS2.1) | ΔΔG(OPLS3) |
17d → 17i | 3.9 | 0.5 | 2.1 |
17d → 17h | 4.9 | 1.0 | 3.9 |
aFree energies are in kcal/mol. Specified ligands are illustrated in Figure 8. Ligand names are taken from Chan et al.71
References
- Jorgensen, W. L.; Maxwell, D. S.; Tirado-Rives, J., Development and testing of the OPLS all-atom force field on conformational energetics and properties of organic liquids. J. Am. Chem. Soc., 1996, 118, 11225-11236.
- Harder, E. et al., OPLS3: A Force Field Providing Broad Coverage of Drug-like Small Molecules and Proteins. J. Chem. Theory Comput., 2016, 12(1), 281–296.
- Storer, J. W.; Giesen, D. J.; Cramer, C. J.; Truhlar, D. G., Class IV charge models: a new semiempirical approach in quantum chemistry. J. Comput. Aided. Mol. Des., 1995, 9(1), 87-110.
- Shivakumar, D.; Harder, E.; Damm, W.; Friesner, R. A.; Sherman, W. J., Improving the Prediction of Absolute Solvation Free Energies Using the Next Generation OPLS Force Field. J. Chem. Theory Comput., 2012, 8, 2553–2558.