Archive for the 'Truhlar' Category

Solubility in olive oil

Here’s a nice example of the application of computed solvation energies in non-aqueous studies. Cramer and Truhlar have employed their latest SM8 technique, which is parameterized for organic solvents and for water, to estimate solvation energies in olive oil.1 Now you may wonder why solvation in olive oil of all things? But the partitioning of molecules between water and olive oil has been shown to be a good predictor of lipophilicity and therefore bioavailability of drugs! The model works reasonably well in reproducing experimental solvation energies and partition coefficients. They do make the case that fluorine substitution which appears to improve solubility in organics,originates not to more favorable solvation in organic solvents (like olive oil) but rather that fluorine substitution dramatically decreases solubility in water.

References

(1) Chamberlin, A. C.; Levitt, D. G.; Cramer, C. J.; Truhlar, D. G., "Modeling Free Energies of Solvation in Olive Oil," Mol. Pharmaceutics, 2008, 5, 1064-1079, DOI: 10.1021/mp800059u

Cramer & Solvation & Truhlar Steven Bachrach 17 Feb 2009 1 Comment

SM8 performance

Cramer and Truhlar have tested their latest solvation model SM8 against a test set of 17 small, drug-like molecules.1 Their best result comes with the use of SM8, the MO5-2X functional, the 6-31G(d) basis set and CM4M charge model. This computational model yields a root mean squared error for the solvation free energy of 1.08 kcal mol-1 across this test set. This is the first time these authors have recommended a particular computational model. Another interesting point is that use of solution-phase optimized geometries instead of gas-phase geometries leads to only marginally improved solvation energies, so that the more cost effective use of gas-phase structures is encouraged.

These authors note in conclusion that further improvement of solvation prediction rests upon “an infusion of new experimental data for molecules characterized by high degrees of functionality (i.e. druglike)”.

References

(1) Chamberlin, A. C.; Cramer, C. J.; Truhlar, D. G., “Performance of SM8 on a Test To Predict Small-Molecule Solvation Free Energies,” J. Phys. Chem. B, 2008, 112, 8651-8655, DOI: 10.1021/jp8028038.

Cramer & Solvation & Truhlar Steven Bachrach 21 Oct 2008 No Comments

Review of SM8

Cramer and Truhlar1 have published a nice review of their SM8 approach to evaluated solvation energy. Besides a quick summary of the theoretical approach behind the model, they detail a few applications. Principle among these is (a) the very strong performance of SM8 relative to some of the standard approaches in the major QM codes (see my previous blog post), (b) modeling interfaces, and (c) computing pKa values of organic compounds.

References

(1) Cramer, C. J.; Truhlar, D. G., "A Universal Approach to Solvation Modeling," Acc. Chem. Res. 2008, 41, 760-768, DOI: 10.1021/ar800019z.

Cramer & Solvation & Truhlar Steven Bachrach 23 Jul 2008 No Comments

New solvation model: SM8

Truhlar and Cramer have updated their Solvation Model to SM8.1 This model allows for any solvent to be utilized (both water and organic solvents) and treats both neutral and charged solutes. While there are some small theoretical changes to the model, the major change is in how the parameters are selected, the number of parameters, and a much more extensive data set is used for the fitting procedure.

Of note is how well this new model works. Table 1 compares the errors in solvation free energies computed using the new SM8 model against some other popular continuum methods. Clearly, SM8 provides much better results. As they point out, what is truly discouraging is the performance of the 3PM model against the continuum methods. 3PM stands for “three-parameter model”, where the solvation energies of all the neutral solute in water is set to their average experimental value (-2.99 kcal mol-1), and the same for the neutral solutes in organic solvents (-5.38 kcal mol-1), and for ions (-65.0 kcal mol-1). The 3PM outperforms most of the continuum methods!

Table 1. Mean unsigned error (kcal mol-1) for the solvation
free energies computed with different methods.1


Method

Aqueous neutrala

Organic neutralsb

Ionsc

SM8d

0.55

0.61

4.31

IEF-PCM/UA0e

4.87

5.99

9.73

IEF-PCM/UAHFf

1.18

3.94

8.15

C-PCM/GAMESSg

1.57

2.78

8.39

PB/Jaguarh

0.86

2.28

6.72

3PM

2.65

1.49

8.60


a274 data points. b666 data points spread among 16 solvents. c332 data points spread among acetonitrile, water, DMSO, and methanol. dUsing mPW1PW/6-31G(d). eUsing mPW1PW/6-31G(d) and the UA0 atomic radii in Gaussian. fUsing mPW1PW/6-31G(d) and the UAHF atomic radii in Gaussian. gUsing B3LYP/6-31G(d) and conductor-PCM in GAMESS. hUsing B3LYP/6-31G(d) and the PB method in Jaguar.

References

(1) Marenich, A. V.; Olson, R. M.; Kelly, C. P.; Cramer, C. J.; Truhlar, D. G., "Self-Consistent Reaction Field Model for Aqueous and Nonaqueous Solutions Based on Accurate Polarized Partial Charges," J. Chem. Theory Comput., 2007, 3, 2011-2033. DOI: 10.1021/ct7001418.

Cramer & Solvation & Truhlar Steven Bachrach 19 Nov 2007 1 Comment

Problems with DFT

We noted in Chapter 2.1 some serious errors in the prediction of bond dissociation energies using B3LYP. For example, Gilbert examined the C-C bond dissociation energy of some simple branched alkanes.1 The mean absolute deviation (MAD) for the bond dissociation energy predicted by G3MP2 is 1.7 kcal mol-1 and 2.8 kcal mol-1 using MP2. In contrast, the MAD for the B3LYP predicted values is 13.7 kcal mol-1, with some predictions in error by more than 20 kcal mol-1. Furthermore, the size of the error increases with the size of the molecule. Consistent with this trend, Curtiss and co-workers noted a systematic underestimation of the heat of formation of linear alkanes of nearly 0.7 kcal mol-1 per bond using B3LYP.2

Further evidence disparaging the general performance of DFT methods (and B3LYP in particular) was presented in a paper by Grimme and in two back-to-back Organic Letters articles, one by Schreiner and one by Schleyer. Grimme3 noted that the relative Energy of two C8H18 isomers, octane and 2,2,3,3-tetramethylbutane are incorrectly predicted by DFT methods (Table 1). While MP2 and CSC-MP2 (spin-component-scaled MP2) correctly predict that the more branched isomer is more stable, the DFT methods predict the inverse! Grimme attributes this error to a failure of these DFT methods to adequately describe medium-range electron correlation.


Table 1. Energy (kcal mol-1) of 2,2,3,3-tetramethylbutane relative to octane.

Method ΔE
Expta 1.9 ± 0.5
MP2b,c 4.6
SCS-MP2b,c 1.4
PBEb,c -5.5
TPSShb,c -6.3
B3LYPb,c -8.4
BLYPb,c -9.9
M05-2Xd,e 2.0
M05-2Xc,d 1.4

aNIST Webbook (http://webbook.nist.gov) bRef. 3. cUsing the cQZV3P basis set and MP2/TZV(d,p) optimized geometries. dRef. 4. eCalculated at M05-2X/6-311+G(2df,2p).

Schreiner5 also compared the energies of hydrocarbon isomers. For example, the three lowest energy isomers of C12H12 are 1-3, whose B3LYP/6-31G(d) structures are shown in Figure 1. What is disturbing is that the relative energies of these three isomers depends strongly upon the computational method (Table 2), especially since these three compounds appear to be quite ordinary hydrocarbons. CCSD(T) predicts that 2 is about 15 kcal mol-1 less stable than 1 and that 3 lies another 10 kcal mol-1 higher in energy. MP2 exaggerates the separation by a few kcal mol-1. HF predicts that 1 and 2 are degenerate. The large HF component within B3LYP leads to this DFT method’s poor performance. B3PW91 does reasonably well in reproducing the CCSD(T) results.


Table 2. Energies (kcal mol-1) of 2 and 3 relative to 1.

Method 2 3
CCSD(T)/cc-pVDZ//MP2(fc)/aug-cc-pVDZa 14.3 25.0
CCSD(T)/cc-pVDZ//B3LYP/6-31+G(d)a 14.9 25.0
MP2(fc)/aug-cc-pVDZa 21.6 29.1
MP2(fc)/6-31G(d)a 23.0 30.0
HF/6-311+(d) a 0.1 6.1
B3LYP/6-31G(d)a 4.5 7.2
B3LYP/aug-cc-pvDZa 0.4 3.1
B3PW91/6-31+G(d) a 17.3 23.7
B3PW91/aug-cc-pVDZa 16.8 23.5
KMLYP/6-311+G(d,p)a 28.4 41.7
M05-2X/6-311+G(d,p)b 16.9 25.4
M05-2X/6-311+G(2df,2p)b 14.0 21.4

aRef. 5. bRef. 4.

 
DFT 1

1

xyz file

DFT 2

2

xyz file

DFT 3

3

xyz file

Figure 1. Structures of 1-3 at B3LYP/6-31G(d).

Another of Schreiner’s examples is the relative energies of the C1010 isomers; Table 3 compares their relative experimental heats of formation with their computed energies. MP2 adequately reproduces the isomeric energy differences. B3LYP fairs quite poorly in this task. The errors seem to be most egregious for compounds with many single bonds. Schreiner recommends that while DFT-optimized geometries are reasonable, their energies are unreliable and some non-DFT method should be utilized instead.


Table 3. Relative C10H10 isomer energies (kcal mol-1)5


Compound

Rel.ΔHf

Rel. E(B3LYP)

Rel. E(MP2)

0.0

0.0

0.0

5.9

-8.5

0.2

16.5

0.7

10.7

20.5

3.1

9.4

26.3

20.3

22.6

32.3

17.6

31

64.6

48.8

61.4

80.8

71.2

78.8

r2

0.954a

0.986b


aCorrelation coefficient between Rel. ΔHf and Rel. E(B3LYP). bCorrelation coefficient between Rel. ΔHf and Rel. E(MP2).

Schleyer’s example of poor DFT performance is in the isodesmic energy of Reaction 1 evaluated for the n-alkanes.6 The energy of this reaction becomes more positive with increasing chain length, which Schleyer attributes to stabilizing 1,3-interactions between methyl or methylene groups. (Schleyer ascribes the term “protobranching” to this phenomenon.) The stabilization energy of protobranching using experimental heats of formation increases essentially linearly with the length of the chain, as seen in Figure 2.

n-CH3(CH2)mCH3 + mCH4 → (m + 1)C2H6         Reaction 1

Schleyer evaluated the protobranching energy using a variety of methods, and these energies are also plotted in Figure 2. As expected, the G3 predictions match the experimental values quite closely. However, all of the DFT methods underestimate the stabilization energy. Most concerning is the poor performance of B3LYP. All three of these papers clearly raise concerns over the continued widespread use of B3LYP as the de facto DFT method. Even the new hybrid meta-GGA functionals fail to adequately predict the protobranching phenomenon, leading Schleyer to conclude: “We hope that Check and Gilbert’s pessimistic admonition that ‘a computational chemist cannot trust a one-type DFT calculation’1 can be overcome eventually”. These papers provide a clear challenge to developers of new functionals.

Figure 2. Comparison of computed and experimental protobranching stabilization energy (as defined in Reaction 1) vs. m, the number of methylene groups of the n-alkane chain.6

Truhlar believes that one of his newly developed functionals answers the call for a reliable method. In a recent article,4 Truhlar demonstrates that the M05-2X7 functional performs very well in all three of the cases discussed here. In the case of the C8H18 isomers (Table 1), M05-2X properly predicts that 2,2,3,3-tetramethylbutane is more stable than octane, and estimates their energy difference within the error limit of the experiment. Second, M05-2X predicts the relative energies of the C12H12 isomers 1-3 within a couple of kcal mol-1 of the CCSD(T) results (see Table 2). Last, in evaluating the isodesmic energy of Reaction 1 for hexane and octane, M05-2X/6-311+G(2df,2p) predicts energies of 11.5 and 17.2 kcal mol-1 respectively. These are in excellent agreement with the experimental values of 13.1 kcal mol-1 for butane and 19.8 kcal mol-1 for octane.

Truhlar has also touted the M05-2X functional’s performance in handling noncovalent interactions.8 For example, the mean unsigned error (MUE) in the prediction of the binding energies of six hydrogen-bonded dimers is 0.20 kcal mol-1. This error is comparable to that from G3 and much better than CCSD(T). With the M05-2X functional already implemented within NWChem and soon to be released within Gaussian and Jaguar, it is likely that M05-2X may supplant B3LYP as the new de facto functional in standard computational chemical practice.

Schleyer has now examined the bond separation energies of 72 simple organic molecules computed using a variety of functionals,9 including the workhorse B3LYP and Truhlar’s new M05-2X. Bond separation energies are defined by reactions of each compound, such as three shown below:

The new M05-2X functional performed the best, with a mean absolute deviation (MAD) from the experimental energy of only 2.13 kcal mol-1. B3LYP performed much worse, with a MAD of 3.96 kcal mol-1. As noted before, B3LYP energies become worse with increasing size of the molecules, but this problem is not observed for the other functionals examined (including PW91, PBE, and mPW1PW91, among others). So while M05-2X overall appears to solve many of the problems noted with common functionals, it too has some notable failures. In particular, the error is the bond separation energies of 4, 5, and 6 is -8.8, -6.8, and -6.0 kcal mol-1, respectively.

References

(1) Check, C. E.; Gilbert, T. M., “Progressive Systematic Underestimation of Reaction Energies by the B3LYP Model as the Number of C-C Bonds Increases: Why Organic Chemists Should Use Multiple DFT Models for Calculations Involving Polycarbon Hydrocarbons,” J. Org. Chem. 2005, 70, 9828-9834, DOI: 10.1021/jo051545k.

(2) Redfern, P. C.; Zapol, P.; Curtiss, L. A.; Raghavachari, K., “Assessment of Gaussian-3 and Density Functional Theories for Enthalpies of Formation of C1-C16 Alkanes,” J. Phys. Chem. A 2000, 104, 5850-5854, DOI: 10.1021/jp994429s.

(3) Grimme, S., “Seemingly Simple Stereoelectronic Effects in Alkane Isomers and the Implications for Kohn-Sham Density Functional Theory,” Angew. Chem. Int. Ed. 2006, 45, 4460-4464, DOI: 10.1002/anie.200600448

(4) Zhao, Y.; Truhlar, D. G., “A Density Functional That Accounts for Medium-Range Correlation Energies in Organic Chemistry,” Org. Lett. 2006, 8, 5753-5755, DOI: 10.1021/ol062318n

(5) Schreiner, P. R.; Fokin, A. A.; Pascal, R. A.; deMeijere, A., “Many Density Functional Theory Approaches Fail To Give Reliable Large Hydrocarbon Isomer Energy Differences,” Org. Lett. 2006, 8, 3635-3638, DOI: 10.1021/ol0610486

(6) Wodrich, M. D.; Corminboeuf, C.; Schleyer, P. v. R., “Systematic Errors in Computed Alkane Energies Using B3LYP and Other Popular DFT Functionals,” Org. Lett. 2006, 8, 3631-3634, DOI: 10.1021/ol061016i

(7) Zhao, Y.; Schultz, N. E.; Truhlar, D. G., “Design of Density Functionals by Combining the Method of Constraint Satisfaction with Parametrization for Thermochemistry, Thermochemical Kinetics, and Noncovalent Interactions,” J. Chem. Theory Comput. 2006, 2, 364-382, DOI: 10.1021/ct0502763.

(8) Zhao, Y.; Truhlar, D. G., “Assessment of Model Chemistries for Noncovalent Interactions,” J. Chem. Theory Comput. 2006, 2, 1009-1018, DOI: 10.1021/ct060044j.

(9) Wodrich, M. D.; Corminboeuf, C.; Schreiner, P. R.; Fokin, A. A.; Schleyer, P. v. R., “How Accurate Are DFT Treatments of Organic Energies?,” Org. Lett., 2007, 9, 1851-1854, DOI: 10.1021/ol070354w.

InChI

1: InChI=1/C11H10/c1-2-5-7-3(1)4(1)8-6(2)10-9(5)11(7,8)10/h1-10H

2: InChI=1/C12H12/c1-2-4-10-6-8-11-7-5-9(3-1)12(10)11/h1-12H

3: InChI=1/C12H12/c1-2-4-8-11(7-3-1)12-9-5-6-10-12/h1-12H

4: InChI=1/C6H6/c1-4-5(2)6(4)3/h1-3H2

5: InChI=1/C8H10/c1-3-7-5-6-8(7)4-2/h3-4H,1-2,5-6H2

6: InChI=1/C10H10/c1-2-8-5-6-9-4-3-7(1)10(8)9/h1-10H

DFT & Schleyer & Truhlar Steven Bachrach 13 Jul 2007 5 Comments