Archive for the 'QM Method' Category

Origin of DFT failures – part II

Here’s one more attempt to discern the failure of DFT to handle simple alkanes (see this earlier post for a previous attempt to answer this question). Tsuneda and co-workers1 have employed long-range corrected (LC) DFT to the problem of the energy associated with “protobranching”, i.e., from the reaction

CH3(CH2nCH3 + n CH4 → (n+1) CH3CH3

They computed the energy of this reaction for the normal alkanes propane through decane using a variety of functionals, and compared these computed values with experimentally-derived energies. Table 1 gives the mean unsigned error for a few of the functionals. The prefix “LC” indicated inclusion of long-range corrections, “LCgau” indicates the LC scheme with a gaussian attenuation, and “LRD” indicates inclusion of long-range dispersion.

Table 1. Mean unsigned errors of the “protobranching”
reaction energy of various functional compared to experiment.

Functional

MUE
(kcal mol-1)


LC2gau-BLYP+LRD

0.09

LC-PBE+LRD

0.17

SVWN5

0.27

LCgau-PBE

1.56

M06

1.98

LC-PBE

2.24

M06-2x

3.40

B3LYP

5.97

HF

6.96


A number of important conclusions can be drawn. First, with both LC and LRD very nice agreement with experiment can be had. If only LC is included, the error increases on average by over 1 kcal mol-1. The MO6-2x functional, touted as a fix of the problem, does not provide complete correction, though it is vastly superior to B3LYP and other hybrid functionals. The authors conclude that the need for LC incorporation points out that the exchange functional lacks the ability to account for this effect. Medium-range correlation is not the main source of the problem as large discrepancies in the reaction energy error occur when different functionals are used that are corrected for LC and LRD. Choice of functional still matters, but LC correction appears to be a main culprit and further studies of its addition to standard functionals would be most helpful.

References

(1) Song, J.-W.; Tsuneda, T.; Sato, T.; Hirao, K., "Calculations of Alkane Energies Using Long-Range Corrected DFT Combined with Intramolecular van der Waals Correlation," Org. Lett. 2010, 12, 1440–1443, DOI: 10.1021/ol100082z

DFT Steven Bachrach 25 May 2010 4 Comments

Benchmarking DFT for the aldol and Mannich Reactions

Houk has performed a very nice examination of the performance of some density functionals.1 He takes a quite different approach than what was proposed by Grimme – the “mindless” benchmarking2 using random molecules (see this post). Rather, Houk examined a series of simple aldol, Mannich and α-aminoxylation reactions, comparing their reaction energies predicted with DFT against that predicted with CBQ-QB3. The idea here is to benchmark DFT performance for simple reactions of specific interest to organic chemists. These reactions are of notable current interest due their involvement in organocatalytic enantioselective chemistry (see my posts on the aldol, Mannich, and Hajos-Parrish-Eder-Sauer-Wiechert reaction). Examples of the reactions studied (along with their enthalpies at CBS-QB3) are Reaction 1-3.

Reaction 1

Reaction 2

Reaction 3

For the four simple aldol reactions and four simple Mannich reactions, PBE1PBE,
mPW1PW91 and MO6-2X all provided reaction enthalpies with errors of about 2 kcal mol-1. The much maligned B3LYP functional, along with B3PW91 and B1B95 gave energies with significant larger errors. For the three α-aminoxylation reactions, the errors were better with B3PW91 and B1B95 than with PBE1PBE or MO6-2X. Once again, it appears that one is faced with finding the right functional for the reaction under consideration!

Of particular interest is the decomposition of these reactions into related isogyric, isodesmic
and homdesmic reactions. So for example Reaction 1 can be decomposed into Reactions 4-7 as shown in Scheme 1. (The careful reader might note that these decomposition reactions are isodesmic and homodesmotic and hyperhomodesmotic reactions.) The errors for Reactions 4-7 are typically greater than 4 kcal mol-1 using B3LYP or B3PW91, and even with MO6-2X the errors are about 2 kcal mol-1.

Scheme 1.

Houk also points out that Reactions 4, 8 and 9 (Scheme 2) focus on having similar bond changes as in Reactions 1-3. And it’s here that the results are most disappointing. The errors produced by all of the functionals for Reactions 4,8 and 9 are typically greater than 2 kcal mol-1, and even MO2-6x can be in error by as much as 5 kcal mol-1. It appears that the reasonable performance of the density functionals for the “real world” aldol and Mannich reactions relies on fortuitous cancellation of errors in the underlying reactions. Houk calls for the development of new functionals designed to deal with fundamental simple bond changing reactions, like the ones in Scheme 2.

Scheme 2

References

(1) Wheeler, S. E.; Moran, A.; Pieniazek, S. N.; Houk, K. N., "Accurate Reaction Enthalpies and Sources of Error in DFT Thermochemistry for Aldol, Mannich, and α-Aminoxylation Reactions," J. Phys. Chem. A 2009, 113, 10376-10384, DOI: 10.1021/jp9058565

(2) Korth, M.; Grimme, S., ""Mindless" DFT Benchmarking," J. Chem. Theory Comput. 2009, 5, 993–1003, DOI: 10.1021/ct800511q

DFT & Houk & Mannich & aldol Steven Bachrach 01 Mar 2010 1 Comment

Predicting aqueous pKa

Predicting the pKa of a compound from first principles remains a challenge, despite all of the many algorithmic and methodological advantages within the sphere of computational chemistry. Predicting the gas-phase deprotonation energy is relatively straightforward as I detail in Section 2.2 of my book. The difficulty is in treating the solvent and the interaction of the acid and its conjugate base in solution. Considering that we are most interested in acidities in water, a very polar solvent, the interactions between water and the conjugate base and the proton are likely to be large and important!

Baker and Pulay report a procedure for determining acidities with the aim of high throughput.1 Thus, computational efficiency is a primary goal. Their approach is to compute the enthalpy change for deprotonation in solution using a continuum treatment and then employ a linear fit to predict the pKa with the equation:

pKa(c) = αcΔH + βc

where c designates a class of compound, such as alcohol, carboxylic acid, amine, etc. Fitting constants αc and βc need to be found then for each unique class of compound, where the fitting is to experimental pKas in water. In their test suite, they employed eleven anilines and amines, seven pyridines, nine alcohols and phenols, and seven carboxylic acids.

They test a number of different computational variants: (a) what functional to employ, (b) what basis set to use for optimizing structures, and (c) what basis set to use for the enthalpy computation. They opt to employ COSMO for treating the solvent and quickly reject the use of gas phase structures (and particularly use of geometries obtained with molecular mechanics. Their ultimate model is OLYP/6-311+G**//3-21G(d) with the COSMO solvation model. Mean deviation is less than 0.4 pK units. They do note that use of HF or PW91 provides similar small errors, but ultimately favor OLYP for its computational performance.

While this procedure offers some guidance for future computation of acidity, I find a couple of issues. First, it relies on fitted parameters for every class of compound. If one is interested in a new class, then one must develop the appropriate parameters – and the experimental values may not be available or perhaps an insufficient number of them are experimentally available. Second, the parameters cover-up a great deal of problematic computational sins, like the solvation energy of the proton, small basis sets, missing correlation energies, missing dispersion corrections etc. A purist might hope for a computational algorithm that allows for systematic correction and improvement in the estimation of pKas. Further work needs to be done to meet this higher goal.

References

(1) Zhang, S.; Baker, J.; Pulay, P., "A Reliable and Efficient First Principles-Based Method for Predicting pKa Values. 1. Methodology," J. Phys. Chem. A 2010, 114 , 425-431, DOI: 10.1021/jp9067069

Acidity & DFT Steven Bachrach 08 Feb 2010 No Comments

Benchmarking DFT for alkane conformers

Another benchmark study of the performance of different functionals – this time looking at the conformations of small alkanes.1 Martin first establishes high level benchmarks: the difference between the trans and gauche conformers of butane: CCSD(T)/cc-pVQZ, 0.606 kcal mol-1 and W1h-val, 0.611 kcal mol-1; and the energy differences of the conformers of pentane, especially the TT and TG gap: 0.586 kcal mol-1 at CCSD(T)/cc-pVTZ and 0.614 kcal mol-1 at W1h-val.

They then examine the relative conformational energies of butane, pentane, hexane and a number of branched alkanes with a slew of functionals, covering the second through fifth rung of Perdew’s Jacob’s ladder. The paper has a whole lot of data – and the supporting
materials
include Jmol-enhanced visualization of the structures! – but the bottom line is the following. The traditionally used functionals (B3LYP, PBE, etc) overestimate conformer energies while the MO6 family underestimates the interaction energies that occur in GG-type conformers. A dispersion correction tends to overcorrect and leads to wrong energy ordering of conformers. But the new double-hybrid functionals (B2GP-PLYP and B2K-PLYP) with the dispersion correction provide quite nice agreement with the CCSD(T) benchmarks.

Also worrisome is that all the functionals have issues in geometry prediction, particularly in the backbone dihedral angles. So, for example, B3LYP misses the τ1 dihedral angle in the GG conformer by 5° and even MO6-2x misses the τ2 angle in the TG conformer by 2.4&deg.

References

(1) Gruzman, D.; Karton, A.; Martin, J. M. L., "Performance of Ab Initio and Density Functional Methods for Conformational Equilibria of CnH2n+2 Alkane Isomers (n = 4-8),"
The Journal of Physical Chemistry A 2009, 113, 11974–11983 , DOI: http://dx.doi.org/10.1021/jp903640h

DFT Steven Bachrach 06 Nov 2009 1 Comment

TD-DFT benchmark study

Here’s another extensive benchmarking study – this time on the use of TD-DFT to predict excitation energies.1 This study looks at the performance of 28 different functionals, and compares the TD-DFT excitation energies against a data set of (a) computed vertical energies and (b) experimental energies. The performance is generally about the same for both data sets, with many functionals (especially the hybrid functionals) giving errors of about 0.3 eV. Performance can be a bit better when examining subclasses of compounds. For example, PBE0 and mPW1PW91 have a mean unsigned error of only 0.14 eV for a set of organic dyes.

References

(1) Jacquemin, D.; Wathelet, V.; Perpete, E. A.; Adamo, C., "Extensive TD-DFT Benchmark: Singlet-Excited States of Organic Molecules," J. Chem. Theory Comput., 2009, 5, 2420-2435, DOI: 10.1021/ct900298e

DFT Steven Bachrach 28 Oct 2009 No Comments

Gaunine tautomers

Here’s another fine paper from the Alonso group employing laser ablation molecular beam Fourier transform microwave spectroscopy coupled with computation to discern molecular structure. In this work they examine the low-energy tautomers of guanine.1 The four lowest energy guanine tautomers are shown in Figure 1. (Unfortunately, Alonso does not include the optimized coordinates of these structures in the supporting information – we need to more vigorously police this during the review process!) These tautomers are predicted to be very close in energy (MP2/6-311++G(d,p), and so one might expect to see multiple signals in the microwave originating from all four tautomers. In fact, they discern all four, and the agreement between the computed and experimental rotational constants are excellent (Table 1), especially if one applies a scaling factor of 1.004. Once again, this group shows the power of combined experiment and computations!


1 (0.0)


2 (0.28)


3 (0.40)


4 (0.99)

Figure 1. Four lowest energy (kcal mol-1, MP2/6-311++G(d,p)) tautomers of guanine.

Table 1. Experimental and computed rotational constants (MHz) of the four guanine tautomers.

 

1

2

3

4

 

Exp

Comp

Exp

Comp

Exp

Comp

Exp

Comp

A

19.22155

1909.0

19.222780

1909.7

1916.080

1908.6

1923.460

1915.6

B

1121.6840

119.2

1116.6710

1113.5

1132.360

1128.2

1136.040

1131.9

C

709.0079

706.6

706.8580

704.2

712.1950

709.5

714.7000

712.0

References

(1) Alonso, J. L.; Peña, I.; López, J. C.; Vaquero, V., "Rotational Spectral Signatures of Four Tautomers of Guanine," Angew. Chem. Int. Ed. 2009, 48, 6141-6143, DOI: 10.1002/anie.200901462

InChIs

Guanine: InChI=1/C5H5N5O/c6-5-9-3-2(4(11)10-5)7-1-8-3/h1H,(H4,6,7,8,9,10,11)/f/h8,10H,6H2
InChIKey=UYTPUPDQBNUYGX-GSQBSFCVCX

MP & nucleic acids Steven Bachrach 05 Oct 2009 No Comments

CD of high-symmetry molecules

I have written a number of blog posts that deal with the computation of optical activity. Trindle and Altun have now reported TD-DFT computations of circular dichroism of high-symmetry molecules.1 The employ either B3LYP (with a variety of basis sets, the largest being 6-311++G(2d,2p)) and SOAP/ATZP. For a number of the high symmetry molecules (two examples are shown in Figure 1), the two methods differ a bit in their predictions of the first excited state, with SOAP typically predicting a red shift relative to the B3LYP. However, both methods general give the same sign of the CD signals and their line shapes are similar.


1


2

Figure 1. B3LYP/6-31G(d) optimized structures of 1 and 2 (again due to incomplete supporting materials, I reoptimized these structures)

References

(1) Trindle, C.; Altun, Z., "Circular dichroism of some high-symmetry chiral molecules: B3LYP and SAOP calculations " Theor. Chem. Acc. 2009, 122, 145-155, DOI: 10.1007/s00214-008-0494-8.

InChIs

1: InChI=1/C18H14O2/c19-15-7-11-3-1-4-12-8-16(20)10-14-6-2-5-13(9-15)18(14)17(11)12/h1-6H,7-10H2
InChIKey=DYZSIUYFWKNLHS-UHFFFAOYAB

2: InChI=1/C20H24/c1-13-9-18-7-8-20-12-15(3)19(11-16(20)4)6-5-17(13)10-14(18)2/h9-12H,5-8H2,1-4H3
InChIKey=JTMLLDPOLFRPGJ-UHFFFAOYAC

DFT & Optical Rotation Steven Bachrach 27 Jul 2009 No Comments

Si-PETN sensitivity explained

PETN C(CH2ONO2)3 is a relatively insensitive explosive. The silicon analogue Si(CH2ONO2)3 is extraordinarily sensitive, exploding at the touch of a spatula. (By the way, this makes it extremely ill-advised as an explosive – it’s way too dangerous!) Goddard employed MO6 computations to explore five different possible decomposition pathways, shown in Scheme 1.1 Reaction 1, the loss of NO2, is a standard decomposition pathway for many explosives, but the barrier for the C and Si analogues are similar and the reaction of the Si compound is not exothermic. The barrier for Reaction 2 is very large, and the barriers for the C and Si analogues for Reactions 3 and 4 are too similar to explain the differences in their sensitivities.

Scheme 1.

Reaction 5, however, does offer an explanation. The barrier for the Si analogue is 32 kcal mol-1, lower than for any other pathway, and almost 50 kcal mol-1 lower than the barrier for the rearrangement of the PETN itself. Furthermore, Reaction 5 is very exothermic for Si-PETN (-44.5 kcal mol-1), while the most favorable pathway for PETN decomposition, Reaction 1, is endothermic. Thus the small barrier and the large amount of energy released for Reaction 5 of Si-PETN suggests its extreme sensitivity.

References

(1) Liu, W.-G.; Zybin, S. V.; Dasgupta, S.; Klapötke, T. M.; Goddard III, W. A., "Explanation of the Colossal Detonation Sensitivity of Silicon Pentaerythritol Tetranitrate (Si-PETN) Explosive," J. Am. Chem. Soc. 2009, 131, 7490-7491, DOI: 10.1021/ja809725p.

InChIs

PETN: InChI=1/C5H8N4O12/c10-6(11)18-1-5(2-19-7(12)13,3-20-8(14)15)4-21-9(16)17/h1-4H2
InChIKey=TZRXHJWUDPFEEY-UHFFFAOYAE

Si-PETN: InChI=1/C4H8N4O12Si/c9-5(10)17-1-21(2-18-6(11)12,3-19-7(13)14)4-20-8(15)16/h1-4H2
InChIKey=FBKTZZKPJKPXMT-UHFFFAOYAL

DFT Steven Bachrach 20 Jul 2009 No Comments

Cysteine conformations revisited

Schaefer, Csaszar, and Allen have applied the focal point method towards predicting the energies and structures of cysteine.1 This very high level method refines the structures that can be used to compare against those observed by Alonso2 in his laser ablation molecular beam Fourier transform microwave spectroscopy experiment (see this post). They performed a broad conformation search, initially examining some 66,664 structures. These reduced to 71 unique conformations at MP2/cc-pvTZ. The lowest 11 energy structures were further optimized at MP2(FC)/aug-cc-pV(T+d)Z. The four lowest energy conformations are shown in Figure 1 along with their relative energies.

I
(0.0)

II
(4.79)

III
(5.81)

IV
(5.95)

Figure 1. MP2(FC)/aug-cc-pV(T+d)Z optimized geometries and focal point relative energies (kJ mol-1) of the four lowest energy conformers of cysteine.1

The three lowest energy structures found here match up with the lowest two structures found by Alonso and the energy differences are also quite comparable: 4.79 kJ and 5.81 mol-1 with the focal point method 3.89 and 5.38 kJ mol-1 with MP4/6-311++G(d,p)// MP2/6-311++G(d,p). So the identification of the cysteine conformers made by Alonso remains on firm ground.

References

(1) Wilke, J. J.; Lind, M. C.; Schaefer, H. F.; Csaszar, A. G.; Allen, W. D., "Conformers of Gaseous Cysteine," J. Chem. Theory Comput. 2009, DOI: 10.1021/ct900005c.

(2) Sanz, M. E.; Blanco, S.; López, J. C.; Alonso, J. L., "Rotational Probes of Six Conformers of Neutral Cysteine," Angew. Chem. Int. Ed. 2008, 4, 6216-6220, DOI: 10.1002/anie.200801337

InChIs

Cysteine:
InChI=1/C3H7NO2S/c4-2(1-7)3(5)6/h2,7H,1,4H2,(H,5,6)/t2-/m0/s1
InChIKey: XUJNEKJLAYXESH-REOHCLBHBU

Schaefer & amino acids & focal point Steven Bachrach 13 Jul 2009 No Comments

CEPA revisited

Back when I was first learning ab initio methods in Cliff Dykstra’s lab, I played a bit with the post-HF method CEPA (couple electron pair approximation). This method fell out of favor over the years with the rise of MP theory and then with DFT. Now, Neese and Grimme and co-workers are resurrecting it.1 Their Accounts article provides a series of tests of CEPA/1 against benchmark computations (typically CCSD(T)) and lo and behold, CEPA performs remarkably well! It bests B3LYP (no surprise there), B2LYP and MP2 in virtually every category, ranging from reaction energies, hydrogen bond energies, van der Waals interaction energies, and activation barrier heights. As an example, for the isomerization energy of toluene to norbornadiene, CCSD(T) estimates the energy is 42.79 kcal mol-1. B3LYP does miserably, with an error of nearly 14 kcal mol-1, but the CEPA/1 estimate is off by only 0.04 kcal mol-1. Since the computational time of CEPA/1 is competitive with MP2, the authors conclude that CEPA/1 is well-worth reinvestigating as an alternative post-HF methodology.

References

(1) Neese, F.; Hansen, A.; Wennmohs, F.; Grimme, S., "Accurate Theoretical Chemistry with Coupled Pair Models," Acc. Chem. Res. 2009, 42, 641-648 DOI: 10.1021/ar800241t.

QM Method Steven Bachrach 18 Jun 2009 No Comments

Next Page »