Q-Chem 5.1 User’s Manual

4.7 Dual-Basis Self-Consistent Field Calculations

The dual-basis approximation[Liang and Head-Gordon(2004a), Steele et al.(2006a)Steele, DiStasio, Jr., Shao, Kong, and Head-Gordon, Steele et al.(2006b)Steele, Shao, DiStasio, Jr., and Head-Gordon, DiStasio, Jr. et al.(2007a)DiStasio, Jr., Steele, and Head-Gordon, Steele and Head-Gordon(2007), Steele et al.(2009)Steele, DiStasio, Jr., and Head-Gordon] to self-consistent field (HF or DFT) energies provides an efficient means for obtaining large basis set effects at vastly less cost than a full SCF calculation in a large basis set. First, a full SCF calculation is performed in a chosen small basis (specified by BASIS2). Second, a single SCF-like step in the larger, target basis (specified, as usual, by BASIS) is used to perturbatively approximate the large basis energy. This correction amounts to a first-order approximation in the change in density matrix, after the single large-basis step:

  \begin{equation}  E_{\mathrm{total}} = E_{\mathrm{small\; basis}} + \mathrm{Tr}[(\Delta \ensuremath{\mathbf{P}})\cdot \textbf{F}]_{\mathrm{large\; basis}} \end{equation}   (4.59)

where $\ensuremath{\mathbf{F}}$ (in the large basis) is built from the converged (small basis) density matrix. Thus, only a single Fock build is required in the large basis set. Currently, HF and DFT energies (SP) as well as analytic first derivatives (FORCE or OPT) are available.

Note: As of version 4.0, first derivatives of unrestricted dual-basis DFT energies—though correct—require a code-efficiency fix. We do not recommend use of these derivatives until this improvement has been made.

Across the G3 set[Curtiss et al.(1991)Curtiss, Raghavachari, Trucks, and Pople, Curtiss et al.(1998)Curtiss, Raghavachari, Redfern, Rassolov, and Pople, Curtiss et al.(2000)Curtiss, Raghavachari, Redfern, and Pople] of 223 molecules, using cc-pVQZ, dual-basis errors for B3LYP are 0.04 kcal/mol (energy) and 0.03 kcal/mol (atomization energy per bond) and are at least an order of magnitude less than using a smaller basis set alone. These errors are obtained at roughly an order of magnitude savings in cost, relative to the full, target-basis calculation.

4.7.1 Dual-Basis MP2

The dual-basis approximation can also be used for the reference energy of a correlated second-order Møller-Plesset (MP2) calculation.[Steele et al.(2006a)Steele, DiStasio, Jr., Shao, Kong, and Head-Gordon, Steele et al.(2009)Steele, DiStasio, Jr., and Head-Gordon] When activated, the dual-basis HF energy is first calculated as described above; subsequently, the MO coefficients and orbital energies are used to calculate the correlation energy in the large basis. This technique is particularly effective for RI-MP2 calculations (see Section 6.6), in which the cost of the underlying SCF calculation often dominates.

Furthermore, efficient analytic gradients of the DB-RI-MP2 energy have been developed[DiStasio, Jr. et al.(2007a)DiStasio, Jr., Steele, and Head-Gordon] and added to Q-Chem. These gradients allow for the optimization of molecular structures with RI-MP2 near the basis set limit. Typical computational savings are on the order of 50% (aug-cc-pVDZ) to 71% (aug-cc-pVTZ). Resulting dual-basis errors are only 0.001 Å in molecular structures and are, again, significantly less than use of a smaller basis set alone.

4.7.2 Dual-Basis Dynamics

The ability to compute SCF and MP2 energies and forces at reduced cost makes dual-basis calculations attractive for ab initio molecular dynamics simulations, which are described in Section 10.7. Dual-basis BOMD has demonstrated[Steele et al.(2010)Steele, Head-Gordon, and Tully] savings of 58%, even relative to state-of-the-art, Fock-extrapolated BOMD. Savings are further increased to 71% for dual-basis RI-MP2 dynamics. Notably, these timings outperform estimates of extended Lagrangian (“Car-Parrinello”) dynamics, without detrimental energy conservation artifacts that are sometimes observed in the latter.[Herbert and Head-Gordon(2004)]

Two algorithm improvements make modest but worthwhile improvements to dual-basis dynamics. First, the iterative, small-basis calculation can benefit from Fock matrix extrapolation.[Herbert and Head-Gordon(2004)] Second, extrapolation of the response equations (“$Z$-vector” equations) for nuclear forces further increases efficiency.[Steele and Tully(2010)] (See Section 10.7.) Q-Chem automatically adjusts to extrapolate in the proper basis set when DUAL_BASIS_ENERGY is activated.

4.7.3 Basis-Set Pairings

We recommend using basis pairings in which the small basis set is a proper subset of the target basis (6-31G into 6-31G*, for example). They not only produce more accurate results; they also lead to more efficient integral screening in both energies and gradients. Subsets for many standard basis sets (including Dunning-style cc-pV$X$Z basis sets and their augmented analogs) have been developed and thoroughly tested for these purposes. A summary of the pairings is provided in Table 4.2; details of these truncations are provided in Figure 4.1.

A new pairing for 6-31G*-type calculations is also available. The 6-4G subset (named r64G in Q-Chem) is a subset by primitive functions and provides a smaller, faster alternative for this basis set regime.[Steele and Head-Gordon(2007)] A case-dependent switch in the projection code (still OVPROJECTION) properly handles 6-4G. For DB-HF, the calculations proceed as described above. For DB-DFT, empirical scaling factors (see Ref. Steele:2007 for details) are applied to the dual-basis correction. This scaling is handled automatically by the code and prints accordingly.

As of Q-Chem version 3.2, the basis set projection code has also been adapted to properly account for linear dependence,[Steele et al.(2009)Steele, DiStasio, Jr., and Head-Gordon] which can often be problematic for large, augmented (aug-cc-pVTZ, etc.) basis set calculations. The same standard keyword (LIN_DEP_THRESH) is used to determine linear dependence in the projection code. Because of the scheme used to account for linear dependence, only proper-subset pairings are now allowed.

Like single-basis calculations, user-specified general or mixed basis sets may be employed (see Chapter 8) with dual-basis calculations. The target basis specification occurs in the standard $basis section. The smaller, secondary basis is placed in a similar $basis2 section; the syntax within this section is the same as the syntax for $basis. General and mixed small basis sets are activated by BASIS2 = BASIS2_GEN and BASIS2 = BASIS2_MIXED, respectively.

BASIS

BASIS2

cc-pVTZ

rcc-pVTZ

cc-pVQZ

rcc-pVQZ

aug-cc-pVDZ

racc-pVDZ

aug-cc-pVTZ

racc-pVTZ

aug-cc-pVQZ

racc-pVQZ

6-31G*

r64G, 6-31G

6-31G**

r64G, 6-31G

6-31++G**

6-31G*

6-311++G(3df,3pd)

6-311G*, 6-311+G*

Table 4.2: Summary and nomenclature of recommended dual-basis pairings
  \[  \begin{array}{cccccc} $H-He:$ &  \scriptsize \begin{array}{ccc} s \\ &  p \\ s & &  d \\ &  p \\ s \\ \end{array} &  \scriptsize \begin{array}{ccc} s \\ &  p \\ s & &  - \\ &  - \\ s \\ \end{array} &  $Li-Ne:$ &  \scriptsize \begin{array}{cccc} s \\ &  p \\ s & &  d \\ &  p & &  f\\ s & &  d \\ &  p \\ s \\ \end{array} &  \scriptsize \begin{array}{cccc} s \\ &  p \\ s & &  d \\ &  p & &  -\\ s & &  d \\ &  p \\ s \\ \end{array}\\ &  $cc-pVTZ$ &  $rcc-pVTZ$ & &  $cc-pVTZ$ &  $rcc-pVTZ$ \\ \end{array}\\  \]    
  \[  \begin{array}{cccccc} $H-He:$ &  \scriptsize \begin{array}{cccc} s \\ &  p \\ s & &  d \\ &  p & &  f\\ s & &  d \\ &  p \\ s \\ \end{array} &  \scriptsize \begin{array}{cccc} s \\ &  - \\ s & &  - \\ &  p & &  -\\ s & &  - \\ &  - \\ s \\ \end{array} &  $Li-Ne:$ &  \scriptsize \begin{array}{ccccc} s \\ &  p \\ s & &  d \\ &  p & &  f\\ s & &  d & &  g\\ &  p & &  f\\ s & &  d \\ &  p \\ s \\ \end{array} &  \scriptsize \begin{array}{ccccc} s \\ &  p \\ s & &  d \\ &  p & &  -\\ s & &  d & &  -\\ &  p & &  -\\ s & &  d \\ &  p \\ s \\ \end{array}\\ &  $cc-pVQZ$ &  $rcc-pVQZ$ & &  $cc-pVQZ$ &  $rcc-pVQZ$ \\ \end{array}\\  \]    
  \[  \begin{array}{cccccc} $H-He:$ &  \scriptsize \begin{array}{ccc} s \\ &  p \\ s & \\ &  p \\ s’ & \\ \end{array} &  \scriptsize \begin{array}{ccc} s \\ &  p \\ s & \\ &  - \\ s’ & \\ \end{array} &  $Li-Ne:$ &  \scriptsize \begin{array}{ccc} s \\ &  p \\ s & &  d \\ &  p \\ s & &  d’ \\ &  p’ \\ s’ \end{array} &  \scriptsize \begin{array}{ccc} s \\ &  p \\ s & &  d \\ &  p \\ s & &  - \\ &  p’ \\ s’ \end{array}\\ &  $aug-cc-pVDZ$ &  $racc-pVDZ$ & &  $aug-cc-pVDZ$ &  $racc-pVDZ$ \\ \end{array} \\  \]    
  \[  \begin{array}{cccccc} $H-He:$ &  \scriptsize \begin{array}{ccc} s \\ &  p \\ s & &  d \\ &  p \\ s & &  d’ \\ &  p’ \\ s’ \end{array} &  \scriptsize \begin{array}{ccc} s \\ &  p \\ s & &  - \\ &  p \\ s & &  - \\ &  - \\ s’ \end{array} &  $Li-Ne:$ &  \scriptsize \begin{array}{cccc} s \\ &  p \\ s & &  d \\ &  p & &  f\\ s & &  d \\ &  p & &  f’\\ s & &  d’ \\ &  p’ \\ s’ \end{array} &  \scriptsize \begin{array}{cccc} s \\ &  p \\ s & &  d \\ &  p & &  -\\ s & &  d \\ &  p & &  -\\ s & &  - \\ &  p’ \\ s’ \end{array}\\ &  $aug-cc-pVTZ$ &  $racc-pVTZ$ & &  $aug-cc-pVTZ$ &  $racc-pVTZ$ \\ \end{array} \\  \]    
  \[  \begin{array}{cccccc} $H-He:$ &  \scriptsize \begin{array}{cccc} s \\ &  p \\ s & &  d \\ &  p & &  f\\ s & &  d \\ &  p & &  f’\\ s & &  d’ \\ &  p’ \\ s’ \end{array} &  \scriptsize \begin{array}{cccc} s \\ &  p \\ s & &  - \\ &  p & &  -\\ s & &  - \\ &  p & &  -\\ s & &  - \\ &  - \\ s’ \end{array} &  $Li-Ne:$ &  \scriptsize \begin{array}{ccccc} s \\ &  p \\ s & &  d \\ &  p & &  f\\ s & &  d & &  g\\ &  p & &  f\\ s & &  d & &  g’\\ &  p & &  f’\\ s & &  d’ \\ &  p’ \\ s’ \end{array} &  \scriptsize \begin{array}{ccccc} s \\ &  p \\ s & &  d \\ &  p & &  -\\ s & &  d & &  -\\ &  p & &  -\\ s & &  d & &  -\\ &  p & &  -\\ s & &  - \\ &  p’ \\ s’ \end{array}\\ &  $aug-cc-pVQZ$ &  $racc-pVQZ$ & &  $aug-cc-pVQZ$ &  $racc-pVQZ$ \\ \end{array}  \]    
Figure 4.1: Structure of the truncated basis set pairings for cc-pV(T,Q)Z and aug-cc-pV(D,T,Q)Z. The most compact functions are listed at the top. Primed functions depict $aug$ (diffuse) functions. Dashes indicate eliminated functions, relative to the paired standard basis set. In each case, the truncations for hydrogen and heavy atoms are shown, along with the nomenclature used in Q-Chem.

4.7.4 Job Control

Dual-basis calculations are controlled with the following $rem. DUAL_BASIS_ENERGY turns on the Dual-Basis approximation. Note that use of BASIS2 without DUAL_BASIS_ENERGY only uses basis set projection to generate the initial guess and does not invoke the Dual-Basis approximation (see Section 4.4.5). OVPROJECTION is used as the default projection mechanism for Dual-Basis calculations; it is not recommended that this be changed. Specification of SCF variables (e.g., THRESH) will apply to calculations in both basis sets.

DUAL_BASIS_ENERGY

Activates dual-basis SCF (HF or DFT) energy correction.


TYPE:

LOGICAL


DEFAULT:

FALSE


OPTIONS:

Analytic first derivative available for HF and DFT (see JOBTYPE)

Can be used in conjunction with MP2 or RI-MP2

See BASIS, BASIS2, BASISPROJTYPE


RECOMMENDATION:

Use dual-basis to capture large-basis effects at smaller basis cost. Particularly useful with RI-MP2, in which HF often dominates. Use only proper subsets for small-basis calculation.


4.7.5 Examples

Example 4.38  Input for a dual-basis B3LYP single-point calculation.

$molecule
   0 1
   H
   H   1   0.75
$end

$rem
   JOBTYPE             sp
   METHOD              b3lyp
   BASIS               6-311++G(3df,3pd)
   BASIS2              6-311G*
   DUAL_BASIS_ENERGY   true
$end

Example 4.39  Input for a dual-basis B3LYP single-point calculation with a minimal 6-4G small basis.

$molecule
   0 1
   H
   H   1   0.75
$end

$rem
   JOBTYPE             sp
   METHOD              b3lyp
   BASIS               6-31G*
   BASIS2              r64G
   DUAL_BASIS_ENERGY   true
$end

Example 4.40  Input for a dual-basis RI-MP2 geometry optimization.

$molecule
   0 1
   H
   H   1   0.75
$end

$rem
   JOBTYPE             opt
   METHOD              rimp2
   AUX_BASIS           rimp2-aug-cc-pVDZ
   BASIS               aug-cc-pVDZ
   BASIS2              racc-pVDZ
   DUAL_BASIS_ENERGY   true
$end


Example 4.41  Input for a dual-basis RI-MP2 single-point calculation with mixed basis sets.

$molecule
   0 1
   H
   O   1   1.1
   H   2   1.1  1  104.5
$end

$rem
   JOBTYPE             opt
   METHOD              rimp2
   AUX_BASIS           aux_mixed
   BASIS               mixed
   BASIS2              basis2_mixed
   DUAL_BASIS_ENERGY   true
$end

$basis
 H 1
 cc-pVTZ
 ****
 O 2
 aug-cc-pVTZ
 ****
 H 3
 cc-pVTZ
 ****
$end

$basis2
 H 1
 rcc-pVTZ
 ****
 O 2
 racc-pVTZ
 ****
 H 3
 rcc-pVTZ
 ****
$end

$aux_basis
 H 1
 rimp2-cc-pVTZ
 ****
 O 2
 rimp2-aug-cc-pVTZ
 ****
 H 3
 rimp2-cc-pVTZ
 ****
$end