For a system whose total density can be represented as , i.e., the molecular orbitals on fragments and are orthogonal to each other, its total energy can be expressed as
(11.92) |
where is the core-Hamiltonian matrix, and are the Coulomb and XC energies (exact exchange will also be present if a hybrid functional is employed). In an embedding calculation where the electron density of (denoted as ) is optimized at a high-level theory in the presence of (which is obtained at a low-level theory and then fixed), the total energy can be expressed as
(11.93) |
Differentiating Eq. (11.93) with respect to gives the Fock matrix for the embedding calculation:
(11.94) |
where the embedding potential
(11.95) |
where is the projector formed by ’s orbitals and is a large enough constant (e.g. 10 a.u.) that effectively elevates the energies of ’s orbitals and thereby enforces the orthogonality between MOs on and . In the Q-Chem implementation (with GEN_SCFMAN_EMBED = TRUE), an alternative approach is employed, where one diagonalizes a modified Fock matrix fromwhich the variational degrees of freedom spanned by ’s orbitals are projected out:
(11.96) |
where is the AO overlap matrix.
One disadvantage of using the energy expression given by Eq. (11.93) is that the embedding potential [Eq. (11.95)] depends on , which needs to be updated in every iteration of the embedding calculation. Thus this scheme affords no savings in computational cost for DFT-in-DFT calculations. To address this limitation, a linearized approximation was suggested,659 which is based on the following expansion (to the first order of ):
(11.97) |
where
(11.98) |
Based on this linearized approximation, the total energy of the entire system is approximately given by
(11.99) |
and the Fock matrix for embedding calculation:
(11.100) |
where the embedding potential stays unchanged during the embedding calculation. In Q-Chem 5.4.1 and versions after, this linearized approximation is used by default in projection-based embedding calculations.
For WFT-in-DFT calculations, one can absorb the embedding potential into the Hamiltonian that is used for the correlated WFT calculation, converting Eq. (11.99) into
(11.101) |
An embedding calculation usually starts from an SCF calculation of the full system at the lower level of theory, which yields canonical MOs. Therefore, it is necessary to partition the occupied space and assign orbitals to fragments and (without losing generality, assuming is the embedded fragment). In the original work by Manby et al.,729 this was achieved by
Performing a Pipek-Mezey localization897 of the canonical occupied orbitals;
Assigning a PM-localized orbital to the “active” fragment if its Mulliken population on is greater than 0.4.
f This approach has been adopted as the default occupied space partition method in Q-Chem.
Recently a parameter-free and more robust partition scheme was proposed by Claudino and Mayhall, which is known as the Subsystem Projected AO Decomposition (SPADE) procedure.210 In this approach, one first transforms the occupied orbitals into the symmetrically orthogonalized AO basis:
(11.102) |
and then denotes the rows in that correspond to fragment as . A singular value decomposition (SVD) is then applied to : , and the SPADE orbitals are then obtained by rotating the original :
(11.103) |
The largest gap in the singular value spectrum determines the most appropriate occupied orbital partition under the given fragmentation.
A WFT-in-DFT calculation requires not only the occupied orbitals on the “active” fragment but also the virtual orbitals. Unlike the occupied orbitals, the virtual orbitals obtained from a projection-based embedding calculation are not assigned to fragments but stay delocalized. If the full virtual space is used in the post-SCF calculation, the savings on computational cost will be rather limited since only the number of occupied orbitals is reduced. Therefore, it is desirable to further truncate the virtual space so that one can significantly reduce the computational cost of WFT-in-DFT calculations.
Claudino and Mayhall recently proposed a simple and efficient approach to truncate the virtual space based on concentric localization (CL),211 which shares the same spirit as the SPADE partition scheme for occupied space. As the first step, the original set of delocalized virtual orbitals () represented in the working basis (WB) are projected onto the embedded fragment in a user-specified projection basis (PB):
(11.104) |
where the superscript “” indicates that only the rows corresponding to fragment ’s basis functions are included and denotes the overlap matrix for PB functions on fragment only. One can choose PB to be the same as WB or even a smaller basis set. A particular set of virtual orbitals denoted as can then be selected by performing an SVD on the overlap between and :
(11.105) |
with
(11.106a) | ||||
(11.106b) | ||||
(11.106c) |
By construction, should consist of the virtual valence shell of the WB. In order to achieve higher accuracy for the embedded correlated method, one can select more virtual orbitals from in a stepwise fashion. A recommended way211 is to singular value decompose the matrix , i.e., the coupling between and through the Fock operator
(11.107) |
where
(11.108a) | ||||
(11.108b) | ||||
(11.108c) | ||||
(11.108d) |
As the size of is the same as , going through the procedure given by Eq. (11.107) doubles the number of active virtual orbitals. This procedure can be carried on iteratively, rendering the accuracy of this method tunable:
(11.109) |
where
(11.110a) | ||||
(11.110b) | ||||
(11.110c) | ||||
(11.110d) |
The virtual orbitals that span the null space, , will remain inactive in the post-SCF calculations. In practice, one is often able to obtain sub-kcal/mol accuracy by only including and , which is known as the “double-” CL shell model.