11.6 Projector-Based Density Embedding

11.6.1 Theory

11.6.1.1 Working Equations

For a system whose total density can be represented as 𝜸=𝜸A+𝜸B, i.e., the molecular orbitals on fragments A and B are orthogonal to each other, the total energy can be represented as

E[𝜸A+𝜸B]=tr(𝜸A𝐡)+J[𝜸A]+Exc[𝜸A]Energy of A+tr(𝜸B𝐡)+J[𝜸B]+Exc[𝜸B]Energy of B+J[𝜸A+𝜸B]+Exc[𝜸A+𝜸B]Non-additive terms (11.79)

where 𝐡 is the core-Hamiltonian matrix, J and Exc are the Coulomb and XC energies. To evaluate the energy of subsystem A in the presence of B, one can variationally optimize A’s orbitals by diagonalizing the Fock matrix

𝐟A in B=𝐡A in B+𝐉[𝜸A]+𝐯xc[𝜸A] (11.80)

where the modified (embedded) core-Hamiltonian has the following form:

𝐡A in B=𝐡+(𝐉[𝜸A+𝜸B]-𝐉[𝜸A])+(𝐯xc[𝜸A+𝜸B]-𝐯xc[𝜸A])+μ𝐏, (11.81)

where 𝐏=𝐒𝜸B𝐒 is the projector formed by B’s orbitals and μ is a large enough constant (e.g. 106 a.u.) that enforces the orthogonality between molecular orbitals on fragments A and B. In the Q-Chem implementation, an alternative approach is employed, where one diagonalizes a modified Fock matrix from which the variational degrees of freedom spanned by B’s orbitals are projected out:

𝐟~A in B=(𝐈-𝐒𝜸B)𝐟A in B(𝐈-𝜸B𝐒). (11.82)

For a DFT-in-DFT case, the final energy of the full system is

E[𝜸embA;𝜸A,𝜸B]=ε[𝜸embA]+E[𝜸B]+J[𝜸A+𝜸B]+Exc[𝜸A+𝜸B]+tr[(𝜸embA-𝜸A)(𝐡A in B-𝐡)], (11.83)

and for a WFT-in-DFT case

E[𝚿A;𝜸A,𝜸B]=ΨA|H^A in B|ΨA+E[𝜸B]+J[𝜸A+𝜸B]+Exc[𝜸A+𝜸B]-tr[𝜸A(𝐡A in B-𝐡)] (11.84)

11.6.1.2 Partition of the Occupied Space

An embedding calculation usually starts from an SCF calculation of the full system at the lower level of theory, which yields canonical MOs. Therefore, it is necessary to partition the occupied space and assign orbitals to fragments A and B (without losing generality, assuming A is the embedded fragment). In the original work by Manby et al.,615 this was achieved by

  • Performing a Pipek-Mezey localization 758 of the canonical occupied orbitals;

  • Assigning a PM-localized orbital to the “active” fragment A if its Mulliken population on A is greater than 0.4.

This approach has been adopted as the default occupied space partition method in Q-Chem.

Recently a parameter-free and more robust partition scheme was proposed by Claudino and Mayhall, which is known as the Subsystem Projected AO Decomposition (SPADE) procedure.168 In this approach, one first transforms the occupied orbitals into the symmetrically orthogonalized AO basis:

𝐂¯occ=𝐒1/2𝐂occ (11.85)

and then denotes the rows in 𝐂¯occ that correspond to fragment A as 𝐂¯occA. A singular value decomposition (SVD) is then applied to 𝐂¯occA: 𝐂¯occA=𝐔AΣA(𝐕A)T, and the SPADE orbitals are then obtained by rotating the original 𝐂occ:

𝐂occSPADE=𝐂occ𝐕A. (11.86)

The largest gap in the singular value spectrum determines the most appropriate occupied orbital partition under the given fragmentation.

11.6.1.3 Truncation of the Virtual Space Using Concentric Localization

A WFT-in-DFT calculation requires not only the occupied orbitals on the “active” fragment but also the virtual orbitals. Unlike the occupied orbitals, the virtual orbitals obtained from a projector-based embedding calculation are not assigned to fragments but stay delocalized. If the full virtual space is used in the post-SCF calculation, the savings on computational cost will be rather limited since only the number of occupied orbitals is reduced. Therefore, it is desirable to further truncate the virtual space so that one can significantly reduce the computational cost of WFT-in-DFT calculations.

Claudino and Mayhall recently proposed a simple and efficient approach to truncate the virtual space based on concentric localization (CL),169 which shares the same spirit as the SPADE partition scheme for occupied space. As the first step, the original set of delocalized virtual orbitals (𝐂vir) represented in the working basis (WB) are projected onto the embedded fragment A in a user-specified projection basis (PB):

𝐂virA=(𝐒PB,A-1)(A𝐒PB,WB)𝐂vir (11.87)

where the superscript “A” indicates that only the rows corresponding to fragment A’s basis functions are included and 𝐒PB,A denotes the overlap matrix for PB functions on fragment A only. One can choose PB to be the same as WB or even a smaller basis set. A particular set of virtual orbitals denoted as 𝐂0 can then be selected by performing an SVD on the overlap between 𝐂virA and 𝐂vir:

(A𝐂vir)T(A𝐒PB,WB)𝐂vir=𝐔𝚺𝐕T𝐕=[𝐕span|𝐕null]𝐂0=𝐂vir𝐕span𝐂null,0=𝐂vir𝐕null (11.88)

By construction, 𝐂0 should consist of the virtual valence shell of the WB. In order to achieve higher accuracy for the embedded correlated method, one can select more virtual orbitals from 𝐂null,0 in a stepwise fashion. A recommended way 169 is to singular value decompose the matrix 𝐂T𝐅𝐂null,0, i.e., the coupling between 𝐂0 and 𝐂null,0 through the Fock operator:

𝐂0T𝐅𝐂null,0=𝐔0𝚺0𝐕0T𝐕0=[𝐕span,0|𝐕null,0]𝐂1=𝐂null,0𝐕span,0𝐂null,1=𝐂null,0𝐕null,0𝐂vir, act=[𝐂0|𝐂1] (11.89)

As the size of 𝐂1 is the same as 𝐂0, going through the procedure given by Eq. (11.89) doubles the number of active virtual orbitals. This procedure can be carried on iteratively, rendering the accuracy of this method tunable:

𝐂nT𝐅𝐂null,n=𝐔n𝚺n𝐕nT𝐕n=[𝐕span,n|𝐕null,n]𝐂n+1=𝐂null,n𝐕span,n𝐂null,n+1=𝐂null,n𝐕null,n𝐂vir, act=[𝐂0|𝐂1||𝐂n+1] (11.90)

The virtual orbitals that span the null space, 𝐂null,n+1, will remain inactive in the post-SCF calculations. In practice, one is often able to obtain sub-kcal/mol accuracy by only including 𝐂0 and 𝐂1, which is known as the “double-ζ” CL shell model.