A formula to determine functional derivatives for a common class of functionals can be written as the integral of a function and its derivatives. This is a generalization of the
Euler–Lagrange equation: indeed, the functional derivative was introduced in
physics within the derivation of the
Lagrange equation of the second kind from the
principle of least action in
Lagrangian mechanics (18th century). The first three examples below are taken from
density functional theory (20th century), the fourth from
statistical mechanics (19th century).
Formula Given a functional F[\rho] = \int f( \boldsymbol{r}, \rho(\boldsymbol{r}), \nabla\rho(\boldsymbol{r}) )\, d\boldsymbol{r}, and a function \phi(\boldsymbol{r}) that vanishes on the boundary of the region of integration, from a previous section
Definition, \begin{align} \int \frac{\delta F}{\delta\rho(\boldsymbol{r})} \, \phi(\boldsymbol{r}) \, d\boldsymbol{r} & = \left [ \frac{d}{d\varepsilon} \int f( \boldsymbol{r}, \rho + \varepsilon \phi, \nabla\rho+\varepsilon\nabla\phi )\, d\boldsymbol{r} \right ]_{\varepsilon=0} \\ & = \int \left( \frac{\partial f}{\partial\rho} \, \phi + \frac{\partial f}{\partial\nabla\rho} \cdot \nabla\phi \right) d\boldsymbol{r} \\ & = \int \left[ \frac{\partial f}{\partial\rho} \, \phi + \nabla \cdot \left( \frac{\partial f}{\partial\nabla\rho} \, \phi \right) - \left( \nabla \cdot \frac{\partial f}{\partial\nabla\rho} \right) \phi \right] d\boldsymbol{r} \\ & = \int \left[ \frac{\partial f}{\partial\rho} \, \phi - \left( \nabla \cdot \frac{\partial f}{\partial\nabla\rho} \right) \phi \right] d\boldsymbol{r} \\ & = \int \left( \frac{\partial f}{\partial\rho} - \nabla \cdot \frac{\partial f}{\partial\nabla\rho} \right) \phi(\boldsymbol{r}) \ d\boldsymbol{r} \, . \end{align} The second line is obtained using the
total derivative, where is a
derivative of a scalar with respect to a vector. The third line was obtained by use of a
product rule for divergence. The fourth line was obtained using the
divergence theorem and the condition that \phi=0 on the boundary of the region of integration. Since \phi is also an arbitrary function, applying the
fundamental lemma of calculus of variations to the last line, the functional derivative is \frac{\delta F}{\delta\rho(\boldsymbol{r})} = \frac{\partial f}{\partial\rho} - \nabla \cdot \frac{\partial f}{\partial\nabla\rho} where and . This formula is for the case of the functional form given by at the beginning of this section. For other functional forms, the definition of the functional derivative can be used as the starting point for its determination. (See the example
Coulomb potential energy functional.) The above equation for the functional derivative can be generalized to the case that includes higher dimensions and higher order derivatives. The functional would be, F[\rho(\boldsymbol{r})] = \int f\left( \boldsymbol{r}, \rho(\boldsymbol{r}), \nabla\rho(\boldsymbol{r}), \nabla^{(2)}\rho(\boldsymbol{r}), \dots, \nabla^{(N)}\rho(\boldsymbol{r})\right)\, d\boldsymbol{r}, where the vector , and is a tensor whose components are partial derivative operators of order , \left [ \nabla^{(i)} \right ]_{\alpha_1 \alpha_2 \cdots \alpha_i} = \frac {\partial^{\, i}} {\partial r_{\alpha_1} \partial r_{\alpha_2} \cdots \partial r_{\alpha_i} } \qquad \qquad \text{where} \quad \alpha_1, \alpha_2, \dots, \alpha_i = 1, 2, \dots , n \ . An analogous application of the definition of the functional derivative yields \begin{align} \frac{\delta F[\rho]}{\delta \rho} &{} = \frac{\partial f}{\partial\rho} - \nabla \cdot \frac{\partial f}{\partial(\nabla\rho)} + \nabla^{(2)} \cdot \frac{\partial f}{\partial\left(\nabla^{(2)}\rho\right)} + \dots + (-1)^N \nabla^{(N)} \cdot \frac{\partial f}{\partial\left(\nabla^{(N)}\rho\right)} \\ &{} = \frac{\partial f}{\partial\rho} + \sum_{i=1}^N (-1)^{i}\nabla^{(i)} \cdot \frac{\partial f}{\partial\left(\nabla^{(i)}\rho\right)} \ . \end{align} In the last two equations, the components of the tensor \frac{\partial f}{\partial\left(\nabla^{(i)}\rho\right)} are partial derivatives of with respect to partial derivatives of
ρ, \left [ \frac {\partial f} {\partial \left (\nabla^{(i)}\rho \right ) } \right ]_{\alpha_1 \alpha_2 \cdots \alpha_i} = \frac {\partial f} {\partial \rho_{\alpha_1 \alpha_2 \cdots \alpha_i} } where \rho_{\alpha_1 \alpha_2 \cdots \alpha_i} \equiv \frac {\partial^{\,i}\rho} {\partial r_{\alpha_1} \, \partial r_{\alpha_2} \cdots \partial r_{\alpha_i} } , and the tensor scalar product is, \nabla^{(i)} \cdot \frac{\partial f}{\partial\left(\nabla^{(i)}\rho\right)} = \sum_{\alpha_1, \alpha_2, \cdots, \alpha_i = 1}^n \ \frac {\partial^{\, i} } {\partial r_{\alpha_1} \, \partial r_{\alpha_2} \cdots \partial r_{\alpha_i} } \ \frac {\partial f} {\partial \rho_{\alpha_1 \alpha_2 \cdots \alpha_i} } \ .
Examples Thomas–Fermi kinetic energy functional The
Thomas–Fermi model of 1927 used a kinetic energy functional for a noninteracting uniform
electron gas in a first attempt of
density-functional theory of electronic structure: T_\mathrm{TF}[\rho] = C_\mathrm{F} \int \rho^\frac53(\mathbf{r}) \, d\mathbf{r} \, . Since the integrand of does not involve derivatives of , the functional derivative of is, \frac{\delta T_{\mathrm{TF}}}{\delta \rho (\boldsymbol{r}) } = C_\mathrm{F} \frac{\partial \rho^\frac53(\mathbf{r})}{\partial \rho(\mathbf{r})} = \frac{5}{3} C_\mathrm{F} \rho^\frac23(\mathbf{r}) \, .
Coulomb potential energy functional The
electron–nucleus potential energy is V[\rho] = \int \frac{\rho(\boldsymbol{r})} \ d\boldsymbol{r}. Applying the definition of functional derivative, \begin{align} \int \frac{\delta V}{\delta \rho(\boldsymbol{r})} \ \phi(\boldsymbol{r}) \ d\boldsymbol{r} & {} = \left [ \frac{d}{d\varepsilon} \int \frac{\rho(\boldsymbol{r}) + \varepsilon \phi(\boldsymbol{r})} \ d\boldsymbol{r} \right ]_{\varepsilon=0} \\[1ex] & {} = \int \frac {\phi(\boldsymbol{r})} \ d\boldsymbol{r} \, . \end{align} So, \frac{\delta V}{\delta \rho(\boldsymbol{r})} = \frac{1} \ . The functional derivative of the classical part of the
electron–electron interaction (often called Hartree energy) is J[\rho] = \frac{1}{2}\iint \frac{\rho(\mathbf{r}) \rho(\mathbf{r}')}\, d\mathbf{r} d\mathbf{r}' \, . From the
definition of the functional derivative, \begin{align} \int \frac{\delta J}{\delta\rho(\boldsymbol{r})} \phi(\boldsymbol{r})d\boldsymbol{r} & {} = \left [ \frac {d \ }{d\varepsilon} \, J[\rho + \varepsilon\phi] \right ]_{\varepsilon = 0} \\ & {} = \left [ \frac {d \ }{d\varepsilon} \, \left ( \frac{1}{2}\iint \frac {[\rho(\boldsymbol{r}) + \varepsilon \phi(\boldsymbol{r})] \, [\rho(\boldsymbol{r}') + \varepsilon \phi(\boldsymbol{r}')] }\, d\boldsymbol{r} d\boldsymbol{r}' \right ) \right ]_{\varepsilon = 0} \\ & {} = \frac{1}{2}\iint \frac {\rho(\boldsymbol{r}') \phi(\boldsymbol{r}) }\, d\boldsymbol{r} d\boldsymbol{r}' + \frac{1}{2}\iint \frac {\rho(\boldsymbol{r}) \phi(\boldsymbol{r}') }\, d\boldsymbol{r} d\boldsymbol{r}' \\ \end{align} The first and second terms on the right hand side of the last equation are equal, since and in the second term can be interchanged without changing the value of the integral. Therefore, \int \frac{\delta J}{\delta\rho(\boldsymbol{r})} \phi(\boldsymbol{r})d\boldsymbol{r} = \int \left ( \int \frac {\rho(\boldsymbol{r}') } d\boldsymbol{r}' \right ) \phi(\boldsymbol{r}) d\boldsymbol{r} and the functional derivative of the electron-electron Coulomb potential energy functional [
ρ] is, \frac{\delta J}{\delta\rho(\boldsymbol{r})} = \int \frac {\rho(\boldsymbol{r}') } d\boldsymbol{r}' \, . The second functional derivative is \frac{\delta^2 J[\rho]}{\delta \rho(\mathbf{r}')\delta\rho(\mathbf{r})} = \frac{\partial}{\partial \rho(\mathbf{r}')} \left ( \frac{\rho(\mathbf{r}')} \right ) = \frac{1}.
von Weizsäcker kinetic energy functional In 1935
von Weizsäcker proposed to add a gradient correction to the Thomas-Fermi kinetic energy functional to make it better suit a molecular electron cloud: T_\mathrm{W}[\rho] = \frac{1}{8} \int \frac{\nabla\rho(\mathbf{r}) \cdot \nabla\rho(\mathbf{r})}{ \rho(\mathbf{r}) } d\mathbf{r} = \int t_\mathrm{W}(\mathbf{r}) \ d\mathbf{r} \, , where t_\mathrm{W} \equiv \frac{1}{8} \frac{\nabla\rho \cdot \nabla\rho}{ \rho } \qquad \text{and} \ \ \rho = \rho(\boldsymbol{r}) \ . Using a previously derived
formula for the functional derivative, \begin{align} \frac{\delta T_\mathrm{W}}{\delta \rho} & = \frac{\partial t_\mathrm{W}}{\partial \rho} - \nabla\cdot\frac{\partial t_\mathrm{W}}{\partial \nabla \rho} \\ & = -\frac{1}{8}\frac{\nabla\rho \cdot \nabla\rho}{\rho^2} - \left ( \frac {1}{4} \frac {\nabla^2\rho} {\rho} - \frac {1}{4} \frac {\nabla\rho \cdot \nabla\rho} {\rho^2} \right ) \qquad \text{where} \ \ \nabla^2 = \nabla \cdot \nabla \ , \end{align} and the result is, \frac{\delta T_\mathrm{W}}{\delta \rho} = \ \ \, \frac{1}{8}\frac{\nabla\rho \cdot \nabla\rho}{\rho^2} - \frac{1}{4}\frac{\nabla^2\rho}{\rho} \ .
Entropy The
entropy of a discrete
random variable is a functional of the
probability mass function. H[p(x)] = -\sum_x p(x) \log p(x) Thus, \begin{align} \sum_x \frac{\delta H}{\delta p(x)} \, \phi(x) & {} = \left[ \frac{d}{d\varepsilon} H[p(x) + \varepsilon\phi(x)] \right]_{\varepsilon=0}\\ & {} = \left [- \, \frac{d}{d\varepsilon} \sum_x \, [p(x) + \varepsilon\phi(x)] \ \log [p(x) + \varepsilon\phi(x)] \right]_{\varepsilon=0} \\ & {} = -\sum_x \, [1+\log p(x)] \ \phi(x) \, . \end{align} Thus, \frac{\delta H}{\delta p(x)} = -1-\log p(x).
Exponential Let F[\varphi(x)]= e^{\int \varphi(x) g(x)dx}. Using the delta function as a test function, \begin{align} \frac{\delta F[\varphi(x)]}{\delta \varphi(y)} & {} = \lim_{\varepsilon\to 0}\frac{F[\varphi(x)+\varepsilon\delta(x-y)]-F[\varphi(x)]}{\varepsilon}\\ & {} = \lim_{\varepsilon\to 0}\frac{e^{\int (\varphi(x)+\varepsilon\delta(x-y)) g(x)dx}-e^{\int \varphi(x) g(x)dx}}{\varepsilon}\\ & {} = e^{\int \varphi(x) g(x)dx}\lim_{\varepsilon\to 0}\frac{e^{\varepsilon \int \delta(x-y) g(x)dx}-1}{\varepsilon}\\ & {} = e^{\int \varphi(x) g(x)dx}\lim_{\varepsilon\to 0}\frac{e^{\varepsilon g(y)}-1}{\varepsilon}\\ & {} = e^{\int \varphi(x) g(x)dx}g(y). \end{align} Thus, \frac{\delta F[\varphi(x)]}{\delta \varphi(y)} = g(y) F[\varphi(x)]. This is particularly useful in calculating the
correlation functions from the
partition function in
quantum field theory.
Functional derivative of a function A function can be written in the form of an integral like a functional. For example, \rho(\boldsymbol{r}) = F[\rho] = \int \rho(\boldsymbol{r}') \delta(\boldsymbol{r}-\boldsymbol{r}')\, d\boldsymbol{r}'. Since the integrand does not depend on derivatives of
ρ, the functional derivative of
ρ is, \frac {\delta \rho(\boldsymbol{r})} {\delta\rho(\boldsymbol{r}')} \equiv \frac {\delta F} {\delta\rho(\boldsymbol{r}')} = \frac{\partial \ \ }{\partial \rho(\boldsymbol{r}')} \, [\rho(\boldsymbol{r}') \delta(\boldsymbol{r}-\boldsymbol{r}')] = \delta(\boldsymbol{r}-\boldsymbol{r}').
Functional derivative of iterated function The functional derivative of the iterated function f(f(x)) is given by: \frac{\delta f(f(x))}{\delta f(y) } = f'(f(x))\delta(x-y) + \delta(f(x)-y) and \frac{\delta f(f(f(x)))}{\delta f(y) } = f'(f(f(x))(f'(f(x))\delta(x-y) + \delta(f(x)-y)) + \delta(f(f(x))-y) In general: \frac{\delta f^N(x)}{\delta f(y)} = f'\left( f^{N-1}(x) \right) \frac{ \delta f^{N-1}(x)}{\delta f(y)} + \delta\left( f^{N-1}(x) - y \right) Putting in gives: \frac{\delta f^{-1}(x)}{\delta f(y) } = - \frac{ \delta\left(f^{-1}(x)-y \right) }{ f'\left(f^{-1}(x)\right) } ==Using the delta function as a test function==