Both the Schrödinger and Heisenberg approaches to quantum mechanics single out time and are not in the spirit of relativity. For example, the Heisenberg approach requires that scalar field operators obey the commutation relation : [\varphi(x), \partial_t \varphi(y)] = i \delta^3(x - y) for two simultaneous spatial positions and , and this is not a relativistically invariant concept. The results of a calculation
are covariant, but the symmetry is not apparent in intermediate stages. If naive field-theory calculations did not produce infinite answers in the
continuum limit, this would not have been such a big problem – it would just have been a bad choice of coordinates. But the lack of symmetry means that the infinite quantities must be cut off, and the bad coordinates make it nearly impossible to cut off the theory without spoiling the symmetry. This makes it difficult to extract the physical predictions, which require a
careful limiting procedure. The problem of lost symmetry also appears in classical mechanics, where the Hamiltonian formulation also superficially singles out time. The Lagrangian formulation makes the relativistic invariance apparent. In the same way, the path integral is manifestly relativistic. It reproduces the Schrödinger equation, the Heisenberg equations of motion, and the canonical commutation relations and shows that they are compatible with relativity. It extends the Heisenberg-type operator algebra to
operator product rules, which are new relations difficult to see in the old formalism. Further, different choices of canonical variables lead to very different-seeming formulations of the same theory. The transformations between the variables can be very complicated, but the path integral makes them into reasonably straightforward changes of integration variables. For these reasons, the Feynman path integral has made earlier formalisms largely obsolete. The price of a path integral representation is that the unitarity of a theory is no longer self-evident, but it can be proven by changing variables to some canonical representation. The path integral itself also deals with larger mathematical spaces than is usual, which requires more careful mathematics, not all of which has been fully worked out. The path integral historically was not immediately accepted, partly because it took many years to incorporate fermions properly. This required physicists to invent an entirely new mathematical object – the
Grassmann variable – which also allowed changes of variables to be done naturally, as well as allowing
constrained quantization. The integration variables in the path integral are subtly non-commuting. The value of the product of two field operators at what looks like the same point depends on how the two points are ordered in space and time. This makes some naive identities
fail.
Propagator In relativistic theories, there is both a particle and field representation for every theory. The field representation is a sum over all field configurations, and the particle representation is a sum over different particle paths. The nonrelativistic formulation is traditionally given in terms of particle paths, not fields. There, the path integral in the usual variables, with fixed boundary conditions, gives the probability amplitude for a particle to go from point to point in time : : K(x, y; T) = \langle y; T \mid x; 0 \rangle = \int_{x(0)=x}^{x(T)=y} e^{i S[x]} \,Dx. This is called the
propagator. To obtain the final state at we simply apply to the initial state and integrate over resulting in: : \psi_T(y) = \int_x \psi_0(x) K(x, y; T) \,dx = \int^{x(T)=y} \psi_0(x(0)) e^{i S[x]} \,Dx. For a spatially homogeneous system, where is only a function of , the integral is a
convolution, the final state is the initial state convolved with the propagator: : \psi_T = \psi_0 * K(;T). For a free particle of mass , the propagator can be evaluated either explicitly from the path integral or by noting that the Schrödinger equation is a diffusion equation in imaginary time, and the solution must be a normalized Gaussian: : K(x, y; T) \propto e^\frac{i m(x - y)^2}{2T}. Taking the Fourier transform in produces another Gaussian: : K(p; T) = e^\frac{i T p^2}{2m}, and in -space the proportionality factor here is constant in time, as will be verified in a moment. The Fourier transform in time, extending to be zero for negative times, gives Green's function, or the frequency-space propagator: : G_\text{F}(p, E) = \frac{-i}{E - \frac{\vec{p}^2}{2m} + i\varepsilon}, which is the reciprocal of the operator that annihilates the wavefunction in the Schrödinger equation, which wouldn't have come out right if the proportionality factor weren't constant in the -space representation. The infinitesimal term in the denominator is a small positive number, which guarantees that the inverse Fourier transform in will be nonzero only for future times. For past times, the inverse Fourier transform contour closes toward values of where there is no singularity. This guarantees that propagates the particle into the future and is the reason for the subscript "F" on . The infinitesimal term can be interpreted as an infinitesimal rotation toward imaginary time. It is also possible to reexpress the nonrelativistic time evolution in terms of propagators going toward the past, since the Schrödinger equation is time-reversible. The past propagator is the same as the future propagator except for the obvious difference that it vanishes in the future, and in the Gaussian is replaced by . In this case, the interpretation is that these are the quantities to convolve the final wavefunction so as to get the initial wavefunction: : G_\text{B}(p, E) = \frac{-i}{-E - \frac{i\vec{p}^2}{2m} + i\varepsilon}. Given the nearly identical only change is the sign of and , the parameter in Green's function can either be the energy if the paths are going toward the future, or the negative of the energy if the paths are going toward the past. For a nonrelativistic theory, the time as measured along the path of a moving particle and the time as measured by an outside observer are the same. In relativity, this is no longer true. For a relativistic theory the propagator should be defined as the sum over all paths that travel between two points in a fixed proper time, as measured along the path (these paths describe the trajectory of a particle in space and in time): : K(x - y, \Tau) = \int_{x(0)=x}^{x(\Tau)=y} e^{i \int_0^\Tau \sqrt{\dot{x}^2 - \alpha} \,d\tau}. The integral above is not trivial to interpret because of the square root. Fortunately, there is a heuristic trick. The sum is over the relativistic arc length of the path of an oscillating quantity, and like the nonrelativistic path integral should be interpreted as slightly rotated into imaginary time. The function can be evaluated when the sum is over paths in Euclidean space: : K(x - y, \Tau) = e^{-\alpha \Tau} \int_{x(0)=x}^{x(\Tau)=y} e^{-L}. This describes a sum over all paths of length of the exponential of minus the length. This can be given a probability interpretation. The sum over all paths is a probability average over a path constructed step by step. The total number of steps is proportional to , and each step is less likely the longer it is. By the
central limit theorem, the result of many independent steps is a Gaussian of variance proportional to : : K(x - y,\Tau) = e^{-\alpha \Tau} e^{-\frac{(x - y)^2}{\Tau}}. The usual definition of the relativistic propagator only asks for the amplitude to travel from to , after summing over all the possible proper times it could take: : K(x - y) = \int_0^\infty K(x - y, \Tau) W(\Tau) \,d\Tau, where is a weight factor, the relative importance of paths of different proper time. By the translation symmetry in proper time, this weight can only be an exponential factor and can be absorbed into the constant : : K(x - y) = \int_0^\infty e^{-\frac{(x - y)^2}{\Tau} -\alpha \Tau} \,d\Tau. This is the
Schwinger representation. Taking a Fourier transform over the variable can be done for each value of separately, and because each separate contribution is a Gaussian, gives whose Fourier transform is another Gaussian with reciprocal width. So in -space, the propagator can be reexpressed simply: : K(p) = \int_0^\infty e^{-\Tau p^2 - \Tau \alpha} \,d\Tau = \frac{1}{p^2 + \alpha}, which is the Euclidean propagator for a scalar particle. Rotating to be imaginary gives the usual relativistic propagator, up to a factor of and an ambiguity, which will be clarified below: : K(p) = \frac{i}{p_0^2 - \vec{p}^2 - m^2}. This expression can be interpreted in the nonrelativistic limit, where it is convenient to split it by
partial fractions: : 2 p_0 K(p) = \frac{i}{p_0 - \sqrt{\vec{p}^2 + m^2}} + \frac{i}{p_0 + \sqrt{\vec{p}^2 + m^2}}. For states where one nonrelativistic particle is present, the initial wavefunction has a frequency distribution concentrated near . When convolving with the propagator, which in space just means multiplying by the propagator, the second term is suppressed and the first term is enhanced. For frequencies near , the dominant first term has the form : 2m K_\text{NR}(p) = \frac{i}{(p_0 - m) - \frac{\vec{p}^2}{2m}}. This is the expression for the nonrelativistic
Green's function of a free Schrödinger particle. The second term has a nonrelativistic limit also, but this limit is concentrated on frequencies that are negative. The second pole is dominated by contributions from paths where the proper time and the coordinate time are ticking in an opposite sense, which means that the second term is to be interpreted as the antiparticle. The nonrelativistic analysis shows that with this form the antiparticle still has positive energy. The proper way to express this mathematically is that, adding a small suppression factor in proper time, the limit where of the first term must vanish, while the limit of the second term must vanish. In the Fourier transform, this means shifting the pole in slightly, so that the inverse Fourier transform will pick up a small decay factor in one of the time directions: : K(p) = \frac{i}{p_0 - \sqrt{\vec{p}^2 + m^2} + i\varepsilon} + \frac{i}{p_0 - \sqrt{\vec{p}^2+m^2} - i\varepsilon}. Without these terms, the pole contribution could not be unambiguously evaluated when taking the inverse Fourier transform of . The terms can be recombined: : K(p) = \frac{i}{p^2 - m^2 + i\varepsilon}, which when factored, produces opposite-sign infinitesimal terms in each factor. This is the mathematically precise form of the relativistic particle propagator, free of any ambiguities. The term introduces a small imaginary part to the , which in the Minkowski version is a small exponential suppression of long paths. So in the relativistic case, the Feynman path-integral representation of the propagator includes paths going backwards in time, which describe antiparticles. The paths that contribute to the relativistic propagator go forward and backwards in time, and the
interpretation of this is that the amplitude for a free particle to travel between two points includes amplitudes for the particle to fluctuate into an antiparticle, travel back in time, then forward again. Unlike the nonrelativistic case, it is impossible to produce a relativistic theory of local particle propagation without including antiparticles. All local differential operators have inverses that are nonzero outside the light cone, meaning that it is impossible to keep a particle from travelling faster than light. Such a particle cannot have a Green's function that is only nonzero in the future in a relativistically invariant theory.
Functionals of fields However, the path integral formulation is also extremely important in
direct application to quantum field theory, in which the "paths" or histories being considered are not the motions of a single particle, but the possible time evolutions of a
field over all space. The action is referred to technically as a
functional of the field: , where the field is itself a function of space and time, and the square brackets are a reminder that the action depends on all the field's values everywhere, not just some particular value.
One such given function of
spacetime is called a
field configuration. In principle, one integrates Feynman's amplitude over the class of all possible field configurations. Much of the formal study of QFT is devoted to the properties of the resulting functional integral, and much effort (not yet entirely successful) has been made toward making these
functional integrals mathematically precise. Such a functional integral is extremely similar to the
partition function in
statistical mechanics. Indeed, it is sometimes
called a
partition function, and the two are essentially mathematically identical except for the factor of in the exponent in Feynman's postulate 3.
Analytically continuing the integral to an imaginary time variable (called a
Wick rotation) makes the functional integral even more like a statistical partition function and also tames some of the mathematical difficulties of working with these integrals.
Expectation values In
quantum field theory, if the
action is given by the
functional of field configurations (which only depends locally on the fields), then the
time-ordered vacuum expectation value of
polynomially bounded functional , , is given by : \langle F \rangle = \frac{\int\mathcal{D}\varphi F[\varphi]e^{i\mathcal{S}[\varphi]}}{\int\mathcal{D}\varphi e^{i\mathcal{S}[\varphi]}}. The symbol here is a concise way to represent the infinite-dimensional integral over all possible field configurations on all of space-time. As stated above, the unadorned path integral in the denominator ensures proper normalization.
As a probability Strictly speaking, the only question that can be asked in physics is:
What fraction of states satisfying condition also satisfy condition ? The answer to this is a number between 0 and 1, which can be interpreted as a
conditional probability, written as . In terms of path integration, since , this means : \operatorname{P}(B\mid A) = \frac {\sum_{F \subset A \cap B}\left| \int\mathcal{D}\varphi O_\text{in}[\varphi]e^{i\mathcal{S}[\varphi]} F[\varphi]\right|^2} {\sum_{F \subset A} \left|\int\mathcal{D}\varphi O_\text{in}[\varphi] e^{i\mathcal{S}[\varphi]} F[\varphi]\right|^2}, where the functional is the superposition of all incoming states that could lead to the states we are interested in. In particular, this could be a state corresponding to the state of the Universe just after the
Big Bang, although for actual calculation this can be simplified using heuristic methods. Since this expression is a quotient of path integrals, it is naturally normalised.
Schwinger–Dyson equations Since this formulation of quantum mechanics is analogous to classical action principle, one might expect that identities concerning the action in classical mechanics would have quantum counterparts derivable from a functional integral. This is often the case. In the language of functional analysis, we can write the
Euler–Lagrange equations as : \frac{\delta \mathcal{S}[\varphi]}{\delta \varphi} = 0 (the left-hand side is a
functional derivative; the equation means that the action is stationary under small changes in the field configuration). The quantum analogues of these equations are called the
Schwinger–Dyson equations. If the
functional measure turns out to be
translationally invariant (we'll assume this for the rest of this article, although this does not hold for, let's say
nonlinear sigma models), and if we assume that after a
Wick rotation : e^{i\mathcal{S}[\varphi]}, which now becomes : e^{-H[\varphi]} for some , it goes to zero faster than a
reciprocal of any
polynomial for large values of , then we can
integrate by parts (after a Wick rotation, followed by a Wick rotation back) to get the following Schwinger–Dyson equations for the expectation: : \left\langle \frac{\delta F[\varphi]}{\delta \varphi} \right\rangle = -i \left\langle F[\varphi]\frac{\delta \mathcal{S}[\varphi]}{\delta\varphi} \right\rangle for any polynomially-bounded functional . In the
deWitt notation this looks like : \left\langle F_{,i} \right\rangle = -i \left\langle F \mathcal{S}_{,i} \right\rangle. These equations are the analog of the
on-shell EL equations. The time ordering is taken before the time derivatives inside the . If (called the
source field) is an element of the
dual space of the field configurations (which has at least an
affine structure because of the assumption of the
translational invariance for the functional measure), then the
generating functional of the source fields is
defined to be : Z[J] = \int \mathcal{D}\varphi e^{i\left(\mathcal{S}[\varphi] + \langle J,\varphi \rangle\right)}. Note that : \frac{\delta^n Z}{\delta J(x_1) \cdots \delta J(x_n)}[J] = i^n \, Z[J] \, \left\langle \varphi(x_1)\cdots \varphi(x_n)\right\rangle_J, or : Z^{,i_1\cdots i_n}[J] = i^n Z[J] \left \langle \varphi^{i_1}\cdots \varphi^{i_n}\right\rangle_J, where : \langle F \rangle_J = \frac{\int \mathcal{D}\varphi F[\varphi]e^{i\left(\mathcal{S}[\varphi] + \langle J,\varphi \rangle\right)}}{\int\mathcal{D}\varphi e^{i\left(\mathcal{S}[\varphi] + \langle J,\varphi \rangle\right)}}. Basically, if is viewed as a functional distribution (this shouldn't be taken too literally as an interpretation of
QFT, unlike its Wick-rotated
statistical mechanics analogue, because we have
time ordering complications here!), then are its
moments, and is its
Fourier transform. If is a functional of , then for an
operator , is defined to be the operator that substitutes for . For example, if : F[\varphi] = \frac{\partial^{k_1}}{\partial x_1^{k_1}}\varphi(x_1)\cdots \frac{\partial^{k_n}}{\partial x_n^{k_n}}\varphi(x_n), and is a functional of , then : F\left[-i\frac{\delta}{\delta J}\right] G[J] = (-i)^n \frac{\partial^{k_1}}{\partial x_1^{k_1}}\frac{\delta}{\delta J(x_1)} \cdots \frac{\partial^{k_n}}{\partial x_n^{k_n}}\frac{\delta}{\delta J(x_n)} G[J]. Then, from the properties of the
functional integrals : \left \langle \frac{\delta \mathcal{S}}{\delta \varphi(x)} [\varphi] + J(x)\right\rangle_J = 0 we get the "master" Schwinger–Dyson equation: : \frac{\delta \mathcal{S}}{\delta \varphi(x)}\left[-i \frac{\delta}{\delta J}\right]Z[J] + J(x)Z[J] = 0, or : \mathcal{S}_{,i}[-i\partial]Z + J_i Z = 0. If the functional measure is not translationally invariant, it might be possible to express it as the product , where is a functional and is a translationally invariant measure. This is true, for example, for nonlinear sigma models where the
target space is diffeomorphic to . However, if the
target manifold is some topologically nontrivial space, the concept of a translation does not even make any sense. In that case, we would have to replace the in this equation by another functional : \hat{\mathcal{S}} = \mathcal{S} - i\ln M. If we expand this equation as a
Taylor series about
J 0, we get the entire set of Schwinger–Dyson equations. == Localization ==