Energy, momentum, and angular momentum of photons Energy The treatment to this point has been
classical. It is a testament, however, to the generality of
Maxwell's equations for electrodynamics that the treatment can be made
quantum mechanical with only a reinterpretation of classical quantities. The reinterpretation is based on the theories of
Max Planck and the interpretation by
Albert Einstein of those theories and of other experiments. Einstein's conclusion from early experiments on the
photoelectric effect is that electromagnetic radiation is composed of irreducible packets of energy, known as
photons. The energy of each packet is related to the angular frequency of the wave by the relation \epsilon = \hbar \omega where \hbar is an experimentally determined quantity known as the reduced
Planck constant. If there are N photons in a box of volume V , the energy in the electromagnetic field is N \hbar \omega and the energy density is {N \hbar \omega \over V} The
photon energy can be related to classical fields through the
correspondence principle that states that for a large number of photons, the quantum and classical treatments must agree. Thus, for very large N , the quantum energy density must be the same as the classical energy density {N \hbar \omega \over V} = \mathcal{E}_c = \frac{\vert \mathbf{E} \vert^2}{8\pi}. The number of photons in the box is then N = \frac{V }{8\pi \hbar \omega}\vert \mathbf{E} \vert^2 .
Momentum The correspondence principle also determines the momentum and angular momentum of the photon. For momentum \mathcal{P}_z = {N \hbar \omega \over cV} = {N \hbar k_z \over V} where k_z is the wave number. This implies that the momentum of a photon is p_z = \hbar k_z .\,
Angular momentum and spin Similarly for the spin angular momentum \mathcal{L} = \frac{ 1 }{ \omega } \mathcal{E}_c \left ( \vert \psi_{\rm R} \vert^2 - \vert \psi_{\rm L} \vert^2 \right ) = \frac{ N\hbar }{ V } \left ( \vert \psi_{\rm R} \vert^2 - \vert \psi_{\rm L} \vert^2 \right )where \mathcal{E}_c is field strength. This implies that the spin angular momentum of the photon is l_z = \hbar \left ( \vert \psi_{\rm R} \vert^2 - \vert \psi_{\rm L} \vert^2 \right ). the quantum interpretation of this expression is that the photon has a probability of \mid \psi_{\rm R} \mid^2 of having a spin angular momentum of \hbar and a probability of \mid \psi_{\rm L} \mid^2 of having a spin angular momentum of -\hbar . We can therefore think of the spin angular momentum of the photon being quantized as well as the energy. The angular momentum of classical light has been verified. A photon that is linearly polarized (plane polarized) is in a superposition of equal amounts of the left-handed and right-handed states. Upon absorption by an electronic state, the angular momentum is "measured" and this superposition collapses into either right-hand or left-hand, corresponding to a raising or lowering of the angular momentum of the absorbing electronic state, respectively.
Spin operator The
spin of the photon is defined as the coefficient of \hbar in the spin angular momentum calculation. A photon has spin 1 if it is in the | R \rangle state and −1 if it is in the | L \rangle state. The spin operator is defined as the
outer product \hat{S} \ \stackrel{\mathrm{def}}{=}\ |\mathrm{R}\rangle \langle \mathrm{R} | - |\mathrm{L}\rangle \langle \mathrm{L} | = \begin{pmatrix} 0 & -i \\ i & 0 \end{pmatrix}. The
eigenvectors of the spin operator are |\mathrm{R}\rangle and |\mathrm{L}\rangle with
eigenvalues 1 and −1, respectively. These values are based on the point of view of the source as the convention to define circular polarization handedness. The expected value of a spin measurement on a photon is then \langle \psi |\hat{S} |\psi\rangle = \vert \psi_{\rm R} \vert^2 - \vert \psi_{\rm L} \vert^2. An operator
S has been associated with an observable quantity, the spin angular momentum. The eigenvalues of the operator are the allowed observable values. This has been demonstrated for spin angular momentum, but it is in general true for any observable quantity.
Spin states We can write the circularly polarized states as |s\rangle where
s = 1 for |\mathrm{R}\rangle and
s = −1 for |\mathrm{L}\rangle. An arbitrary state can be written |\psi\rangle = \sum_{s=-1,1} a_s \exp \left ( i \alpha_x -i s \theta \right ) |s\rangle where \alpha_1 and \alpha_{-1} are phase angles,
θ is the angle by which the frame of reference is rotated, and \sum_{s=-1,1} \vert a_s \vert^2 = 1.
Spin and angular momentum operators in differential form When the state is written in spin notation, the spin operator can be written \hat{S}_d \rightarrow i { \partial \over \partial \theta} \hat{S}_d^{\dagger} \rightarrow -i { \partial \over \partial \theta}. The eigenvectors of the differential spin operator are \exp \left ( i \alpha_x -i s \theta \right ) |s\rangle. To see this, note \hat{S}_d \exp \left ( i \alpha_x -i s \theta \right ) |s\rangle \rightarrow i { \partial \over \partial \theta} \exp \left ( i \alpha_x -i s \theta \right ) |s\rangle = s \left [ \exp \left ( i \alpha_x -i s \theta \right ) |s\rangle \right ]. The spin angular momentum operator is \hat{l}_z = \hbar \hat{S}_d.
Nature of probability in quantum mechanics Probability for a single photon There are two ways in which probability can be applied to the behavior of photons; probability can be used to calculate the probable number of photons in a particular state, or probability can be used to calculate the likelihood of a single photon to be in a particular state. The former interpretation violates energy conservation. The latter interpretation is the viable, if nonintuitive, option. Dirac explains this in the context of the
double-slit experiment: Some time before the discovery of quantum mechanics people realized that the connection between light waves and photons must be of a statistical character. What they did not clearly realize, however, was that the wave function gives information about the probability of
one photon being in a particular place and not the probable number of photons in that place. The importance of the distinction can be made clear in the following way. Suppose we have a beam of light consisting of a large number of photons split up into two components of equal intensity. On the assumption that the beam is connected with the probable number of photons in it, we should have half the total number going into each component. If the two components are now made to interfere, we should require a photon in one component to be able to interfere with one in the other. Sometimes these two photons would have to annihilate one another and other times they would have to produce four photons. This would contradict the conservation of energy. The new theory, which connects the wave function with probabilities for one photon gets over the difficulty by making each photon go partly into each of the two components. Each photon then interferes only with itself. Interference between two different photons never occurs.—
Paul Dirac,
The Principles of Quantum Mechanics, 1930, Chapter 1
Probability amplitudes The probability for a photon to be in a particular polarization state depends on the fields as calculated by the classical Maxwell's equations. The polarization state of the photon is proportional to the field. The probability itself is quadratic in the fields and consequently is also quadratic in the quantum state of polarization. In quantum mechanics, therefore, the state or
probability amplitude contains the basic probability information. In general, the rules for combining probability amplitudes look very much like the classical rules for composition of probabilities: [The following quote is from Baym, Chapter 1] • The probability amplitude for two successive probabilities is the product of amplitudes for the individual possibilities. For example, the amplitude for the x polarized photon to be right circularly polarized
and for the right circularly polarized photon to pass through the y-polaroid is \langle R|x\rangle\langle y|R\rangle, the product of the individual amplitudes. • The amplitude for a process that can take place in one of several
indistinguishable ways is the sum of amplitudes for each of the individual ways. For example, the total amplitude for the x polarized photon to pass through the y-polaroid is the sum of the amplitudes for it to pass as a right circularly polarized photon, \langle y|R\rangle\langle R|x\rangle, plus the amplitude for it to pass as a left circularly polarized photon, \langle y|L\rangle\langle L|x\rangle\dots • The total probability for the process to occur is the absolute value squared of the total amplitude calculated by 1 and 2.
Uncertainty principle Mathematical preparation For any legal operators the following inequality, a consequence of the
Cauchy–Schwarz inequality, is true. \frac{1}{4} \left|\langle (\hat{A} \hat{B} - \hat{B} \hat{A} )x | x \rangle\right|^2\leq \left\| \hat{A} x \right\|^2 \left\| \hat{B} x \right\|^2. If
B A ψ and
A B ψ are defined, then by subtracting the means and re-inserting in the above formula, we deduce \Delta_{\psi} \hat{A} \, \Delta_{\psi} \hat{B} \ge \frac{1}{2} \left|\left\langle\left[{\hat{A}},{\hat{B}}\right]\right\rangle_\psi\right| where \left\langle \hat{X} \right\rangle_\psi = \left\langle \psi \right| \hat{X} \left| \psi \right\rangle is the operator
mean of observable
X in the system state ψ and \Delta_{\psi} \hat{X} = \sqrt{\langle {\hat{X}}^2\rangle_\psi - \langle {\hat{X}}\rangle_\psi ^2}. Here \left[{\hat{A}},{\hat{B}}\right] \ \stackrel{\mathrm{def}}{=}\ \hat{A} \hat{B} - \hat{B} \hat{A} is called the
commutator of
A and
B. This is a purely mathematical result. No reference has been made to any physical quantity or principle. It simply states that the uncertainty of one operator times the uncertainty of another operator has a lower bound.
Application to angular momentum The connection to physics can be made if we identify the operators with physical operators such as the angular momentum and the polarization angle. We have then \Delta_{\psi} \hat{L}_z \, \Delta_{\psi} {\theta} \ge \frac{\hbar}{2}, which means that angular momentum
and the polarization angle cannot be measured simultaneously with infinite accuracy. (The polarization angle can be measured by checking whether the photon can pass through a polarizing filter oriented at a particular angle, or a
polarizing beam splitter. This results in a yes/no answer that, if the photon was plane-polarized at some other angle, depends on the difference between the two angles.)
States, probability amplitudes, unitary and Hermitian operators, and eigenvectors Much of the mathematical apparatus of quantum mechanics appears in the classical description of a polarized sinusoidal electromagnetic wave. The Jones vector for a classical wave, for instance, is identical with the quantum polarization state vector for a photon. The right and left circular components of the Jones vector can be interpreted as
probability amplitudes of spin states of the photon. Energy conservation requires that the states be transformed with a unitary operation. This implies that infinitesimal transformations are transformed with a Hermitian operator. These conclusions are a natural consequence of the structure of Maxwell's equations for classical waves. Quantum mechanics enters the picture when observed quantities are measured and found to be discrete rather than continuous. The allowed observable values are determined by the eigenvalues of the operators associated with the observable. In the case angular momentum, for instance, the allowed observable values are the eigenvalues of the spin operator. These concepts have emerged naturally from
Maxwell's equations and Planck's and Einstein's theories. They have been found to be true for many other physical systems. In fact, the typical program is to assume the concepts of this section and then to infer the unknown dynamics of a physical system. This was done, for instance, with the dynamics of electrons. In that case, working back from the principles in this section, the quantum dynamics of particles were inferred, leading to the
Schrödinger equation, a departure from
Newtonian mechanics. The solution of this equation for atoms led to the explanation of the
Balmer series for atomic spectra and consequently formed a basis for all of atomic physics and chemistry. This is not the only occasion in which Maxwell's equations have forced a restructuring of Newtonian mechanics. Maxwell's equations are relativistically consistent.
Special relativity resulted from attempts to make classical mechanics consistent with Maxwell's equations (see, for example,
Moving magnet and conductor problem). == See also ==