Dirac equation
From Wikipedia, the free encyclopedia
In physics, the Dirac equation is a relativistic quantum mechanical wave equation formulated by British physicist Paul Dirac in 1928 and provides a description of elementary spin½ particles, such as electrons, consistent with both the principles of quantum mechanics and the theory of special relativity. The equation demands the existence of antiparticles and actually predated their experimental discovery, making the discovery of the positron, the antiparticle of the electron, one of the greatest triumphs of modern theoretical physics.
Contents 
[edit] Mathematical formulation
The Dirac equation in the form originally proposed by Dirac is:

 where
 m is the rest mass of the electron,
 c is the speed of light,
 p is the momentum operator,
 is the reduced Planck's constant,
 x and t are the space and time coordinates.
The new elements in this equation are the 4×4 matrices and , and the fourcomponent wavefunction . The matrices are all Hermitian and have squares equal to the identity matrix:
and they all mutually anticommute: {α_{i},α_{j}} = 0 and {α_{i},β} = 0. Explicitly,
where i and j are distinct and range from 1 to 3. These matrices, and the form of the wavefunction, have a deep mathematical significance. The algebraic structure represented by the Dirac matrices had been created some 50 years earlier by the English mathematician W. K. Clifford, which in turn had been based on the mid19th century work of the German mathematician Hermann Grassmann in his "Lineare Ausdehnungslehre" (Theory of Linear Extensions). The latter had been regarded as wellnigh incomprehensible by most of his contemporaries. The appearance of something so seemingly abstract, at such a late date, in such a direct physical manner, amounts to one of the most remarkable chapters in the history of physics.
[edit] Comparison with the Schrödinger equation
The Dirac equation is superficially similar to the Schrödinger equation for a free particle:
The left side represents the square of the momentum operator divided by twice the mass, which is the nonrelativistic kinetic energy. If one wants to get a relativistic generalization of this equation, then the space and time derivatives must enter symmetrically, as they do in the relativistic Maxwell equations—the derivatives must be of the same order in space and time. In relativity, the momentum and the energy are the space and time parts of a geometrical spacetime vector, the 4momentum, and they are related by the relativistically invariant relation
which says that the length of this vector is the rest mass m. Replacing E and p by and as Schrödinger theory requires, we get a relativistic equation:
and the wave function is a relativistic scalar, it is a complex number which has the same numerical value in all frames. Because the equation is second order in the time derivative, one must specify both the initial value of and not just . This is normal for classical waves where the initial conditions are the position and velocity. However, in quantum mechanics, the wavefunction is supposed to be the complete description; just knowing the wavefunction should determine the future.
In the Schrödinger theory, the probability density is given by the positive definite expression
and its current by
and the conservation of probability density has a local form:
In a relativistic theory, the form of the probability density and the current must form a four vector, so the form of the probability density can be found from the current just by replacing by :
Everything is relativistic now, but the probability density is not positive definite, because the initial values of both and can be freely chosen. This expression reduces to Schrödinger's density and current for superpositions of positive frequency waves whose wavelength is long compared to the compton wavelength, that is, for nonrelativistic motions. It reduces to a negative definite quantity for superpositions of negative frequency waves only. It mixes up both signs when forces which have an appreciable amplitude to produce relativistic motions are involved, at which point scattering can produce particles and antiparticles.
Although it was not a successful description of a single particle, this equation is resurrected in quantum field theory, where it is known as the Klein–Gordon equation, and describes a relativistic spin0 complex field. The nonpositive probability density and current are the chargedensity and current, while the particles are described by a modeexpansion.
In order to give the Klein–Gordon equation an interpretation as an equation for the probability amplitude for a single particle to have a given position, the negative frequency solutions need to be interpreted as describing the particle travelling backwards in time, so that they propagate into the past. The equation with this interpretation does not predict the future from the present except in the nonrelativistic limit, rather it places a global constraint on the amplitudes. This can be used to construct a perturbation expansion with particles zipping backwards and forwards in time, the Feynman diagrams, but it does not allow a straightforward wavefunction description, since each particle has its own separate proper time.
[edit] Dirac's coup
What is needed, then, is an equation that is firstorder in both space and time. One could formally take the relativistic expression for the energy , replace p by its operator equivalent, expand the square root in an infinite series of derivative operators, set up an eigenvalue problem, then solve the equation formally by iterations. Most physicists had little faith in such a process, even if it were technically possible.
As the story goes, Dirac was staring into the fireplace at Cambridge, pondering this problem, when he hit upon the idea of taking the square root of the wave operator thus:
On multiplying out the right side, we see that in order to get all the crossterms such as to vanish, we must assume
with
Dirac, who had just then been intensely involved with working out the foundations of Heisenberg's matrix mechanics, immediately understood that these conditions could be met if A, B... are matrices, with the implication that the wave function has multiple components. This immediately explained the appearance of twocomponent wave functions in Pauli's phenomenological theory of spin, something that up until then had been regarded as mysterious, even to Pauli himself. However, one needs at least 4×4 matrices to set up a system with the properties desired—so the wave function had four components, not two, as in the Pauli theory.
Given the factorization in terms of these matrices, one can now write down immediately an equation
with to be determined. Applying again the matrix operator on either side yields
On taking we find that all the components of the wave function individually satisfy the relativistic energy–momentum relation. Thus the soughtfor equation that is firstorder in both space and time is
With and , we get the Dirac equation.
[edit] Comparison with the Pauli theory
The necessity of introducing halfintegral spin goes back experimentally to the results of the Stern–Gerlach experiment. A beam of atoms is run through a strong inhomogeneous magnetic field, which then splits into N parts depending on the intrinsic angular momentum of the atoms. It was found that for silver atoms, the beam was split in two—the ground state therefore could not be integral, because even if the intrinsic angular momentum of the atoms were as small as possible, 1, the beam would be split into 3 parts, corresponding to atoms with L_{z} = −1, 0, and +1. The conclusion is that silver atoms have net intrinsic angular momentum of ^{1}⁄_{2}. Pauli set up a theory which explained this splitting by introducing a twocomponent wave function and a corresponding correction term in the Hamiltonian, representing a semiclassical coupling of this wave function to an applied magnetic field, as so:
Here A^{μ} is the applied electromagnetic field, and the three sigmas are Pauli matrices. e is the charge of the particle, e.g. e = − e_{0} for the electron. On squaring out the first term, a residual interaction with the magnetic field is found, along with the usual Hamiltonian of a charged particle interacting with an applied field:
This Hamiltonian is now a 2 × 2 matrix, so the Schrödinger equation based on it,
must use a twocomponent wave function. Pauli had introduced the sigma matrices
as pure phenomenology—Dirac now had a theoretical argument that implied that spin was somehow the consequence of the marriage of quantum theory to relativity.
The Pauli matrices share the same properties as the Dirac matrices—they are all Hermitian, square to 1, and anticommute. This allows one to immediately find a representation of the Dirac matrices in terms of the Pauli matrices:
The Dirac equation now may be written as an equation coupling twocomponent spinors:
Notice that on the diagonal we find the rest energy of the particle. If we set the momentum to zero—that is, bring the particle to rest—then we have
The equations for the individual twospinors are now decoupled, and we see that the "top" and "bottom" twospinors are individually eigenfunctions of the energy with eigenvalues equal to plus and minus the rest energy, respectively. The appearance of this negative energy eigenvalue is completely consistent with relativity.
It should be strongly emphasized that this separation in the rest frame is not an invariant statement—the "bottom" twospinor does not represent antimatter as such in general. The entire fourcomponent spinor represents an irreducible whole—in general, states will have an admixture of positive and negative energy components. If we couple the Dirac equation to an electromagnetic field, as in the Pauli theory, then the positive and negative energy parts will be mixed together, even if they are originally decoupled. Dirac's main problem was to find a consistent interpretation of this mixing. As we shall see below, it brings a new phenomenon into physics—matter/antimatter creation and annihilation.
[edit] Covariant form and relativistic invariance
The explicitly covariant form of the Dirac equation is (employing the Einstein summation convention):
In the above, are the Dirac matrices. is Hermitian, and the are antiHermitian, with the definition
This may be summarized using the Minkowski metric on spacetime in the form
where the bracket expression means , the anticommutator. These are the defining relations of a Clifford algebra over a pseudoorthogonal 4d space with metric signature ( + − − − ). Note that one may also employ the metric form ( − + + + ) by multiplying all the gammas by a factor of i. At an elementary level, the choice may be regarded as conventional, but there are specific reasons^{[clarification needed]} for preferring the former, both mathematically and for convenience in calculation and physical interpretation. In the literature, one almost always finds the convention ( + − − − ) in use. The specific Clifford algebra employed in the Dirac equation is known as the Dirac algebra.
The Dirac equation may be interpreted as an eigenvalue expression, where the rest mass is proportional to an eigenvalue of the 4momentum operator, the proportion being the speed of light in vacuo:
In practice, physicists often use units of measure such that and c are equal to 1, known as "natural" units. The equation then takes the simple form
or, if Feynman slash notation is employed,
A fundamental theorem states that if two distinct sets of matrices are given that both satisfy the Clifford relations, then they are connected to each other by a similarity transformation:
If in addition the matrices are all unitary, as are the Dirac set, then S itself is unitary;
The transformation U is unique up to a multiplicative factor of absolute value 1. Let us now imagine a Lorentz transformation to have been performed on the derivative operators, which form a covariant vector. In order that the operator remain invariant, the gammas must transform among themselves as a contravariant vector with respect to their spacetime index. These new gammas will themselves satisfy the Clifford relations, because of the orthogonality of the Lorentz transformation. By the fundamental theorem, we may replace the new set by the old set subject to a unitary transformation. In the new frame, remembering that the rest mass is a relativistic scalar, the Dirac equation will then take the form
If we now define the transformed spinor
then we have the transformed Dirac equation
Thus, once we settle on a unitary representation of the gammas, it is final provided we transform the spinor according the unitary transformation that corresponds to the given Lorentz transformation.
These considerations reveal the origin of the gammas in geometry, hearkening back to Grassmann's original motivation  they represent a fixed basis of unit vectors in spacetime. Similarly, products of the gammas such as represent oriented surface elements, and so on. With this in mind, we can find the form the unit volume element on spacetime in terms of the gammas as follows. By definition, it is
In order that this be an invariant, the epsilon symbol must be a tensor, and so must contain a factor of , where g is the determinant of the metric tensor. Since this is negative, that factor is imaginary. Thus

 V = iγ^{0}γ^{1}γ^{2}γ^{3}
This matrix is given the special symbol , owing to its importance when one is considering improper transformations of spacetime, that is, those that change the orientation of the basis vectors. In the representation we are using for the gammas, it is
Also note that we could as easily have taken the negative square root of the determinant of g  the choice amounts to an initial handedness convention.
[edit] Lorentz Invariance of the Dirac equation
The Lorentz invariance of the Dirac equation follows from its covariant nature.
[edit] Comparison with the KleinGordon equation
In Feynman slash notation the KleinGordon equation:
can be factorised as:
The last factor is simply the Dirac equation. Hence any solution to the Dirac equation is automatically a solution to the KleinGordon equation, but the converse is not true: that is, not all solutions to the Klein–Gordon equation solve the Dirac equation.
[edit] Adjoint equation and Dirac current
By defining the adjoint spinor
and noticing that

 ,
we obtain, by taking the Hermitian conjugate of the Dirac equation and multiplying from the right by γ^{0}, the adjoint equation:
where is understood to act to the left. Multiplying the Dirac equation by from the left, and the adjoint equation by ψ from the right, and adding, produces the law of conservation of the Dirac current in covariant form:
Now we see the great advantage of the firstorder equation over the one Schrödinger had tried  this is the conserved probability current density required by relativistic invariance, only now its 0component is positive definite:
The Dirac equation and its adjoint are the Euler–Lagrange equations of the 4d invariant action integral
where the scalar L is the Dirac Lagrangian
and for the purposes of variation, and are regarded as independent fields. The relativistic invariance also follows immediately from the variational principle.
[edit] Coupling to an electromagnetic field
To consider problems in which an applied electromagnetic field interacts with the particles described by the Dirac equation, one uses the correspondence principle, and takes over into the theory the corresponding expression from classical mechanics, whereby the total momentum of a charged particle in an external field is modified as so:
(where q is the charge of the particle; for example, q = − e < 0 for an electron). In natural units, the Dirac equation then takes the form
This validity of this prescription is confirmed experimentally with great precision. It is known as minimal coupling, and is found throughout particle physics. Indeed, while the introduction of the electromagnetic field in this way is essentially phenomenological in this context, it rises to a fundamental principle in quantum field theory.
Now as stated above, the transformation U is defined only up to a phase factor e^{iθ}. Also, the fundamental observable of the Dirac theory, the current, is unchanged if we multiply the wave function by an arbitrary phase. We may exploit this to get the form of the mutual interaction of a Dirac particle and the electromagnetic field, as opposed to simply considering a Dirac particle in an applied field, by assuming this arbitrary phase factor to depend continuously on position:
Notice now that
In order to preserve minimal coupling, we must add to the potential a term proportional to the gradient of the phase. But we know from electrodynamics that this leaves the electromagnetic field itself invariant. The value of the phase is arbitrary, but not how it changes from place to place. This is the starting point of gauge theory, which is the main principle on which quantum field theory is based. The simplest such theory, and the one most thoroughly understood, is known as quantum electrodynamics. The equations of field theory thus have invariance under both Lorentz transformations and gauge transformations.
[edit] Curved spacetime Dirac equation
The Dirac equation can be written in curved spacetime using vierbein fields. Vierbeins describe a local frame that enables to define Dirac matrices at every point. Contracting these matrices with vierbeins give the right transformation properties. This way Dirac's equation takes the following form in curved spacetime ^{[1]}:
Here is the vierbein and D_{μ} is the covariant derivative for fermion fields, defined as follows
where η_{ac} is the Lorentzian metric, σ^{ab} is the commutator of Dirac matrices:
and is the spin connection:
where is the Christoffel symbol. Note that here, Latin letters denote the "Lorentzian" indices and Greek ones denote "Riemannian" indices.
[edit] Physical interpretation
The Dirac theory, while providing a wealth of information that is accurately confirmed by experiments, nevertheless introduces a new physical paradigm that appears at first difficult to interpret and even paradoxical. Some of these issues of interpretation must be regarded as open questions. Here we will see how the Dirac theory brilliantly answered some of the outstanding issues in physics at the time it was put forward, while posing others that are still the subject of debate.
[edit] Identification of observables
The critical physical question in a quantum theory is  what are the physically observable quantities defined by the theory? According to general principles, such quantities are defined by Hermitian operators that act on the Hilbert space of possible states of a system. The eigenvalues of these operators are then the possible results of measuring the corresponding physical quantity. In the Schrödinger theory, the simplest such object is the overall Hamiltonian, which represents the total energy of the system. If we wish to maintain this interpretation on passing to the Dirac theory, we must take the Hamiltonian to be
This looks promising, because we see by inspection the rest energy of the particle and, in case A = 0, the energy of a charge placed in an electric potential qA^{0}. What about the term involving the vector potential? In classical electrodynamics, the energy of a charge moving in an applied potential is
Thus the Dirac Hamiltonian is fundamentally distinguished from its classical counterpart, and we must take great care to correctly identify what is an observable in this theory. Much of the apparent paradoxical behavior implied by the Dirac equation amounts to a misidentification of these observables. Let us now describe one such effect. (cont'd)
[edit] Energy per Oscillator
This section may require cleanup to meet Wikipedia's quality standards. 
In quantum mechanics, the measure E of the average energy per oscillator is given by:
Where

 T is the measure of the temperature
 h is the Planck's constant
 R is the universal gas constant
 F is the constant frequency
 N_{0} is the number of oscillators at the lowest energy
If we let x = e ^{− hF / RT}, then
and then
If and , then E satisfies:
and
[edit] History
Since the Dirac equation was originally invented to describe the electron, we will generally speak of "electrons" in this article. The equation also applies to quarks, which are also elementary spin½ particles. A modified Dirac equation can be used to approximately describe protons and neutrons, which are not elementary particles (they are made up of quarks), but have a net spin of ½. Another modification of the Dirac equation, called the Majorana equation, is thought to describe neutrinos — also spin½ particles.
The Dirac equation describes the probability amplitudes for a single electron. This is a singleparticle theory; in other words, it does not account for the creation and destruction of the particles. It gives a good prediction of the magnetic moment of the electron and explains much of the fine structure observed in atomic spectral lines. It also explains the spin of the electron. Two of the four solutions of the equation correspond to the two spin states of the electron. The other two solutions make the peculiar prediction that there exist an infinite set of quantum states in which the electron possesses negative energy. This strange result led Dirac to predict, via a remarkable hypothesis known as "hole theory," the existence of particles behaving like positivelycharged electrons. Dirac thought at first these particles might be protons. He was chagrined when the strict prediction of his equation (which actually specifies particles of the same mass as the electron) was verified by the discovery of the positron in 1932. When asked later why he hadn't actually boldly predicted the yet unfound positron with its correct mass, Dirac answered "Pure cowardice!" He shared the Nobel Prize anyway, in 1933.
Despite these successes, Dirac's theory is flawed by its neglect of the possibility of creating and destroying particles, one of the basic consequences of relativity. This difficulty is resolved by reformulating it as a quantum field theory. Adding a quantized electromagnetic field to this theory leads to the theory of quantum electrodynamics (QED). Moreover the equation cannot fully account for particles of negative energy but is restricted to positive energy particles.
A similar equation for spin 3/2 particles is called the RaritaSchwinger equation.
[edit] Hole theory
The negative E solutions found in the preceding section are problematic, for it was assumed that the particle has a positive energy. Mathematically speaking, however, there seems to be no reason for us to reject the negativeenergy solutions. Since they exist, we cannot simply ignore them, for once we include the interaction between the electron and the electromagnetic field, any electron placed in a positiveenergy eigenstate would decay into negativeenergy eigenstates of successively lower energy by emitting excess energy in the form of photons. Real electrons obviously do not behave in this way.
To cope with this problem, Dirac introduced the hypothesis, known as hole theory, that the vacuum is the manybody quantum state in which all the negativeenergy electron eigenstates are occupied. This description of the vacuum as a "sea" of electrons is called the Dirac sea. Since the Pauli exclusion principle forbids electrons from occupying the same state, any additional electron would be forced to occupy a positiveenergy eigenstate, and positiveenergy electrons would be forbidden from decaying into negativeenergy eigenstates.
Dirac further reasoned that if the negativeenergy eigenstates are incompletely filled, each unoccupied eigenstate – called a hole – would behave like a positively charged particle. The hole possesses a positive energy, since energy is required to create a particle–hole pair from the vacuum. As noted above, Dirac initially thought that the hole might be the proton, but Hermann Weyl pointed out that the hole should behave as if it had the same mass as an electron, whereas the proton is over 1800 times heavier. The hole was eventually identified as the positron, experimentally discovered by Carl Anderson in 1932.
It is not entirely satisfactory to describe the "vacuum" using an infinite sea of negativeenergy electrons. The infinitely negative contributions from the sea of negativeenergy electrons has to be canceled by an infinite positive "bare" energy and the contribution to the charge density and current coming from the sea of negativeenergy electrons is exactly canceled by an infinite positive "jellium" background so that the net electric charge density of the vacuum is zero. In quantum field theory, a Bogoliubov transformation on the creation and annihilation operators (turning an occupied negativeenergy electron state into an unoccupied positive energy positron state and an unoccupied negativeenergy electron state into an occupied positive energy positron state) allows us to bypass the Dirac sea formalism even though, formally, it is equivalent to it.
In certain applications of condensed matter physics, however, the underlying concepts of "hole theory" are valid. The sea of conduction electrons in an electrical conductor, called a Fermi sea, contains electrons with energies up to the chemical potential of the system. An unfilled state in the Fermi sea behaves like a positivelycharged electron, though it is referred to as a "hole" rather than a "positron". The negative charge of the Fermi sea is balanced by the positivelycharged ionic lattice of the material.
[edit] Dirac bilinears
There are five different (neutral) Dirac bilinear terms not involving any derivatives:
 (S)calar: (scalar, Peven)
 (P)seudoscalar: (scalar, Podd)
 (V)ector: (vector, Peven)
 (A)xial: (vector, Podd)
 (T)ensor: (antisymmetric tensor, Peven),
where and .
A Dirac mass term is an S coupling. A Yukawa coupling may be S or P. The electromagnetic coupling is V. The weak interactions are VA.
[edit] See also
 Breit equation
 KleinGordon equation
 Quantum electrodynamics
 RaritaSchwinger equation
 Feynman checkerboard
 Theoretical and experimental justification for the Schrödinger equation
[edit] References
 ^ Lawrie, Ian D.. A Unified Grand Tour of Theoretical Physics.
[edit] Selected papers
 P.A.M. Dirac "The Quantum Theory of the Electron", Proc. R. Soc. A117) link to the volume of the Proceedings of the Royal Society of London containing the article at page 610
 P.A.M. Dirac "A Theory of Electrons and Protons", Proc. R. Soc. A126) link to the volume of the Proceedings of the Royal Society of London containing the article at page 360
 C.D. Anderson, Phys. Rev. 43, 491 (1933)
 R. Frisch and O. Stern, Z. Phys. 85, 4 (1933)
[edit] Textbooks
 Halzen, Francis; Martin, Alan (1984). Quarks & Leptons: An Introductory Course in Modern Particle Physics. John Wiley & Sons. ISBN.
 Dirac, P.A.M., Principles of Quantum Mechanics, 4th edition (Clarendon, 1982)
 Shankar, R., Principles of Quantum Mechanics, 2nd edition (Plenum, 1994)
 Bjorken, J D & Drell, S, Relativistic Quantum mechanics
 Thaller, B., The Dirac Equation, Texts and Monographs in Physics (Springer, 1992)
 Schiff, L.I., Quantum Mechanics, 3rd edition (McGrawHill, 1955)
 Griffiths, D.J., Introduction to Elementary Particles (Wiley, John & Sons, Inc., 1987) ISBN 0471603864.
[edit] External links
 The Dirac Equation at MathPages