Gauge theories
From Scholarpedia
| This article is undergoing 2 initial reviews; It may contain inaccuracies and unapproved changes made by anonymous reviewers. | ||||||||||||||||||||
Author: Dr. Gerard ′t Hooft, Institute for Theoretical
Physics
Universiteit Utrecht
Gauge theories refers to a quite general class of quantum field theories used for the description of elementary particles and their interactions. The theories are characterized by the presence of vector fields, and as such are a generalization of the older theory of Quantum Electrodynamics (QED) that is used to describe the electromagnetic interactions of charged elementary particles with spin 1/2. Local gauge invariance is a very central issue. An important feature is that these theories are often renormalizable when used in 3 space- and 1 time dimension.
1. Maxwell's equations and gauge invariance
The simplest example of a gauge theory is electrodynamics, as
described by the Maxwell equations. The electric field strength
and the magnetic field strength
obey the homogeneous Maxwell equations
(in SI units):
- (1)
- (2)
.
According to Poincaré's Lemma,
Eq. (2) implies that there exists another
vector field
such that
- (3)
.
Since Eq. (1) now reads
- (4)
,
we can also conclude that there is a potential field
such that
- (5)
.
The field
is the electric potential field;
the vector field
is called the vector potential field.
The strengths of these potential fields are
determined by the inhomogeneous Maxwell equations, which are the
equations that relate the strengths of the electromagnetic fields to
the electric charges and currents that generate these fields. The use
of potential fields often simplifies the problem of solving Maxwell's
equations.
What turns this theory into a gauge theory is the fact that the
values of these potential fields are not completely determined by
Maxwell's equations. Consider an electromagnetic field configuration
, and suppose that
it is described by the potential fields
. Then, using any arbitrary scalar function
, one can find a different set of
potential fields describing the same electric and magnetic fields, by
writing
- (6)
.
Inspecting Equations (3) and (5), one
easily observes that
and
. Thus, the set (
) and
(
) describe the same physical situation.
Because of this, we call the transformation (6) a
gauge transformation. Since
may be chosen
to be an arbitrary function of the points
in
space-time, we speak of a local gauge transformation. The fact
that the electromagnetic fields are invariant under these local gauge
transformations turns Maxwell's theory into a gauge theory.
In relativistic quantum field theory, the field
of a non-interacting spinless particle would typically
obey the equation
- (7)
,
where units where used such that the velocity of light
, and Planck's
constant
. This gives the dispersion relation between energy and momentum as
dictated by Special Relativity:
- (8)
,
Suppose now that the particle in question carries an electric charge
. How is its equation then affected by the presence of
electro-magnetic fields? It turns out that one cannot write the
correct equations using the fields
and
directly. Here, one can only choose to add terms depending
on the (vector) potential fields instead:
- (9)
.
It can be verified that this equation correctly produces waves that
are deflected by the electro-magnetic forces in the way one expects.
For instance, the energy
is easily seen to be enhanced
by an amount
, which is the potential
energy of a charged particle in an electric potential field.
However, what happens to this equation when performing a gauge
transformation? It appears as if the equation changes, so that the
solution for the field
should change as well.
Indeed,
changes in the following way:
- (10)
.
Thus, the field
makes a rotation in the complex plane. This is closely related
to a 'scale transformation', which would result if one were to remove the 'i' from Eq. (10).
It was Hermann Weyl who noted that this symmetry transformation simply redefines the scale of the field
, and introduced the word 'gauge' to describe this feature.
The combinations
- (11)
,
are called covariant derivatives, because they are chosen in
such a way that the derivatives of the function
cancel out in a gauge transformation:
and this makes it easy to see that Equation (10)
correctly describes the way
transforms under a
local gauge transformation, obeying the same field equation
(9) both before and after the transformation (all terms
in the equation are multiplied by the same exponential
, so that that factor is immaterial).
The absolute value,
does not change
at all under a gauge transformation, and indeed this is the quantity
that corresponds to something that is physically observable: it is
the probability that a particle can be found at
. A rule of thumb is that local gauge invariance requires
all derivatives in our equations to be replaced by covariant
derivatives.
2. Yang-Mills theory
In the 1950s, it was known that the field equations for the field of
a proton,
, and the field of a neutron,
, are such that one can rotate these fields
in a complex two-dimensional space:
- (14)
,
where the matrix
may contain four arbitrary complex numbers, as long
as it is unitary (
), and usually, the
determinant of
is restricted to be 1. Since these
equations resemble the rotations one can perform in ordinary space,
to describe spin of a particle, the symmetry in question here was
called isospin.
In 1954, C.N. Yang and R.L. Mills published a very important idea.
Could one modify the equations in such a way that these isospin
rotations could be regarded as local gauge rotations? this would
mean that, unlike the case that was known, the matrices
should be allowed to depend on space and time, just
like the gauge generator
in
electromagnetism. Yang and Mills were also inspired by the observation
that Einstein's theory of gravity, General Relativity, also
allows for transformations very similar to local gauge
transformations: the replacement of the coordinate frame by other
coordinates in an arbitrary, space-time dependent way.
To write down field equations for protons and neutrons, one needs the
derivatives of these fields. The way these derivatives transform
under a local gauge transformation implies that there will be terms
containing the gradients
of the matrices
. To make the theory gauge-invariant, these gradients
would have to be cancelled out, and in order to do that, Yang and
Mills replaced the derivatives
by covariant
derivatives
, as
was done in electromagnetism, see Equation (11). Here,
however, the fields
had to be matrix-valued, just
as the isospin
matrices:
- (15)
Since the
matrices contain four coefficients with one constraint (the
determinant has to be 1), one ends up with a set of three new vector
fields (there are 3 independent real vectors in the matrix
(15)). At first sight, they appear to be the fields of a vector
particle with isospin one. In practice, this should correspond to
particles with one unit of spin (i.e., the particle rotates about its
axis), and its electric charge could be neutral or one or minus one
unit. Yang-Mills theory therefore predicts and describes a new type
of particles with spin one that transmit a force not unlike the
electro-magnetic force.
The fields that are equivalent to Maxwell's electric and magnetic fields are obtained by considering the commutator of two covariant derivatives:
- (16)
,
where the indices take the values
,
with 0 referring to the time-component.
Since
, this tensor has 6
independent components, three forming an electric vector field, and
three a magnetic field. Each of these components is also a matrix.
The commutator,
is a new, non-linear
term, which makes the Yang-Mills equations a lot more complicated
than the Maxwell system.
In other respects, the Yang-Mills particles, being the energy quanta of the Yang-Mills fields, are similar to photons, the quanta of light. Yang-Mills particles also carry no intrinsic mass, and travel with the speed of light. Indeed, these features were at first reasons to dismiss this theory, because massless particles of this sort should have been detected long ago, whereas they were conspicuously absent.
3. The Brout-Englert-Higgs mechanism
The theory was revived when it was combined with
spontaneous breakdown of local gauge symmetry, also known as the
Brout-Englert-Higgs mechanism. Consider a scalar (spinless)
particle described by a field
. This field
is assumed to be a vector field, in the sense that it undergoes some
rotation when a gauge transformation is performed. In practice this
means that the particle carries one or several kinds of charges that
make it sensitive to the Yang-Mills force, and often it has several
components, which means there are various species of this particle.
Such particles must obey Bose-Einstein statistics, which
implies that it can undergo Bose-Einstein condensation. In
terms of its field
this means the following:
- In the vacuum the field
takes a non-vanishing value
.
This is usually written as
- (17)
.
After a local gauge transformation, this would look like
- (18)
,
where
is a matrix field representing the
local gauge transformation.
It is often said that, therefore, the vacuum is not gauge-invariant,
but, strictly speaking, this is not correct. The situation described
by Equation (18) is the same vacuum as
(17); it is only described differently. However, this
property of the vacuum does have important consequences. Due to the
fact that the rotated field now describes the same situation as the
previous value, there is no different physical particle associated to
the rotated field. Only the length of the vector
has physical significance. This length is
gauge-invariant. therefore, only the length of the vector
is associated to one type of particle, which must be neutral
for the Yang-Mills forces. This particle is now called the
Higgs particle.
As the Higgs field is a constant source for the Yang-Mills field
strength, the Yang-Mills field equations are modified by it. Due to
the Higgs field, the Yang-Mills "photons" described by the Yang-Mills
field
get a mass. This can also be
explained as follows. Massless photons can only have two helicity
states, that is, they can spin only in two directions. This is
related to the fact that light can be polarized in exactly two
directions. Massive photons (particles with non-vanishing mass and
with one unit of spin), can always spin in three directions. This
third rotation mode is now provided by the Higgs field, which itself
loses several of its physical components. The total number of
physical field components stays the same before and after the Brout-Englert-Higgs
mechanism. A further consequence of this effect on the Yang-Mills
field is that the force transmitted by the massive photons is a
short-range one (the range of the force being inversely proportional
to the mass of the photon).
The weak interactions could now be successfully described
by a Yang-Mills theory. The set of local gauge transformations forms
the mathematical group
. This
group generates 4 species of photons (3 for
and 1
for
). The Brout-Englert-Higgs mechanism breaks this group down
in such a way that a subgroup of the form
remains.
This is the electromagnetic theory, with just one photon. The other
three photons become massive; they are responsible for the weak
interactions, which in practice appear to be weak just because these
forces have a very short range. With respect to electromagnetism, two
of these intermediate vector bosons,
,
are electrically charged, and a third,
, is
electrically neutral. When the latter's existence was derived from
group theoretical arguments, this gave rise to the prediction of a
hitherto unnoticed form of the weak interaction: the
neutral current interaction. This theory, that combines electromagnetism
and the weak force into one, is called the electro-weak theory,
and it was the first fully renormalizable theory for the weak force
(see Chapter 5).
4. Quantum Chromodynamics
When it was understood that the weak interactions, together with the electromagnetic ones, can be ascribed to a Yang-Mills gauge theory, the question was asked how to address the strong force, a very strong force with relatively short range of action, which controls the behavior of the hadronic particles such as the nucleons and the pions. It was understood since 1961 that these particles behave as if built from subunits, called quarks. Three varieties of quarks were known (up, down, and strange), and three more would be discovered later (charm, top, and bottom). These quarks have the peculiar property that they permanently stick together either in triplets, or one quark sticks together with one anti-quark. Yet when they approach one another very closely, they begin to behave more freely as individuals.
These features we now understand as, again, being due to a Yang-Mills
gauge theory. Here, we have the mathematical group
as local gauge group, while now the symmetry is not affected by
any Brout-Englert-Higgs mechanism. Due to the non-linear nature of
the Yang-Mills field, it self-interacts, which forces the fields to
come in patterns quite different from the electromagnetic case:
vortex lines are formed, which form unbreakable bonds
between quarks. At close distances, the Yang-Mills force becomes
weak, and this is a feature that can be derived in an elementary way
using perturbation expansions, but it is a property of the quantized
Yang-Mills system that hitherto had been thought to be impossible for
any quantum field theory, called asymptotic freedom. The
discovery of this feature has a complicated history.
implies that every species of quark comes in three
types, referred to as color: they are "red", "green" or "blue".
The field of a quark is therefore a 3-component vector in an internal
'color' space. Yang-Mills gauge transformations rotate this vector in
color space. The Yang-Mills fields themselves form 3 by 3 matrices,
with one constraint (since the determinant of the Yang-Mills gauge
rotation matrices must be kept equal to one). Therefore, the
Yang-Mills field has 8 colored photon-like particles, called
gluons. Anti-quarks carry the conjugate colors ("cyan",
"magenta" or "yellow"). The theory is now called Quantum chromodynamics
(QCD). It is also a renormalizable theory.
The gluons effectively keep the quarks together in such a way that their colors add up to a total that is color-neutral ("white" or a "shade of gray"). This is why either three quarks or one quark and one anti-quark can sit together to form a physically observable particle (a hadron). This property of the theory is called permanent quark confinement. Because of the strongly non-linear nature of the fields, quark confinement is in fact quite difficult to prove, whereas the property of asymptotic freedom can be demonstrated exactly. Indeed, a mathematically air-tight demonstration of confinement, with the associated phenomenon of a mass gap in the theory (the absence of strictly massless hadronic objects) has not yet been given, and is the subject of a $1,000,000,- prize, issued by the Clay Mathematics Institute of Cambridge, Massachusetts.
5. The Lagrangian
One cannot choose all field equations at will. They must obey conditions such as energy conservation.
This implies that there is an action principle (action = reaction), and this principle is most
conveniently expressed by writing the Lagrangian for the theory. The
Lagrangian (more precisely, Lagrange density)
is an expression in
terms of the fields of the system. For a real scalar field
it is
- (19)
and for the Maxwell fields it is
- (20)
where the summation is the Lorentz covariant summation over the Lorentz indices
.
The field equations can all be derived from this expression by demanding that the action integral,
- (21)
where
is the sum of the Lagrangians of all fields in the system,
be stationary under all infinitesimal variations of these fields. This is called the Euler-Lagrange principle,
and the equations are the Euler-Lagrange equations.
For gauge theories this generalizes directly: one writes
- (22)
using the expression (16) for the gauge fields
, and adds
all terms associated to the other fields that are introduced. All symmetries of the theory are the symmetries of the Lagrangian, and the dimensionality of all coupling strengths can easily be read off from the Lagrangian as well,
which is of importance for the renormalization procedure (see next chapter).
6. Renormalization and Anomalies
According to the laws of quantum mechanics, the energy in a field consists of energy packets, and these energy packets are in fact the particles associated to the field. Quantum mechanics gives extremely precise prescriptions on how these particles interact, as soon as the field equations are known and can be given in the form of a Lagrangian. The theory is then called quantum field theory (QFT), and it explains not only how forces are transmitted by the exchange of particles, but it also states that multiple exchanges should occur. In many older theories, these multiple exchange gave rise to difficulties: their effects seem to be unbounded, or infinite. In a gauge theory, however, the small distance structure is very precisely prescribed by the requirement of gauge-invariance. In such a theory one can combine the infinite effects of the multiple exchanges with redefinitions of masses and charges of the particles involved. This procedure is called renormalization. In 3 space and 1 time dimension, most gauge theories are renormalizable. This allows us to compute the effects of multiple particle exchanges to high accuracy, thus allowing for detailed comparison with experimental data.
Renormalization requires that masses and coupling strengths of particles be defined very carefully. If all coupling parameters of a theory are given a mass-dimensionality that is zero or positive, the number of divergent expressions stays under control. Usually, requiring the theory to remain gauge invariant throughout the renormalization procedure leaves no ambiguity for the definitions. However, it is not obvious that unambiguous, gauge invariant definitions exist at all, since gauge invariance has to hold for all interactions, whereas only a few infinite expressions can be replaced by finite ones.
The proof that showed how and why unambiguous renormalized
expressions can be obtained, could be most elegantly obtained by
realizing that gauge theories can be formulated in any number of
space-time dimensions. It was even possible to define all Feynman
diagrams unambiguously for theories in spaces where the dimensions
are
, where
is an
infinitesimal quantity. Taking the limit
requires the subtraction of poles of the form
from the original, "bare" mass and coupling
parameters. The result is a set of unique, finite and gauge invariant
expressions. In practice, it was found that this procedure, called
dimensional regularization and renormalization is also
convenient for carrying out technically complicated calculations of
loop diagrams.
However, there is a special case where extension to dimensions
different from the canonical one is impossible. This is when
fermionic particles exhibit chiral symmtry. Chiral symmetry is a
symmetry that distinguishes left-rotating from right rotating
particles, and indeed it plays a crucial role in the Standard Model.
Chiral symmetry is only possible if space is 3 dimensional, and so
does not allow for dimensional renormalization. Indeed, sometimes chiral
symmetry cannot be preserved when renormalizing the theory.
An anomaly occurs, called chiral anomaly.
It was first discovered when a calculation of the
decay amplitude gave answers that did not
follow the expected symmetry pattern.
Since the gauge symmetries of the Standard Model do distinguish left rotating from right rotating particles (in particular, only left-rotating neutrinos are produced in a weak interaction), anomalies were a big concern. It so happens, however, that all anomalous amplitudes that would jeopardize gauge invariance and hence the self consistency of our equations, all cancel out. This is related to the fact that certain "grand unified" extensions of the Standard Model are based on anomaly free gauge groups (see Chapter 7).
The anomaly has a direct physical implication. A topologically twisted field configuration called the instanton (because it represents an event at a given instant in time), represents exactly the gauge field configuration where the anomaly is maximal. It causes a violation of the conservation of some of the gauge charges. When there is an anomaly, at least one of the charges involved cannot be a gauge charge, but must be a charge to which no gauge field is coupled, like baryonic charge. Indeed, in the electroweak theory, instantons trigger the violation of the conservation laws of baryons. It is now believed that this might explain the imbalance between matter and antimatter that must have arisen during early phases of the Universe.
7. Standard Model
Apart from the weak force, the electromagnetic force and the strong
force, there is the gravitational force acting upon elementary
particles. No other elementary forces are known. At the level of
individual particles, gravity is so weak that it can be ignored in
most cases. Suppose now that we take the
Yang-Mills system, together with the Higgs field, to
describe electromagnetism and the weak force, and add to this the
Yang-Mills theory for the strong force, and we
include all known elementary matter fields, being the quarks and
the leptons, with their appropriate transformation rules under a
gauge transformation; suppose we add to this all possible ways these
fields can mix, a feature observed experimentally, which can be
accounted for as a basic type of self-interaction of the fields. Then
we obtain what is called the Standard Model. It is one great
gauge theory that literally represents all our present understanding
of the subatomic particles and their interactions.
The Standard Model owes its strength to the fact that it is renormalizable. It has been subject of numerous experimental experiments and observations. It has withstood all these tests remarkably well. One important modification became inevitable around the early 1990's: in the leptonic sector, also the neutrinos carry a small amount of mass, and their fields mix. This was not totally unexpected, but highly successful neutrino experiments (in particular the Japanese Kamiokande experiment) now had made it clear that these effects are really there. They actually implied a further reinforcement of the Standard Model.
One ingredient has not yet been confirmed: the Higgs particle. Observation of this object is expected in the near future, notably by the Large Hadron Collider at CERN, Geneva. The simplest versions of the Standard model only require one single, electrically neutral Higgs particle, but the 'Higgs sector' could be more complicated: the Higgs could be much heavier than presently expected, or there could exist more than one variety, in which case also electrically charged scalar particles would be found.
The Standard Model is not perfect from a mathematical point of view. At extremely high energies (energies much higher than what can be attained today in the particle accelerators), the theory becomes unnatural. In practice, this means that we do not believe anymore that everything will happen exactly as prescribed in the theory; new phenomena are to be expected. The most popular scenario is the emergence of a new symmetry called supersymmetry, a symmetry relating bosons with fermions (particles such as electrons and quarks, which require Dirac fields for their description).
8. Grand Unified Theories
It is natural to suspect that the electroweak forces and the strong
forces should also be connected by gauge rotations. This would imply
that all forces among the subatomic particles are actually related by
gauge transformations. There is no direct evidence for this, but
there are several circumstances that appear to point in this
direction. In the present version of the Standard Model, the
Yang-Mills fields, describing the strong force, indeed
exhibit very large coupling strengths, whereas the
sector, describing the electric (and part of the weak) sector, has a
tiny coupling strength. One can now use the mathematics of
renormalization, in particular the so-called renormalization group,
to calculate the effective strengths of these forces at
much higher energies. It is found that the
forces
decrease in strength, due to asymptotic freedom, but that the
coupling strength increases. The
force varies more slowly. At extremely high energies, corresponding
to ultra short distance scales, around
cm, the
three coupling strengths appear to approach one another, as if that
is the place where the forces unite.
It was found that
and
fir quite nicely in a group called
. They indeed form a subgroup of
. One may then assume that a Brout-Englert-Higgs
mechanism breaks this group down to a
subgroup. One obtains a so-called Grand Unified Field theory.
In this theory, one assumes three generations of
fermions, each transforming in the same way under
transformations (mathematically, they form a
and a
representation).
The
theory, however, predicts that the proton can
decay, extremely slowly, into leptons and pions. The decay has been
searched for but not found. Also, in this model, it is not easy to
account for the neutrino mass and its mixings. A better theory was
found where
is enlarged into
.
The
and the
representations of
together with a single right handed neutrino field,
combine in to a
representation of
(one for each of the three generations).
This grand unified model puts the neutrinos at the same level as the
charged leptons. Often, it is extended to a supersymmetric version.
9. Final remarks
Any gauge theory is constructed as follows. First, choose the gauge
group. This can be the direct product of any number of irreducible,
compact Lie groups, either of the series
,
or
, or the exceptional groups
or
. Then, choose
fermionic (spin 1/2) and scalar (spin 0) fields forming
representations of this local gauge group. The left helicity and
the right helicity components of the fermionic fields may be in
different representations, provided that the anomalies cancel out.
Besides the local gauge group, we may impose exact and/or approximate
global symmetries as well. Finally, choose mass terms and interaction
terms in the Lagrangian, described by freely adjustable coupling
parameters. There will be only a finite number of such parameters,
provided that all interactions are chosen to be of the renormalizable
type (this can now be read off easily from the theory's Lagrangian).
There are infinitely many ways to construct gauge theories along these lines. However, it seems that the models that are most useful to describe observed elementary particles, are the relatively simple ones, based on fairly elementary mathematical groups and representations. One may wonder why Nature appears to be so simple, and whether it will stay that way when new particles and interactions are discovered. Conceivably, more elaborate gauge theories will be needed to describe interactions at energies that are not yet attainable in particle accelerators today.
Related subjects are Supersymmetry and Superstring theory. They are newer ideas about particle structure and particle symmetries, where gauge invariance also plays a very basic role.
| Invited by: | Dr. Riccardo Guida, Institut de Physique Théorique; CEA, IPhT; CNRS; Gif-sur-Yvette, France |

,
,


that finds an asymmetric value
.




