Axial anomaly

From Scholarpedia
Roman W. Jackiw (2008), Scholarpedia, 3(10):7302. doi:10.4249/scholarpedia.7302 revision #136939 [link to/cite this article]
Jump to: navigation, search
Post-publication activity

Curator: Roman W. Jackiw

The axial anomaly is a quantum term that violates the classical conservation of the axial current.


Quantum Symmetry Anomalies

A mathematical model for physical phenomena may possess a symmetry when its dynamics is analyzed in terms of unquantized, commuting variables, but the symmetry may disappear when the dynamics is quantized and analysis is performed in terms of non-commuting quantum variables. Such a tenuous symmetry is said to be "anomalous," beset by a "quantum symmetry anomaly." Correspondingly, constants of motion of the unquantized theory are no longer conserved when quantum effects are taken into account \(^1\) . In greater detail, the effect arises for the following reason. Quantized dynamics frequently involves an infinite number of degrees of freedom, even when in the classical, unquantized version there is only a finite number. This infinity leads to various divergences, especially in quantum field theory (but also in some quantum mechanical systems) \(^2\), and these divergences have to be controlled and "renormalized" in order to well-define the quantum theory. The symmetry anomalies arise when the regularization and renormalization procedures, needed to well-define the theory, do not respect the putative symmetries.

The first instances of quantum symmetry anomalies were identified for models which appear to possess symmetries associated with masslessness: scale symmetry and, for Dirac-Fermions, axial symmetry. We shall here discuss the anomalies in axial symmetries of Dirac Fermions, also called the "Adler-Bell-Jackiw anomalies."\(^1\)

A massless, non-interacting Dirac-Fermi field satisfies the equation

\[\tag{1} i\hbar\, \gamma^\mu\ \frac{\partial}{\partial x^\mu} \ \psi (x) = 0. \]

Summation over a repeated index is implied. \(\psi\) is a 4-component column spinor; \(x\) stands for the space-time variables \(x^0 = c t \ \mbox{and}\ x^i = r^i \ (i = 1, 2, 3)\ ;\) the index \(\mu\) ranges over temporal \((0)\) and spatial \((i)\) components, and \(\gamma^\mu \ (\mu = 0, 1, 2, 3)\) comprise a set of \(4 \times 4\) Dirac matrices, whose explicit form will not concern us, beyond noting that they satisfy the Clifford algebra.

\[\tag{2} \begin{array}{lcl} \gamma^\mu\gamma^\nu + \gamma^\nu \gamma^\mu = 2 g^{\mu \nu} \, I\\ g^{\mu \nu} = \mbox{diag}\ (1, -1, -1, -1) \ \mbox{(Lorentz signature)} \end{array} \]

For massive fields the equation reads

\[\tag{3} i\hbar\, \gamma^\mu\ \frac{\partial}{\partial x^\mu} \ \psi(x) - m\, c \ \psi (x) = 0, \]

where \(m\) is the mass. (Henceforth we set Planck's constant \(\hbar\) and the velocity of light \(c\) to unity.)

The equations (1) and (3) possess a gauge symmetry.

\[\tag{4} \psi (x) \to e^{i\theta}\, \psi(x) \]

If \(\psi(x)\) is a solution, so is \(e^{i\theta}\, \psi(x)\) where \(\theta\) is an arbitrary constant. And this symmetry is present whether \(\psi\) is a classical field or a quantum field operator. As a consequence of this symmetry, the charge

\[\tag{5} Q \equiv \int d^3 \, r\ \psi^\dagger \psi \]

is time independent, or equivalently a charge current 4-vector \(J^\mu\) \(\tag{6} J^\mu \equiv \psi^\dagger\, \gamma^0\, \gamma^\mu\, \psi \)

satisfies a continuity equation

\[\tag{7} \frac{\partial}{\partial x^\mu}\ J^{\mu} (x) = 0. \]

[\(\psi^\dagger\) is a 4-component, row spinor, with entries that are complex conjugates of \(\psi\) (in the unquantized theory) or Hermitian conjugates of \(\psi\) (in the quantized theory).]

It is interesting to delve deeper into the matrix structure of these equations. Upon defining the idempotent and Hermitian \(\gamma_5\) matrix by

\[\tag{8} \gamma_5 = i \gamma^0 \, \gamma^1\, \gamma^2\, \gamma^3, \quad \gamma_5\, \gamma_5 = I, \]

we verify that \(\gamma_5\) anti-commutes with the Dirac matrices \(\gamma^\mu\ .\) Next we construct chiral projection matrices

\[\tag{9} P_\pm = \frac{1}{2}\ (I \pm\gamma_5), \ \ P_\pm + P_\mp = I,\ \ P_\pm P_\pm = P_\pm, \ \ P_\pm P_\mp = 0, \]

which select chiral components of \(\psi\ .\)

\[\tag{10} \psi_\pm \equiv P_\pm \, \psi, \quad \gamma_5\, \psi_\pm = \pm\, \psi_\pm \]

By action of \(P_\pm\) on the equations (1) and (3) we obtain decoupled equations for the chiral components \(\psi_\pm\) in the massless case

\[\tag{11} i \, \gamma^\mu\ \frac{\partial}{\partial x^\mu}\ \psi_\pm (x) = 0, \]

but a mixing remains on the massive case,

\[\tag{12} i \, \gamma^\mu\ \frac{\partial}{\partial x^\mu}\ \psi_\pm (x) - m \, \psi_\mp (x) = 0, \]

while the charge (5) and the current (6) become summed expressions of the \((+)\) variables and the \((-)\) variables.

\[\tag{13} Q = \int d^3 r \left(\psi^\dagger_+ \psi_+ + \psi^\dagger_- \psi_-\right) = Q_+ + Q_ - \ :\]
\[\tag{14} J^\mu = \psi^\dagger_+ \, \gamma^0 \gamma^\mu\, \psi_+ + \psi^ \dagger_-\, \gamma^0 \gamma^\mu \, \psi_- = J^\mu_+ + J^\mu_- \]

Since in the massless model there is no mixing between \((+)\) and \((-)\) components, it follows that \(Q_+\) and \(Q_-\) are separately conserved, and that \(J^\mu_+\) and \(J^\mu_-\) separately obey continuity equations. Alternatively and equivalently one can state that in the massless case the axial vector current

\[\tag{15} J^\mu_5 = \psi^\dagger \gamma^0\, \gamma^\mu \, \gamma_5 \, \psi = J^\mu_+ - J^\mu_- \]

satisfies a continuity equation,

\[\tag{16} \frac{\partial}{\partial x^\mu}\ J^\mu_5\, (x) = 0 \]

and that the axial charge

\[\tag{17} Q_5 \equiv \int d^3 r \, \psi^\dagger \, \gamma_5\, \psi \]

is time independent. The additional constant of motion arises as a consequence of the axial gauge symmetry. The transformation

\[\tag{18} \psi \to e^{i \gamma_5 \theta}\, \psi = (\cos \theta + i\, \gamma_5\, \mbox{sin} \, \theta) \ \psi, \ \psi_\pm \to e^{\pm\, i\, \theta}\, \psi_\pm \]

maps solutions into solutions of the massless equation, and this is true whether \(\psi\) is a classical field or a quantum field operator.

To encounter anomalies, we enlarge the massless model by introducing a coupling to a vector gauge field \(A_\mu\ ,\) treated for the moment as an externally prescribed quantity, without dynamics. Eq. (1) is now replaced by

\[\tag{19} i \, \gamma^\mu\ \left(\frac{\partial}{\partial x^\mu} + i A_\mu (x) \right) \ \psi(x) = 0. \]

A superficial examination of the system leads to the conclusion that the previous symmetries, (4) and (18) continue to hold; indeed (4) can be generalized to a "local" gauge symmetry with \(\theta (x)\) acquiring a space-time dependence, provided \(A_\mu\) is also transformed.

\[\tag{20} A_\mu (x) \to A_\mu (x) - \frac{\partial}{\partial x^\mu}\ \theta (x) \]

[When the transformation parameter \(\theta\) is position independent, as previously in (4) and (18), the symmetry is a "global" gauge symmetry.]

Correspondingly one would conclude that even in the presence of \(A_\mu\) that chiral charges \(Q_\pm\) remain time-independent and the vector (6) and axial vector currents (15) still satisfy continuity equations (7) and (16).

But these conclusions are valid only if the \(\psi\) fields are classical functions and not quantum field operators. For the latter, the problem resides in the fact that the fundamental quantization condition for Dirac-Fermi fields

\[\tag{21} \begin{array}{lll} \psi^\dagger_m (t, \mathbf{r}) \, \psi_n (t, \mathbf{r}^\prime) + \psi_n (t, \mathbf{r}^\prime)\, \psi^\dagger_m (t, \mathbf{r})\\[1ex] \qquad \quad = \delta_{mn}\, \delta^3\, (\mathbf{r} - \mathbf{r}^\prime) \end{array} \]

implies that the product of \(\psi^\dagger \ \mbox{and}\ \psi\) at the same space-time point is necessarily singular. [In the above (\(m, n\)) label the components of \(\psi^\dagger \ \mbox{and}\ \psi\ .\)] Since the charges and currents involve bilinears of the Dirac-Fermi fields at the same space-time point, they are necessarily ill-defined in the quantum theory. As mentioned previously, a regularization and renormalization is needed to render the currents well-defined. But it turns out that every regularization/renormalization method in the presence of the vector field \(A_\mu\) violates the symmetries that are present in the unquantized theory. It is possible to preserve (4) or (18) [or a linear combination of the two] but not both.

Since the preservation of both symmetries is impossible, a choice must be made which one should be preserved. The choice is dictated by the physical context of the theory under examination. Since local gauge symmetries, as in (4) and (20), are frequently needed for consistency of the theory (as in the standard model of particle physics) they are the ones that are preserved, while global axial gauge symmetries as in (18), are abandoned --- they become beset by anomalies.

Physical Consequences of Axial Symmetry Anomalies

For the example (19) given above, preserving the local gauge symmetry has the consequence that in the regulated/renormalized quantum field theory the charge (5) remains conserved and the vector current (6) continues to satisfy the continuity equation (7). Correspondingly the axial charge (17) acquires a time dependence and the axial vector current (15) obeys an anomalous continuity equation. Its form is

\[\tag{22} \frac{\partial}{\partial x^\mu}\ J^\mu_5\, (x) = \frac{N}{8 \pi^2}\ {^\ast F}^{\mu\nu} (x)\, F_{\mu\nu}\, (x), \]

where \(F_{\mu\nu}\) is the field strength constructed from \(A_\mu\ :\)

\[\tag{23} F_{\mu\nu} (x) \equiv \frac{\partial}{\partial x^\mu} \ A_\nu (x) - \frac{\partial}{\partial x^\nu}\ A_\mu (x) \]

and \({^\ast F}^{\mu\nu}\) is its dual.

\[\tag{24} {^\ast F}^{\mu\nu} \equiv \frac{1}{2}\ \varepsilon^{\mu\nu\alpha\beta}\, F_{\alpha\beta} \]

\(N\) is a numerical constant which is determined by the number and strength of Dirac-Fermi fields coupling to \(A_\mu\ .\) For the single field of our example, \(N=1\ .\) While we have taken \(A_\mu\) to be externally prescribed, it has been shown that the result (22) holds with dynamical \(A_\mu\ .\) The occurrence of the symmetry anomalies leads to a variety of effects in the standard particle physics model.

On the one hand, the standard model appears to possess symmetries that are not present in Nature, not even approximately. These classical, global gauge symmetries if present in the quantized theory, would forbid the decay of a (massless) neutral pion to two photons. But the physical pion's mass can be accurately described as (approximately) vanishing, yet the decay width is not negligible.

\[\tag{25} \Gamma \, (\pi^0 \to 2\gamma) \approx 8.4 \, e V \]

Also the same symmetries predict the existence of a neutral pseudo scalar meson, approximately degenerate with the pion. But no such particle has been observed. It is fortunate that the anomalies in the quantized standard model remove the offending global gauge symmetries. Indeed because the strength of the axial anomaly is known, one can calculate the width for neutral pion decay (for massless pions). One finds \(7.725\, \mbox{eV}\ ,\) or \(8.1 \, \mbox{eV}\ ,\) when mass corrections are included. Moreover, this excellent agreement with (25) requires that there be three colors of Fermions. Thus the axial anomaly in the global gauge symmetry not only determines neutral pion decay and cancels the prediction of an unwanted partner meson, but also gives indirect determination of the number of color degrees of freedom. Furthermore, the standard model possess an anomaly in the continuity equation for the fermion number current, thereby allowing proton decay. While this startling result establishes that in our present theory stability of matter is not absolute, there is no practical significance because the predicted decay rate is negligible \(^3\ .\)

On the other hand, local gauge symmetries must be preserved for consistency of the standard model. This is achieved by adjusting the Fermion content (quarks and leptons) so that possible anomalies cancel. This requirement is met if quarks are matched with leptons, and thus the heaviest "top" quark was predicted to exist once the "bottom" quark was discovered, in order that in the third family of Fermions quarks matched the tau leptons. A similar anomaly cancellation requirement was found in string theory and led to the revival of that subject.

These physically important effects vividly demonstrate that quantum symmetry anomalies are not obscure pathologies of the quantum mechanical formalism, but describe in a paradoxical-anomalous fashion aspects of natural phenomena.

Mathematical Connections to Axial Symmetry Anomalies

The discovery of the field theoretic structures associated with axial anomalies seeded an intense interaction between physicists and mathematicians, who for their own purposes had been working with related quantities. The connection arises when the previously described formulas are generalized to incorporate a non-Abelian Lie algebra and group; this is the Yang-Mills theory. To this end, we remain with the massless Dirac equation (19), but replace the function \(i A_\mu\) by a Lie-algebra, matrix valued quantity \(A_\mu \equiv \sum\limits_\alpha\, A^{\ \alpha}_{\mu}\, T_\alpha\ ,\) where \(T_\alpha\) are anti-Hermitian representation matrices satisfying the Lie algebra commutators with structure constraints \(f _{a b} ^{\ \ c}\ :\)

\[\tag{26} [T_a, T_b] = \sum\limits_c\, f _{a b} ^{\ \ c}\ T_c, \]

and are normalized by \(t r\ T_a \, T_b = - \delta_{a b}/2\ .\) ( For \(SU(2), T_a = \sigma_a/2i, \sigma \equiv\) Pauli matrix.) The Dirac spinors \(\Psi\) acquire components, which are acted upon by the representation matrices.

\[\tag{27} i\, \gamma^\mu\, \left(\frac{\partial}{\partial x^\mu} + A_\mu (x)\right) \, \Psi (x) = 0 \]

The singlet axial vector current \(J^\mu_5\) obeys the anomalous continuity equation

\[\tag{28} \frac{\partial}{\partial x^\mu}\ J^\mu_5\ (x) = \frac{1}{8 \pi^2}\ \ t r \, {^\ast F^{\mu\nu}} (x)\, F_{\mu\nu} (x) , \]

where \(f^{\mu\nu}\) is now the non-Abelian field strength (Yang-Mills curvature).

\[\tag{29} F_{\mu\nu} (x) \equiv \frac{\partial}{\partial x^\mu}\ A_\nu (x) - \frac{\partial}{\partial x^\nu} \ A_\mu (x) + [A_\mu (x), A_\nu (x)] \]

(Anomalies also beset non-singlet currents \(J^\mu_{5 \, a}= \psi^\dagger\, \gamma^0\, \gamma^\mu\, \gamma_5\, T_a\, \psi\ ,\) but these will not be discussed here.) Also, for the mathematical discussion we pass from Lorentzian to Euclidean signature\[g_{\mu\nu} = \mbox{diag}\ (1, 1, 1, 1)\ .\]

The mathematical connection is put into evidence by (28) the generalization of (22), where on the right side occurs the Pontryagin density \(\mathcal{P}\ ,\)

\[\tag{30} \mathcal{P} \equiv -\frac{1}{16 \pi^2}\ t r\ {^\ast F^{\mu\nu}}\, F_{\mu\nu}, \]

whose 4-dimensional integral measures the topological properties of the Yang-Mills gauge potentials \(A_\mu\)(connections) and fields \(F_{\mu\nu}\) (curvatures) that enter in \(\mathcal{P}\ .\) For the integral to converge, \(F_{\mu\nu}\) must tend to zero at infinite argument. This means that \(A_\mu\) must tend to a pure gauge \(g\ ,\) which is group valued,

\[\tag{31} A_\mu (x) \to g^{-1} (x)\ \frac{\partial}{\partial x^\mu} \ g (x) \]

and \(g\) is restricted to tend to the identity. Gauge functions \(g\) with this restriction fall into equivalence (homotopy) classes labeled by integers, and gauge functions in different classes cannot be deformed into each other. That integer \(n\) is given by the Pontryagin number

\[\tag{32} n = \int d^4 x \mathcal{P} \]

While \(\mathcal{P}\) is gauge invariant, it can also be presented as the divergence of a gauge variant 4-vector \(K^\mu\ ,\) called the topological current or the Chern-Simons current.

\[\tag{33} \mathcal{P} (x) = \frac{\partial}{\partial x^\mu}\ K^\mu (x) \ :\]

\[\tag{34} K^\mu (x) = -\frac{1}{16\pi^2} \ \varepsilon^{\mu\alpha\beta\gamma}\ t r \ \left[\frac{1}{2}\ A_\alpha (x)\ \frac{\partial}{\partial x^\beta} \ A_\gamma (x) + \frac{1}{3}\ A_\alpha (x) \, A_\beta (x)\, A_\gamma (x)\right] \]

Consequently, the 4-dimensional volume integral of \(^\ast f^{\mu\nu} F_{\mu\nu}\) in (32) can be written as an integral of \(K^\mu\) over the 3-dimensional surface (at infinity) bounding the 4-dimensional volume. There the vector potentials in \(K^\mu\) are replaced by their asymptotic form (31), and the resulting integration gives the integer \(n\) that characterizes the winding number, the homotopy class, of \(g\ .\)

The Pontryagin quantity is a topological entity for various reasons. We have seen already that it is determined by the asymptotic behavior of gauge functions, which fall into distinct classes labeled by integers. Also the integral (32) does not require specifying the geometry of the integration volume --- even with non-trivial geometries no metric tensor is required in (32). Finally one can check that (32) is invariant against local variations of \(A_\mu\ .\)

While gauge field configurations with non-vanishing Pontryagin number are easily constructed, especially interesting is a class of connections that satisfy

\[\tag{35} ^\ast F^{\mu\nu} = \pm\, F^{\mu\nu}. \]

These are called instantons, and by virtue of the Bianchi identity,

\[\tag{36} D_\mu\, {^\ast F^{\mu\nu}} = 0, \]

they satisfy the Yang-Mills equation of motion.

\[\tag{37} \begin{array}{lll} D_\mu\, F^{\mu\nu} = 0\\[1ex] \left[D_\mu \ldots \equiv \frac{\partial}{\partial x^\mu} + [A_\mu, \ldots]\right] \end{array} \]

The physical interpretation of instantons is that they provide a semi-classical signal for the occurrence of quantum tunneling; here it is the tunneling between homotopy classes of gauge fields. Indeed the previously mentioned proton instability is understood as arising from such tunneling; that is why its magnitude is exponentially small and therefore negligible. In detail, the homotopy structure in the gauge theory is analogous to the periodicity of a crystal, and the Yang-Mills theory acquires an unexpected \(\theta -\) parameter, analogous to the Bloch momentum of a Bloch wave. Equivalently, one recognizes that the quantum Yang-Mills action possesses the contribution \(\theta \int d^4 x \mathcal{P} (x)\ .\) Since \(\mathcal{P}\) is a total divergence, this does not affect classical equations of motion, but influences the quantum theory. Since \(\mathcal{P}\) is odd under CP transformation, the new term is a source of CP violation, which is only a very weak effect in Nature. This leads to an outstanding puzzle about the standard model: what determines the tiny magnitude of \(\theta\ ?\)

The Pontryagin index also carries information about the Dirac equation (27) (in Euclidean space). For generic \(A_\mu\ ,\) solutions of (27) are not normalizable. However, for particular forms of \(A_\mu\ ,\) normalizable solutions may exist; they possess definite chirality, say there are \(n_+ (n_-)\) of positive (negative) chirality. The celebrated Atiyah-Singer index theorem gives a formula for the "index" of the Dirac operator, i.e. for \(n_+ - n_-\ .\)

\[\tag{38} n_+ - n_- = \int d^4 x \mathcal{P} = n \]

We thus recognize that the anomaly equation (28) is a local version for the Atiyah-Singer index theorem.

The topological Chern-Simons current (34) also enjoys a physical role. By selecting a single, definite component to be a contribution to a physical Lagrangian in 4-dimensional space-time, one constructs a theory that violates Lorentz invariance. These days there is great interest in the possibility of (feeble) Lorentz invariance violation, and the topological entities arising from axial anomalies provide an attractive realization of the idea, for which thus far there is no experimental evidence.

For another application of the Chern-Simons term with Lorentzian signature, one chooses a single, definite component, say the third, \(z\ ,\) component, and suppresses dependence of the vector potentials on that variable, \(x^3 = z\) in the example. One then has in hand a quantity defined on (2+1)-dimensional space-time, which can be used as an addition to any (2+1)-dimensional Lagrange density, describing physics on a plane. The new term is interesting in that it is not gauge invariant, but its variation is gauge covariant. So the equations of motion remain gauge covariant, and the Chern-Simons contribution provides a mass term for the gauge field, while retaining gauge invariance. These structures (mainly in their Abelian version) have been used in analyses of the quantum Hall effect.

The discussion has been concerned with gauge fields and Dirac-Fermi fields. Analogous effects are found with gravitational fields, with the gravitational connection (Christoffel or spin) taking the role of the gauge potential and the Riemann tensor replacing the gauge field strength. Again one finds anomalies involving the gravitational Chern-Pontryagin term. There is a gravitational Chern-Simons current, which may be used to build a Lorentz symmetry violating gravity model, or may be a contribution to a (2+1)-dimensional gravity theory, where the gravitons preserve diffeomorphism invariance, but are massive. (2+1)-dimensional gravity has a physical realization in descriptions of planar motion in the presence of cosmic strings.

The unexpected mathematical properties of the axial anomaly exhibit deep mathematical features in our description of Nature, in its fundamental workings. It is remarkable that these features find their realization in anomalies of the quantum mechanical formalism.


1. J.S. Bell and R. Jackiw, "A PCAC Puzzle: π0→γγ in the σ-model in the sigma model" Nuovo Cim. A 51, 47 (1969); S.L. Adler, "Axial vector vertex in spinor electrodynamics," Phys. Rev. 177, 2426 (1969).

2. B. Holstein, "Anomalies for Pedestrians," Amer. Jnl. Phys. 61, 142 (1993).

3. G. 't Hooft, "Symmetry breaking through Bell-Jackiw anomalies," Phys. Rev. Lett. 37, 8 (1976).

Further Reading

  • R. Bertlmann, Anomalies in quantum field theory, (Oxford, New York, 1996).
  • K. Fujikawa and H. Suzuki, Path integrals and quantum anomalies, ibid, 2004.
  • S. Weinberg, The quantum theory of fields, (Cambridge, New York, 1995).
  • S.L. Adler, "Anomalies to all orders" in Fifty Years of Yang-Mills Theory, G.'t Hooft ed. (World Scientific, Singapore, 2005, p. 187).
  • R. Jackiw, "Fifty Years of Yang-Mills theory and our moments of triumph," (idem, p. 229).

See also

Personal tools

Focal areas