Magnetism: mathematical aspects
From Scholarpedia
| This article has not been published yet; It may be unfinished, contain inaccuracies or unapproved changes. | |||||||||||||||||||||
Author: Prof. Daniel C. Mattis, Dept. of Physics, University of Utah,Salt Lake City, UT 84112-0830
Already as an infant, Albert Einstein (Nobel prize winner, 1921) wondered about the physics of magnetic “action at a distance”. His was not the only brilliant mind entranced by magnetic phenomena. Among other Nobelists who have made incidental or even major contributions to our understanding of this field we note (along with the year of their prize) the names of H. Lorentz and P. Zeeman (1902), P. Curie (1903), W. Heisenberg (1932), W. Pauli (1945), F. Bloch (1952), C.N. Yang (1957), H. Bethe (1967), L. Néel (1970) and P.W. Anderson (1977). The list grows longer if we include the ancillary topics of magnetic resonance, Hall effect, and superconductivity or developments in unrelated fields, such as the concept of the Goldstone mode in high-energy physics inspired by the spin waves of the theory of magnetism. It is not a coïncidence that in "statistical mechanics", which comprises the study of physics at finite temperature, contemporary concerns such as phase transitions and their critical exponents evolved out of the corresponding microscopic properties of magnetic materials at the Curie point. In short, much of the mathematics that was originally developed to unveil the sources of magnetism found subsequent applications in other branches of theoretical physics and – in the case of the Ising model – far afield in cryptology, epidemiology, economics, political science and even sociology!
In this article we show how theories of magnetism are classified according to their internal and external symmetries, spatial dimensionality and various other physical properties. But from the outset, it is important to clarify to the reader that we are only seeking to understand the material origins of magnetic phenomena, insofar as they might originate in the many-body comportment of electrons in atoms, molecules and solids. This is quite distinct from studies of the resulting electromagnetic field, whether this last is treated in the classical version due to Maxwell, dating back to the mid-19th Century, or in the quantized (QED) version developed in the middle of the 20th‚ the more so when we turn to antiferromagnets in which the concomitant magnetic fields cancel already at a microscopic level.
What we present below is a small survey of a few idealized models of magnetism culled from an incredibly large array; we discuss the motivation behind them together with the interesting mathematics that arises in the course of solving these many-body problems.
Effects of spatial dimensionality and of various symmetries
Setting aside Dirac's hypothetical magnetic monopole and the fragile current loop of Ampère, the leading physical source (and the unit) of magnetism has to be the permanent magnetic dipole‚ such as the elementary "Bohr magneton" carried by each and every electron. It is sometimes useful to idealize magnetic materials as periodic arrays of unit cells each of which contains one or more atoms or molecules sporting one or more Bohr magnetons.
As example of this periodicity imagine a family of "hypercubic solids" in arbitrary dimensions, ranging from
(just 1 cell), to
cells in
, and up to unphysically high dimensions
. Each cell has
nearest-neighbors and
is large. The lattices are called the linear chain (
) in 1D, the square (or simple quadratic
) in 2D, the simple cubic (
) in 3D, etc. In the
, identical cells lie along a given axis at points
with
their label and
the lattice parameter. Thus
applies to an hypothetical, ideal, polymer of great length. Cells of the
lattice are located at
and those of the
lattice are at
, etc.
It is found that the leading term of various correlation functions of particles confined to such lattices, when expanded in powers of
, yield formulas identical to those obtained in the self-consistent "mean-field", aka "molecular field" approximation. "Critical properties", that is, the singular contributions to thermodynamic functions (specific heat, susceptibility, etc.) near a phase transition, can sometimes be extrapolated to 3D from
using the renormalization group (RG) rigged for
‚ upon expanding in powers of
. This has justified the study of leading large
approximations, even though physical limitations restrict magnetic systems as well as other materials to
.
To further define a physical model it is necessary to specify the contents of each cell and the interactions among neighboring cells. For many models it is possible to solve the multi-cell model in
dimensions (or at the very least, to analyze its thermodynamical properties) by a "transfer matrix approach" on a
dimensional lattice. More precisely, the partition function in d dimensions is related to the largest eigenvalue of a transfer operator on a lattice in
dimensions. The eigenvalue problem that needs be solved is usually trivial in
but is, with some exceptions, too cumbersome to carry out for
.
On the other hand and at the opposite extreme, for
or 5 or even greater (
), the statistical mechanics and phase diagram of many models of magnetism become easily solvable and predictable in leading
approximation, just as is the case in quantum field theories.
Clearly, whether
is large or small is important. Just as single strand polymers differ from 3D solids even when the constituent atoms are identical, the properties of magnetic polymers differ from those of magnetic solids. Once
is specified, the dynamics of the individual magnetic moments (point-group symmetry) together with the symmetries of their interactions (bonds) are what determine the collective properties in the ground state and at finite
. These symmetries and dynamics fall into various classes or categories.
Typically, once the class or category of the model is given, it is in the three-dimensional world
in which we live that it is most difficult to find precise or merely reliable mathematical solutions. That is what keeps theoretical physicists in business. As an example let us next examine arguably the simplest model of anisotropic nearest-neighbor interactions among quantized spins on lattices in various dimensions
, the one named after E. Ising who first studied it in the 1920's.
The one-dimensional Ising model vs. other possibilities
Ising’s model was based in the “old” quantum theory in which spins, affixed to pounts on a space lattice, can only point “up” or “down.” This discrete algebra is denoted Z(2). In
the Hamiltonian is
(where each
).
The Ising model is better described in terms of Pauli matrices
with
absorbing the factor
as follows: each spin at
is assumed to interact with nearest-neighbors at
via a highly anisotropic
“exchange” matrix characterizing the nearest-neighbor bond, which then assumes the appearance:
. The total interaction with an external or self-consistently generated magnetic field is given by
if it is “longitudinal” or
if “transverse.” Also
is the Hamiltonian of an individual nearest-neighbor bond in Heisenberg’s model, with
the interaction with any orientation external field (as here, by symmetry, there can be no distinction between parallel and transverse.)
Finally, the interaction
describes the “X-Y” model in which we again need distinguish “in-plane” from “out-of-plane” external fields. These three models are discussed separately below.
The most general bilinear Hamiltonian for a bond connecting two sites is given by an Hermitean
matrix
. This generalization has 9 independent parameters, with the Ising version in its most anisotropic limit, all the way to the totally isotropic Heisenberg model,
, representing isotropic limit, given that it is explicitly invariant under arbitrary spatial rotations. (
if
and 0 otherwise is Kronecker’s delta.) Additionally, quartic forms for two-site interactions and interactions that involve three sites, have sometimes been considered in the literature in connection with various physical applications, but they are not discussed further here.
Ground states, elementary excitations and Tc
The spin
Ising
of
sites connected by
(ferromagnetic) bonds
was described above. The combined ground state (lowest energy) solution of the Hamilton or Schrödinger equation
is
for either of two ground state configurations: all spins “up” (+1) or all “down” (–1). Either configuration manifests perfect long-range order (LRO).
The next higher energy levels are associated with a single “domain wall” defined as follows: the first
spins are all parallel, say “up,” followed by spins numbered
, that are all “down.” Because the
bond is promoted from energy
to energy
, the energetic cost of this defect is
. There are
possible values of
, that is,
distinct positions on which to place the break. A second break can occur at
(
where
; a third one at some third distinct place, etc. Because a bond can only be broken once a sort of “exclusion principle” prevents any two breaks from sharing the same bond. Thus it appears that the domain walls are fermions, unusual only in the sense that each such fermions adds a constant amount
to the total energy – regardless how many others are present. The corresponding Boltzmann factor of each is
. It follows that the entropy
(
= Boltzmann’s constant natural logarithm of the number of allowed configurations) is the sum of
(recall: initially there were two configurations) and of
. The free energy
becomes:
The last term can be neglected in the large
limit. From this one can infer that at temperature
the thermodynamic average of the number
of domain walls is,
in which the second factor is the “Fermi-Dirac distribution function” (being the average number of fermions that lives on each of the
bonds in thermal equilibrium at temperature
.) Both
and
are analytic in
except in the limit
, signaling there is an order-disorder thermodynamic phase transition at
. The model can be “solved” more formally and the same results obtained more directly following a nonlinear transformation to bond variables. Let
, … , each
independent of the others. Then
. This is just an Hamiltonian of
noninteracting pseudo-spins in a pseudo external magnetic field
. Because
has two values but does not explicitly enter
, each configuration
is two-fold degenerate. This accounts for
in (1).
In
, Ising’s model exhibits a genuine phase transition; the second derivative of
is discontinuous at a finite temperature
identified as the “Curie point”
, a quantity proportional to
that is also a function of
. Table I lists
to three decimal places in terms of
of nearest-neighbor sites on hypercubic lattices.
|
More on Tc
The critical temperature of zero in
shown in Table I was obtained trivially. The nonvanishing values of
on any of the three standard lattices in
(the
, honeycomb and triangular) can also be found exactly using the duality relations of Kramers and Wannier. Duality is what relates
on a spin lattice to
on a dual lattice that is constructed on the bonds of said lattice. The
lattice is self-dual with coördination number
and the triangular and honeycomb lattices are duals of each other with
and 3 respectively. The duality relations
and
were originally derived by comparing high temperature series expansions of the partition function with low
expansions, without needing to actually evaluate either sum. Armed with just this sort of information, Onsager showed that
is given by
for Ising ferromagnets on any one of the 3 principal lattices in 2D.
There is no such formula for the values of
listed in Table I for the Ising ferromagnet in
and
dimensions, so there the values of
have to be obtained numerically.
Symmetry breaking
The exact eigenvalue of the transfer matrix of the Ising model on a
lattice, hence the exact evaluation of the partition function and of the free energy in this model, including the particulars of the singular phase transition (both specific heat and magnetic susceptibility diverge at
,) were first obtained by L. Onsager in the early 1940’s by the use of spinor algebra. A subsequent version based on a more familiar fermion field theory was constructed by T.D. Schultz, D.C. Mattis and E.H. Lieb and will be sketched below. But it seems that some properties of these exact solutions generalize to all model ferromagnets in arbitrary dimensions, viz.:
Above
in zero external magnetic field, the “up-down” symmetry is maintained perfectly. When cooling below
this symmetry is spontaneously broken by the onset of LRO. It is noteworthy that the application of an homogeneous external magnetic field (B), which also breaks the symmetry of the ferromagnet, also creates LRO in any dimension
. It follows that a finite real external magnetic field pushes the critical temperature all the way up to
.
Stated otherwise: regardless of the ground state and of the nature of the low-temperature phase of a magnetic substance(and regardless of spatial dimension
,) in the presence of a finite, real, external field
there will be created LRO at all finite
. The only interesting question is, how does this magnetic order behave as function of
in the limit
? If it remains finite in that limit, we have spontaneous ferromagnetism; if in that limit it vanishes, the information to be sought is in the magnetic susceptibility, a quantity related to the short-range order.
Transfer matrix in d = 1 Ising model
In Gibbsian statistical mechanics, the partition function
is related to the free energy
by
, hence knowledge of the one yields the other. The temperature is given by
. The trace (abbreviated:
) is defined as the sum over all diagonal elements of the argument, treated as a matrix. Because the number of such terms is exponential in
this sum cannot be performed efficiently in the limit
, especially near
. (Otherwise one could obtain, for example, the energy as a function of temperature
and other thermodynamic quantities, numerically.) The following calculation, carried out explicitly for the Ising
, shows how to get around this difficulty in 1D.
In the 1D model with
, the quantity inside the
operation can be written as
, upon ordering interactions consecutively. We note that each factor has exactly the same form,
. (Here
is the unit
matrix.) It follows that,
where a dot "
" indicates ordinary matrix multiplication. The
's are the two eigenvalues of the
“transfer matrix” V, viz.,
and
. Then,
. Thus only the larger eigenvalue survives, given that the contribution of the smaller one to the partition function (and to F) is exponentially smaller when we proceed to the thermodynamic limit
.
The largest eigenvalue also acquires a special significance if we identify the spinor
as the probability of the jth spin being “up” (
) and “down”
. With
in the interval 0,1 each of these probabilities is positive and they add up to 1. The corresponding probability of the
spin being “up” or “down” is then
. We present this as a proportionality and not as an equation because the sum of the probabilities at
also need to be normalized. (In probability theory, by normalization is meant that each entry is positive and the sum of all entries is unity.) Therefore the correct equation is:
with
to be determined. In the present example it is found that the right-hand vector tends to (½,½ ) asymptotically at large
, regardless of the initial
. If the initial state belonged to the largest eigenvalue of
, which is
, then
= ½ and the preceding result holds for all
and not just asymptotically. It follows that the function
is
and that
. The other eigenvalue of
,
, belongs to a spinor (½ , –½ ) that cannot be interpreted in terms of probabilities.
Thus, the evaluation of
can be reduced to an ordinary eigenvalue problem subject to the following famous, if obvious, theorem:
- Frobenius’ Theorem: “The largest eigenvalue of a matrix of arbitrary dimension, all elements of which are positive, belongs to an eigenvector that has only non-negative elements.”
For want of a better name we shall call this the “largest eigenvector”. Because all other eigenvectors must have one or more changes of sign in order to be orthogonal to the “largest eigenvector,” it is the only eigenvector that can be normalized according to the following rule: each of its entries must lie in the interval 0,1 and the sum of all its entries must = 1. Thus normalized, the “largest eigenvector” becomes the “reduced density matrix” and its entries are probabilities. Only this “largest eigenvector” is ever needed in the calculation of
and of free energy F, whereas all eigenvectors are required in the evaluation of any nontrivial correlation function.
Application to Ising model on Sq lattice
In 2D statistical mechanics, the matrix “transferring” the
column of spins to
is a sort of “quantum”
. The rows are labeled
. All bonds
that connect spins on rows
and
within a single vertical
column must be included. Combining these with horizontal transfers of the
column into the
we obtain a complete transfer operator
. The horizontal contributions are given by the
defined just above Eq. (3). Because all references in this operator are to the
column we can omit the column index
for the sake of notational simplicity.
The second factor in the
shown above is exponentiated as follows:
after defining
, using a trivial Pauli operator identity, and defining
as before. After similarly defining
for vertical bonds, we obtain the full 2D transfer matrix (all
) in the form:
where
(5A)
It “transfers” all the spins on the
column to
. The exponent is congruent to a
Ising
with nearest-neighbor bonds
in a transverse magnetic field
, an exactly solvable model. We therefore seek to solve the eigenvalue problem:
for the largest possible value of
. This can be done following a sequence of simplifying transformations. The first of these is a global rotation about the y-axis by 90º, i.e.,
and
. After this (5A) becomes:
(5B)
It is possible to express both the spin operators
and
entirely in terms of the spinor raising/lowering operators, in such a way that the exponents are homogeneously quadratic in the
. But because these operators are neither fermions (which anticommute) nor bosons (which commute), these exponents cannot be readily diagonalized. The eigenvalues of the exponentiated quadratic forms have no obvious significances.
In fact, the
satisfy the following mixed commutation relations:
, whereas for
,
. (6)
To proceed we make use of a highly nonlinear “Jordan-Wigner” transformation. Such a mapping of fermions onto spins was originally invented in the 1920’s to prove that it was mathematically possible to construct a fermionic field theory out of an array of spins ½. Here we invert the construction, expressing each spin by a fermion operator
that carries an exponential wake made up of “earlier” fermion operators. The algebra of the fermions is postulated to be pure anticommutation:
(7)
We note the following trivial identities:
and
.
We construct the Pauli spin operators in (5B) out of such fermion field operators.
and
. (8)
The reader will want to verify that this representation of the
operators satisfies the mixed commutation relations in (6). Then, inserting (8) into (5B) with the aid of the above identities yields the transfer operator as a product of two exponential forms, each quadratic in fermion operators. Thus, the eigenvalues of the quadratic forms in the following expressions are useful in the evaluation of
.
(9A)
We expand the local operators
in plane waves,
. (For didactic reasons we have imposed periodic boundary conditions on the
of fermions, setting
, but with little additional effort solutions can be found for more general or even for arbitrary boundary conditions, or for periodic boundary conditions on the original spin operators.) Because the Fourier expansion takes the form of a unitary transformation it preserves the algebra. Therefore the a’s satisfy the same set of anticommutation relations as the c’s in Eq. (7), viz.,
with all other anticommutators = 0.
By translational invariance this procedure breaks the transfer matrix up into
noninteracting sectors, each labeled by
and containing a form bilinear in fermions:
(9B)
Each factor on the rhs takes the form,
Because factors in different k-sectors commute, the individual 4X4
’s can be diagonalized individually. We need to find the largest solution
of the equation
in each separate k-sector. Their product yields the partition function
. Consequently the free energy
, the logarithm of
, is explicitly a sum which turns into an integral in
,
The thermodynamic properties are obtained from
by successive differentiations. These derivatives of
can be calculated in closed form in terms of elliptic functions. Note that
is (correctly) extensive (proportional to the area
.) It is easily shown that the identical integral would have been obtained in the large
limit, had we transferred the rows instead of the columns. So, even though the procedure might have seemed asymmetric, actually all symmetries are preserved.
Without going into details of the evaluation, we find the free energy
has singular derivatives at
(the actual value of
is easily calculated and agrees with the earlier estimates.) Above
there is no LRO and the magnetization is zero. At or below
there develops LRO because of a two-fold degeneracy of the ground state of the transfer operator. Correlations can be calculated with the aid of Toeplitz matrix theory. Both the magnetization and the LRO increase with decreasing
until, at
, all spins are precisely parallel, all “up” or all “down”.
The two-dimensional transfer matrix of the
dimensional Ising model can also be written in the form of Eq. (5B) provided the column label
is replaced by a planar label
, with bonds to
and
. That is, the transfer matrix for the 3D Ising model can be mapped onto a two-dimensional Ising model in a perpendicular field. Because only the largest eigenvalue is needed in the calculation of
or
, a variational approximation is useful, as is the renormalization group (RG). Unfortunately the Jordan-Wigner transformation itself fails to be of help, because the exponential tails fail to cancel for half the bonds; therefore the quadratic form in spins on a 2D plane cannot be transformed into a quadratic form in either fermions or bosons. The transfer operator can, however, be reexpressed as a quartic form in fermions. Thus the 3D Ising model falls into the realm of problems (
field theories) that are generally well understood yet have not yet found an exact mathematical solution outside of approximate or RG procedures.
Other seemingly simple model ferromagnets that remain unsolved at the present time include the 2D Ising model in a real, finite, external magnetic field (whether homogeneous or staggered,) as well as most three-dimensional models of any kind.
More symmetry considerations
In the above, the spontaneous breaking of discrete up/down symmetry in the ground state at
was of no particular consequence. But what if the symmetry had been continuous? Let us consider spins that can point into any direction according to either the O(2) (circular) or O(3) (spherical) symmetries. Then the excitation spectrum above the ground state becomes gapless. Consider the following magnetic polymer (
) whose dynamics are described by the following Hamiltonian:
(10)
in which we assume the individual spins are themselves classical two- or three-dimensional vectors of unit length (all
) and not operators. The dot product (
) ensures the bond energies are scalar under rotations. This
is known as the “classical” Heisenberg Hamiltonian if the spin vectors are three-dimensional or as the “classical” “X-Y” model if the spins are constrained to lie in the x,y plane. We distinguish the classical spins here from any of the quantum versions discussed supra, in which the components of the individual spins fail to commute.
(The distinction between ferromagnetism in the extreme quantum limit of s=½ and the classical models of ferromagnetism may, in fact, be academic, as all interesting properties, correlations, etc. are qualitatively independent of the magnitudes of the spins; not so, the difference between models with Z(2), O(2) and O(3) symmetries. Such rotational symmetries, and not the quantum mechanics, seem to be a determining factor in the thermodynamics.)
The classical O(2) X-Y model is also known as the “plane rotator” model, given that bonds connecting nearest-neighbor sites
take the form
of rigid coupled pendulums.
On a
lattice, neither the O(2) nor the O(3) model can sustain LRO at any
. The lack of spontaneous symmetry breaking at any finite
in both models is the result of a rigorous no LRO theorem first proved by Hohenberg and later generalized by Mermin and Wagner to all systems having a continuous symmetry in
spatial dimensions. This theorem clearly does not apply to the Ising model because of its discrete symmetry; nor does it address the issues of the existence or nonexistence of a phase transition at finite
.
In fact, the X-Y model (but not the Heisenberg model!) does exhibit an unusual phase transition on a two-dimensional lattice, at a finite
approximately equal to
separating two disordered phases. This, the well-known “Kosterlitz-Thouless” phase transition, is a two-dimensional version of a liquid <–> vapor transition. There are many ways to examine its critical properties.
In one of them, the transfer matrix of the classical model is mapped onto a one-dimensional anisotropic Heisenberg
of spins ½ in which the J-matrix takes the form
, with the effective parameter
varying as
. The critical point is thus at
. Internal excitations (here physically interpreted as the spectrum of quantized clockwise or anticlockwise vortices) are gapped (bound) at low temperatures but have a continuous spectrum above
.
On the same two-dimensional lattice, the O(3) model remains in its high-temperature phase at all finite
without undergoing any phase transition whatever. The cause is, presumably, the high density of low-energy hedgehog-like excitations called skyrmions that are allowed in this model but not in the other.
Both models
support a gapless spectrum of spin waves at low
. In dimensions
, both O(2) and O(3) models exhibit rather ordinary order-disorder second-order phase transitions at a finite
. Now let us examine some details in
as the simplest example.
Here again the ground state energy is the same
but the excitation spectrum can now be vanishingly small, as a small twist
in the orientation
of each spin relative to its neighbor (say,
with
) only costs an energy
per bond. The imposition of any boundary conditions – say, requiring that the first and last spins be parallel – causes
to become discretized, e.g.
, with
an integer
. The ratio
is related to the wavelength of the excitation, in units of the lattice parameter
. We call this excitation a spin wave; its energy forms a quasi-continuum and vanishes as
. The lack of an energy gap in the large
limit is a feature of many field theories with continuous symmetries, as was first remarked in the 1950’s by Nambu, Goldstone, et al, by analogy with the spin waves that Bloch found 20 years earlier. Where the “Goldstone mode” relates to the spin wave spectrum, the Goldstone boson relates to the “magnon”, which is an elementary bosonic particle that results from further quantization of spin dynamics. It carries 1 unit of angular momentum
.
But just as there is an exception in particle physics, e.g. for the Higgs boson, there is one in magnetism for integer quantum spins in a 1D antiferromagnetic Heisenberg chain, where the lowest excitations have to surmount a “mass” gap. We turn to this interesting anomaly next.
Role of spin in d = 1 antiferromagnets
The antiferromagnetic Heisenberg Hamiltonian, i.e., the lc of Eq. (10) with
, in which spins are
operators, as opposed to classical vectors considered earlier, has a magnon spectrum exhibiting a finite excitation gap even at the longest wavelengths. This is quite unlike the gapless spin wave spectrum
of the ferromagnet,
, and from the excitations of the S=½ antiferromagnet – the spectrum of which,
, is calculated exactly by “Bethe’s ansatz” discussed at the end of this article. The spin wave excitations of
antiferromagnets and of classical spin antiferromagnets also are all gapless. So what happens to integer spin operators of magnitude
, to make them that different?
We start with the proof that the excitation spectrum for all half-odd-integer spins
is continuous and gapless, regardless whether the sign of
is positive or negative.
The proof in 1D is as follows: take the ground state wave function
for a
subject to periodic boundary conditions and operate on it by
in a way that distorts the
individual bond only by a small amount
, thus each of the
bonds sees its energy rise in an amount
. If
is orthogonal to
we have constructed an excited state of total energy
above the ground state. One may consider this as the variational calculation of the energy of a 1-magnon state. (The proof in
and
just extends the proof for the
to finite-width strips or cylinders.) If the spins are higher but of the form half-odd-integer, an operator
having these properties is easily constructed. In the case of integer spins, however, the corresponding operator, when applied to the ground state wave function, fails to yield a state orthogonal to the ground state and the proof fails.
This energy gap in the spectrum of integer spin antiferromagnetic
’s was first conjectured by D. Haldane and it is named after him; it turns out to exist only in 1D antiferromagnets and only if
is an integer:
. The magnitude of Haldane’s gap goes to zero as
is increased. (This is expected if the model is to approach its gapless correspondence limit smoothly.) The gap also vanishes in any dimension
. Thus we are dealing with a feature that is both interesting and fragile, which is optimum for spins
in
and will be extensively revisited at the end of this article.
We can discern the dichotomy in a
of as few as 3 spins
arrayed in a triangle. For 3 spins the Heisenberg Hamiltonian is diagonalizable:
, where
= the total combined spin.
has a maximum value 3/2 for spins S=½ and a maximum 3 for spins 1. In the ferromagnet (
) the ground states for both these values of
belongs to their respective maxima and are similar in all aspects.
In the antiferromagnetic (
) triangle of 3 spins, the ground states belong to a total spin minimum. The minima are
for spins 1, and
= ½ for spins ½. After some back-of-the-envelope exact calculations, one determines that the ground state of the three spins ½ consist of two degenerate doublets, i.e., that it is 4-fold degenerate. Thus there is no energy gap separating the two lowest-lying states.
For the antiferromagnetic triangle made of spins
, however, the ground state belongs to a unique
singlet state. All other eigenstates in this model lie at energies that are at least
higher so that here, the Haldane gap is
.
Exercise for the reader: contrast the eigenstates of
= ½ and
=1 Heisenberg antiferromagnetic chains of 4 spins when laid out on a single square plaquette, with or without diagonal linkages. (Either geometry can be done analytically in closed form.) The conclusions are quite similar. Discussion of the Haldane gap in the limit of large
is reprised near the end of this article. References to early and pertinent literature are given in ref. 2.
The antiferromagnetic
of spins
exhibits an additional idiosyncracy at large
: the two ends, at
and
respectively, act like free spins ½ in their response to external fields, as in paramagnetic resonance (EPR.) The characteristics of EPR allow one to determine the spin; the ends of a chain of spins 1 display spins ½ ! This surprising behavior was, in fact, predicted theoretically – and confirmed by experiment – almost simultaneously. Chains of
spins would, presumably, have ends that exhibit the properties of spins 1, etc. The situation is similar to that in the defective antiferromagnets discussed below, given that dangling ends of the chain at
and
can be viewed as breaks in a longer chain, or as a symmetry-breaking disruption of translational invariance, caused by cutting the bond connecting the last spin at
to the first, in a chain with periodic boundary conditions.
No magnetism at finite T in 1D
In 1D one can also obtain the free energy of the classical O(3) Heisenberg model without separate calculations of energy and entropy, using a transfer matrix for the partition function
. To within boundary terms its largest eigenvalue yields,
(11)
an expression that translates to a (thermal averaged) angle between neighboring spins of
at low
. The correlations of two spins separated by a macroscopic distance
fall off
. Thus, in this gapless one-dimensional model with continuous symmetry the ground state LRO disappears exponentially at any finite
. Following the Mermin-Wagner theorem some such result was to be expected. It is also notable that many years earlier, L.D. Landau had already observed that no model with finite range interactions can sustain LRO in 1D at any finite
.
The nature of local moments
Given all these choices of models, what are the most physically plausible magnetic contents of a unit cell? Typically, spins and angular momenta that characterize an atom or ion disappear (are quenched) in solids. The earth elements are counter-examples, in that they have unfilled f-shells that can accommodate up to 7 electrons in an orbital that is somewhat smaller than typical inter-atomic distances and which are, therefore, not much disturbed by the crystal symmetry. A factor of 7 Bohr magnetons puts these spin magnitudes largely in the classical limit, such as was assumed in the classical Heisenberg model treated above.
Actually, the Hamiltonian in Heisenberg’s 1928 model of magnetism, or what is commonly understood today to be the Heisenberg Hamiltonian, is similar to Eq. (4) in form but uses operators for the components of individual vector spins. As we know, in quantum theory all angular momenta satisfy an algebra
with 3 operator components
. In the extreme quantum limit, for a single electron, the S’s have an irreducible representation in
Pauli spin matrices that anticommute with one another but commute with operators at all other sites. For higher spins the irreducible representations are
dimensional, where
; the various components are operators that satisfy
, which is equivalent to the generic
.
Despite the introduction of operators...
Nature of magnetic interactions in cells and in metals
Antiferromagnetism and ferrimagnetism
Ferrimagnets
The Kondo lattice
Nagaoka mechanism of ferromagnetism
Frustration
Defective antiferromagnets
Decomposition of 1D quantum antiferromagnets into fermions
Conclusion
Physics and mathematics have always had a close relation, but none closer than the calculation of magnetism using concepts in analysis, number theory, algebra and group theory. We have tried to show this in the present article by concentrating on just a few of the topics that have come up in an evolving theory of magnetism. Hopefully, an even broader understanding of magnetic phenomena will follow new mathematics and mathematical concepts.
References
A. Levinovitz and N. Ringertz, Eds., The Nobel Prize, the first 100 Years, Imperial College Press, London, 2001
Daniel C. Mattis, The Theory of Magnetism Made Simple: an introduction to physical concepts and to some useful mathematical methods, World Scientific Publ. Co., Singapore, 2006; the development of various aspects of magnetism and of its associated theories is the subject of chapters 1 and 2 and an extensive bibliography leads to the original documentation.
much of this material is discussed in greater depth in the various chapts. 3 – 9 of ref. 2.
P.A.M. Dirac, Proc. Roy. Soc. (London) A123, 60 (1931), also see pp. 76,78, ref. 2. The magnetic monopole has never been observed, but the existence of just one such monopole would necessitate quantization of all electric charges in the universe, a known fact of nature – and one that is otherwise unexplained.
ref. 2, p. 31 recounts the 1920’s history of this concept that culminated in the constant , in which me is the mass of the electron
ref. 2, chapt. 8.
H. Kramers and G. Wannier, Phys. Rev. 60, 252, 263 (1941)
L. Onsager, Phys. Rev. 65, 117 (1944)
T. Schultz, D. Mattis and E. Lieb, Rev. Mod. Phys. 36, 856 (1964)
see ref. 2, chapt. 3, §3.12
As we shall see later, this statement does not apply to the special case of one-dimensional antiferromagnets, in which the magnitude of the individual spins plays an important role.
N. Mermin and H. Wagner, Phys. Rev. Lett. 17, 1133 and 1307 (1966)
D. Mattis, Phys. Lett. 104, 357 (1984)
H. Bethe, Zeit. f. Physik 71, 205 (1931), reprinted in English translation in D. Mattis, The Many-Body Problem, an encyclopedia of exactly solved models in one dimension, World Scientific Publ., Singapore, 2009 (3rd Printing with revisions and corrections.)
E. Lieb, T. Schultz and D. Mattis, Ann. Phys. (NY) 16, 407 (1961), also reprinted in its entirety in The Many-Body Problem cited above. Among recent applications, note J. Jing and H. Ma, Level Crossing and Quantum Phase Transition of XY Ring, Mod. Phys. Lett. B22, 535 (2008)
W. Heisenberg, H. Wagner and K. Yamazaki, Nuov. Cim. LIX A, (1 Feb. 1969)
We list some early papers: P. W. Anderson, Heavy-electron superconductors, spin fluctuations and triplet pairing, Phys. Rev. B30, 1549 (1984), G. Baskaran and P.W. Anderson, Gauge theory of high-temperature superconductors and strongly correlated Fermi systems, Phys. Rev. B37, 580 (1988), P. Fazekas and E. Muller-Hartmann, Magnetic and nonmagnetic ground states of the Kondo lattice, Zeit. f. Phys. B85, 285 (1991), M. Sigrist, H. Tsunetsuga and K. Ueda, Rigorous results for the one-electron Kondo lattice model, Phys. Rev. Lett. 67, 2211 (1991), J.A. White, Numerical exact diagonalization of the one-dimensional symmetric Kondo lattice, Phys. Rev. B46, 13905 (1992), P.Paul and D. Mattis, Exctinction of spin interactions in the 2D Kondo lattice, Int. J.Mod. Phys. B24, 3199 (1995), H. Tsunetsuga, M. Sigrist and K. Ueda, The ground state phase diagram of the one dimensional Kondo lattice model, Rev. Mod. Phys. 69, 809-864 (1997), S. Capponi and F.F. Assaad, Spin and charge dynamics of the ferromagnetic and antiferromagnetic two-dimensional half-filled Kondo model, Phys. Rev. B63, 155114 (2001)
Z. Gulacsi and D. Vollhardt find this in a similar model (the periodic Anderson model) that can be solved exactly; see arXiv:cond-mat/0504174v1 (7 April, 2005)
G. Forgacs, Phys. Rev. B22, 4473 (1980). See also D. Mattis and R. Swendsen, Statistical Mechanics Made Simple, 2nd Edition, World Scientific Publ. Co, Singapore, 2008, §8.12, for the explicit solution of the transfer matrix in this example and for a discussion of unfrustrated (separable model) spin glasses.
N. Nagosa, Y. Hatsugai and M. Imada, J. Phys. Soc. Jpn. 58, 978 (1989), with further refs. given in ref 2., chapt. 5.
F.D.M. Haldane, Phys. Lett. A93, 454 (1983) and Phys. Rev. Lett. 50, 1153 (1983), see also: D. Controzzi and E. Hawkins,
See also
| Invited by: | Prof. Vieri Mastropietro, Mathematics Department, Univ. Roma Tor Vergata, Italy |
| Assistant editor: | Mr. Abdellatif Nemri, Department of biological sciences, University of Montreal, Canada |
