Principle of least action
From Scholarpedia
| This article is undergoing 2 initial reviews; It may contain inaccuracies and unapproved changes made by anonymous reviewers. | ||||||||||||||||||||
Author: Dr. Chris G. Gray, Department of Physics University of Guelph
The principle of least action is the basic variational principle of particle and continuum systems. The true dynamical trajectories of a system are found by imagining all possible trajectories that the system could conceivably take, computing the action (a functional of the trajectory) for each of these trajectories, and selecting the one (or more) that makes the action "least" (actually stationary). The true trajectories are those that have least action.
Statements of Hamilton and Maupertuis Principles
There are two major versions of the action, due to Hamilton and Maupertuis, and two corresponding action principles. The Hamilton principle is nowadays the most used. The Hamilton action
is defined as an integral along an actual or virtual (trial) space-time trajectory
connecting two specified space-time events, initial event
and final event
,
- (1)
where
is the Lagrangian, and
. For most of what follows we will assume the simplest case where
, where
and
are the kinetic and potential energies, respectively; an exception occurs in the relativistic section below. In general, q stands for the complete set of independent generalized coordinates,
,
, where
is the number of degrees of freedom. Any holonomic (coordinate) geometric constraints are assumed to be taken into account by the choice of the
,
,
. Nonholonomic (velocity) geometric constraints are excluded (see below). Hamilton's principle states that among all conceivable trajectories
that could connect the given end points
and
in the given time
, the true trajectories are those that make
stationary. As we shall see, if the trajectory is sufficiently short, the action
is a minimum. In general, for long trajectories,
is a saddle point (and is never a maximum). To emphasize the particular constraint on the varied trajectories, we write Hamilton's principle as
- (2)
where the constraint of fixed time
is written explicitly, and the constraint of fixed end-positions
and
is left implicit. We will consider other variational principles below, but all will have fixed
and
(quantities other than
will also be constrained) so we will always leave the constraint of fixed
and
implicit. It is clear from (1) that S is a functional of the trial trajectory q(t), and in (2)
denotes the first-order variation in S corresponding to the
small variation
in the trial trajectory.
The second major version of the action is Maupertuis' action
, where
- (3)
where the first (time-independent) form is the general definition, with
the canonical momentum, and pdq stands for
in general. The second (time-dependent) form for
in (3) is valid for normal systems in which the kinetic energy
is quadratic in the velocity components
. The Maupertuis action principle states that for true trajectories
is stationary on trajectories with fixed end positions
and
and fixed energy
. Following our earlier conventions, we write this principle as
- (4)
Note that
is fixed but
is not in Maupertuis' principle (4), the reverse of the conditions in Hamilton's principle (2).
The two principles (2) and (4) are related by a Legendre transformation, as discussed below. Hamilton's principle is applicable to both conservative and nonconservative systems, with
then explicitly time-dependent (e.g. due to a time-dependent potential), whereas the form (4) of Maupertuis' principle is restricted to conservative systems (it can be generalized – see Gray et al 2004).
History
Various aspects of the history of action principles and variational principles in general are discussed in the historical references at the end of this article. Maupertuis' principle is older than Hamilton's principle by about a century (1744 vs 1834). Over the years, and even recently, a number of reformulations and generalizations of the two basic action principles have been given (see Gray et al 1996, 2004 for extensive discussions and references).
Euler-Lagrange Equation
Solution of the variational problem posed by Hamilton's principle (2) yields the true trajectories
. Solution of Maupertuis' variational equation (4) using the time-dependent (second) form of
in (3) also yields the the true trajectories, whereas using the time-independent (first) form of
in (3) yields true orbits, i.e. spatial shape of the true paths. The solutions can be obtained directly from the variational principles (see below), or from the solution of the corresponding Euler-Lagrange differential equations which are equivalent to the variational principles. For Hamilton's principle, the Euler-Lagrange equation of motion (usually called simply Lagrange's equation) is (see, e.g., Brizard 2008, Goldstein et al 2002)
- (5)
where we have assumed one degree of freedom for simplicity of notation; if there is more than one, an equation of the form (5) holds for each of the
,
,
. The time-dependent version of Maupertuis' principle yields the same equation of motion for the space-time trajectories
. The time-independent version of Maupertuis' principle (often called Jacobi's principle) yields (Lanczos 1970, Landau and Lifshitz 1969) corresponding equations for the spatial paths (orbits).
Strictly speaking, because the action principles are formulated as boundary value problems (q is specified at two points
and
) and not as initial value problems, there may be more than one solution: there can in fact be zero, one, two, ..., up to an infinite number of solutions in particular problems. For example, applying the Hamilton principle to the one dimensional harmonic oscillator and specifying q = 0 at t = 0 and at t = T (one period
) gives an infinite number of solutions, i.e.
with one solution for each value of the amplitude A, which is arbitrary. The same system with the constraints q = 0 at t = 0 and q = A at t = T/4 has the unique solution
, and for the constraints q = 0 at t = 0 and at t = T/4 no solution exists. In practice, one usually has initial conditions in mind, and selects the appropriate solution of the corresponding boundary value problem, or imposes the initial conditions directly on the solution of the Euler-Lagrange equation of motion.
Restrictions to Holonomic and Nondissipative Systems
The action principles (2) and (4) are restricted to holonomic systems, i.e. systems whose geometrical constraints (if any) involve only the coordinates and not the velocities. Simple examples of holonomic and nonholonomic systems are a particle confined to a spherical surface, and a wheel confined to rolling without slipping on a horizontal plane, respectively. Attempts to extend the action principles to nonholonomic systems have been controversial, and do not appear to have been successful (Papastavridis 2002).
In general, the action principles do not apply to dissipative systems, i.e. systems with frictional forces. However, for a few cases, Lagrangians for dissipative systems have been found, and Hamilton's principle then applies (for a brief review, see Gray et al 2004).
More generally, the question of whether a Lagrangian and corresponding action principle exist for a particular dynamical system, given the equation of motion and the nature of the forces acting on the system, is referred to as the "inverse problem of the calculus of variations" (Santilli 1978).
for the quartic oscillator
starting at
with
. For this particular oscillator the kinetic focus occurs at a fraction 0.646 of the half-period
, illustrated here for trajectory
. The kinetic foci of all true trajectories of this family lie along the heavy gray line, the caustic, which is a hyperbolic curve for this oscillator. Squares indicate recrossing events of true trajectory
with the other two true trajectories. (From Gray and Taylor 2007.)When Action is a Minimum
The action (either
or
) is stationary for true trajectories; it is either a local minimum, or a saddle point (the action is larger for some trial trajectories and smaller for others, compared to the true trajectory action). Action is never a maximum (nonrelativistically). We discuss here only the case of the Hamilton action S for one-dimensional (1D) systems, and refer to Gray and Taylor (2007) for discussions of Maupertuis' action
, and 2D etc. systems. For some 1D potentials
(those with
everywhere), e.g.
,
, and
, all true trajectories have minimum
. For most potentials, however, only sufficiently short true trajectories have minimum action; the others have an action saddle point. "Sufficiently short" means that the final space-time event occurs before the so-called kinetic focus event of the trajectory. The latter is defined as the earliest event along the trajectory, following the initial event, where the second variation
, for some trajectory variation.
A more intuitive definition of kinetic focus can be given. As an example, consider a family of true trajectories for the quartic oscillator with
, all starting at
at
, and with initial velocity
. Three trajectories of the family, denoted
,
, and
, are shown in Fig. 1. These true trajectories intersect each other – note the squares in Fig. 1 showing intersections of trajectories
and
with trajectory
. The kinetic focus event
of the true trajectory
, with starting event
, is the event closest to
at which a second true trajectory, with slightly different initial velocity at
, intersects trajectory
, in the limit for which the two trajectories coalesce as their initial velocities at
are made equal. Based on this definition a simple prescription for finding the kinetic focus can be given (Gray and Taylor 2007), and for a quartic oscillator trajectory starting at P(0,0) the kinetic focus Q occurs at time
, where T is the period, as shown in Fig.1 for trajectory 0.
The other trajectories shown in Fig. 1 have their own kinetic foci, i.e.
for trajectory
and
for trajectory
. The locus of all the kinetic foci of the family is called the caustic (it is an envelope), and is shown as the heavy gray line in Fig. 1.
Thus, for trajectory
in Fig. 1, if the trajectory terminates before kinetic focus
, the action
is a minimum; if the trajectory terminates beyond
, the action is a saddle point.
Relation of Hamilton and Maupertuis Principles
The Hamilton and Maupertuis principles are related by a Legendre transformation (Gray et al 1996, 2004). Recall first that the Lagrangian
and Hamiltonian
are so-related, i.e.
- (6)
If we integrate (6) with respect to
along an arbitrary virtual or trial trajectory between two points
and
, and use the definitions (1) and (3) of
and
we get
, or
- (7)
where
is the mean energy along the trial trajectory. (Along a true trajectory, with
const, (7) reduces to the well-known relation (Goldstein et al 2002)
.) From the Legendre transformation relation (7) between
and
, for conservative systems one can derive Hamilton's principle from Maupertuis' principle, and vice-versa (Gray et al, 1996, 2004). The two action principles are thus equivalent for conservative systems, and related by a Legendre transformation whereby one changes between energy and time as independent variables.
The existence in mechanics of two actions and two corresponding variational principles with a Legendre transformation between them is analogous to the situation in thermodynamics (Gray et al 2004). There, as established by Gibbs, one introduces two free energies related by a Legendre transformation, i.e. the Helmholtz and Gibbs free energies, with each free energy satisfying a variational principle which determines the thermal equilibrium state of the system.
Generalizations
If we vary the trial trajectory
in (7), with no variation in end positions
and
, the corresponding variations
,
,
and
are seen to be related by
- (8)
Next one can show (Gray et al 1996) that the two sides of (8) separately vanish for variations around a true trajectory. The left side of (8) then gives
, since
(a constant) on a true trajectory, which is called the unconstrained Hamiltonian principle. This can be written in standard form
, where
is a constant Lagrange multiplier, here determined as
(negative of energy of the true trajectory). If one constrains
to be fixed for all trial trajectories, then
and we have (
, the usual Hamilton principle. If instead we constrain
to be fixed we get (
, the so-called reciprocal Hamilton principle.
The right side of (8) gives
, which is called the unconstrained Maupertuis principle, which can also be written as
where
(duration of true trajectory) is a constant Lagrange multiplier. If one constrains
to be fixed for the trial trajectories, one gets (
, which is a generalization of Maupertuis' principle (4); we see that the constraint of fixed energy in (4) can be relaxed to one of fixed mean energy. If instead we constrain W to be fixed, we get (
, which is called the reciprocal Maupertuis principle. In these generalizations of Maupertuis' principle, conservation of energy is a consequence of the principle for time-invariant systems (just as it is for Hamilton's principle), whereas conservation of energy is an assumption of the original Maupertuis principle.
In all the variational principles discussed here, we have held the end-points
and
fixed. It is possible to derive additional generalized principles (Gray et al 2004) which allow variations
and
in the end-points.
As we shall see, these alternative formulations of the action principles, particularly the reciprocal Maupertuis principle, have advantages when using action principles to solve practical problems, and also in making the connection to quantum variational principles. We note that reciprocal variational principles are common in geometry and in thermodynamics (see Gray et al 2004 for discussion and references), but their use in mechanics is relatively recent.
Practical Use of Action Principles
Just as in quantum mechanics, variational principles can be used directly to solve a dynamics problem, without employing the equations of motion. This is termed the the direct variational or Rayleigh-Ritz method. The solution may be exact (in simple cases) or essentially exact (using numerical methods), or approximate and analytic (using a restricted and simple set of trial trajectories). We illustrate the approximation method with a simple pedagogical example and refer the reader elsewhere for more complicated examples dealing with research problems (Gray et al 1996, 2004). Consider a one-dimensional quartic oscillator, with Hamiltonian
- (9)
Unlike a harmonic oscillator, the frequency
will depend on the amplitude or energy of motion. We wish to estimate this dependence. As a trial trajectory we take
- (10)
,
where the amplitude A is regarded as known (given W and
-see eq.(12)) and where we treat
as a variational parameter; we will vary
such that an action principle is satisfied. For definiteness, we use the reciprocal Maupertuis principle
discussed in the previous section, but the other action principles can be employed similarly. From the definitions, we find the mean energy
and action
over a cycle of the trial trajectory (10) to be
Treating
as a variational parameter in (11) and applying
gives
- (13)
Substituting (13) in (11) gives for
- (14)
which can be combined with (13) to give
- (15)
i.e. a variational estimate of the frequency as a function of the energy. The approximation (15) is accurate to 0.75% and can be improved systematically by including terms
,
, etc., in the trial trajectory
.
Relativistic Systems
The Hamilton and Maupertuis principles, and the generalizations discussed above, can be put in Lorentz invariant form (Gray et al 2004). As an example of the relativistic Hamilton principle, consider a particle of mass m and charge e in an external electromagnetic field with a potential having contravariant components
, and covariant components
, where
and
(for
) are the usual scalar and vector potentials. A Lorentz invariant form for the Hamilton action for this system is (Jackson 1999, Landau and Lifshitz 1962, Lanczos 1970)
- (16)
The signs of the Lagrangian and corresponding action can be chosen arbitrarily; here we choose the sign of Lanczos (1970) in (16). The four-dimensional path in (16) runs from the initial space-time point
to the final space-time point
, with corresponding proper times
and
. Here
is the infinitesimal interval of the path (or of the proper time),
, the metric has signature (
,
,
,
), and we use the summation convention and take
(speed of light )
. S itself is not gauge invariant, but a gauge transformation
(for arbitrary
) adds only constant boundary points terms to
, so that
is unchanged. The Hamilton principle is thus gauge invariant.
If we introduce a parameter
along the four-dimensional path (a valid choice is proper time s along the true or any virtual path), we can write
in standard form,
, where
is the Lagrangian and
. The Euler-Lagrange equation yields the covariant Lorentz equation of motion
- (17)
where
, with
, and we have chosen the parameter
, the true path proper time. Specific examples, such as an electron in a uniform magnetic field, are discussed in the references (Gray et al 2004, Jackson 1999). As discussed briefly below, the equations for the field (Maxwell equations) can also be derived from an action principle.
Action principles are important also in general relativity. Note from (16) that for a special relativistic free particle the action principle
can be interpreted as a "principle of stationary proper time" (Rohrlich 1965), or more colloquially as a "principle of maximal aging" (Taylor and Wheeler 1992). The proper time is stationary, here a maximum, for the true trajectory (which is straight in a Lorentz frame) compared to the proper time for all virtual trajectories. The principle of maximal aging is also valid, for "short" trajectories, in general relativity for the motion of a particle in a gravitational field (Taylor and Wheeler 2000). For "long" true trajectories (defined above in the fifth section) the proper time is a saddle point (Misner et al 1973, Wald 1984). In general relativity the Einstein gravitational field equations can also be derived from an action principle, using the so-called Einstein-Hilbert action (Landau and Lifshitz 1962, Misner et al 1973). The latter is perhaps the best example of a case where new physics was derived from action principles, since Einstein and Hilbert were both motivated by action principles, at least partly, in establishing the field equations. There was a near miss in the case of wave mechanics, as described in the next section.
Relation to Quantum Variational Principles
We discuss here only the Schrödinger time-independent quantum variational principle; for discussion and references to various quantum time-dependent principles, we refer to Gray et al (2004). As is well known (e.g. Merzbacher 1998), the time-independent Schrödinger equation
- (18)
for the stationary states
, with energies
, is equivalent to the variational principle
- (19)
where
is the Hamiltonian operator corresponding to the classical Hamiltonian
. The subscript in (19), quantum number
, indicates a constrained variation of
such that
is the particular stationary solution selected; for example, to obtain the ground state, one could restrict the search to nodeless trial functions
. As mentioned earlier, (19) is the basis of a very useful approximation scheme in quantum mechanics (Epstein 1974), analogous to the direct use of classical action principles to solve classical dynamics problems (see above).
The reader will notice the similarity of (19) to one of the classical variational principles discussed above, i.e. the reciprocal Maupertuis principle applied to the case of stationary motions:
- (20)
The classical mean energy
is clearly analogous to the quantum mean energy
. The constraints (
in (20), n in (19)) are also analogous because at large quantum numbers we have for stationary bound motions
(Bohr-Sommerfeld), where
is Planck's constant. Thus fixed
and fixed
are equivalent, at least for large quantum numbers.
The above heuristic arguments can be tightened up. First, (20) can be derived (in simple cases) in the classical limit
from (19) (Gray et al 1996). Conversely, one can "derive" quantum mechanics (i.e. (19)) by applying quantization rules to (20) (Gray et al 1999). Schrödinger, in his first paper on wave mechanics (Schrödinger 1926a), tried to derive the quantum variational principle from a classical variational principle. Unfortunately he did not have available the formulation (20) of the classical action principle, and abandoned this route to quantum mechanics. Very quickly, in his second paper (Schrödinger 1926b), he found the route which is now in the text books.
A semiclassical variational principle can be based on the reciprocal Maupertuis principle (20) (Gray et al 2004). Thus, for bound states, one first determines the classical energy as a function of the action W by solving (20) as described earlier (e.g. see eq.(14) for the quartic oscillator), and then imposes the Bohr-Sommerfeld quantization condition (or one of its refinements) on action W. This gives the allowed energies semiclassically as a function of the quantum number.
Continuum Mechanics and Field Theory
Action principles can be applied to field-like quantities
, both classically (Goldstein et al 2002, Landau and Lifshitz 1962, Soper 1976, Burgess 2002, Jackson 1999, Morse and Feshbach 1953, Brizard 2008) and quantum-mechanically (Dyson 2007, Wentzel 1949). The systems can be nonrelativistic or relativistic. We have already mentioned above the application of action principles to the electromagnetic and gravitational fields, and to the Schrödinger wave function. These methods are also widely applied in classical continuum mechanics, e.g., to strings, membranes, elastic solids and fluids (Yourgrau and Mandelstam 1968, Lanczos 1970, Reddy 2002).
We illustrate with one simple example, the one-dimensional vibrating string, following Brizard (2008). The nonrelativistic classical equation of motion for the transverse displacement
is
- (21)
where
is the density and
the tension. Eq. (21) is the well known wave equation. It is assumed that
is zero at the two ends,
and
, and that
is given at two times,
and
. One easily verifies that the equation of motion (21) follows from the action principle
, with the given constraints, where
- (22)
with
- (23)
the Lagrangian density
. Because of the simple quadratic Lagrangian density (23), the variation of (22) can readily be done directly; alternatively, we can use the Euler-Lagrange equation for 1D fields
, a generalization of (5),
- (24)
which also gives (21).
Conservation Laws
Conservation laws are a consequence of symmetries of the Lagrangian or action. For example, conservation of energy follows from invariance under time translation. The link between symmetries and conservation laws holds for particle and continuum systems (Noether's theorem). The conservation laws can be derived either from the Lagrangian and equations of motion (Goldstein et al 2002), or directly from the action and the variational principle (Goldstein et al 2002, Lanczos 1970, Oliver 1994, Schwinger et al 1998). Since Noether's theorem is to be discussed elsewhere in Scholarpedia, we refer to that article for details.
References (historical)
- Goldstine, H.H. (1980). A History of the Calculus of Variations from the 17th Through the 19th Century, Springer, New York.
- Hankins, T.L. (1980). Sir William Rowan Hamilton, Johns Hopkins U.P., Baltimore.
- Lanczos, C. (1970). The Variational Principles of Mechanics, 4th edition, University of Toronto Press, Toronto.
- Terrall, M. (2002). The Man who Flattened the Earth, University of Chicago Press, Chicago. (biography of Maupertuis)
- Todhunter, I. (1861). A History of the Progress of the Calculus of Variations During the Nineteenth Century, Cambridge U.P., Cambridge.
- Yourgrau, W. and S. Mandelstam (1968). Variational Principles in Dynamics and Quantum Theory, 3rd edition, Saunders, Philadelphia.
References
- Brizard, A.J. (2008). An Introduction to Lagrangian Mechanics, World Scientific, Singapore.
- Burgess, M. (2002). Classical Covariant Fields, Cambridge U.P., Cambridge.
- Dyson, F. (2007). Advanced Quantum Mechanics, World Scientific, Singapore.
- Epstein, S.T. (1974). The Variation Method in Quantum Chemistry, Academic, New York.
- Goldstein, H., C. Poole and I. Safko (2002). Classical Mechanics, 3rd edition, Addison-Wesley, New York.
- Gray, C.G., G. Karl and V.A. Novikov (1996). "The Four Variational Principles of Mechanics", Ann. Phys. 251, 1-25.
- Gray, C.G., G. Karl and V.A. Novikov (1999). "From Maupertuis to Schrödinger. Quantization of Classical Variational Principles", Am. J. Phys. 67, 959-961.
- Gray, C.G., G. Karl and V.A. Novikov (2004). "Progress in Classical and Quantum Variational Principles", Rep. Prog. Phys. 67, 159-208.
- Gray, C.G. and E.F. Taylor (2007). "When Action is Not Least", Am. J. Phys. 75, 434-458.
- Jackson, J.D. (1999). Classical Electrodynamics, 3rd edition, Wiley, New York.
- Landau, L.D. and E.M. Lifshitz (1962). The Classical Theory of Fields, 2nd edition, Pergamon, New York.
- Landau, L.D. and E.M. Lifshitz (1969). Mechanics, 2nd edition, Pergamon, Oxford.
- Merzbacher, E. (1998). Quantum Mechanics, 3rd edition, Wiley, New York.
- Misner, C. W., K. S. Thorne and J. A. Wheeler (1973). Gravitation, Freeman, San Francisco.
- Morse, P.M. and H. Feshbach (1953). Methods of Theoretical Physics, Vol.1, McGraw Hill, New York.
- Oliver, D. (1994). The Shaggy Steed of Physics, Springer, New York.
- Papastavridis, J.G. (2002). Analytical Mechanics, Oxford U.P., New York.
- Reddy, J.N. (2002). Energy Principles and Variational Methods in Applied Mechanics, Wiley, New York.
- Rohrlich, F. (1965). Classical Charged Particles, Addison-Wesley, Reading.
- Santilli, R. M. (1978). Foundations of Theoretical Mechanics I, Springer, New York.
- Schrödinger, E. (1926a). "Quantisierung als eigenwert problem I", Ann. Phys. 79, 361-376; (1926b). "Quantisierung als eigenwert problem II", Ann. Phys. 79, 489-527.
- Schwinger, J., L.L. DeRaad Jr, K.A. Milton and W-Y Tsai (1998). Classical Electrodynamics, Perseus Books, Reading.
- Soper, D.E. (1976). Classical Field Theory, Wiley, New York.
- Taylor, E.F. and J.A. Wheeler (1992). Spacetime Physics, 2nd edition, Freeman, New York.
- Taylor, E.F. and J.A. Wheeler (2000). Exploring Black Holes: Introduction to General Relativity, Addison-Wesley Longman, San Francisco.
- Wald, R.M. (1984). General Relativity, University of Chicago Press, Chicago.
- Wentzel, G. (1949). Quantum Theory of Fields, Interscience, New York.
Further reading
- Brown, L. M. (2005). editor, Feynman's Thesis, World Scientific, Singapore.
- Doughty, N. A. (1990). Lagrangian Interaction, Addison-Wesley, Reading.
- Feynman, R.P., R.B. Leighton and M. Sands (1963). The Feynman Lectures on Physics, Vol.II, Ch.19, Addison-Wesley, Reading.
- Greiner, W. and J. Reinhardt (1996). Field Quantization, Springer, Berlin.
- Hildebrandt, S. and A. Tromba (1996). The Parsimonious Universe, Springer, New York.
- Moiseiwitsch, B.L. (1966). Variational Principles, Interscience, New York.
- Nesbet, R.K. (2003). Variational Principles and Methods in Theoretical Physics and Chemistry, Cambridge U.P.,Cambridge.
- Tabarrok, B. and F. P. J. Rimrott (1994). Variational Methods and Complementary Formulations in Dynamics, Kluwer, Dordrecht.
- Toms, D. J. (2007). The Schwinger Action Principle and Effective Action, Cambridge U.P., Cambridge.
See Also
Dynamical systems, gauge invariance, Hamilton-Jacobi equation, Hamiltonian Systems, Lagrangian Mechanics, Noether's Theorem.
