Dr. Ian Gladwell
From Scholarpedia
Department of Mathematics, Southern Methodist University, Dallas, TX
Contents |
Introduction
An explicit ordinary differential two-point boundary value problem
(BVP) of total order
on a finite interval
may be written in first order system form as
- (1)
Here,
and the
boundary conditions defined by
must
be independent.
In practice, most BVPs do not arise directly in the form
(1) but instead as a combination of equations
defining various orders of derivatives of the variables which sum
up to
. In an explicit BVP, the boundary conditions
and the right hand sides of the ordinary differential equations
(ODEs) can involve the derivatives of each solution variable up to
an order one less than the highest derivative of that variable
appearing on the left hand side of the ODE defining the variable.
To write a general system of ODEs of different orders in the form
(1), we define
to be the vector made
up of all the solution variables and their derivatives up to one
less than the highest derivative of each variable, then add
trivial ODEs to define these derivatives. See the section on
initial value problems for an example of how this is achieved.
The words "two-point" refer to the fact that the boundary
condition function
is evaluated at the solution at
the two interval endpoints
and
unlike for initial value problems (IVPs) where the
initial conditions are all evaluated at a single point.
Occasionally, problems arise where the function
is
also evaluated at the solution at other points in
. In these cases, we have a multipoint BVP. A
multipoint problem may be converted to a two-point problem by
defining separate sets of variables for each subinterval between
the points and adding boundary conditions which ensure continuity
of the variables across the whole interval. Like rewriting the
original BVP in the compact form (1), rewriting a
multipoint problem as a two-point problem may not lead to a
problem with the most efficient computational solution.
Most practically arising two-point BVPs have separated boundary
conditions where the function
may be split into two
parts (one for each endpoint):
Here,
and
for some value
with
and where each of the vector functions
and
are independent. However,
there are well known, commonly arising, boundary conditions which
are non separated; for example, consider periodic boundary
conditions which, for a problem written in the form of equation
(1), take the form
Existence and Uniqueness
Questions of existence and uniqueness for BVPs are much more difficult than for IVPs, Indeed, there is no general theory of existence and uniqueness. However, there is a vast literature on individual cases; see Bernfeld and Lakshmikantham (1974) for a survey of a variety of techniques that may be used. If we restrict attention to cases where the corresponding IVPs have a unique solution then we can make some general comments. Consider the IVP
- (2)
corresponding to the ODE in (1). If this IVP has a unique solution for
all choices of initial vectors
then the existence
of a solution to (1) hinges on the solvability of the
nonlinear system of equations
- (3)
where
is the solution of the IVP
(2) at
for the initial value
. If there is a solution then it is the unique
solution (among solutions of this type) if the system
has just one solution
.
For linear BVPs, where the ODEs and boundary conditions are both
linear, the equation
is a linear
system of algebraic equations. Hence, generally there will be
none, one or an infinite number of solutions, analogously to the
situation with systems of linear algebraic equations.
In addition to the possibilities for linear problems, nonlinear problems can also have a finite number of solutions. Consider the following simple model of the motion of a projectile with air resistance:
- (4)
These equations may be viewed as describing the planar
motion of a shot fired from a cannon. Here,
is the
height of the shot above the level of the cannon,
is the velocity of the shot, and
is the angle
(in radians) of the trajectory of the shot with the horizontal.
The independent variable
measures the horizontal
distance from the cannon. The constant
represents
air resistance (friction) and
is the
appropriately scaled gravitational constant. This model neglects
three–dimensional effects such as cross winds and the rotation of
the shot. The initial height is
and the muzzle
velocity
for the cannon is fixed. The standard
projectile problem is to choose the initial angle of the cannon
and hence of the shot,
, so that the shot will
hit a target at the same height as the cannon at a distance
; that is, we require
. Altogether the boundary conditions are
Does this BVP have a solution? Physical intuition suggests that it
certainly does not for
beyond the range of
the cannon for the fixed muzzle velocity
. On the
other hand, if
is small enough, we do expect
a solution, but is there only one? To see that there is not,
consider the case when the target is very close to the cannon. We
can hit the target by shooting with an almost flat trajectory or
by shooting high and dropping the shot mortar-like on the target.
That is, there are (at least) two solutions that correspond to
initial angles
and
. It turns out that there are exactly two solutions;
see Figure 1 for an example.
Now, let
increase. There are still two
solutions, but the larger the value of
, the
smaller the angle
and the larger the
angle
. If we keep increasing
, eventually we reach the maximum range with
the given muzzle velocity. At this distance there is just one
solution;, that is,
. In
summary, there is a critical value of
for
which there is exactly one solution. If
is
smaller than this critical value, there are exactly two solutions
and if it is larger, there is no solution.
Shooting or Marching Methods
The simplicity of the projectile model suggests a computational method
of solution. This is to compute the unknown initial value
to satisfy the nonlinear equation
. This approach requires the
(computational) solution of an IVP for the ODEs for each value of
the angle
attempted. The nonlinear equation
may be solved by any suitable method. Physical intuition suggests
exploiting the relationship between the angle chosen and the range
achieved in a bisection-like algorithm but, in more complex cases,
such simple "physical" relationships are usually not available and
a general purpose iteration such as a Newton iteration is often
used. The shooting method can be very successful on simple
problems such as the projectile problem. It can be extended easily
to suggest a method of solution for almost any boundary value
problem based on solving equation (3) and it has
been automated in many pieces of mathematical software. However,
its success depends on a number of factors the most important of
which is the stability of the initial value problem that must be
solved at each iteration. Unfortunately, it is the case that for
many stable boundary value problems the corresponding initial
value problems (beginning from either endpoint and integrating
towards the other endpoint) are insufficiently stable for shooting
to succeed. So, shooting methods are not computationally suitable
for the whole range of practical boundary value problems,
particularly those on very long or infinite intervals. A second
difficulty, sometimes interconnected with the aforementioned
stability problem, is that iteration methods such as Newton for
solving equation (3) may require a far more accurate
initial estimate for the initial value
than is
readily available.
Infinite Intervals
Many ODE BVPs arising from the analysis of partial differential
equations through the computation of similarity solutions or via
perturbation methods. These problems are often defined on
semi-infinite ranges. For example the Blasius equation
- (5)
arises from a similarity solution of the
partial differential equations describing fluid flow over a flat
plate. Of course, the boundary condition at infinity is
asymptotic, it should be read as
as
, and it implies that
as
where
the constant
is a priori unknown.
This problem is easy to solve computationally — shooting from the
origin and using a standard nonlinear equation solver works
without difficulty. Of course, we can't integrate the equations to
"infinity" but we can replace the boundary condition at infinity
by a corresponding one a finite point,
, and that
point
need not be chosen very large because the
asymptotic expansion of the solution has
exponentially as
. So, for example,
using the boundary condition
with
provides a quite accurate solution. There are no
fast increasing solutions to the equation "near" the desired
solution so there is no unstable growth of computational solutions
on quite long ranges of integration as long as the unknown initial
value
is not chosen too far away from the
correct value of approximately
, for which we
find that the "unknown" constant
.
In the Blasius problem the location and type of boundary conditions is determined "physically" and gives us a stable (well-conditioned) problem. In general, matters are more complicated though physical principles remain an essential guide. For simplicity of exposition (and understanding) consider the linear problem
- (6)
Its general solution is
Note that there are three components of the solution, two that decay as
increases from the origin towards positive infinity and one that
grows. Suppose that we solve this equation on the interval
with boundary conditions
The last boundary condition interpreted as
as
, implies that
.
Then, the other boundary conditions imply that
and
. So, there is a unique solution of this BVP.
On the other hand, if the boundary conditions are
- (7)
the boundary condition
again implies that
, but now the third condition places no constraint
on the coefficients, and the remaining condition tells us only
that
, so any value of
results in a solution, i.e., this BVP has infinitely many
solutions. This problem provides an example of exponential
dichotomy; Ascher et al (1995) and Mattheij and Molenaar (2002)
discuss the requirements exponential dichotomy in detail. For a
problem to be well-posed the boundary conditions must be set
appropriately. For the simple equation (6), if
the boundary conditions are separated, essentially we must have
two boundary conditions at the origin and one at infinity matching
the two decaying and one increasing (towards infinity) basis
functions in the solution.
If a BVP is not well–posed with boundary conditions at infinity,
it is natural to expect numerical difficulties when those boundary
conditions are imposed at a large but finite point
even though, in this case, a solution is always defined. Suppose
then that we solve the equation (6) with
boundary conditions
replacing (7). For large values of
, the system of linear equations for the
coefficients
,
, and
in the general solution is extremely ill–conditioned reflecting
the poor conditioning of equation (6) with
boundary conditions (7); see Shampine et al (2003)
for the details.
Numerical Methods
We described shooting methods in section and
we explained there that there are inherent problems in that
approach. These problems may be overcome, at least partially,
using variants on the shooting method which broadly come under the
heading of multiple shooting; see Ascher and Petzold (1998)
for a detailed discussion.
Most general purpose software packages for BVPs are based on
global methods which fall into two related categories. The
first is finite differences where a mesh is defined on the
interval
and the derivative in (1)
is replaced by a difference approximation at each mesh point; see
Keller (1992) for details. The resulting difference equations plus the
boundary conditions give a set algebraic equations for the
solution on the mesh. These equations are generally nonlinear but
are linear when the differential equations and boundary conditions
are both linear. To achieve a user specified error the software
generally adjusts the mesh placement using local error estimates
based on higher order differencing involving techniques such as
deferred correction; see Ascher and Petzold (1998) and Shampine et
al (2003) for a discussion of error estimation and mesh placement.
A second global approach is to approximate the solution by defining a basis for a linear space of functions usually defined piecewise on a mesh and to collocate this approximate solution. (Collocation is the process of substituting the approximate solution in the ODE then requiring the ODE to be satisfied exactly at a number of collocation points. The number of collocation points plus the number of boundary conditions must equal the number of unknown coefficients in the approximate solution; that is, they must equal the number of basis functions.) The commonest choice of approximation is a linear space of splines. The error is again controlled by adjusting the mesh spacing using local error estimates involving approximate solutions of varying orders of accuracy; see Ascher et al (1995), Mattheij and Molenaar (2002), or Ascher and Petzold (1998) for a detailed discussion. Choosing a spline basis (or more or less equivalently using certain types of Runge-Kutta formulas on the mesh) leads to a nonlinear system which must be solved iteratively. This iterative approach involves solving linear systems of equations which are structured (they are almost block diagonal when the boundary conditions are separated) and which can be solved inexpensively; the survey paper Amodio et al (2001) discusses a variety of methods for these systems and for the related systems arising when the boundary conditions are nonseparated. Similarly structured systems arise from finite difference approximations and also from multiple shooting techniques.
Sturm–Liouville Eigenproblems
Another type of BVP that arises in the analytical solution of certain linear partial differential equations is the Sturm–Liouville eigenproblem. In its simplest form this is a scalar self-adjoint linear second order ODE BVP
- (8)
Here the parameter
, an eigenvalue, is to be determined such that
the BVP (8) has a nontrivial (not identically zero)
solution. There are broad analogies here with the generalized
algebraic eigenproblem
where, depending
on the properties of the matrices
and
, various distributions of the finite number of
eigenvalues
are possible. In the case of the
BVP (8), in simple cases there are a countable number
of number of eigenvalues each with a corresponding solution
. So, for example, as shown in \cite{z}, if
and
are sufficiently
smooth and
on
then
the eigenvalues are real and distinct, and may be ordered
defining a
discrete spectrum. The eigenfunction
corresponding to
has
zeros
in
. The set of eigenfunctions
is linearly independent. If
we relax the smoothness conditions on the coefficients
and
, and/or permit these functions
to take on a wider range of values, many different phenomena are
observed from doubling of the eigenvalues to the occurrence of
continuous spectra; see Zettl (2005) for a comprehensive
discussion of the various possibilities.
Numerical methods for Sturm–Liouville eigenproblems that have been
implemented in software include finite difference and finite
element discretizations which both lead to a generalized algebraic
eigenproblem. Pruess methods approximate the eigenproblem by
another where the coefficients
and
are replaced by piecewise constants resulting in a
set of problems which may each be solved analytically. Finally,
shooting methods are usually implemented using a scaled Pr\"{u}fer
transformation,
where
is a
scaling function;
gives the standard Pr\"{u}fer
transformation. The transformation leads to a pair of nonlinear
ODEs for
and
where the ODE for
does not depend on
so may be
solved alone. More directly important, the boundary conditions in
problem (8) are replaced by
which provide the basis for a shooting method where each
eigenvalue may be determined by the solution of a single nonlinear
algebraic equation. See Pryce (1993) for a general discussion of
the various types of numerical methods for Sturm–Liouville
eigenproblems
References
- P. Amodio, J.R. Cash, G. Roussos, R.W. Wright, G. Fairweather, I. Gladwell, G.L. Kraut and M. Paprzycki, Almost block diagonal linear systems: sequential and parallel solution techniques, and applications, Numer. Lin. Algebra Applics. 7 (2000) 275-317.
- U.M. Ascher, R.M.M. Mattheij, R.D. Russell, Numerical Solution of Boundary Value Problems for Ordinary Differential Equations, SIAM Classics in Applied Mathematics 13, SIAM, Philadelphia, PA, 1995.
- U.M. Ascher and L.R. Petzold, Computer Methods for Ordinary Differential Equations and Differential Algebraic Equations, SIAM, Philadelphia, PA, 1998.
- S.R. Bernfeld and V. Lakshmikantham, An Introduction to Nonlinear Boundary Value Problems, Mathematics in Science and Engineering 109, Academic Press, New York, NY, 1974.
- H.B. Keller, Numerical Methods for Two-Point Boundary-Value Problems, Dover, New York, NY, 1992.
- R.M.M. Mattheij and J. Molenaar, Ordinary Differential Equations in Theory and Practice, SIAM Classics in Applied Mathematics 43, SIAM, Philadelphia, PA, 2002.
- J.D. Pryce, Numerical Solution of Sturm-Liouville Problems, Clarendon Press, Oxford, UK, 1993.
- L.F. Shampine, I. Gladwell and S. Thompson, Solving ODEs with MATLAB, Cambridge University Press, Cambridge, UK, 2003.
- A. Zettl, Sturm-Liouville Theory, Mathematical Surveys and Monographs 121, AMS, Providence, RI, 2005.
when %
and 
