Robert L. Warnock

Revision as of 23:51, 27 April 2009

History

<label>history</label>

Hamilton made one of the earliest studies of geometrical optics in an arbitrary medium with varying index of refraction (Hamilton, 1827), (Synge, 1937), (Carathe'odory, 1937). He found an eloquent summary of the topic in a "characteristic function", which is the optical path length of a ray, regarded as a function of initial and final positions and times of the ray. This and related functions satisfy partial differential equations, and directly determine infinite families of rays (or particle orbits in the extension to mechanics).

Jacobi (Jacobi,1842) sharpened Hamilton's formulation, clarified mathematical issues, and made significant applications. The resultant Hamilton-Jacobi theory and later developments are presented in several famous texts (Arnol'd, 1974), (Landau-Lifshitz, 1969), (Gantmacher, 1970), (Born-Wolf, 1965), (Lanczos, 1949), (Carathe'odory, 1982), (Courant-Hilbert, 1962). For studies using modern PDE theory see (Benton, 1977). The theory embodies a wave-particle duality, which figured in the advent of the de Broglie - Schroedinger wave mechanics (Butterfield, 2005).

In a broader view than that of the original work, a solution of the Hamilton-Jacobi equation is the generator of a canonical transformation, a symplectic change of variables intended to simplify the equations of motion. In this framework (as applied to mechanics) there are solutions of a type different from that of Hamilton, which not only determine orbits but also invariant tori in phase space on which the orbits lie. These solutions, which are known to exist only under special circumstances, are the subject of the celebrated work of Kolmogorov, Arnol'd, and Moser (Gallavotti, 1983}, which has implications for stability of motion. Even approximate invariants have implications for stability over finite times (Nekhoroshev, 1977), (Warnock-Ruth, 1992), and also find applications in semi-classical quantum theory (Einstein-Brillouin-Keller quantization) (Percival, 1977). Various forms and generalizations of the Hamilton-Jacobi equation occur widely in contemporary applied mathematics, for instance in optimal control theory (Bellman, 1957).

Canonical Transformation <label>canon_sect</label>

A mechanical system with \(n\) degrees of freedom is described by generalized coordinates \(q=(q_1,\cdots, q_n)\) and corresponding generalized momenta \( p=(p_1,\cdots,p_n)\); we write \(z=(q,p)\). The motion of the system is governed by Hamilton's ordinary differential equations,

             \dot q= H_p(z,t)\ ,\quad \dot p=-H_q(z,t)\ ,

</math> where \( \dot{}\ \) denotes the time derivative and subscripts indicate vectors of partial derivatives; thus \(H_q=(\partial H/\partial q_1,\cdots,\partial H/\partial q_n)\). The Hamiltonian function \(H:\mathbb{R}^{2n}\times\mathbb{R}\rightarrow\mathbb{R}\) is here assumed to be \(C^2\) in \(z\) and continuous in \(t\). The solution of the initial value problem for the Hamiltonian system (<ref>hameq</ref>) is denoted by \({\mathbf z}(t,z_0)=({\mathbf q}(t,z_0),{\mathbf p}(t,z_0))\) for initial value \(z_0={\mathbf z}(0,z_0)\). This solution, denoted by the bold faced letter \(\mathbf z\) to distinguish it from a general point \(z\) in phase space, will be called an ``orbit". If \(H\) depends on the time, specification of an orbit requires the initial time \(t_0\) (not just the elapsed time) as well as the initial condition \(z_0\); for convenience the origin of time is chosen so that \(t_0=0\).

We seek a change of variable, \(Z=(Q,P)=\Phi(z,t)=(\Phi_1(z,t),\Phi_2(z,t))\), in general time dependent, so that the equations of motion retain their form, but with a new Hamiltonian \(K\), namely

   \dot Q= K_P(Z,t)\ ,\quad \dot P=- K_Q(Z,t)\ .
 </math>

If it can be arranged that \(K\) is independent of \(Q\), then \(\mathbf P\) is constant and the solution of (<ref>newhameq</ref>) is given simply as

  {\mathbf Q}(t,Z_0)=Q_0+\int_0^t  K_P(P_0,\tau)d\tau\ ,\quad {\mathbf P}(t,Z_0)=P_0\ .

</math> The solution of (<ref>hameq</ref>) is retrieved by the inverse transformation \(z=\Psi(Z,t)\).

We write \({\mathbf Z}(t,Z_0)=({\mathbf Q}(t,Z_0),{\mathbf P}(t,Z_0))=\Phi({\mathbf z}(t,z_0),t)\) for an orbit in the new coordinates, where \(Z_0=\Phi(z_0,0)\). We often suppress reference to initial conditions. A canonical transformation will be determined through the equation

 {\mathbf p}(t)\cdot\dot{\mathbf q}(t)-H({\mathbf z}(t),t)=-{\mathbf Q}(t)\cdot\dot{\mathbf P}(t)-K({\mathbf Z}(t),t)

+\frac{d}{dt}F({\mathbf q}(t),{\mathbf P}(t),t)\ , </math> where \(\cdot\) indicates the scalar product and the given function \(F(q,P,t)\) is \(C^2\) in its first two arguments, \(C^1\) in \(t\), and such that

 \det F_{qP}=\det\{\partial^2 F/\partial q_i\partial P_j\}\ne 0\ ,
 </math>

in some open region \(\Omega\in \mathbb{R}^{2n+1}\) of \((q,P,t)\)-space. This function \(F\) is called the generator or generating function of the transformation. Writing out \(dF/dt\), we see that (<ref>canon</ref>) is satisfied if

{\mathbf p}(t)=F_q({\mathbf q}(t),{\mathbf P}(t),t)\ ,

</math>

  {\mathbf Q}(t)=F_P({\mathbf q}(t),{\mathbf P}(t),t)\ ,

</math>

  K({\mathbf Z}(t),t)=H({\mathbf z}(t),t)+ F_t({\mathbf q}(t),{\mathbf P}(t),t)\ .

</math> This leads us to define the canonical transformation the equations

  p=F_q(q,P,t)\ ,

</math>

  Q=F_P(q,P,t)\ .

</math> Owing to condition (<ref>nonsing</ref>) and the inverse function theorem we can solve (<ref>canona</ref>) for \(P=\Phi_2(z,t)\) (at least locally in \(\Omega\)) and then substitute in (<ref>canon2</ref>) to get \(Q=\Phi_1(z,t)\) as well. Similarly, to get the inverse transformation \(z=\Psi(Z,t)\) we solve (<ref>canonb</ref>) for \(q=\Psi_1(Z,t)\) then substitute in (<ref>canona</ref>) to find \(p=\Psi_2(Z,t)\). Then the new Hamiltonian is defined by <math canonc>

  K(Z,t)=H(z,t)+F_t(q,P,t)=H(\Psi(Z,t),t)+F_t(\Psi_1(Z,t),P,t)\ .

</math>

Take \(\partial/\partial P\) of (<ref>explicit</ref>), evaluate on orbits, and then subtract \(d/dt\) of (<ref>canon2</ref>). Similarly, take \(\partial/\partial q\) of (<ref>explicit</ref>), evaluate on orbits, and subtract \(d/dt\) of (<ref>canon1</ref>). This leads to the informative equations

   F_{qP}(\dot{\mathbf q}-H_p)-(\dot{\mathbf Q}-K_P)+F_{PP}(\dot{\mathbf P}+K_Q)=0\ ,

</math>

   F_{qP}(\dot{\mathbf P}+K_Q)-(\dot{\mathbf p}+H_q)+F_{qq}(\dot{\mathbf q}-H_p)=0\ .

</math> In view of (<ref>nonsing</ref>), this shows that (<ref>hameq</ref>) implies (<ref>newhameq</ref>) and vice versa, as long as \((q,P,t)\) lies in \(\Omega\).

There are other possible choices of the old and new variables on which the generating function depends (Goldstein,1981). Following notation of Goldstein, we are concerned with \(F_1(q,Q,t)\) and \(F= F_2(q,P,t)\), for which the equations are as follows:

    {\mathbf p }\dot{\mathbf q}-H={\mathbf P}\dot{\mathbf Q}-K+dF_1/dt, \quad p= F_{1q},\quad P=-F_{1Q}\ ,

</math>

    {\mathbf p}\dot{\mathbf q}-H=-{\mathbf Q}\dot{\mathbf P}-K+dF_2/dt, \quad p=F_{2q},\quad Q=F_{2P}\ .

</math> In each case \(K=H+F_{it}\) and the determinant of second derivatives of \(F_i\) should be non-zero, as in (<ref>nonsing</ref>).

One can show that the transformation defined by any generator with requisite smoothness is symplectic, which means that its Jacobian matrix \(M=\{ \partial \Phi_i(z,t)/\partial z_j \}\) is symplectic for all \(z\). An alternative viewpoint is to take symplecticity as the defining property of a canonical transformation.

Hamilton-Jacobi Equation and Invariant Tori <label>hjsect</label>

We now wish to determine \(F\) in such a way that \(K\) will indeed be independent of \(Q\), thus giving (<ref>soln</ref>) as the solution of the transformed equations. Demanding that form of \(K\), and substituting (<ref>canona</ref>) in (<ref>canonc</ref>) we have <math hj>

    H(q, F_q(q,P,t),t)+ F_ t(q,P,t)=K(P,t)\ ,

</math> which is the Hamilton-Jacobi equation for the type-2 generator. Here \(P\) is regarded as a parameter; the independent variables of the PDE are \(q\) and \(t\). A solution of (<ref>hj</ref>) depending on \(n\) parameters \(P_i\) and such that \(\det F_{qP}\ne 0\) was called a complete solution (Vollstaendige Loesung) by Jacobi; in his case \(K=0\). As we have seen, it determines a canonical transformation.

Equation (<ref>hj</ref>) is clearly a necessary condition on the generator of a transformation for which the new Hamiltonian is independent of \(Q\). But can it be used to construct such a generator? For this a first question is the status of the function \(K(P,t)\). Can it be chosen arbitrarily or is it somehow determined in the course of the solution of (<ref>hj</ref>)? There is one choice of \(K\) which can be made freely at the start, and which leads to an important solution, namely \(K= 0\). This is the case considered by Hamilton and Jacobi, which we shall discuss in the following section.

To illustrate the situation with non-zero \(K\), take the case of a time-independent Hamiltonian \(H(z)\) and look for a solution in which \(K\) and \(F\) are also time-independent. Take polar coordinates \((q,p)=(\phi,I), \ \ (Q,P)=(\psi,J),\ \ \phi,\psi\in [0,2\pi],\ \ I,J\in [0,\infty)\). Also, define \(G\) so that \(F(\phi,J)=\phi\cdot J+G(\phi,J)\), where the first term on the right gives the identity transform. Then the Hamilton-Jacobi equation to solve for \(G\) is

    H(\phi,J+G_\phi(\phi,J))=K(J)\ ,

</math> and the equations (\ref{canona}) and (\ref{canonb}) defining the transformation are

      I=J+G_\phi(\phi,J)\ ,

</math>

     \psi=\phi+G_J(\phi,J)\ .

</math> If \(G\) satisfies (<ref>hj</ref>) for some function \(K(J)\), then \(J\) is constant and (\ref{polara}) represents an invariant torus in phase space. The new angle variable \(\psi\) advances linearly in time, according to (<ref>soln</ref>).

Now consider a perturbed integrable system with Hamiltonian

     H(\phi,I)=H_0(I)+\epsilon V(\phi,I)\ ,
     </math>

which satisfies a condition of non-degeneracy

     \det\ \nu_I(I)\ne 0,\quad \nu(I)= H_{0I}(I) .
     </math>

Next rearrange (<ref>hjpolar</ref>) to subtract the first terms of the Taylor series of \(H_0(J+G_\phi)\):

     -\nu(J)\cdot G_\phi=\epsilon V(\phi,J+G_\phi)+\big[ H_0(J+G_\phi)-H_0(J)-\nu(J)\cdot G_\phi\big]      +\big[ H_0(J)-K(J)\big] \ .
     </math>

The sum of the terms in the first square bracket is \(\mathcal{O}(G_\phi^2)\) and therefore small if the transformation (<ref>polara</ref>,<ref>polarb</ref>) is close to the identity. Introducing the Fourier series

     G_\phi(\phi,J)= \sum_{m\in Z^n} im\ g_m(J)\exp(im\cdot\phi)\ ,
     </math>

and taking the Fourier transform of (<ref>{pertform}) we have

      g_m(J)=\frac{i}{m\cdot\nu(J)}\frac{1}{(2\pi)^n}\int_{T^n}
     \exp(-im\cdot\phi)\big[\epsilon V(\phi,J+G_\phi) +       
        H_0(J+G_\phi)-H_0(J)-\nu(J)\cdot G_\phi\big]d\phi,\quad m\ne {\mathbf 0}\ .
     </math>

Since \(G_\phi\) does not contain the zero mode, the set of equations (<ref>hjfourier</ref}) for all \(m\ne {\bf 0}\) is a closed system for the Fourier coefficients \(g_m,\ m\ne{\mathbf 0}\). If a solution of this system is known for some \(J\), then we have solved the projection of (<ref>hjpolar</ref>) onto every mode except the zero mode. We can then solve also the zero mode projection simply by defining \(K\) as the average of the l.h.s.:

:\[
      K(J)=\frac{1}{(2\pi)^n}\int_{T^n}d\phib\big[H_0(J+G_\phi)+
      \epsilon V(\phi,J+G_\phi)\big]\ .
      \]

The zero mode amplitude \(g_{\mathbf 0}\) can be chosen arbitrarily, for instance put equal to zero. Thus we get some understanding of how the PDE (<ref>hjpolar</ref>) could be solved without a prior knowledge of its right hand side.

At first sight Eq.(<ref>hjfourier</ref>) would seem to be a straightforward fixed point problem that might be solved by some kind of iteration, provided that the divisor \(m\cdot\nu(J)\) could be bounded away from zero through an appropriate choice of \(J\). The iteration might be started by keeping only the term \(\epsilon V\), which gives lowest order perturbation theory. If the series (<ref>fourier</ref>) is truncated, then the problem can indeed be approached in that way, and (<ref>hjfourier</ref>) provides a practical method for computing approximate invariant tori (WArnock-Ruth,). The exact problem requires the refined method of KAM theory to control small divisors (Gallavotti, Poeschel). The theory ensures the existence of invariant tori for sufficiently small \(\epsilon\) but they are not continuous functions of \(J\). Rather, they exist only on a Cantor set in \(J\)-space, and the concept of complete solution does not apply in a literal sense (Poeschel).

Robert L. Warnock

Revision as of 23:51, 27 April 2009

History

Canonical Transformation <label>canon_sect</label>

Hamilton-Jacobi Equation and Invariant Tori <label>hjsect</label>

Personal tools

Namespaces

Variants

Views

Actions

Search

Navigation

Focal areas

Activity

Tools

@@ Line 1: / Line 1: @@
-==History==
+==History== <label>history</label>
-Hamilton made one of the earliest studies of geometrical optics in  an arbitrary medium with varying index of refraction (Hamilton 1827), (Synge 1937), (Carathe'odory 1937).
+Hamilton made one of the earliest studies of geometrical
-He found an eloquent summary of the topic in
+optics in an arbitrary medium with varying index of refraction (Hamilton,
-a "characteristic function", which is the optical path length of a ray,
+), (Synge, 1937), (Carathe'odory, 1937).  He found an eloquent summary
-regarded as a function of initial and final positions and times of the ray. This
+of the topic in a "characteristic function", which is the optical path
-and related functions satisfy partial differential equations, and directly
+length of a ray, regarded as a function of initial and final positions
-determine infinite families of rays (or particle orbits in the extension to mechanics).
+and times of the ray.  This and related functions satisfy partial
+differential equations, and directly determine infinite families of rays
+(or particle orbits in the extension to mechanics).
-Jacobi (Jacobi 1842) sharpened Hamilton's formulation, clarified mathematical issues, and made significant applications.
+Jacobi (Jacobi,1842) sharpened Hamilton's formulation, clarified
-The resultant
+mathematical issues, and made significant applications.  The resultant
-Hamilton-Jacobi theory and  later developments  are presented in several famous texts (Arnol'd
+Hamilton-Jacobi theory and later developments are presented in several
-), (Landau-Lifshitz 1969), (Gantmacher 1970), (Born-Wolf 1965), (Lanczos 1949),
+famous texts (Arnol'd, 1974), (Landau-Lifshitz, 1969), (Gantmacher, 1970),
-(Carathe'odory 1982), (Courant-Hilbert 1962). For studies using modern PDE theory see (Benton 1977).
+(Born-Wolf, 1965), (Lanczos, 1949), (Carathe'odory, 1982), (Courant-Hilbert,
-The theory embodies a wave-particle duality, which figured in the advent of the de Broglie - Schroedinger
+).  For studies using modern PDE theory see (Benton, 1977).  The
-wave  mechanics (Butterfield 2005).
+theory embodies a wave-particle duality, which figured in the advent of
+the de Broglie - Schroedinger wave mechanics (Butterfield, 2005).
-In a broader view than that of the original work, a solution of the Hamilton-Jacobi (H-J) equation is the generator of a canonical transformation,
+In a broader view than that of the original work, a solution of the
-a symplectic change of variables intended to simplify the equations of motion. In this framework
+Hamilton-Jacobi  equation is the generator of a canonical
-(as applied to mechanics) there are solutions of a type different from that of Hamilton, which not only determine orbits
+transformation, a symplectic change of variables intended to simplify
-but also invariant tori in phase space on which the orbits lie. These solutions, which are known to exist only
+the equations of motion.  In this framework (as applied to mechanics)
-under special circumstances, are the subject of the celebrated work of
+there are solutions of a type different from that of Hamilton, which not
-Kolmogorov, Arnol'd, and Moser (Gallavotti 1983}, which has implications for stability of motion.
+only determine orbits but also invariant tori in phase space on which
-Even approximate invariants have implications for stability over finite times (Nekhoroshev 1977), (Warnock-Ruth 1992),
+the orbits lie.  These solutions, which are known to exist only under
-and also find applications in semi-classical quantum theory (Einstein-Brillouin-Keller quantization)  (Percival 1977).
+special circumstances, are the subject of the celebrated work of
-Various forms and generalizations of the H-J equation occur widely in contemporary applied mathematics,
+Kolmogorov, Arnol'd, and Moser (Gallavotti, 1983}, which has implications
-for instance in optimal control theory (Bellman 1957).
+for stability of motion.  Even approximate invariants have implications
+for stability over finite times (Nekhoroshev, 1977), (Warnock-Ruth, 1992),
+and also find applications in semi-classical quantum theory
+(Einstein-Brillouin-Keller quantization) (Percival, 1977).  Various forms
+and generalizations of the Hamilton-Jacobi equation occur widely in contemporary
+applied mathematics, for instance in optimal control theory (Bellman,
+).
 ==Canonical Transformation <label>canon_sect</label>==
-A mechanical system with <math>n</math> degrees of freedom is described by generalized coordinates <math>q=(q_1,\cdots, q_n)</math> and corresponding
+A mechanical system with <math>n</math> degrees of freedom is described
-generalized momenta <math> p=(p_1,\cdots,p_n)</math>; we write <math>z=(q,p)</math>. The motion of the system is governed by
+by generalized coordinates <math>q=(q_1,\cdots, q_n)</math> and
+corresponding generalized momenta <math> p=(p_1,\cdots,p_n)</math>; we
+write <math>z=(q,p)</math>.  The motion of the system is governed by
 Hamilton's ordinary differential equations,
-<math hameq>
+::<math hameq>
-                            \dot q= H_p(z,t)\ ,\quad \dot p=-H_q(z,t)\ ,
+              \dot q= H_p(z,t)\ ,\quad \dot p=-H_q(z,t)\ ,
-  </math>
+</math>
-where  <math> \dot{}\ </math> denotes the time derivative and subscripts indicate vectors of partial derivatives; thus  <math>H_q=(\partial H/\partial q_1,\cdots,\partial H/\partial q_n)</math>.
+where <math> \dot{}\ </math> denotes the time derivative and subscripts
-The Hamiltonian function <math>H:\mathbb{R}^{2n}\times\mathbb{R}\rightarrow\mathbb{R}</math>
+indicate vectors of partial derivatives; thus <math>H_q=(\partial
-is  here assumed to be <math>C^2</math> in <math>z</math> and  continuous in <math>t</math>. The solution of the initial value problem for the Hamiltonian system
+H/\partial q_1,\cdots,\partial H/\partial q_n)</math>.  The Hamiltonian
-(<ref>hameq</ref>) is denoted by <math>{\mathbf z}(t,z_0)=({\mathbf q}(t,z_0),{\mathbf p}(t,z_0))</math> for initial value <math>z_0={\mathbf z}(0,z_0)</math>.
+function
-This solution,  denoted by the bold faced letter <math>\mathbf z</math> to distinguish it from
+<math>H:\mathbb{R}^{2n}\times\mathbb{R}\rightarrow\mathbb{R}</math> is
-a general point <math>z</math> in phase space, will be called an ``orbit". If <math>H</math> depends on the time, specification
+here assumed to be <math>C^2</math> in <math>z</math> and continuous in
-of an orbit requires the initial time <math>t_0</math> (not just the elapsed time) as well as the initial condition <math>z_0</math>; for convenience we choose the origin
+<math>t</math>.  The solution of the initial value problem for the
-of time so that <math>t_0=0</math>.
+Hamiltonian system (<ref>hameq</ref>) is denoted by <math>{\mathbf
+z}(t,z_0)=({\mathbf q}(t,z_0),{\mathbf p}(t,z_0))</math> for initial
+value <math>z_0={\mathbf z}(0,z_0)</math>.  This solution, denoted by
+the bold faced letter <math>\mathbf z</math> to distinguish it from a
+general point <math>z</math> in phase space, will be called an ``orbit".
+If <math>H</math> depends on the time, specification of an orbit
+requires the initial time <math>t_0</math> (not just the elapsed time)
+as well as the initial condition <math>z_0</math>; for convenience the
+origin of time is chosen so that <math>t_0=0</math>.
+We seek a change of variable,
+<math>Z=(Q,P)=\Phi(z,t)=(\Phi_1(z,t),\Phi_2(z,t))</math>, in general
+time dependent, so that the equations of motion retain their form, but
+with a new Hamiltonian <math>K</math>, namely
+::<math newhameq>
+    \dot Q= K_P(Z,t)\ ,\quad \dot P=- K_Q(Z,t)\ .
+  </math>
+If it can be arranged that <math>K</math> is independent of
+<math>Q</math>, then <math>\mathbf P</math> is constant and the solution of
+(<ref>newhameq</ref>) is given simply as
+::<math soln>
+   {\mathbf Q}(t,Z_0)=Q_0+\int_0^t  K_P(P_0,\tau)d\tau\ ,\quad {\mathbf P}(t,Z_0)=P_0\ .
+</math>
+The solution of (<ref>hameq</ref>) is retrieved by the inverse
+transformation <math>z=\Psi(Z,t)</math>.
+We write <math>{\mathbf Z}(t,Z_0)=({\mathbf Q}(t,Z_0),{\mathbf
+P}(t,Z_0))=\Phi({\mathbf z}(t,z_0),t)</math> for an orbit in the new
+coordinates, where <math>Z_0=\Phi(z_0,0)</math>.  We often suppress reference to
+initial conditions.  A '''canonical transformation''' will be determined
+through the equation
+::<math canon>
+  {\mathbf p}(t)\cdot\dot{\mathbf q}(t)-H({\mathbf z}(t),t)=-{\mathbf Q}(t)\cdot\dot{\mathbf P}(t)-K({\mathbf Z}(t),t)
++\frac{d}{dt}F({\mathbf q}(t),{\mathbf P}(t),t)\ ,
+</math>
+where <math>\cdot</math> indicates the scalar product and the given
+function <math>F(q,P,t)</math> is <math>C^2</math> in its first two
+arguments, <math>C^1</math> in <math>t</math>, and such that
+::<math nonsing>
+  \det F_{qP}=\det\{\partial^2 F/\partial q_i\partial P_j\}\ne 0\ ,
+  </math>
+in some open region <math>\Omega\in \mathbb{R}^{2n+1}</math> of <math>(q,P,t)</math>-space.
+This function <math>F</math> is  called the  '''generator''' or generating function of the transformation. Writing out <math>dF/dt</math>, we see that (<ref>canon</ref>) is satisfied if
+::<math canon1>
+ {\mathbf p}(t)=F_q({\mathbf q}(t),{\mathbf P}(t),t)\ ,
+</math>
+::<math canon2>
+   {\mathbf Q}(t)=F_P({\mathbf q}(t),{\mathbf P}(t),t)\ ,
+</math>
+::<math canon3>
+   K({\mathbf Z}(t),t)=H({\mathbf z}(t),t)+ F_t({\mathbf q}(t),{\mathbf P}(t),t)\ .
+</math>
+This leads us to define the canonical transformation the equations
+::<math canona>
+   p=F_q(q,P,t)\ ,
+</math>
+::<math canonb>
+   Q=F_P(q,P,t)\ .
+</math>
+Owing to condition (<ref>nonsing</ref>) and the inverse function theorem
+we can solve (<ref>canona</ref>) for <math>P=\Phi_2(z,t)</math> (at least
+locally in <math>\Omega</math>) and then substitute in
+(<ref>canon2</ref>) to get <math>Q=\Phi_1(z,t)</math> as well.
+Similarly, to get the inverse transformation <math>z=\Psi(Z,t)</math> we
+solve (<ref>canonb</ref>) for <math>q=\Psi_1(Z,t)</math> then substitute
+in (<ref>canona</ref>) to find <math>p=\Psi_2(Z,t)</math>.  Then the new
+Hamiltonian is defined by
+<math canonc>
+   K(Z,t)=H(z,t)+F_t(q,P,t)=H(\Psi(Z,t),t)+F_t(\Psi_1(Z,t),P,t)\ .
+</math>
+Take <math>\partial/\partial P</math> of (<ref>explicit</ref>), evaluate
+on orbits, and then subtract <math>d/dt</math> of (<ref>canon2</ref>).
+Similarly, take <math>\partial/\partial q</math> of
+(<ref>explicit</ref>), evaluate on orbits, and subtract <math>d/dt</math> of
+(<ref>canon1</ref>).  This leads to the informative equations
+::<math jac20_1>
+    F_{qP}(\dot{\mathbf q}-H_p)-(\dot{\mathbf Q}-K_P)+F_{PP}(\dot{\mathbf P}+K_Q)=0\ ,
+</math>
+::<math jac20_2>
+    F_{qP}(\dot{\mathbf P}+K_Q)-(\dot{\mathbf p}+H_q)+F_{qq}(\dot{\mathbf q}-H_p)=0\ .
+</math>
+In view of (<ref>nonsing</ref>), this shows that (<ref>hameq</ref>) implies (<ref>newhameq</ref>)
+and vice versa, as long as <math>(q,P,t)</math> lies in <math>\Omega</math>.
+There are other possible choices of the old and new variables on which the generating function depends (Goldstein,1981).
+Following notation of Goldstein, we are concerned with <math>F_1(q,Q,t)</math> and <math>F= F_2(q,P,t)</math>,
+for which the  equations are as follows:
+::<math F1>
+     {\mathbf p }\dot{\mathbf q}-H={\mathbf P}\dot{\mathbf Q}-K+dF_1/dt, \quad p= F_{1q},\quad P=-F_{1Q}\ ,
+</math>
+::<math F2>
+     {\mathbf p}\dot{\mathbf q}-H=-{\mathbf Q}\dot{\mathbf P}-K+dF_2/dt, \quad p=F_{2q},\quad Q=F_{2P}\ .
+</math>
+In each case <math>K=H+F_{it}</math> and the determinant of second derivatives of <math>F_i</math>
+should be non-zero, as in (<ref>nonsing</ref>).
+One can show that the transformation defined by any generator with requisite smoothness is symplectic, which means that its
+Jacobian matrix <math>M=\{ \partial \Phi_i(z,t)/\partial z_j \}</math> is symplectic for all <math>z</math>. An alternative
+viewpoint is to take symplecticity as the defining property of a canonical transformation.
+==Hamilton-Jacobi Equation and Invariant Tori <label>hjsect</label>==
+We now wish to determine <math>F</math> in such a way that
+<math>K</math> will indeed be independent of <math>Q</math>, thus giving
+(<ref>soln</ref>) as the solution of the transformed equations.
+Demanding that form of <math>K</math>, and substituting
+(<ref>canona</ref>) in (<ref>canonc</ref>) we have
+<math hj>
+     H(q, F_q(q,P,t),t)+ F_ t(q,P,t)=K(P,t)\ ,
+</math>
+which is the Hamilton-Jacobi equation for the type-2 generator.  Here
+<math>P</math> is regarded as a parameter; the independent variables of the PDE are
+<math>q</math> and <math>t</math>.  A solution of (<ref>hj</ref>) depending on <math>n</math> parameters
+<math>P_i</math> and such that <math>\det F_{qP}\ne 0</math> was called a
+'''complete solution''' (Vollstaendige Loesung) by Jacobi; in his case <math>K=0</math>.  As we
+have seen, it determines a canonical transformation.
+Equation (<ref>hj</ref>) is clearly a necessary condition on the
+generator of a transformation for which the new Hamiltonian is
+independent of <math>Q</math>.  But can it be used ''to construct''
+such a generator?  For this a first question is the status of the
+function <math>K(P,t)</math>.  Can it be chosen arbitrarily or is it
+somehow determined in the course of the solution of (<ref>hj</ref>)?
+There is one choice of <math>K</math> which can be made freely at the
+start, and which leads to an important solution, namely <math>K=
+</math>.  This is the case considered by Hamilton and Jacobi, which we
+shall discuss in the following section.
+To illustrate the situation with non-zero <math>K</math>, take the case
+of a time-independent Hamiltonian <math>H(z)</math> and look for a
+solution in which <math>K</math> and <math>F</math> are also
+time-independent.  Take polar coordinates <math>(q,p)=(\phi,I),
+\ \ (Q,P)=(\psi,J),\ \ \phi,\psi\in [0,2\pi],\ \ I,J\in [0,\infty)</math>. Also,
+define <math>G</math> so that <math>F(\phi,J)=\phi\cdot
+J+G(\phi,J)</math>, where the first term on the right gives the identity
+transform.  Then the Hamilton-Jacobi equation to solve for
+<math>G</math> is
+::<math hjpolar>
+     H(\phi,J+G_\phi(\phi,J))=K(J)\ ,
+</math>
+and the equations (\ref{canona}) and (\ref{canonb}) defining the transformation are
+::<math polara>
+       I=J+G_\phi(\phi,J)\ ,
+</math>
+::<math polarb>
+      \psi=\phi+G_J(\phi,J)\ .
+</math>
+If <math>G</math> satisfies (<ref>hj</ref>) for some function
+<math>K(J)</math>, then <math>J</math> is constant and (\ref{polara})
+represents an invariant torus in phase space.  The new angle variable
+<math>\psi</math> advances linearly in time, according to (<ref>soln</ref>).
+Now consider a perturbed integrable system with Hamiltonian
+::<math pertint>
+      H(\phi,I)=H_0(I)+\epsilon V(\phi,I)\ ,
+      </math>
+which satisfies a condition of non-degeneracy
+::<math nondegen>
+      \det\ \nu_I(I)\ne 0,\quad \nu(I)= H_{0I}(I) .
+      </math>
+Next rearrange (<ref>hjpolar</ref>) to subtract the first terms of the Taylor series of <math>H_0(J+G_\phi)</math>:
+::<math pertform>
+      -\nu(J)\cdot G_\phi=\epsilon V(\phi,J+G_\phi)+\big[ H_0(J+G_\phi)-H_0(J)-\nu(J)\cdot G_\phi\big]      +\big[ H_0(J)-K(J)\big] \ .
+      </math>
+The sum of the terms in the first square bracket  is <math>\mathcal{O}(G_\phi^2)</math> and therefore small if the transformation (<ref>polara</ref>,<ref>polarb</ref>) is close to the identity.  Introducing the
+Fourier series
+::<math fourier>
+      G_\phi(\phi,J)= \sum_{m\in Z^n} im\ g_m(J)\exp(im\cdot\phi)\ ,
+      </math>
+and taking the Fourier transform of (<ref>{pertform}) we have
+::<math hjfourier>
+       g_m(J)=\frac{i}{m\cdot\nu(J)}\frac{1}{(2\pi)^n}\int_{T^n}
+      \exp(-im\cdot\phi)\big[\epsilon V(\phi,J+G_\phi) +
+         H_0(J+G_\phi)-H_0(J)-\nu(J)\cdot G_\phi\big]d\phi,\quad m\ne {\mathbf 0}\ .
+      </math>
+Since <math>G_\phi</math> does not contain the zero mode, the
+set of equations (<ref>hjfourier</ref}) for all <math>m\ne {\bf 0}</math> is a closed system for
+the Fourier coefficients <math>g_m,\ m\ne{\mathbf 0}</math>. If a solution of this system
+is known for some <math>J</math>, then we have solved the projection of (<ref>hjpolar</ref>) onto
+every mode except the zero mode. We can then solve also the zero mode projection
+simply by defining <math>K</math> as the average of the l.h.s.:
+ ::<math>
+      K(J)=\frac{1}{(2\pi)^n}\int_{T^n}d\phib\big[H_0(J+G_\phi)+
+      \epsilon V(\phi,J+G_\phi)\big]\ .
+      </math>
+The zero mode amplitude <math>g_{\mathbf 0}</math> can be chosen arbitrarily,
+for instance put equal to zero. Thus we get some understanding  of how
+the PDE (<ref>hjpolar</ref>) could be solved without a prior knowledge of its right hand side.
+At first sight Eq.(<ref>hjfourier</ref>) would seem to be a straightforward fixed
+point problem that might be solved by some kind of iteration, provided that
+the divisor <math>m\cdot\nu(J)</math> could be bounded away from zero through an appropriate
+choice of <math>J</math>. The iteration might be started by keeping only the term <math>\epsilon V</math>, which
+gives lowest order perturbation theory. If the series (<ref>fourier</ref>) is truncated, then the problem
+can indeed be approached in that way, and (<ref>hjfourier</ref>) provides a practical
+method for computing approximate invariant tori (WArnock-Ruth,). The exact
+problem requires the refined method of KAM theory to control small divisors
+(Gallavotti, Poeschel). The theory ensures the existence of invariant tori for sufficiently
+small <math>\epsilon</math> but they are not continuous functions of <math>J</math>. Rather, they exist only
+on a Cantor set in <math>J</math>-space, and the concept of complete solution does not apply
+in a literal sense (Poeschel).