The central question of this lesson is short to ask: what is the state of a mechanical system, the smallest amount of information we must give, at one instant, to fix its entire future (and past) motion? Since Newton's law is second order, a position alone is not enough: we also need a velocity. Making that pair a single point in a space of states turns the laws of motion into a flow and, when energy is conserved, every possible motion (dynamics) can be read off as a level curve (geometry).
The goal. We want the state of a mechanical system: the least information we must give, at one instant, to fix its whole future (and past). Newton's law is second order, so a position alone is not enough; we also need a velocity. We will first answer this question in the simplest case of one particle in a one-dimensional system; its generalization to many particles and higher spacial dimension is the subject of a later lesson.
From last time. The previous lesson built up conservative forces, which admit a potential \(F(q)=-dU/dq\) and the conservation of energy \(E=\half m\dot q^2+U(q)\), with two concrete examples we carry straight over: the simple pendulum, whose gravitational (height) potential is \(U(\theta)=mgL\,(1-\cos\theta)\), and the harmonic oscillator, \(U(q)=\half m\omega^2 q^2\). Both will return below in the discussion of phase portraits.
Take a single particle on a line. Pairing its position \(q\) with the momentum \(p=m\dot q\) puts position and velocity on the same footing, and the second-order law \(m\ddot q=F(q)\) splits into a pair of first-order equations,
Nothing has been solved: we have merely traded one second-order equation for two first-order ones. The gain is conceptual.
We keep the momentum \(p=m\dot q\) rather than the velocity because it is what is conserved in collisions and the natural partner of \(q\) in every later formulation of mechanics; here the two differ only by the factor \(m\).
A useful freedom. The coordinate \(q\) need not be a Cartesian position: any variable that fixes the configuration will do (an angle, a separation, a normal-mode amplitude), with \(p\) the momentum conjugate to that choice. The pendulum below is the first example, its natural coordinate being the angle \(\theta\). Choosing such coordinates well is the point of the Lagrangian formulation later in the course.
Examples. Already in one dimension the phase space need not be a flat plane; its shape is fixed by the configurations the system possesses. For the harmonic oscillator the position \(q\in\RR\) and \(p\in\RR\) are unrestricted, so \(\GG=\RR^2\); for the pendulum the angle \(\theta\) and \(\theta+2\pi\) are the same configuration, so the positions close into a circle and \(\GG=S^1\times\RR\), a cylinder.
Finally, write the equations of motion compactly as
At every point of phase space the dynamics attaches a vector \(\vb V(x)\): a vector field, the phase velocity, like a stationary wind across \(\GG\). A solution is a curve \(x(t)\) everywhere tangent to this field, a phase curve (also called an orbit, or a trajectory). We now read this off in two examples; what makes the field so powerful, the existence–uniqueness theorem, we keep for the end.
Last lesson we solved the oscillator outright: with \(U=\half m\omega^2 q^2\), Newton's law \(\ddot q=-\omega^2 q\) has the solution
\[ q(t)=q(0)\cos\omega t+\frac{\dot q(0)}{\omega}\sin\omega t. \]Let us read that solution in the phase plane. Let us first take the simplest situation, release from rest at \(q(0)=q_0\): then \(\dot q(0)=0\), implying the initial momentum is \(p(0)=0\), and
\[ q(t)=q_0\cos\omega t,\qquad p(t)=m\dot q(t)=-m\omega q_0\,\sin\omega t. \]As \(t\) advances, the phase point \(x(t)=(q(t),p(t))\) visits in turn
\[ \begin{array}{c|cccc} t & 0 & \pi/2\omega & \pi/\omega & 3\pi/2\omega\\ \hline x=(q,p) & (q_0,\,0) & (0,\,-m\omega q_0) & (-q_0,\,0) & (0,\,+m\omega q_0) \end{array} \]right, bottom, left, top: the state runs clockwise around an ellipse and closes at \(t=2\pi/\omega\). This is no accident, and it holds for any initial state \(x_0=(q_0,p_0)\). In the generic case, the solution is \(q(t)=q_0\cos\omega t+\tfrac{p_0}{m\omega}\sin\omega t\) and \(p(t)=p_0\cos\omega t-m\omega q_0\sin\omega t\); substituting into the energy and using \(\cos^2(\omega t)+\sin^2(\omega t)=1\), the time cancels and
\[ E\big(q(t),p(t)\big)=\frac{p(t)^2}{2m}+\half m\omega^2 q(t)^2=\frac{p_0^{\,2}}{2m}+\half m\omega^2 q_0^{\,2}, \]the same at every instant: the energy is conserved along every solution, for every \((q_0,p_0)\). Each trajectory is therefore confined to a level set \(E(q,p)=\text{const}\), the equation of an ellipse,
\[ \frac{p^2}{2m}+\half m\omega^2 q^2=E\quad\Longleftrightarrow\quad \frac{q^2}{2E/(m\omega^2)}+\frac{p^2}{2mE}=1. \]And here is the point we keep: we did not really need the solution. Conservation of energy alone forces every trajectory onto a level curve \(E(q,p)=\text{const}\); the whole nested family of them, one ellipse per energy, run clockwise about the origin, is the phase portrait (i.e., the family of all possible trajectories). Its semi-axes are \(\sqrt{2E/(m\omega^2)}\) along \(q\) and \(\sqrt{2mE}\) along \(p\); where the ellipse meets the \(q\)-axis the particle is momentarily at rest (the turning points, \(U(q)=E\)), and at the bottom of the well the motion is all kinetic. The centre \((0,0)\) is the equilibrium: the particle at rest at the minimum.
Explore it. Pick a system and click in the lower (phase) plane to release a trajectory. The upper panel is the potential \(U(q)\) with the orbit's energy as a dashed line; the panel below the phase plane shows the actual physical motion of a spring following the harmonic motion. Two additional examples are considered: the pendulum and a particlein a double potential well potential.
A phase portrait carries less information than an explicit solution (it has discarded the clock, so we no longer know when the particle is where) but in another sense far more, for it displays all motions at once, one curve per energy.
The pendulum shows the method at its best. The plane pendulum of length \(\ell\) obeys the nonlinear equation
\[ \ddot\theta+\omega_0^2\sin\theta=0,\qquad \omega_0^2=\frac{g}{\ell}, \]which has no solution in elementary functions. An exact solution does exist in terms of Jacobi elliptic functions, but it is the portrait, not the formula, that we are after. With \(U(\theta)=\omega_0^2(1-\cos\theta)\) (and \(p=\dot\theta\), the momentum conjugate to \(\theta\) in units where the moment of inertia \(m\ell^2=1\)) the conserved energy gives the level curves
\[ p(\theta)=\pm\sqrt{2\big(E-\omega_0^2(1-\cos\theta)\big)}, \]read straight off the corrugated potential (select Pendulum in the explorer). Three kinds of motion appear:
Finally, \(\theta\) and \(\theta+2\pi\) are the same configuration, so the left and right edges of the portrait are identified: the phase plane becomes the cylinder \(\GG=S^1\times\RR\) of the Examples in §1. On it a rotation is a loop that wraps around and closes, a libration a small loop that does not, and the two saddles at \(\theta=\pm\pi\) are a single point on the back seam.
In both examples we quietly relied on something: distinct trajectories never meet, and the pendulum's separatrix only approaches the upright point without ever crossing. This is no accident; it is the content of one theorem, the foundation under every portrait we drew (see Appendix A for a proof).
The structural fact that makes a phase portrait legible is that orbits never cross:
Two further remarks round this out. Determinism: the present state fixes the entire future and, running the flow backward, the entire past. Equilibria: a point \(x^\star\) with \(\vb V(x^\star)=0\) is a motionless state, a trajectory by itself, which for the setup at hand means \(p=0\) and \(F(q)=0\), i.e. \(q\) at a critical point of the potential.
More precisely, the theorem gives a unique solution \(x(t)\). Adjoining the time axis to phase space forms the extended phase space \(\RR_t\times\GG\), in which the solution sits as its graph \(t\mapsto(t,x(t))\), an integral curve; distinct integral curves never meet. The orbit the portrait draws is this graph's projection onto \(\GG\), and orbits avoid crossing only because the field is autonomous; for a time-dependent \(\vb V(x,t)\) the projections may cross while the graphs stay disjoint, the clean picture being restored there by the autonomous extension \((\dot x,\dot t)=(\vb V,1)\).
Next lesson we will show that the same reading works for any one-dimensional conservative system, with no solving: every trajectory lies on a level curve of the energy,
so the whole portrait is read straight off the graph of \(U\): the motion confined to \(U(q)\le E\), every minimum of \(U\) a centre (expanded, a hidden harmonic oscillator of frequency \(\omega=\sqrt{U''/m}\)), every maximum a saddle with its separatrix. The double well already waiting in the explorer above is the first example, two centres and the saddle between them. Turning that qualitative picture into numbers, the period \(T(E)\) of a closed orbit, the energy method for an arbitrary \(U(q)\), and the way several wells partition the plane, is where we will pick up.
We asked what the state of a system is, and we ended able to read every motion of a one-dimensional system off the geometry of a single picture, without solving a thing: equilibria, oscillations, and the boundary between swinging and rotating, all visible at a glance. That picture is not merely convenient; it is the natural home of mechanics and of much of physics beyond it. The phase space will lead later in the course to:
Appendix
We prove the existence–uniqueness theorem of §4 in any dimension, \(x\in\RR^{n}\). In fact, the hypothesis can be relaxed: it is necessary that \(\vb V\) is Lipschitz near \(x_0\), \(\|\vb V(x)-\vb V(y)\|\le L\|x-y\|\); this is automatic when \(\vb V\) is \(C^1\), since a bounded derivative supplies the constant \(L\).
When \(\GG\) is a manifold rather than \(\RR^n\), as for the pendulum's cylinder \(S^1\times\RR\), the statement still holds: it is local, every patch of the cylinder looks like an open piece of \(\RR^2\), the argument below runs there unchanged, and the identification \(\theta\sim\theta+2\pi\) only governs how trajectories close up, never their local existence or uniqueness.
Proof of the theorem. Throughout, \(x_0\in\RR^n\) denotes the fixed initial point, and curves are compared in the sup-norm \(\|x\|=\max_{t\in I}\|x(t)\|\). A continuous \(x(t)\) solves the IVP iff it solves the integral equation
\[ x(t)=x_0+\int_{t_0}^{t}\vb V(x(s))\,\dd s =: (Tx)(t). \]For one direction, integrate \(\dot x=\vb V(x)\) from \(t_0\) to \(t\): the left side is \(x(t)-x(t_0)\) by the fundamental theorem of calculus, which gives the equation. For the other, differentiate the equation to recover \(\dot x=\vb V(x)\), and put \(t=t_0\) to make the integral vanish and get back \(x(t_0)=x_0\). The integral form is the convenient one, since it folds the initial condition into a single equation and makes sense for any merely continuous \(x\). A solution is therefore exactly a fixed point of the Picard operator \(T\). Fix a closed ball of radius \(r\) about \(x_0\) on which \(\vb V\) is bounded, \(\|\vb V\|\le\mu\), and Lipschitz, \(\|\vb V(x)-\vb V(y)\|\le L\|x-y\|\); the curves \(M=\{x\in C(I;\RR^n):\|x-x_0\|\le r\}\) on \(I=[t_0-h,t_0+h]\), measured in the sup-norm \(\|x\|=\max_{t\in I}\|x(t)\|\), form a complete space (because \(\RR^n\) is). Choosing \(h\le r/\mu\) makes \(T:M\to M\).
Contraction. Here, and only here, the Lipschitz hypothesis is used:
\[ \big\|(Tx)(t)-(Ty)(t)\big\|\le\int_{t_0}^{t} L\,\|x(s)-y(s)\|\,\dd s\le L\,h\,\|x-y\|, \]so for \(h<1/L\), \(\kappa:=Lh<1\) and \(T\) is a contraction. The contraction-mapping lemma below (Banach–Caccioppoli) then gives the unique fixed point, the unique local solution.
A remark. The Picard iterates are explicit and in fact converge on the whole interval, not merely a short one: from the constant curve \(u_0(t)\equiv x_0\), \(\|u_{n+1}(t)-u_n(t)\|\le \mu L^n|t-t_0|^{n+1}/(n+1)!\), and the factorial (the series that builds \(e^{Lt}\)) converges for every \(t\). ∎
It remains to establish the tool we invoked.
This lemma has a more general form, valid for contractions on any complete metric space; here we use only the Euclidean case.
Proof of the lemma. Fix any curve \(u_0\) and iterate \(u_{n+1}=Tu_n\). Each step contracts, \(\|u_{n+1}-u_n\|=\|Tu_n-Tu_{n-1}\|\le\kappa\,\|u_n-u_{n-1}\|\le\kappa^{n}\,\|u_1-u_0\|\), so for \(m>n\) the triangle inequality and the geometric series give
\[ \|u_m-u_n\|\le\sum_{k=n}^{m-1}\|u_{k+1}-u_k\|\le\|u_1-u_0\|\sum_{k=n}^{m-1}\kappa^{k}\le\frac{\kappa^{n}}{1-\kappa}\,\|u_1-u_0\|, \]which tends to \(0\) as \(n\to\infty\). The iterates therefore form a Cauchy sequence (their terms get arbitrarily close to one another), and by completeness of the curve space they converge to some curve \(u_\star\). A contraction is continuous, \(\|Tu_n-Tu_\star\|\le\kappa\,\|u_n-u_\star\|\to0\), so letting \(n\to\infty\) in \(u_{n+1}=Tu_n\) gives \(u_\star=Tu_\star\): a fixed point. It is the only one, since two fixed points would obey \(\|u_\star-v_\star\|=\|Tu_\star-Tv_\star\|\le\kappa\,\|u_\star-v_\star\|\), impossible for \(\kappa<1\) unless \(u_\star=v_\star\). ∎