The stable manifold theorem is one of the most important in the theory of non-linear ODEs and dynamical systems. Unfortunately, some of the standard introductory texts (Hirsch–Smale, Perko) either do not give a proof, or do not motivate the proof, while more advanced texts (Katok–Hasselblatt, Barreira–Pesin) are too high-powered to be appropriate in the setting of an introductory graduate course in ODEs, such as the one I am teaching now. So I’m taking this opportunity to turn my hastily scrawled hand-written notes on a (hopefully) properly-motivated proof into something that will be legible to myself and others on a more permanent basis.
The form of the theorem we will prove is this: let be an open domain and the flow of a vector field . Suppose that is an equilibrium point for and let be the stable subspace for (the span of all generalised eigenvectors corresponding to eigenvalues with negative real part). Let be the centre-unstable subspace for (corresponding to eigenvalues with zero or positive real part). Then there exists and a function such that the set
has the following properties:
- is positively invariant;
- given an initial condition , we have .
(Note that if has an equilibrium point , we can make the change of coordinates and proceed as stated above. The choice simplifies the notation.)
Proof: We present a proof using Perron’s method — this is essentially the proof given in Perko’s book, but with significantly more motivation and a more abstract viewpoint. The idea is to use the same general strategy that worked well for us in the proof of the Picard–Lindelöf theorem on local existence and uniqueness of solutions: prove existence and uniqueness of something by obtaining as the unique fixed point of a contraction on a complete metric space. In the Picard–Lindelöf theorem, the space was the space of approximate solutions, and the contraction was the Picard operator that transformed a candidate solution into something even closer to being a solution. Here we need to consider a slightly different space and operator.
- We will consider the space of curves that approach the fixed point with a certain exponential rate, as shown below. In particular, for each we will consider the set of such curves that begin at for some . As before, we do not assume a priori that these curves are trajectories of the ODE.
- We will define an integral operator on this space whose fixed points are solutions of the ODE. Then an application of the Banach fixed point theorem completes the proof.
To this end, let be the linear part of at the fixed point, and let and be the restrictions of to the stable subspace and centre-unstable subspace, respectively. Let be the maximum value of for eigenvalues of , and fix constants , as shown below. Then all eigenvalues of lie strictly to the left of and all eigenvalues of lie strictly to the right of . In particular, we can fix a norm on with the property that
for all . (Note that the second of these does not give decay to , but does give some control on growth.)
This is a complete metric space with distance given by
(The reader familiar with Banach spaces will note that is in fact a Banach space.)
Fixing , we consider the subspace . Here and throughout we will write and for the parts of lying in the stable subspace and centre-unstable subspace, respectively, so that . The goal is to find, for each sufficiently small , some that is a trajectory of the ODE . Then we can put and conclude that has the properties claimed in the theorem. (In fact we will not prove that is , which takes some more work, but positive invariance and exponential stability will follow from what we show.)
Having defined our metric space , we need to define an operator on it whose fixed points are trajectories of the system. As in the Picard–Lindelöf theorem we would like to define an integral operator. We cannot simply use the one defined there () for at least two reasons:
- we need to preserve the property , which this does not;
- we will need to have the possibility that , otherwise the operator will preserve the subsets of on which is constant, and in particular each of these subsets could have a solution of the ODE, but the whole point is that should only contain one value of corresponding to a solution.
Recall the idea from variation of constants: treat the ODE as a perturbation of a linear system, and write down an expression that is constant for the linear system. Then differentiating this expression gives the effect of the perturbation at each time, and integrating yields an expression for the solution of the full non-linear system. In other words, we write , where , so that the ODE becomes
If then the solution is , and in particular is constant. So we differentiate this expression and use (3) to get
Integrating, we see that (3) is equivalent to
Now by decomposing everything into the part lying in and the part lying in , we see that is a solution of (3) if and only if we have
for some (and hence every) . The reason for using non-zero values of in the second equation is that we have good control on when but not when , and so by sending we may observe that
then if and only if solves (3). One may also understand the motivation behind sending as follows: for a fixed , we expect the first term from (6) to grow exponentially as , which makes it difficult to verify the property . By eliminating this term (and the corresponding growth in the integral), we obtain an expression that is more tractable in the limit .
Now we show that maps to itself, and that it is a contraction, which will complete the proof by an application of the Banach fixed point theorem. First we observe that because the non-linear part of the vector field is with , it is Lipschitz on small neighbourhoods of , and the Lipschitz constant can be made arbitrarily small by making the neighbourhood small enough. More precisely, for every there exists such that if then .
Now we make a computation that shows both claims in the previous paragraph. Fix and let be as above. Recall that by the definition of we have for all , and write . Then
Using the fact that and , we obtain
Writing and recalling the definition of in (2), this gives
Given , by putting we see that
By choosing small enough we can guarantee that , and hence is a contraction. Thus it has a unique fixed point . This is a trajectory of the ODE (3) which approaches exponentially (with rate at least ) and has , so we put and conclude that has the properties claimed in the statement of the theorem.
A few remarks are in order. The proof here follows the proof in Perko’s book (p. 107–111), but includes more abstract language and more complete motivation. We have shown that exists and its graph is positively invariant with trajectories converging exponentially to . One can also show that is and , so that is tangent to . This proof is not in Perko’s book — he refers to the text of Coddington and Levinson (p. 332–333). If I feel ambitious I may try to include this part of the proof at a later point.
The proof given above also shows that is the only trajectory with that converges exponentially to with rate at least . In particular, any such trajectory lies in the stable manifold . However, it is possible that there are trajectories lying outside of that converge to more slowly — for example, there may be trajectories converging to with subexponential speed along the centre direction .
Ultimately, the only property of the subspace that we used in the proof was the fact that all eigenvalues of have real part strictly greater than . In fact, the entire proof above is valid if we replace with a subspace such that
- ; and
- there exists such that every eigenvalue of has real part strictly less than , and every eigenvalue of has real part strictly greater than .
In this setting, we still get a stable manifold, which is often called a strong stable manifold and denoted , because it corresponds to the directions in which the contraction is the strongest. There may be trajectories lying outside of that approach exponentially, but the rate at which they do so is less than .