Sharkovsky’s theorem

This post is based on a talk given by Keith Burns in the UH dynamical systems seminar yesterday, in which he presented a streamlined proof of Sharkovsky’s theorem due to him and Boris Hasselblatt.

1. Background and statement of the theorem

We recall the statement of Sharkovsky’s theorem. The setting is a continuous map {f\colon {\mathbb R}\rightarrow {\mathbb R}}, and one is interested in what dynamical properties {f} must necessarily have if we know that {f} has a periodic orbit of some given least period {p} — for example, does existence of such an orbit force the existence of periodic points with least period {q} for various values of {q}? Or does it force the map to have positive entropy?

The assumptions of continuity and one-dimensionality are vital: for discontinuous one-dimensional maps, or for continuous higher-dimensional maps, it is possible to have a point of least period {p} without having points of any other period or having positive entropy.

In one dimension, on the other hand, one can begin with the following simple observation: if {f} has a periodic point with least period 2, then there is {p} such that {f(p) > p=f^2(p)}, and so applying the intermediate value theorem to {f(x)-x}, the map must have a fixed point.

What if {f} has a periodic point {p} with least period 3? Suppose that {p<f(p)<f^2(p)} (the other configurations are similar), and let {I_1=[p,f(p)]} and {I_2=[f(p),f^2(p)]}. Once again using the intermediate value theorem, we see that {f(I_1) \supset I_2} and {f(I_2) \supset I_1 \cup I_2}. Thus we may consider the topological Markov chain {(X,\sigma)} given by the graph in Figure 1, and observe that it embeds into the dynamics of {f}, so that in particular, {f} has periodic points of every order.

Fig 1

Sharkovsky’s theorem generalises the above observations. Consider the following unorthodox total order on the integers:

\displaystyle  \begin{aligned} 3\prec 5&\prec 7\prec \cdots \\ 6\prec 10&\prec 14\prec \cdots \\ 12\prec 20&\prec 28\prec \cdots \\ &\vdots \\ \cdots \prec 16 &\prec 8 \prec 4 \prec 2 \prec 1 \end{aligned}

Then if {f} has a periodic point of least period {p}, it has a point of least period {q} for every {q\succ p}.

The paper by Burns and Hasselblatt includes a discussion of the history of the theorem and the various proofs that have been given. Here we outline the most streamlined proof available at the present time.

2. Sketch of the proof

Given two intervals {I,J\subset {\mathbb R}}, write {I\rightarrow J} if {f(I)\supset J}. As in the case of a period 3 orbit above, the key will be to construct a collection of intervals {I_j} (whose endpoints are certain points of the period {p} orbit) such that the covering relations {I_j\rightarrow I_k} yield a directed graph with periodic paths of all the desired periods.

To deduce existence of various periodic orbits for {f} from these covering relations, one makes the following observations.

  1. If {f^p(I)\supset I}, then {I} contains a fixed point of {f^p}.
  2. If {I_0 \rightarrow I_1 \rightarrow \cdots \rightarrow I_k}, then {f^k(I_0)\supset I_0}.
  3. The above two observations guarantee existence of {x\in I_0} with {f^k(x)=x}. To guarantee that {x} can be chosen with least period {k}, it suffices to know that some {I_j} to have disjoint interior from all the others, and that the endpoints of this {I_j} do not follow the itinerary given by the sequence of covering relations above.

Detailed proofs of the above observations are in the Burns-Hasselblatt paper. With these tools in hand, the proof comes down to constructing the directed graph of covering relations referred to above.

Fig 2

To this end, consider a periodic orbit {P} with {p} points — the green points in Figure 2. These are permuted by {f}; some are moved to the left, some are moved to the right. Of all the points in the orbit that are moved to the right, let {z} be the rightmost, so that all points in the orbit to the right of {z} are moved to the left. Let {z'} be the next point to the right of {z}, and let {\Delta=[z,z']} as in the figure.

Now consider the intervals {I_k = f^k(\Delta)}. Note that {I_0} has the following property: every point in {P} lying to the left of {\Delta} moves to the right of {\Delta}, and vice versa. Let {I_m} be the first interval that does not have this property, and let {x\in I_m\cap P} be a point that does not switch sides of {\Delta} under the action of {f}. For concreteness we consider the case where {x} is to the left of {\Delta}, as in Figure 2.

By construction, all the points between {x} and {\Delta} are mapped to the right of {\Delta}. Of these, let {y} be the point that is mapped furthest to the right.

Write {R_y} for the interval {[y,\infty)}, and consider the intervals {J_k = I_k \cap R_y} for {0\leq k < m}, together with {J_m = [x,y]}. We claim that {J_0\rightarrow J_0}, {J_m\rightarrow J_0}, and {J_k \rightarrow J_{k+1}} for each {0\leq k<m}, so that we have the covering relations shown in Figure 3, which can be used to establish existence of periodic points with all least periods greater than {m}.

Fig 3

So why do the intervals {J_k} have the covering relations shown?

  1. The fact that {J_0\rightarrow J_0} follows from the fact that {J_0=I_0=\Delta} and the endpoints of {\Delta} switch sides of {\Delta}.
  2. The definition of {I_k} gives {I_k\rightarrow I_{k+1}}.
  3. Writing {I_k^R} for the part of {J_k} lying in or to the right of {\Delta} and {I_k^L} for the part lying in or to the left, we see that {I_k^R\rightarrow I_{k+1}^L} and {I_k^L\rightarrow I_{k+1}^R}; this is because for {0\leq k < m}, all points of {P\cap I_k} switch sides of {\Delta} under the action of {f}. (This is the definition of {m}.)
  4. Using similar notation for {J_k^L} and {J_k^R}, we see that {I_k^R = J_k^R} and {J_k^L \rightarrow I_k^R}; this is because {y} was chosen to be the point that maps furthest to the right.
  5. This establishes that {J_k^L \rightarrow J_k^R}, and similarly {J_k^R \rightarrow J_k^L}.

For {k=m} we see that {f(x)} lies to the left of {\Delta}, and {f(y)} to the right, so that {f(J_m)\supset \Delta=I_0}.

The extra piece of information in the final item of the list above shows that the graph in Figure 3 can be replaced by the one in Figure 4.

Fig 4

Let us sum up what we know. One of the following two cases holds. Case 1: There is some point {x} of the orbit such that {x} and {f(x)} both lie on the same side of {\Delta}. Case 2: Every point of the orbit is mapped to the other side of {\Delta} by {f}.

In Case 1, the arguments above show that for every {q>p} and every even {q<p}, there is an orbit of least period {q}, which is at least as strong as the conclusion in Sharkovsky’s theorem.

In Case 2, one may observe that since {f} permutes the elements of {P} and every element is moved from one side of {\Delta} to the other, {p} must be even. Moreover, by considering {P^L} and {P^R} separately, as invariant sets for {f^2}, we can repeat the argument inductively to complete the proof of the theorem.

About Vaughn Climenhaga

I'm an assistant professor of mathematics at the University of Houston. I'm interested in dynamical systems, ergodic theory, thermodynamic formalism, dimension theory, multifractal analysis, non-uniform hyperbolicity, and things along those lines.
This entry was posted in theorems and tagged . Bookmark the permalink.

Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s