The Extreme Value Theorem: a bootstrap approach

At the very end of last week I was teaching the first-year mathematics students about the Extreme Value Theorem.

Here is a reminder of what the theorem says.

Theorem (Extreme Value Theorem)
Let a,b be real numbers with a<b, and let f:[a,b]R be continuous. Then there exist points c and d in [a,b] such that, for all x[a,b], we have f(c)f(x)f(d).

Here we have

f(c)=min{f(x):x[a,b]}=inf{f(x):x[a,b]}

and 

f(d)=max{f(x):x[a,b]}=sup{f(x):x[a,b]}.

Unfortunately I only had a few minutes left for this, ran out of time, and didn't finish the proof. As I am on a very tight schedule this year, I had to refer the students to the the typed notes for the rest of the proof. On the other hand, I did finish the first part of the proof, showing that a continuous real-valued function f on a closed and bounded interval [a,b] is always bounded above, i.e., the set f([a,b]) is bounded above.

In this post I want to see how you can use just this fact to prove the rest of the Extreme Value Theorem without having to repeat aspects of the proof.

I will take the following lemma as my starting point.

Lemma
Let a,b be real numbers with a<b, and let f:[a,b]R be continuous. Then there exists M>0 such that, for all x[a,b], we have f(x)M.

Our first step is to see that we can use this lemma to show that f is bounded below.

Corollary
Let a,b be real numbers with a<b, and let f:[a,b]R be continuous. Then there exists M>0 such that, for all x[a,b], we have f(x)M.

We can prove this Corollary by applying the Lemma to either of the functions f or |f|, where the latter function is the function x|f(x)|.

Proof of the Extreme Value Theorem

We already know that the set f([a,b])={f(x):x[a,b]} is bounded (i.e., both bounded below and bounded above). We now prove the existence of a suitable point d. (The proof for c is the same, but flipped as usual.)

Set S=supf([a,b])=sup{f(x):x[a,b]}R. We need to show that there exists a point d[a,b] such that f(d)=S. Assume, towards a contradiction, that there is no such point d. In that case, since S is an upper bound for f([a,b]), we must actually have f(x)<S for all x[a,b]

Consider the function g:[a,b]R defined by g(x)=1/(Sf(x)). Then, since f(x)<S for all x[a,b], g is a well-defined, continuous, real-valued function on [a,b], and moreover g(x)>0 for all x[a,b].

By our Lemma, g is bounded above, i.e., there exists M>0 such that, for all x[a,b], we have g(x)M.

Since these real numbers are strictly positive, we can take reciprocals and obtain, for all x[a,b], that 1/g(x)1/M, i.e., Sf(x)1/M. Note that 1/M>0. Rearranging we have, for all x[a,b], that

f(x)S1M.

But then S1/M is an upper bound for f([a,b]), and S1/M<S, which contradicts the fact that S is the least upper bound for f([a,b]). (That is the definition of supremum!)

We have obtained a contradiction, and so there must be a point d[a,b] with f(d)=S, as required.

The proof for c is similar, or you can apply the result about maximum to the function f to find a point c where f attains its maximum value, and then f attains its minimum value there.

Comments

Popular posts from this blog

A rule of thumb for when to use the Ratio Test

An introduction to the Hahn-Banach extension theorem: Part VI

An introduction to the Weierstrass M-test: Part I