(Optional) Cantor’s first proof of the uncountability of the reals

Section 12.6 (Optional) Cantor’s first proof of the uncountability of the reals

For completeness we include Cantor’s first proof that \(|\mathbb{N}| \lt |\mathbb{R}|\text{.}\) This, proved in 1874, is more involved than his very famous diagonal argument which was published in 1891. Despite this it is still accessible to the tools developed in this text.

🔗

Like the diagonal argument, this first proof also is a proof by contradiction. We assume the existence of a bijection from \(\mathbb{N}\) to an interval of the real-line and then show that this leads to a contradiction. However, rather than relying on decimal expansions of real numbers, it instead relies on the supremum — the least upper bound property. The reader should take a moment to examine Exercise 8.6.15 and Exercise 8.6.16, and also revise Section 6.4 before continuing.

🔗

Axiom 12.6.1. Least upper bound property of the reals.

Let \(A \subseteq \mathbb{R}\) be bounded above. That is, there is some \(M \in \mathbb{R}\) so that \(a \leq M\) for all \(a \in A\text{.}\) Then the supremum of \(A\) exists and is a real number.

🔗

This is the critical difference between the reals and the rationals — the rationals do not satisfy the least upper bound property. For example, one can take the set of truncated decimal expansions of \(\sqrt{2}\text{:}\)

\begin{align*} \set{1.0, 1.4, 1.41, 1.414, 1.4142, \dots} \amp = \set{\frac{1}{1}, \frac{14}{10}, \frac{141}{100}, \frac{1414}{1000}, \frac{14142}{10000}, \dots}\\ \amp= \set{ \lfloor 10^n \sqrt{n} \rfloor \cdot 10^{-n} \st n \in \mathbb{N}}\\ \amp \subseteq \mathbb{Q} \end{align*}

The least upper bound of this set is \(\sqrt{2}\) which is not a rational number 11.2.6.

🔗

We can use the least upper bound property to prove

¹⁷¹

This is a nice and intuitive result; in fact it is such a nice result we set it as an exercise 8.6.17.

that increasing sequences of real numbers that are bounded above must converge to a real limit.

🔗

Lemma 12.6.2.

Let \((a_n)\) be a sequence of real numbers that

is bounded above by some real number \(M\text{,}\) and

🔗
satisfies \(a_k \leq a_{k+1}\) for all \(k\text{.}\)

🔗

Then the sequence \(a_n\) converges as \(n \to \infty\text{.}\)

🔗

Proof.

See Exercise 8.6.17.

🔗

Now we are ready to get into the proof. Let \([a,b]\) be an interval of the real line with \(a \lt b\text{.}\) Then assume to the contrary, that there is a bijection

\begin{equation*} g: \mathbb{N} \to [a,b] \end{equation*}

This bijection defines an infinite sequence via \(x_n = g(n)\) for any \(n \in \mathbb{N}\text{.}\) Notice that all the terms of this sequence must be distinct since \(g\) is injective.

🔗

The proof works by finding some \(q \in [a,b]\) so that \(q\) is not part of the sequence. Hence there is no \(n \in \mathbb{N}\) so that \(g(n)=q\text{,}\) and so \(g\) is not surjective and hence not bijective.

🔗

We construct two new sequences \((a_n)\) and \((b_n)\) from the sequence \((x_n)\text{.}\)

Find the first two \(x_n\) lying strictly inside the interval, that is \(a \lt x_n \lt b\text{.}\) These two terms must be distinct and let \(a_1\) be the smaller and \(b_1\) be the larger.

🔗
Similarly, find the first two terms of the sequence that lie inside \((a_1,b_1)\) — call the smaller \(a_2\) and the larger \(b_2\text{.}\)

🔗
Keep repeating this to define \((a_3,b_3)\text{,}\) \((a_4,b_4)\) and so on.

🔗

Notice that

for any \(n\) we must have \(a_n \lt b_n\text{,}\) and

🔗
the sequences of \(a\)’s and \(b\)’s satisfy

\begin{equation*} a \lt a_1 \lt a_2 \lt a_3 \lt \dots \lt b_3 \lt b_2 \lt b_1 \lt b \end{equation*}

and so the sequence \((a_n)\) is increasing and bounded above by \(b\text{,}\) while the sequence \((b_n)\) is decreasing and bounded below by \(a\text{.}\)

🔗

🔗
the sequences of \(a\)’s and \(b\)’s may or may not be infinite.

🔗
for any \(n\text{,}\) the terms \(x_1,x_2,\dots, x_{2n}\) do not lie inside the interval \((a_n,b_n)\text{.}\)

🔗

This last point can be proved quite readily using induction on \(n\text{.}\)

🔗

Lemma 12.6.3.

The interval \((a_n,b_n)\) does not contain the sequence terms \(x_1,x_2,\dots,x_{2n}\text{.}\)

🔗

Proof.

We prove the result by induction on \(n\text{.}\)

Notice that \(x_1,x_2\) must belong to \(\set{a,b,a_1,b_1}\) (by construction). Thus \(x_1,x_2 \not\in (a_1,b_1)\) since the open interval exludes the endpoints.

🔗
Now assume that the result holds for \(n=k\text{,}\) and so \(x_1,x_2,\dots x_{2n}\) are lie outside the interval \((a_n,b_n)\text{.}\) Then either \(x_{2n+1}, x_{2n+2}\) lie outside this interval, or, if they lie inside the interval they are one or both of \(a_{n+1}, b_{n+1}\text{.}\) Consequently they lie outside \((a_{n+1},b_{n+1})\text{.}\)

🔗

Hence the result holds for all \(n \in \mathbb{N}\text{.}\)

🔗

The sequences \((a_n), (b_n)\) may or may not be infinite. So initially assume that they are finite. That is, the final interval is \((a_N, b_N)\text{.}\) Now notice that there cannot be two or more \(x_n\) inside the interval \((a_N,b_N)\) because otherwise we would use those to define another interval \((a_{N+1}, b_{N+1})\text{.}\) Thus there are either zero or one more term of the sequence \((x_n)\) lying inside \((a_N,b_N)\text{.}\) But this means that any other number inside \((a_N,b_N)\) is not part of the sequence \((x_n)\text{.}\) That is, there is some \(q \in (a_N,b_N)\) so that \(q \neq x_n = g(n)\) for any \(n \in \mathbb{N}\text{.}\)

🔗

Now instead, assume that the sequences of \(a\)’s and \(b\)’s are infinite. Since the sequence of \(a\)’s is increasing and bounded above by \(b\text{,}\) Lemma 12.6.2 implies that it converges to a limit. The same argument (applied to \(-b_n\)) shows that the sequence of \(b\)’s also converges to a limit. So define

\begin{equation*} \lim_{n \to \infty} a_n = \alpha \qquad \lim_{n \to \infty} b_n = \beta. \end{equation*}

Notice that we cannot have \(\beta \lt \alpha\) (otherwise we would have \(a_k \gt b_k\) for some \(k\)), and so either \(\alpha \lt \beta\) or \(\alpha = \beta\text{.}\)

If \(\alpha \lt \beta\text{,}\) then every \(q \in (\alpha, \beta)\) is not in the sequence of \(x\)’s. If \(q = x_n\) for some \(n\text{,}\) then this would contradict Lemma 12.6.3 above, since it would imply that \(q = x_n \in (\alpha,\beta) \subseteq (a_n,b_n)\text{.}\)

🔗
On the other hand, if \(\alpha=\beta\text{,}\) then \(q = \alpha\) is not in the sequence of \(x\)’s. If \(q=\alpha=x_n\) for some \(n\text{,}\) then again this would contradict Lemma 12.6.3 since it would imply that \(q=x_n =\alpha \in (a_n,b_n)\text{.}\)

🔗

Thus even when the sequences of \(a\)’s and \(b\)’s are infinite, there is always some \(q \in [a,b]\) that is not in the sequence of \(x\)’s. Hence the function \(g:\mathbb{N} \to [a,b]\) is not a surjection, contradicting our initial assumption that it is bijective.

🔗

Prev Top Next