Skip to main content

Subsection B.5.1 Where Does the Formula for the Area of a Circle Come From?

Typically when we come across \(\pi\) for the first time it is as the ratio of the circumference of a circle to its diameter

\begin{align*} \pi &= \frac{C}{d} = \frac{C}{2r} \end{align*}

Indeed this is typically the first definition we see of \(\pi\text{.}\) It is easy to build an intuition that the area of the circle should be propotional to the square of its radius. For example we can draw the largest possible square inside the circle (an inscribed square) and the smallest possible square outside the circle (a circumscribed square):

The smaller square has side-length \(\sqrt{2} r\) and the longer has side-length \(2r\text{.}\) Hence

\begin{align*} 2 r^2 & \leq A \leq 4r^2 & \text{ or } 2 & \leq \frac{A}{r^2} \leq 4 \end{align*}

That is, the area of the circle is between 2 and 4 times the square of the radius. What is perhaps less obvious (if we had not been told this in school) is that the constant of propotionality for area is also \(\pi\text{:}\)

\begin{align*} \pi &= \frac{A}{r^2}. \end{align*}

We will show this using Archimedes' proof. He makes use of these inscribed and circumscribed polygons to make better and better approximations of the circle. The steps of the proof are somewhat involved and the starting point is to rewrite the area of a circle as

\begin{align*} A &= \frac{1}{2} C r \end{align*}

where \(C\) is (still) the circumference of the circle. This suggests that this area is the same as that of a triangle of height \(r\) and base length \(C\)

\begin{align*} T &= \frac{1}{2} C r \end{align*}

Archimedes' proof then demonstrates that indeed this triangle and the circle have the same area. It relies on a “proof by contradiction” — showing that \(T \lt A\) and \(T \gt A\) cannot be true and so the only possibility is that \(A=T\text{.}\)

We will first show that \(T \lt A\) cannot happen. Construct an \(n\)-sided “inscribed” polygon as shown below:

Let \(p_n\) be the inscribed polygon as shown.

We need 4 steps.

  • The area of \(p_n\) is smaller than that of the circle — this follows since we can construct \(p_n\) by cutting slices from the circle.
  • Let \(E_n\) be the difference between the area of the circle and \(p_n\text{:}\) \(E_n = A - A(p_n)\) (see the left of the previous figure). By the previous point we know \(E_n \gt 0\text{.}\) Now as we increase the number of sides, this difference becomes smaller. To be more precise

    \begin{align*} E_{2n} & \leq \frac{1}{2} E_n. \end{align*}

    The error \(E_n\) is made up of \(n\) “lobes”. In the centre-left of the previous figure we draw one such lobe and surround it by a rectangle of dimensions \(a \times 2b\) — we could determine these more precisely using a little trigonometry, but it is not necessary.

    This diagram shows the lobe is smaller than the rectangle of base \(2b\) and height \(a\) Since there are \(n\) copies of the lobe, we have

    \begin{align*} E_n & \leq n \times 2ab & \text{rewrite as } \frac{E_n}{2} & \leq nab \end{align*}

    Now draw in the polygon \(p_{2n}\) and consider the associated “error” \(E_{2n}\text{.}\) If we focus on the two lobes shown then we see that the area of these two new lobes is equal to that of the old lobe (shown in centre-left) minus the area of the triangle with base \(2b\) and height \(a\) (drawn in purple). Since there are \(n\) copies of this picture we have

    \begin{align*} E_{2n} &= E_n - nab & \text{now use that $nab \geq E_n/2$}\\ & \leq E_n - \frac{E_n}{2} = \frac{E_n}{2} \end{align*}
  • The area of \(p_n\) is smaller than \(T\text{.}\) To see this decompose \(p_n\) into \(n\) isosceles triangles. Each of these has base shorter than \(C/n\text{;}\) the straight line is shorter than the corresponding arc — though strictly speaking we should prove this. The height of each triangle is shorter than \(r\text{.}\) Thus
    \begin{align*} A(p_n) &= n \times \frac{1}{2} \text{(base)}\times \text{(height)}\\ & \leq n \times \frac{Cr}{2n} = T \end{align*}
  • If we assume that \(T \lt A\text{,}\) then \(A-T = d\) where \(d\) is some positive number. However we know from point 2 that we can make \(n\) large enough so that \(E_n \lt d\) (each time we double \(n\) we halve the error). But now we have a contradiction to step 3, since we have just shown that
    \begin{align*} E_n = A-A(p_n) & \lt A-T & \text{which implies that}\\ A(p_n) & \gt T. \end{align*}

Thus we cannot have \(T \lt A\text{.}\)

If we now assume that \(T \gt A\) we will get a similar contradiction by a similar construction. Now we use regular \(n\)-sided circumscribed polygons, \(P_n\text{.}\)

The proof can be broken into 4 similar steps.

  1. The area of \(P_n\) is greater than that of the circle — this follows since we can construct the circle by trimming the polygon \(P_n\text{.}\)
  2. Let \(E_n\) be the difference between the area of the polygon and the circle: \(E_n = A(P_n)-A\) (see the left of the previous figure). By the previous point we know \(E_n \gt 0\text{.}\) Now as we increase the number of sides, this difference becomes smaller. To be more precise we will show

    \begin{align*} E_{2n} & \leq \frac{1}{2} E_n. \end{align*}

    The error \(E_n\) is made up of \(n\) “lobes”. In the centre-left of the previous figure we draw one such lobe. Let \(L_n\) denote the area of one of these lobes, so \(E_n = nL_n\text{.}\) In the centre of the previous figure we have labelled this lobe carefully and also shown how it changes when we create the polygon \(P_{2n}\text{.}\) In particular, the original lobe is bounded by the straight lines \(\vec{ad}, \vec{af}\) and the arc \(\widehat{fbd}\text{.}\) We create \(P_{2n}\) from \(P_n\) by cutting away the corner triangle \(\triangle aec\text{.}\) Accordingly the lines \(\vec{ec}\) and \(\vec{ba}\) are orthogonal and the segments \(|bc|=|cd|\text{.}\)

    By the construction of \(P_{2n}\) from \(P_n\text{,}\) we have

    \begin{align*} 2L_{2n} &= L_n - A( \triangle aec) & \text{or equivalently } L_{2n} &= \frac{1}{2} L_n - A( \triangle abc) \end{align*}

    And additionally

    \begin{align*} L_{2n} & \leq A( \triangle bcd) \end{align*}

    Now consider the triangle \(\triangle abd\) (centre-right of the previous figure) and the two triangles within it \(\triangle abc\) and \(\triangle bcd\text{.}\) We know that \(\vec{ab}\) and \(\vec{cb}\) form a right-angle. Consequently \(\vec{ac}\) is the hypotenuse of a right-angled triangle, so \(|ac| \gt |bc| = |cd|\text{.}\) So now, the triangles \(\triangle abc\) and \(\triangle bcd\) have the same heights, but the base of \(\vec{ac}\) is longer than \(\vec{cd}\text{.}\) Hence the area of \(\triangle abc\) is strictly larger than that of \(\triangle bcd\text{.}\)

    Thus we have

    \begin{align*} L_{2n} & \leq A(\triangle bcd) \lt A(\triangle abc) \end{align*}

    But now we can write

    \begin{align*} L_{2n} &= \frac{1}{2} L_n - A( \triangle abc) \lt \frac{1}{2} L_n - L_{2n} & \text{rearrange}\\ 2L_{2n} & \lt \frac{1}{2} L_n & \text{there are $n$ such lobes, so }\\ 2n L_{2n} & \lt \frac{n}{2} L_n & \text{since $E_n = n L_n$, we have}\\ E_{2n} & \lt \frac{1}{2} E_n & \text{which is what we wanted to show.} \end{align*}
  3. The area of \(P_n\) is greater than \(T\text{.}\) To see this decompose \(P_n\) into \(n\) isosceles triangles. The height of each triangle is \(r\text{,}\) while the base of each is longer than \(C/n\) (this is a subtle point and its proof is equivalent to showing that \(\tan \theta \gt \theta\)). Thus
    \begin{align*} A(P_n) &= n \times \frac{1}{2} \text{(base)}\times \text{(height)}\\ & \geq n \times \frac{Cr}{2n} = T \end{align*}
  4. If we assume that \(T \gt A\text{,}\) then \(T-A = d\) where \(d\) is some positive number. However we know from point 2 that we can make \(n\) large enough so that \(E_n \lt d\) (each time we double \(n\) we halve the error). But now we have a contradiction since we have just shown that
    \begin{align*} E_n = A(P_n) - A & \lt T-A & \text{which implies that}\\ A(p_n) & \gt T. \end{align*}

Thus we cannot have \(T \gt A\text{.}\) The only possibility that remains is that \(T=A\text{.}\)