There is a pledge of the big and of the small in the infinite.
In the next two posts, we are going to look at two interesting geometric ideas of the 19th century involving circles. Next time, we will consider Poincaré’s disk model for hyperbolic geometry. Today, though, we immerse ourselves in the universe of inversive geometry.
Consider a circle in the infinite 2-dimensional plane:
This circle divides the plane into two regions: the bounded region inside the circle and the unbounded region outside the circle (let’s say that the points on the circle belong to both regions). A natural thing to want to do, now, especially in the context of this blog, would be to try to exchange these two regions, to map the infinite space outside the circle into the bounded space of the circle, and vice versa, in a “natural” way.
I could be bounded in a nutshell, and count myself a king of infinite space.
-William Shakespeare, Hamlet
Upon first reflection, one might be tempted to say that we want to “reflect” points across the circle. And this is sort of right, but reflection already carries a meaning in geometry. Truly reflecting points across the circle would preserve their distance from the circle, so the inside of the circle could only be mapped onto a finite ring whose outer radius is twice that of the circle two. Moreover, it would not be clear how to reflect points from outside this ring into the circle.
Instead, we want to consider a process known as “inversion.” Briefly speaking, we want to arrange so that points arbitrarily close to the center of the circle get sent to points arbitrarily far away from the center of the circle, and vice versa. For simplicity, let us suppose that the circle is centered at the origin of the plane and has a radius of 1. The most natural way to achieve our aim is to send a point to a point that lies in the same direction from the origin as and whose distance from the origin is the reciprocal of the distance from to the origin. Here’s an example:
One can check that, algebraically, this inversion sends a point with coordinates to a point with coordinates . Points inside the circle are sent to points outside the circle, points outside the circle are sent to points inside the circle, and points on the circle are sent to themselves. Moreover, as one might expect from the name, the inversion map is its own inverse: applying it twice, we end up where we started. Perfect!
Wait a second, though. We’re being a little too hasty. What about the origin? Where is it sent? Our procedure doesn’t seem to tell us, and if we try to use our algebraic expression, we end up dividing by zero. Since the origin is inside the circle, it should certainly be sent to a point outside the circle, but all of those points are already taken. Also, since points arbitrarily close to the origin get mapped to points arbitrarily far from the origin, we want to send the origin to a point as far away from itself as possible. At first glance, we might seem to be in a quandary here, but longtime readers of this blog will see an obvious solution: the origin gets mapped to a point at infinity! (And the point at infinity, in turn, gets mapped to the origin.)
(Technical note: Since we’ve added a point at infinity, the inversion map should be seen not as a map on the plane , but on its one-point compactification (or Alexandroff compactification), . In fact, the inversion map is a topological homeomorphism of with itself.)
Let’s examine what the inversion map does to simple geometric objects. We have already seen what happens to points. It should also be obvious that straight lines through the origin get mapped to themselves. For example, in the image above, the line connecting and gets mapped to itself. (Here we are specifying, of course, that every line contains the point at infinity.)
A bit of thought and calculation will convince you that lines not passing through the origin get sent to circles that do pass through the origin.
Since the inversion map is its own inverse, circles passing through the origin get mapped to lines that don’t pass through the origin. Circles that don’t pass through the origin, on the other hand, get mapped to other circles that don’t pass through the origin.
There’s an important special case of this phenomenon: a circle that is met perpendicularly by the circle through which we are inverting gets mapped to itself.
We thus have a sort of duality between lines and circles that has been revealed through the process of circle inversion. Lines, when seen in the right light, are simply circles with an infinite radius. We’re going to move on to some applications of circle inversion in just a sec, but, first, a pretty picture of an inverted checkerboard.
The introduction of the method of circle inversion is widely attributed to the Swiss mathematician Jakob Steiner, who wrote a treatise on the matter in 1824. When combined with the more familiar rigid transformations of rotation, translation, and reflection, the decidedly non-rigid transformation of inversion gives rise to inversive geometry, which became a major topic of study in nineteenth geometry. It was perhaps most notably applied by William Thomson (later to become 1st Baron Kelvin, immortalized in the name of a certain temperature scale), at the age of 21, to solve problems in electrostatics. Circle inversion also allows for extremely elegant proofs of classical geometric facts. We end today’s post with an example.
Consider three half-circles, all tangent to one another and centered on the same horizontal line, with two placed inside the third, as follows:
This figure (or, more precisely, the grey region enclosed by the semicircles) is known as an arbelos, and its first known appearance dates back to The Book of Lemmas by Archimedes. A remarkable fact about the arbelos is that, starting with the smallest of the semicircles in the figure, one can nestle into it an infinite sequence of increasingly small circles, each tangent to the two larger semicircles and the circle appearing before it, thus creating the striking Pappus chain, named for Pappus of Alexandria, who investigated the figure in the 3rd century AD:
Let us label the circles in the Pappus chain (starting with the smallest semicircle in the arbelos) , etc. (So, in the picture above, is the center of , is the center of , and so on.) Clearly, the size of decreases as increases, but it is natural to ask how quickly it decreases. It is also natural to ask how the position of the point changes as increases. In particular, what is the height of above the base of the figure? It turns out that the answers to these two questions are closely related, a fact discovered by Pappus through a long and elaborate derivation in Euclidean geometry, and which we will derive quickly and elegantly through circle inversion.
Let denote the diameter of the circle , and let denote the height of the point above the base of the Pappus chain (i.e., the line segment ). We will prove the remarkable formula:
For all , .
For concreteness, let us demonstrate the formula for . The same argument will work for each of the circles in the Pappus chain. As promised, we are going to use circle inversion. Our first task is to find a suitable circle across which to invert our figure. And that circle, it turns out, will be the circle centered at and perpendicular to :
Now, what happens when we invert our figure? First, consider the two larger semicircles in the arbelos, with diameters and . The circles of which these form the upper half pass through the center of our circle of inversion and thus, as discussed above, are mapped to straight lines by our inversion. Moreover, since the centers of these circles lie directly to the right of , a moment’s thought should convince you that they are mapped to vertical lines.
Now, what happens to the circles in the Pappus chain? Well, none of them pass through , so they will all get mapped to circles. is perpendicular to the circle of inversion, so it gets mapped to itself. But, in the original diagram, is tangent to the larger semicircles in the arbelos. Since circle inversion preserves tangency, in the inverted diagram, is tangent to the two vertical lines that these semicircles are mapped to. And, of course, the same is true of all of the other circles in the Pappus chain. Finally, note that, since the center of lies on the base of the figure, which passes through the center of our inversion circle, it also gets mapped to a point on the base of the figure. Putting this all together, we end up with the following striking figure:
The circle with diameter gets mapped to the vertical line through , and the circle with diameter gets mapped to the vertical line through . Our Pappus chain, meanwhile, is transformed by inversion into an infinite tower of circles, all of the same size, bounded by these vertical lines. Moreover, the circle and the point are left in place by the inversion. It is now straightforward to use this tower to calculate the height of in terms of the diameter of . To get from down to the base, we must first pass through half of , which has a height of . We then must pass through the image of under the inversion, which has a height of . Then the image of , which also has a height of . And, finally, the image of the smallest semicircle of the arbelos, which has a height of . All together, we get:
For further reading on circle inversion, see Harold P. Boas’ excellent article, “Reflections on the Arbelos.”
Cover image: René Magritte, The false mirror