One question which comes up quite frequently regarding the complex numbers is: why do we define them and their operations the way we do? Everyone knows the old adage about solving quadratic and cubic equations and setting i2=−1, but this doesn't really capture the why of complex numbers in my opinion. In a previous post, I established the complex numbers as the field R[x]/(x2+1). This is a beautiful way to view complex numbers if you happen to know algebra quite well; if you do not, then this is not a very satisfactory characterization. This begs the question: what is the most natural characterization for the complex numbers with the least technical overhead? Recently, I found a characterization that I think captures the essence of complex numbers quite well that should appeal to many people's sensibilities.
In this post, I will go through the characterization and derivation. I have not seen the approach I've taken here as an axiomatic approach to the complex numbers. While it is not novel, it does have a nice simplicity and elegance to it.
Characterizing C
We know from experience that the complex numbers are elements of the form x+iy. Alternatively, we can - and often do - view complex numbers as points in the R2 plane. We associate x+iy with the coordinate pair (x,y). In the coordinate pair representation of complex numbers, complex multiplication is given by (x1,y1)(x2,y2)=(x1x2−y1y2,x1y2+x2y1). Our goal is to derive this from reasonable axioms. The coordinate pair representation (i.e. the R2 representation) will be the starting point.
The coordinate plane R2 is in fact a vector space over R. It is a standard exercise for first time linear algebra students to prove this. What this means is that R2 comes equipped with two operations: vector addition and scalar multiplication. These operations must satisfy the usual vector space axioms.
Vector addition is defined as so: (x1,y1)+(x2,y2)=(x1+x2,y1+y2). Scalar multiplication is defined by the expression α(x,y)=(αx,αy). These two serve as part of the platform for defining the complex numbers.
Before we start deriving C, we must first consider how different vector spaces and fields are. Every field is a vector space over itself, so we know that there is some non-trivial relationship between the two. However not every vector space can be turned into a field. What separates vector spaces and fields is that fields carry an additional structure: multiplication. You can of course define multiplication on vector spaces (e.g. (x1,…,xn)(y1,…,yn)=(x1y1,…,xnyn)) but it may not always give a meaningful or useful structure. Every element in a field must have a multiplicative inverse, multiplication must be commutative, and multiplication must distribute over addition. Since we already know that C is somehow constructed from R2, what we are tasked with is deriving the multiplication on C, i.e. C is a fieldification of R2.
Exercise: Check that the multiplication given in the preceding paragraph does not give a field on Rn except when n=1.
Let us now list some observations/assumptions:
- We can identify R with the x axis in R2. Meaning that if we multiplied (x1,0) with (x2,0), then we should get (x1x2,0) since this is how we multiply real numbers.
- Since R2 is a vector space over R, we can think of the scalar multiplication equation α(x,y)=(αx,αy) as being equivalent to (α,0)(x,y)=(αx,αy).
- Since we want a field structure, we need that multiplication is commutative, that is to say that (x1,y1)(x2,y2)=(x2,y2)(x1,y1). Moreover, every nonzero element has a multiplicative inverse.
- It's natural to want that if (x1,y1) is a unit vector (in the Euclidean metric, meaning it lies on the unit circle), then (x1,y1)(x2,y2) has the same length as (x2,y2). This is a generalization of the fact that |1⋅x|=|x|=|−1⋅x|, i.e. if you multiply x by a unit vector (1 or −1 in R), then its length is unchanged.
The first two properties can be thought of as pushing the algebraic structure of R to R2; the third is a necessary condition for a field structure; the fourth however is a geometric condition which really captures the essence of the complex numbers. Any field extension of R (that is a field which contains R) is going to satisfy the first three properties, but there are many fields that do this. It turns out that they're all isomorphic, but these other fields lack the geometry of complex numbers. Our fairly modest fourth condition is what clinches the deal.
Deriving Complex Multiplication
Since we need our multiplication to distribute over addition, it must be of the form
(x1y1)(x2y2)=(α1x1x2+α2x1y2+α3y1x2+α4y1y2β1x1x2+β2x1y2+β3y1x2+β4y1y2).
The reason for this is that since the map (x2,y2)↦(x1,y1)(x2,y2) distributes over addition and scalar multiplication (by the field axioms), it is actually a linear map. Every linear map must be of the above form. Prove this yourself if you are not convinced. The constants α1,…,α4,β1,…,β4 must be independent of (x1,y1) and (x2,y2) in order for multiplication to be well-defined. Thus they are universal to our problem at hand.
Let us consider then what happens when we enforce our first property. If we multiply two "real" numbers (i.e. numbers of the form (x,0)), then our multiplication equation becomes
(x10)(x20)=(α1x1x2β1x1x2)
since y1=0=y2. However we stated that we should get (x1x2,0) when we multiply (x1,0) and (x2,0). This gives that α1=1 and β1=0 by equating our components.
Let us now consider what happens when we impose our second property. Multiplying (x1,0) and (x2,y2) gives
(x10)(x2y2)=(x1x2+α2x1y2β2x1y2).
Our second condition stated that this should give us (x1x2,x1y2). Equating components forces α2=0 and β2=1. With these two properties, we've significantly simplified our general expression for multiplication to the following:
(x1y1)(x2y2)=(x1x2+α3y1x2+α4y1y2x1y2+β3y1x2+β4y1y2).
By imposing our third condition of (x1,y1)(x2,y2)=(x2,y2)(x1,y1), we must have that the following is true:
(x1x2+α3y1x2+α4y1y2x1y2+β3y1x2+β4y1y2)=(x1x2+α3x1y2+α4y1y2x1y2+β3x1y2+β4y1y2).
Again by equating our components, it follows that α3=0 and β3=1 since x1y2 appears on the left hand side so we are left with the following expression for our multiplication:
(x1y1)(x2y2)=(x1x2+α4y1y2x1y2+y1x2+β4y1y2).
At this point, all that is left to determine are α4 and β4. Before narrowing these two down to - hopefully - get C out of the mix, let us see where we stand. The second part of our third condition is that every nonzero element has a multiplicative inverse. Particularly, this means that (x1,y1)(x2,y2)=(0,0) if and only if one or both of (x1,y1) and (x2,y2) are (0,0).
Viewing multiplication by (x1,y1)≠(0,0) as a linear mapping again, the above argument gives that the kernel of the induced linear map is trivial, i.e. its determinant is nonzero. The linear map in question is of the form
(x1α4y1y1x1+β4y1).
The determinant of this matrix is x21+β4x1y1−α4y21. If the matrix is to be invertible, then the determinant can never be zero, i.e. setting x21+β4x1y1−α4y21=0 gives no solution. The only way for this to happen is if the discriminant is negative (which corresponds to non-real solutions though this is putting the cart before the horse). Considering x1 to be the dependent variable, he discriminant in this case is β24y21−4α4y21.
Since y1≠0, we have that β24y21−4α24y21<0 if and only if β24−4α24<0. Equivalently, β24<4α24. Picking any such α4 and β4 which satisfy this condition would necessarily give rise to a field structure as we desired. However as stated above, these fields do not capture the geometry we would like to associate to the complex numbers. As such, it is not obvious from the first three axioms what we should pick as the definition of the complex numbers. Enter our fourth property.
Our fourth property - while seemingly very modest - is actually a very strong condition. It forces a lot of geometry into our field. Let us now invoke our fourth property in two different ways. Consider (0,1). This is a unit vector so (0,1)(0,1) should have length one. When we multiply these what we get is (α4,β4). The only way for this vector to have length one is if α24+β24=1, i.e. α4=±√1−β24.
With a concrete relationship between α4 and β4, we can now sort out what the values must be. Let us multiply (0,1) and (cosθ,sinθ). Both of these are unit vectors so the length of the resulting vector must be one as above. Multiplying the vectors, we get (±√1−β24sinθ,cosθ+β4sinθ).
The length of this vector is (1−β24)sin2θ+cos2θ+2β4sinθcosθ+β24sin2θ which simplifies to 1+β4sin2θ. This can only possibly be 1 if β4=0 or θ=0,π2,3π2. Picking any θ other than those three values, it follows that β4 must be zero since β4 must be universal. Thus α4=±1. It is quite straightforward to show that the only choice for α4 which preserves lengths is α4=−1.
Thus the multiplication on R2 we posited reduces to
(x1,y1)(x2,y2)=(x1y1−x2y2,x1y2+y1x2)
which is the usual multiplication on C! We also get that (0,1)(0,1)=(−1,0) which is a restatement of i2=−1. Moreover, in this procedure the matrix representation of complex numbers fell right out quite naturally in our considerations.
The other choices of α4 and β4 give rise to "different" fields, though really they are equivalent to C and the reason is that by virtue of having a negative discriminant, we assured that the polynomial equation that they solve is not solvable over R. Quotienting R[x] by the polynomial would give a degree two field extension which is isomorphic to C.
No comments:
Post a Comment