# 32.1: Complex Numbers

- Page ID
- 38823

## Real Numbers

Let us think of the ordinary numbers as set out on a line which goes to infinity in both positive and negative directions. We could start by taking a stretch of the line near the origin (that is, the point representing the number zero) and putting in the integers as follows:

Next, we could add in rational numbers, such as ½, 23/11, etc., then the irrationals like \(\sqrt{2}\), then numbers like \(\pi\), and so on, so any number you can think of has its place on this line. Now let’s take a slightly different point of view, and think of the numbers as represented by a *vector* from the origin to that number, so 1 is

and, for example, –2 is represented by:

Note that if a number is multiplied by –1, the corresponding vector is turned through 180 degrees. In pictures,

The “vector” 2 is turned through \(\pi\), or 180 degrees, when you multiply it by –1.

## Solving Quadratic Equations

In solving the standard quadratic equation

\[ax^2 + bx + c = 0 \label{A.1}\]

we find the solution to be:

\[ x =\dfrac{-b \pm \sqrt{b^2-ac}}{2a} \label{A.2}\]

The problem with this is that sometimes the expression inside the square root is negative. What does that signify? For some problems in physics, it means there is no solution. For example, if I throw a ball directly upwards at 10 meters per sec, and ask when will it reach a height of 20 meters, taking *g* = 10 m per sec^{2}, the solution of the quadratic equation for the time *t* has a negative number inside the square root, and that means that the ball doesn’t get to 20 meters, so the question didn’t really make sense.

We shall find, however, that there are other problems, in wide areas of physics, where negative numbers inside square roots have an important physical significance. For that reason, we need to come up with a scheme for interpreting them.

The simplest quadratic equation that gives trouble is:

\[x^2 + 1 = 0 \label{A.3}\]

the solutions being

\[x = \pm \sqrt{-1}\label{A.4}\]

What does that mean? We’ve just seen that the square of a positive number is positive, and the square of a negative number is also positive, since multiplying one negative number, which points backwards, by another, which turns any vector through 180 degrees, gives a positive vector. Another way of saying the same thing is to regard the minus sign itself, -, as an operator which turns the number it is applied to through 180 degrees. Now \((-2)\times (-2)\) has two such rotations in it, giving the full 360 degrees back to the positive axis.

*To make sense of the square root of a negative number, we need to find something which when multiplied by itself gives a negative number*. Let’s concentrate for the moment on the square root of –1, from the quadratic equation above. Think of –1 as the operator – acting on the vector 1, so the – turns the vector through 180 degrees. We need to find the square root of this operator, the operator which applied *twice *gives the rotation through 180 degrees. Put like that, it is pretty obvious that the operator we want rotates the vector 1 through 90 degrees.

But if we take a positive number, such as 1, and rotate its vector through 90 degrees only, it isn’t a number at all, at least in our original sense, since we put all known numbers on one line, and we’ve now rotated 1 away from that line. The new number created in this way is called a pure imaginary number, and is denoted by \(i\).

Once we’ve found the square root of –1, we can use it to write the square root of any other negative number—for example, \(2i\) is the square root of \(–4\). Putting together a real number from the original line with an imaginary number (a multiple of *i*) gives a *complex number*. Evidently, complex numbers fill the entire two-dimensional plane. Taking ordinary Cartesian coordinates, any point \(P\) in the plane can be written as \((x, y)\) where the point is reached from the origin by going \(x\) units in the direction of the positive real axis, then y units in the direction defined by \(i\), in other words, the *\(y\)* axis.

Thus the point *P* with coordinates (*x*, *y*) can be identified with the complex number *z*, where

\[z = x + iy. \label{A.5}\]

The plane is often called the *complex plane*, and representing complex numbers in this way is sometimes referred to as an Argand Diagram.

Visualizing the complex numbers as two-dimensional vectors, it is clear how to *add* two of them together. If *z*_{1} = *x*_{1} + *iy*_{1}, and *z*_{2} = *x*_{2} + *iy*_{2}, then *z*_{1} + *z*_{2} = (*x*_{1} + *x*_{2}) + *i*(*y*_{1} + *y*_{2}). The real parts and imaginary parts are added separately, just like vector components.

Multiplying two complex numbers together does not have quite such a simple interpretation. It is, however, quite straightforward—ordinary algebraic rules apply, with *i*^{2} replaced where it appears by -1. So for example, to multiply *z*_{1} = *x*_{1} + *iy*_{1} by *z*_{2} = *x*_{2} + *iy*_{2},

\[z_1z_2 = (x_1 + iy_1)( x_2 + iy_2) = (x_1x_2 - y_1y_2) + i(x_1y_2 + x_2y_1). \label{A.6}\]

## Polar Coordinates

Some properties of complex numbers are most easily understood if they are represented by using the polar coordinates \(r, \theta\) instead of \((x, y)\) to locate \(z\) in the complex plane.

Note that \(z = x + iy\) can be written \(r(\cos \theta + i \sin \theta)\) from the diagram above. In fact, this representation leads to a clearer picture of multiplication of two complex numbers:

\[\begin{align} z_1z_2 &= r_2 ( \cos(\theta_1 + i\sin \theta_1) r_2( \cos(\theta_2 + i\sin \theta_2) \label{A.7} \\[4pt] & = r_1r_2 \left[ (\cos \theta_1 \cos \theta_2 - \sin \theta_1 \sin \theta_2) + i (\sin \theta_1 \cos \theta_2 + \cos \theta_1 \sin \theta_2) \right] \label{A.8} \\[4pt] & = r_1r_2 \left[ \cos(\theta_1+\theta_2) + i\sin (\theta_1+\theta_2) \right] \label{A.9} \end{align}\]

So, if

\[ z = r(cos \theta + i\sin \theta ) = z_1z_2 \label{A.10}\]

then

\[r = r_1r_2 \label{A.11}\]

and

\[\theta=\theta_1\theta_2 \label{A.12}\]

That is to say, to multiply together two complex numbers, we *multiply* the *r*’s – called the *moduli* – and *add* the phases, the \(\theta\) ’s. The modulus \(r\) is often denoted by \(|*z*|\), and called **mod z**, the phase \(\theta\) is sometimes referred to as

**arg**. For example, \(|i| = 1\), \(\text{arg}\; i = \pi/2\).

*z*We can now see that, although we had to introduce these complex numbers to have a \(\sqrt{-1}\), we do not need to bring in new types of numbers to get \(\sqrt{-1}\), or \(\sqrt{i}\). Clearly, \(|\sqrt{i}|=1\), \( arg \sqrt{i} = 45°\). It is on the circle of unit radius centered at the origin, at 45°, and squaring it just doubles the angle.

## The Unit Circle

In fact this circle—called the *unit circle*—plays an important part in the theory of complex numbers and every point on the circle has the form

\[ z = \cos \theta + i \sin \theta = Cis(\theta) \label{A.13}\]

Since all points on the unit circle have \(|z| = 1\), by definition, multiplying any two of them together just amounts to adding the angles, so our new function \(Cis(\theta)\) satisfies

\[ Cis(\theta_1)Cis(\theta_2)=Cis(\theta_1+\theta_2). \label{A.14}\]

But that is just how multiplication works for exponents! That is,

\[a^{\theta_1}a^{\theta_2} = a^{\theta_1+\theta_2} \label{A.15}\]

for \(a\) any constant, which strongly suggests that maybe our function \(Cis(\theta\) is nothing but some constant \(a\) raised to the power \(\theta\), that is,

\[ Cis(\theta) = a^{\theta}\label{A.16}\]

It turns out to be convenient to write \(a^{\theta} = e^{(\ln a)\theta} = e^{A \theta}\), where \(A = \ln a\). This line of reasoning leads us to write

\[\cos \theta + i\sin \theta = e^{A\theta} \label{A.17}\]

Now, for the above “addition formula” to work for multiplication, \(A\) must be a constant, *independent* of \(\theta\). Therefore, we can find the value of *A* by choosing \(\theta\) for which things are simple. We take \(\theta\) to be very small—in this limit:

\[ \cos \theta = 1\]

\[ \sin \theta = \theta\]

\[ e^{A\theta} = 1+ A\theta\]

with we drop terms of order \(\theta^2\) and higher.

Substituting these values into Equation \ref{A.17} gives \(\theta\)

So we find:

\[ (\cos \theta + i \sin \theta) e ^{i \theta} \label{A.18}\]

To test this result, we expand \(e^{i \theta}\):

\[ \begin{align} e^{i \theta} &= 1 + i\theta + \dfrac{(i\theta)^2}{2!} + \dfrac{(i\theta)^3}{3!} + \dfrac{(i\theta)^4}{4!} + \dfrac{(i\theta)^5}{5!} ... \label{A.19a} \\[4pt] &= 1 + i\theta - \dfrac{\theta^2}{2!} - \dfrac{i\theta^3}{3!} +\dfrac{\theta^4}{4!} +\dfrac{i\theta^5}{5!} ... \label{A.19b} \\[4pt] &= \left( 1 - \dfrac{\theta^2}{2!} + \dfrac{\theta^4}{4!} \right) + i \left(\theta - \dfrac{i\theta^3}{3!}+\dfrac{i\theta^5}{5!} \right) \label{A.19c} \\[4pt] &= \cos \theta + i\sin \theta \label{A.19d} \end{align}\]

We write \(= \cos \theta + i\sin \theta\) in Equation \ref{A.19d} because the series in the brackets are precisely the Taylor series for \(\cos \theta\) and \(\sin \theta\) confirming our equation for \(e^{i\theta}\). Changing the sign of \(\theta\) it is easy to see that

\[ e^{-i \theta} = \cos \theta - i\sin \theta \label{A.20}\]

so the two trigonometric functions can be expressed in terms of exponentials of complex numbers:

\[\cos (\theta) = \dfrac{1}{2} \left( e^{i\theta} + e^{-i \theta} \right)\]

\[\sin (\theta) = \dfrac{1}{2i} \left( e^{i\theta} - e^{-i \theta} \right)\]

## Contributors

Michael Fowler (Beams Professor, Department of Physics, University of Virginia)