13.1: A- Complex Numbers

Last updated
Save as PDF

Page ID: 276017

\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

\( \newcommand{\dsum}{\displaystyle\sum\limits} \)

\( \newcommand{\dint}{\displaystyle\int\limits} \)

\( \newcommand{\dlim}{\displaystyle\lim\limits} \)

\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

\( \newcommand{\Span}{\mathrm{span}}\)

\( \newcommand{\id}{\mathrm{id}}\)

\( \newcommand{\Span}{\mathrm{span}}\)

\( \newcommand{\kernel}{\mathrm{null}\,}\)

\( \newcommand{\range}{\mathrm{range}\,}\)

\( \newcommand{\RealPart}{\mathrm{Re}}\)

\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

\( \newcommand{\Argument}{\mathrm{Arg}}\)

\( \newcommand{\norm}[1]{\| #1 \|}\)

\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)

\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)

\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vectorC}[1]{\textbf{#1}} \)

\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\(\newcommand{\longvect}{\overrightarrow}\)

\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

Real Numbers

Let us think of the ordinary numbers as set out on a line which goes to infinity in both positive and negative directions. We could start by taking a stretch of the line near the origin (that is, the point representing the number zero) and putting in the integers as follows:

Figure \(\PageIndex{1}\): Integers on number line

Next, we could add in rational numbers, such as ½, 23/11, etc., then the irrationals like \(\sqrt{2}\), then numbers like \(\pi\), and so on, so any number you can think of has its place on this line. Now let’s take a slightly different point of view, and think of the numbers as represented by a vector from the origin to that number, so 1 is

and, for example, –2 is represented by:

Note that if a number is multiplied by –1, the corresponding vector is turned through 180 degrees. In pictures,

The “vector” 2 is turned through \(\pi\), or 180 degrees, when you multiply it by –1.

Example \(\PageIndex{1}\)

What are the square roots of 4?

Answer: Well, 2, obviously, but also –2, because multiplying the backwards pointing vector –2 by –2 not only doubles its length, but also turns it through 180 degrees, so it is now pointing in the positive direction. We seem to have invented a hard way of stating that multiplying two negatives gives a positive, but thinking in terms of turning vectors through 180 degrees will pay off soon.

Solving Quadratic Equations

In solving the standard quadratic equation

\[ax^2 + bx + c = 0 \label{A.1}\]

we find the solution to be:

\[ x =\dfrac{-b \pm \sqrt{b^2-ac}}{2a} \label{A.2}\]

The problem with this is that sometimes the expression inside the square root is negative. What does that signify? For some problems in physics, it means there is no solution. For example, if I throw a ball directly upwards at 10 meters per sec, and ask when will it reach a height of 20 meters, taking g = 10 m per sec², the solution of the quadratic equation for the time t has a negative number inside the square root, and that means that the ball doesn’t get to 20 meters, so the question didn’t really make sense.

We shall find, however, that there are other problems, in wide areas of physics, where negative numbers inside square roots have an important physical significance. For that reason, we need to come up with a scheme for interpreting them.

The simplest quadratic equation that gives trouble is:

\[x^2 + 1 = 0 \label{A.3}\]

the solutions being

\[x = \pm \sqrt{-1}\label{A.4}\]

What does that mean? We’ve just seen that the square of a positive number is positive, and the square of a negative number is also positive, since multiplying one negative number, which points backwards, by another, which turns any vector through 180º, gives a positive vector. Another way of saying the same thing is to regard the minus sign itself, -, as an operator which turns the number it is applied to through 180º. Now \((-2)\times (-2)\) has two such rotations in it, giving the full 360º back to the positive axis.

To make sense of the square root of a negative number, we need to find something which when multiplied by itself gives a negative number. Let’s concentrate for the moment on the square root of –1, from the quadratic equation above. Think of –1 as the operator – acting on the vector 1, so the – turns the vector through 180º. We need to find the square root of this operator, the operator which applied twice gives the rotation through 180º. Put like that, it is pretty obvious that the operator we want rotates the vector 1 through 90º.

But if we take a positive number, such as 1, and rotate its vector through 90º only, it isn’t a number at all, at least in our original sense, since we put all known numbers on one line, and we’ve now rotated 1 away from that line. The new number created in this way is called a pure imaginary number, and is denoted by \(i\).

Once we’ve found the square root of –1, we can use it to write the square root of any other negative number—for example, \(2i\) is the square root of \(–4\). Putting together a real number from the original line with an imaginary number (a multiple of i) gives a complex number. Evidently, complex numbers fill the entire two-dimensional plane. Taking ordinary Cartesian coordinates, any point \(P\) in the plane can be written as \((x, y)\) where the point is reached from the origin by going \(x\) units in the direction of the positive real axis, then y units in the direction defined by \(i\), in other words, the \(y\) axis.

Thus the point P with coordinates (x, y) can be identified with the complex number \(z\), where

\[z = x + iy. \label{A.5}\]

The plane is often called the complex plane, and representing complex numbers in this way is sometimes referred to as an Argand Diagram (Figure \(\PageIndex{2}\)).

220px-Complex_number_illustration.svg.png — Figure \(\PageIndex{2}\): A complex number can be visually represented as a pair of numbers (a, b) forming a vector on a diagram called an Argand diagram, representing the complex plane. "Re" is the real axis, "Im" is the imaginary axis, and i satisfies i² = −1. (Cc BY-SA 4.0; Wolfkeeper).

Visualizing the complex numbers as two-dimensional vectors, it is clear how to add two of them together. If \(z_1 = x_1 + iy_1\), and \(z_2 = x-2 + iy_2\), then \(z_1 + z_2 = (x_1 + x_2) + i(y_1 + y_2)\). The real parts and imaginary parts are added separately, just like vector components.

Multiplying two complex numbers together does not have quite such a simple interpretation. It is, however, quite straightforward—ordinary algebraic rules apply, with \(i^2\) replaced where it appears by -1. So for example, to multiply \(z_1 = x_1 + iy_1\) by \(z_2 = x_2 + iy_2\),

\[ \begin{align} z_1z_2 &= (x_1 + iy_1)( x_2 + iy_2) \\[5pt] = (x_1x_2 - y_1y_2) + i(x_1y_2 + x_2y_1). \label{A.6} \end{align} \]

Polar Coordinates

Some properties of complex numbers are most easily understood if they are represented by using the polar coordinates \(r, \theta\) instead of \((x, y)\) to locate \(z\) in the complex plane (Figure \(\PageIndex{3}\)).

Figure \(\PageIndex{3}\): Polar Argand diagram. (CC BY-SA 4.0; Svjo).

Note that \(z = x + iy\) can be written \(r(\cos \theta + i \sin \theta)\) from Figure \(\PageIndex{3}\). In fact, this representation leads to a clearer picture of multiplication of two complex numbers:

\[\begin{align} z_1z_2 &= [ r_2 ( \cos(\theta_1 + i\sin \theta_1) ][r_2( \cos(\theta_2 + i\sin \theta_2)] \label{A.7} \\[5pt] & = r_1r_2 \left[ (\cos \theta_1 \cos \theta_2 - \sin \theta_1 \sin \theta_2) + i (\sin \theta_1 \cos \theta_2 + \cos \theta_1 \sin \theta_2) \right] \label{A.8} \\[5pt] &= r_1r_2 \left[ \cos(\theta_1+\theta_2) + i\sin (\theta_1+\theta_2) \right] \label{A.9} \end{align}\]

So, if

\[\begin{align} z &= r(\cos \theta + i\sin \theta ) \\[5pt] &= z_1z_2 \label{A.10}\end{align}\]

then

\[r = r_1r_2 \label{A.11}\]

and

\[\theta=\theta_1\theta_2 \label{A.12}\]

That is to say, to multiply together two complex numbers, we multiply the r’s – called the moduli – and add the phases, the \(\theta\)’s. The modulus \(r\) is often denoted by \(|z|\), and called mod z, the phase \(\theta\) is sometimes referred to as arg z. For example, \(|i| = 1\), \(\text{arg}\; i = \pi/2\).

We can now see that, although we had to introduce these complex numbers to have a \(\sqrt{-1}\), we do not need to bring in new types of numbers to get \(\sqrt{-1}\), or \(\sqrt{i}\). Clearly, \(|\sqrt{i}|=1\), \( arg \sqrt{i} = 45°\). It is on the circle of unit radius centered at the origin, at 45°, and squaring it just doubles the angle.

The Unit Circle

In fact this circle—called the unit circle—plays an important part in the theory of complex numbers and every point on the circle has the form

\[ z = \cos \theta + i \sin \theta = Cis(\theta) \label{A.13}\]

Since all points on the unit circle have \(|z| = 1\), by definition, multiplying any two of them together just amounts to adding the angles, so our new function \(Cis(\theta)\) satisfies

\[ Cis(\theta_1)Cis(\theta_2)=Cis(\theta_1+\theta_2). \label{A.14}\]

But that is just how multiplication works for exponents! That is,

\[a^{\theta_1}a^{\theta_2} = a^{\theta_1+\theta_2} \label{A.15}\]

for \(a\) any constant, which strongly suggests that maybe our function \(Cis(\theta\) is nothing but some constant \(a\) raised to the power \(\theta\), that is,

\[ Cis(\theta) = a^{\theta}\label{A.16}\]

It turns out to be convenient to write \(a^{\theta} = e^{\ln a)\theta} = e^{A \theta}\), where \(A = \ln a\). This line of reasoning leads us to write

\[\cos \theta + i\sin \theta = e^{A\theta} \label{A.17}\]

Now, for the above “addition formula” to work for multiplication, \(A\) must be a constant, independent of \(\theta\). Therefore, we can find the value of \(A\) by choosing \(\theta\) for which things are simple. We take \(\theta\) to be very small—in this limit

\[\begin{align*} \cos \theta &\approx 1 \\[5pt] \sin \theta &\approx \theta \end{align*}\]

and

\[e^{A\theta} \approx 1 + A\theta\]

after dropping terms of order \(\theta^2\) and higher.

Substituting these values into Equation \ref{A.17} gives\(\theta\)

So we find:

\[ (\cos \theta + i \sin \theta) e ^{i \theta} \label{A.18}\]

To test this result, we expand \(e^{i \theta}\):

\[ \begin{align} e^{i \theta} &= 1 + i\theta + \dfrac{(i\theta)^2}{2!} + \dfrac{(i\theta)^3}{3!} + \dfrac{(i\theta)^4}{4!} + \dfrac{(i\theta)^5}{5!} + \ldots \label{A.19a} \\[5pt] &= 1 + i\theta - \dfrac{\theta^2}{2!} - \dfrac{i\theta^3}{3!} +\dfrac{\theta^4}{4!} +\dfrac{i\theta^5}{5!} + \ldots \label{A.19b} \\[5pt] & = \left( 1 - \dfrac{\theta^2}{2!} + \dfrac{\theta^4}{4!} \right) + i \left(\theta - \dfrac{i\theta^3}{3!}+\dfrac{i\theta^5}{5!} \right) \label{A.19c} \\[5pt] &= \cos \theta + i\sin \theta \label{A.19d} \end{align}\]

We write \(= \cos \theta + i\sin \theta\) in the last line because the series in the brackets are precisely the Taylor series for \(\cos \theta\) and \(\sin \theta\) confirming our equation for \(e^{i\theta}\). Changing the sign of \(\theta\) it is easy to see that

\[ e^{-i \theta} = \cos \theta - i\sin \theta \label{A.20}\]

so the two trigonometric functions can be expressed in terms of exponentials of complex numbers:

\[\cos (\theta) = \dfrac{1}{2} \left( e^{i\theta} + e^{-i \theta} \right)\]

\[\sin (\theta) = \dfrac{1}{2i} \left( e^{i\theta} - e^{-i \theta} \right)\]

Euler Formula

The Euler formula states that any complex number can be written (Equation \ref{A.19d}):

\[e^{i \theta} = \cos \theta + i\sin \theta \nonumber \]

Contributors and Attributions

Michael Fowler (Beams Professor, Department of Physics, University of Virginia)

Search

Text Color

Text Size

Margin Size

Font Type