# 13: Elements of Dirac Notation

In the early days of quantum theory, P. A. M. (Paul Adrian Maurice) Dirac created a powerful and concise formalism for it which is now referred to as Dirac notation or bra-ket (bracket $$\langle \, | \, \rangle$$) notation.

Two major mathematical traditions emerged in quantum mechanics: Heisenberg’s matrix mechanics and Schrödinger’s wave mechanics. These distinctly different computational approaches to quantum theory are formally equivalent, each with its particular strengths in certain applications. Heisenberg’s variation, as its name suggests, is based matrix and vector algebra, while Schrödinger’s approach requires integral and differential calculus. Dirac’s notation can be used in a first step in which the quantum mechanical calculation is described or set up. After this is done, one chooses either matrix or wave mechanics to complete the calculation, depending on which method is computationally the most expedient.

## Kets

In Dirac’s notation what is known is put in a ket, $$| \, \rangle$$. So, for example, $$| p \rangle$$ expresses the fact that a particle has momentum $$p$$. It could also be more explicit: $$| p=2 \rangle$$, the particle has momentum equal to 2; $$| x=1.23 \rangle$$, the particle has position 1.23. $$| \psi \rangle$$ represents a system in the state $$\psi$$ and is therefore called the state vector. The ket can also be interpreted as the initial state in some transition or event.

## Bras

The bra $$\langle \, |$$ represents the final state or the language in which you wish to express the content of the ket $$| \, \rangle$$. For example,$$\langle 0.25 | \psi \rangle$$ is the probability amplitude that a particle in state $$\psi$$ will be found at position $$x = 0.25$$. In conventional notation we write this as $$\psi(x=0.25)$$, the value of the function $$\psi$$ at $$x$$=0.25. The absolute square of the probability amplitude, $$\left| \langle x=0.25| \psi \rangle \right|^2$$, is the probability density that a particle in state $$\psi$$ will be found at $$x$$ = 0.25. Thus, we see that a bra-ket pair can represent an event, the result of an experiment. In quantum mechanics an experiment consists of two sequential observations - one that establishes the initial state (ket) and one that establishes the final state (bra).

## Bra-Ket Pairs

If we write $$\langle x| \psi \rangle$$, we are expressing $$\psi$$ in coordinate space without being explicit about the actual value of $$x$$. $$\langle 0.25 | \psi \rangle$$ is a number, but the more general expression $$\rangle x | \psi \rangle$$ is a mathematical function, a mathematical function of $$x$$, or we could say a mathematical algorithm for generating all possible values of $$\langle x| \psi \rangle$$, the probability amplitude that a system in state $$| \psi \rangle$$ has position $$x$$.

Example

For the ground state of the well-known particle-in-a-box of unit dimension.

$\langle x | \psi \rangle = \psi(x) \ 2^{1/2} \sin (\pi x)$

However, if we wish to express $$\psi$$ in momentum space we would write

$\langle p | \psi \rangle = \psi(p) = 2^{1/2} \dfrac{e^{-ip} +1}{\pi^2 - p^2}$

How one finds this latter expression will be discussed later.

The major point here is that there is more than one language in which to express $$| \psi \rangle$$. The most common language for chemists is coordinate space ($$x$$, $$y$$, and $$z$$, or $$r$$, $$\theta$$, and $$\phi$$, etc.), but we shall see that momentum space offers an equally important view of the state function. It is important to recognize that $$\langle x| \psi \rangle$$ and $$\langle p| \psi \rangle$$ are formally equivalent and contain the same physical information about the state of the system. One of the tenets of quantum mechanics is that if you know $$| \psi \rangle$$, you know everything there is to know about the system, and if, in particular, you know $$\langle x| \psi \rangle$$, you can calculate all of the properties of the system and transform $$\langle x| \psi \rangle$$, if you wish, into any other appropriate language such as momentum space.

A bra-ket pair can also be thought of as a vector projection (i.e., a dot product) - the projection of the content of the ket onto the content of the bra, or the “shadow” the ket casts on the bra. For example, $$\langle \Phi | \psi \rangle$$ is the projection of state $$\psi$$ onto state $$\Phi$$. It is the amplitude (probability amplitude) that a system in state $$| \psi \rangle$$ will be subsequently found in state $$| \Phi \rangle$$. It is also what we have come to call an overlap integral.

The $$| \psi \rangle$$ state vector can be a complex function (that is have the form, $$a + ib$$, or $$exp(-ipx)$$, for example, where $$i = \sqrt{-1}$$). Given the relation of amplitudes to probabilities mentioned above, it is necessary that $$\langle \psi | \psi \rangle$$, the projection of $$| \psi \rangle$$ onto itself is real. This requires that

$\langle \psi | = | \psi \rangle^*$

where $$| \psi \rangle^*$$ is the complex conjugate of $$| \psi \rangle$$. So if $$| \psi \rangle = a + ib$$ then $$\langle \psi | = a - ib$$, which yields $$\langle \psi | \psi \rangle = a^2 + b^2$$ , a real number.

## The Linear Superposition

The analysis above can be approached in a less direct, but still revealing way by writing $$| \psi \rangle$$ and $$\langle \Phi |$$ as linear superpositions in the eigenstates of the position operator as is shown below.

$| \psi \rangle = \int | x \rangle \langle x | \psi \rangle \, dx$

$\langle \Phi | = \int \langle \Phi | x' \rangle \langle x' | \, dx'$

Combining these as a bra-ket pair yields,

$\langle \Phi | \psi \rangle = \iint \langle \Phi | x' \rangle \langle x' | x \rangle \langle x | \psi \rangle \; dx' \,dx = \int \langle \Phi | x \rangle \langle x | \psi \rangle \; dx$

The $$x'$$ disappears because the position eigenstates are an orthogonal basis set and $$\langle x' | x \rangle =0$$ unless $$x' = x$$ in which case it equals 1.

$$| \psi \rangle = \sum_n | n \rangle \langle n | \psi \rangle$$ is a linear superposition in the discrete (rather than continuous) basis set $$\{|n\rangle \}$$. A specific example of this type of superposition is easy to demonstrate using matrix mechanics. For example,The vector on the left represents spin-up in the x-direction, while the vectors on the right side are spin-up and spin-down in the z-direction, respectively. This can also be expressed symbolically in Dirac notation as . It is easy to show that all three vector states are normalized, and that S and S form an orthonormal basis set. In other words, , and .

It cannot be stressed too strongly that a linear superposition is not a mixture. For example, when the system is in the state $$| S_{su} \rangle$$ every measurement of the $$x$$-direction spin yields the same result: spin-up. However, measurement of the z-direction spin yields spin-up 50% of the time and spin-down 50% of the time. The system has a well-defined value for the spin in the x-direction, but an indeterminate spin in the z-direction. It is easy to calculate the probabilities for the z-direction spin measurements:

$\left| \langle S_{zu} | S_{xu} \rangle \right|^2 = \dfrac{1}{2}$

and

$\left| \langle S_{zd} | S_{xu} \rangle \right|^2 = \dfrac{1}{2}.$

The reason $$| S_{xu} \rangle$$ cannot be interpreted as a 50-50 mixture of $$| S_{zu} \rangle$$ and $$| S_{zu} \rangle$$ is because $$| S_{zu} \rangle$$ and$$| S_{zu} \rangle$$ are linear superpositions of $$| S_{xu} \rangle$$ and $$| S_{zxd} \rangle$$:

$| S_{zu} \rangle = \dfrac{ | S_{xu} \rangle + | S_{xd} \rangle}{2^{1/2}}$

and

$| S_{zd} \rangle = \dfrac{ | S_{xu} \rangle + | S_{xd} \rangle}{2^{1/2}}$

Thus, if $$| S_{xu} \rangle$$ is a mixture of $$| S_{zu} \rangle$$ and $$| S_{zd} \rangle$$and it would yield an indefinite measurement of the spin in the x-direction in spite of the fact that it is an eigenfunction of the x-direction spin operator.

Example

Just one more example of the linear superposition. Consider a trial wave function for the particle in the one-dimensional, one-bohr box such as:

$\Phi(x) = \sqrt{105} (x^2-x^3)$

Because the eigenfunctions for the particle-in-a-box problem form a complete basis set, $$\Phi(x)$$ can be written as a linear combination (i.e., a linear superposition) of these eigenfunctions.

$| \Phi \rangle = \sum_n |n \rangle \langle n| \Phi \rangle = \sum_n | n \rangle \int \langle n |x \rangle \langle x | \Phi \rangle dx$

In this notation $$\langle n | \Phi \rangle$$ is the projection of $$| \Phi \rangle$$ onto the eigenstate $$|n\rangle$$. This projection or shadow of $$\Phi$$ onto $$n$$ can be written as $$c_n$$. It is a measure of the contribution $$| n \rangle$$) makes to the state $$| \Phi \rangle$$. It is also an overlap integral. Therefore we can write

$| \Phi \rangle = \sum_n | n \rangle c_n$

Using a numerical software like Matlab, it is easy to show that the first ten coefficients in this expansion are:

 $$c_1$$ $$c_2$$ $$c_3$$ $$c_4$$ $$c_5$$ $$c_6$$ $$c_7$$ $$c_8$$ $$c_9$$ $$c_{10}$$ 0.935 -0.351 0.035 -0.044 0.007 -0.013 0.003 -0.005 0.001 -0.003

These expansion coefficients argue that the trail wavefunction strongly resembles the lowest energy eigenstate ($$|n \rangle$$) of the particle in the box system.

## Operators, Eigenvectors, Eigenvalues, and Expectation Values

In matrix mechanics operators are matrices and states are represented by vectors. The matrices operate on the vectors to obtain useful physical information about the state of the system. According to quantum theory there is an operator for every physical observable and a system is either in a state with a well-defined value for that observable or it is not. The operators associated with spin in the $$x$$- and $$z$$-direction are shown below in units of

When operates on the result is S is an eigenfunction or eigenvector of with eigenvalue 1 (in units of h/4). However, Sxu is not an eigenfunction of because where This means, as mentioned in the previous section, that S does not have a definite value for spin in the z- direction. Under these circumstances we can’t predict with certainty the outcome of a z-direction spin measurement, but we can calculate the average value for a large number of measurements. This is called the expectation value and in Dirac notation it is represented as follows: In matrix mechanics it is calculated as follows.

This result is consistent with the previous discussion which showed that is a 50-50 linear superposition of and with eigenvalues of +1 and -1, respectively. In other words, half the time the result of the measurement is +1 and the other half -1, yielding an average value of zero. Now we will look at the calculation for the expectation value for a system in the state , which is set up as follows: To make this calculation computationally friendly we expand in the eigenstates of the position operator. Note the simplification that occurs because

## The Variation Method

We have had a preliminary look at the variation method, an approximate method used when an exact solution to Schrödinger’s equation is not available. Using

$\Phi(x) = \sqrt{30} x (1 -x)$

as a trial wave function for the particle-in-the-box problem, we evaluate the expectation value for the energy as

$\langle E \rangle = \langle \Phi | \hat{H} | \Phi \rangle$

However, employing Dirac’s formalism we can expand $$\Phi$$ as noted above, in terms of the eigenfunctions of $$H$$ as follows.

$\langle E \rangle = \langle \Phi | \hat{H} | \Phi \rangle = \sum _n \langle | \hat{H} | n \rangle \langle n | \Phi \rangle$

However,

$\hat{H} | n \rangle = E_n | n \rangle$

because the states

$|n \rangle = \sqrt{2} \sin (n\pi x)$

are eigenfunctions of the energy operator $$\hat{H}$$, Thus, the energy expression becomes.

$\langle E \rangle = \sum_n \langle | n \rangle E_n \langle n | \Phi \rangle = \sum_n | C_n |^2E_n$

with

$E_n = \dfrac{n^2\pi^2}{2}$

Because $$\Phi$$ is not an eigenfunction of $$\hat{H}$$, the energy operator, this system. does not have a well-defined energy and all we can do is calculate the average value for many experimental measurements of the energy. Each individual energy measurement will yield one of the eigenvalues of the energy operator, $$E_n$$, and the $$|c_n|^2$$ values tells us the probability of this result being achieved. Using Mathcad it is easy to show that

 $$c_1^2$$ $$c_3^2$$ $$c_5^2$$ $$c_4^2$$ 0.9987 0.0014, 0.00006 0.00001

All other coefficients are zero or vanishingly small. These results say that if we make an energy measurement on a system in the state represented by M there is a 99.87% chance we will get 4.935, a 0.14% chance we will get 19.739, and so on. We might say then that the state $$\Phi$$ is a linear combination of the first four odd eigenfunctions, with the first eigenfunction making by far the biggest contribution.

The variational theorem says that no matter how hard you try in constructing trial wave functions you cannot do better than the ‘true’ ground state value for the energy, and this equation captures that important principle. The only way M can give the correct result for the ground state of the particle in the box, for example, is if $$c_1 = 1$$, or if $$M$$ is the eigenfunction itself. If this is not true, then c < 1 and the other values of c are non-zero and the energy has to be greater than $$E$$. Taking another look at the last two equations reveals that a measurement operator can always be written as a projection operator involving its eigenstates.

## Momentum Operator in Coordinate Space

Wave-particle duality is at the heart of quantum mechanics. A particle with wavelength has wave function (un-normalized) . However, according to deBroglie’s wave equation e the particle’s momentum is . Therefore the momentum wave function of the particle in coordinate space is . In momentum space the following eigenvalue equation holds: . Operating on the momentum eigenfunction with the momentum operator in momentum space returns the momentum eigenvalue times the original momentum eigenfunction. In other words, in its own space the momentum operator is a multiplicative operator (the same is true of the position operator in coordinate space). To obtain the momentum operator in coordinate space this expression can be projected onto coordinate space by operating on the left by .

Comparing the first and last terms reveals that and that is the momentum operator in coordinate space. is the position wave function in momentum space. Using the method outlined above it is easy to show that the position operator in momentum space is

## Fourier Transform

Quantum chemists work mainly in position (x,y,z) space because they are interested in electron densities, how the electrons are distributed in space in atoms and molecules. However, quantum mechanics has an equivalent formulation in momentum space. It answers the question of what does the distribution of electron velocities look like? The two formulations are equivalent, that is, they contain the same information, and are connected formally through a Fourier transform. The Dirac notation shows this connection very clearly.

[p px x d x \psi= \psi\]

Starting from the left we have the amplitude that a system in state Q has position x. Then, if it has position x, the amplitude that it has momentum p. We then sum over all values of x to find all the ways a system in the stateQ can have momentum p. As a particular example we can chose the particle-in-a-box problem with eigenfunctions noted above. It is easy to show that the momentum eigenstates in position space in atomic units (see previous section) are . This, of course, means that the complex conjugate is. Therefore, the Fourier transform of Q(x) into momentum space

This integral can be evaluated analytically and yields the following momentum space wavefunctions for the particle-in-a-box. A graphical display of the momentum distribution function,(p), for several states is shown below.

## Summary and References

J. L. Martin (see references below) has identified four virtues of Dirac notation.

1. It is concise. There are a small number of basic elements to Dirac’s notation: bras, kets, bra-ket pairs, ket-bra products, and the completeness relation (continuous and discreet). With these few building blocks you can construct all of quantum theory.
2. It is flexible. You can use it to say the same thing in several ways; translate with ease from one language to another. Perhaps the insight that the Dirac notation offers to the Fourier transform is the best example of this virtue.
3. It is general. It is a syntax for describing what you want to do without committing yourself to a particular computational approach. In other words, you use it to set up a problem and then choose the most expeditious way to execute the calculation.
4. While it is not exactly the industry standard, it should be for the reasons listed in 1-3 above. It is widely used, so if you want to read the literature in quantum chemistry and physics, you need to learn Dirac notation. In addition most of the best quantum textbooks in chemistry and physics use it.
5. I would like to add a 5th virtue. Once you get the “hang of it” you will find that it is simple to use and very enlightening. It facilitates the understanding of all the fundamental quantum concepts.

## References

1. Chester, M. Primer of Quantum Mechanics ; Krieger Publishing Co.:Malabar, FL, 1992.
2. Das, A.; Melissinos, A. C. Quantum Mechanics: A Modern Introduction; Gordon and Breach Science Publishers: New York, 1986.Feynman, R. P.; Leighton, R. B.; Sands, M. The Feynman Lectures on Physics, Vol.3 ;Addison-Wesley: Reading, 1965.
3. Martin, J. L. Basic Quantum Mechanics ; Claredon Press, Oxford, 1981.