# 7.4: Perturbation Theory Expresses the Solutions in Terms of Solved Problems

It is easier to compute the changes in the energy levels and wavefunctions with a scheme of successive corrections to the zero-field values. This method, termed *perturbation theory*, is the single most important method of solving problems in quantum mechanics, and is widely used in atomic physics, condensed matter and particle physics. Perturbation theory is another approach to finding approximate solutions to a problem, by starting from the exact solution of a related, simpler problem. A critical feature of the technique is a middle step that breaks the problem into "solvable" and "perturbation" parts. Perturbation theory is applicable if the problem at hand cannot be solved exactly, but can be formulated by adding a "small" term to the mathematical description of the exactly solvable problem.

**Figure \(\PageIndex{1}\):** Perturbed Energy Spectrum. Image used with permission (CC BY-SA 2.0, Frontier).

### The Perturbation Series

We begin with a Hamiltonian \(H^0\) having known eigenkets and eigenenergies:

\[ H^o | n^o \rangle = E_n^o | n^o \rangle \label{7.4.1}\]

The task is to find how these eigenstates and eigenenergies change if a small term \(H^1\) (an external field, for example) is added to the Hamiltonian, so:

\[ ( H^0 + H^1 ) | n \rangle = E_n | n \rangle \label{7.4.2}\]

That is to say, on switching on \(H^1\) changes the wavefunctions:

\[ \underbrace{ | n^o \rangle }_{\text{unperturbed}} \Rightarrow \underbrace{|n \rangle }_{\text{Perturbed}}\label{7.4.3}\]

and energies (Figure \(\PageIndex{1}\)):

\[ \underbrace{ E_n^o }_{\text{unperturbed}} \Rightarrow \underbrace{E_n }_{\text{Perturbed}} \label{7.4.4}\]

The basic assumption in perturbation theory is that \(H^1\) is sufficiently small that the leading corrections are the same order of magnitude as\(H^1\) itself, and the true energies can be better and better approximated by a successive series of corrections, each of order \(H^1/H^o\) compared with the previous one.

The strategy is to **expand** the true wavefunction and corresponding eigenenergy as series in \(H^1/H^o\). These series are then fed into Equation \(\ref{7.4.2}\), and terms of the same order of magnitude in \(H^1/H^o\) on the two sides are set equal. The equations thus generated are solved one by one to give progressively more accurate results.

To make it easier to identify terms of the same order in \(H^1/H^o\) on the two sides of the equation, it is convenient to introduce a dimensionless parameter \(\lambda\) which always goes with \(H^1\), and then expand both eigenstates and eigenenergies as power series in \(\lambda\),

\[ \color{red} | n \rangle = \sum _ i^m= \lambda ^i| n^i \rangle \label{7.4.5}\]

\[ \color{red} E_n = \sum_{i=0}^m \lambda ^i E_n^i \label{7.4.6}\]

where \(m\) is how many terms in the expansion we are considering. The ket \(|n^i \rangle\) is multiplied by \(\lambda^i\) and is therefore of order \((H^1/H^o)^i\). For example, in first order perturbation theory, Equations \(\ref{7.4.5}\) are truncated at \(m=1\) (and setting \(\lambda=1\)):

\[ | n \rangle \approx | n^o \rangle + | n^1 \rangle \label{7.4.7}\]

\[ E_n \approx E_n^o + E_n^1 \label{7.4.8} \]

\(\lambda\) is purely a bookkeeping device: we will set it equal to 1 when we are through! It’s just there to keep track of the orders of magnitudes of the various terms.

However, let's consider the general case for now. Adding the full expansions for the eigenstate (Equation \(\ref{7.4.5}\)) and energies (Equation \(\ref{7.4.6}\)) into the Schrödinger equation for the perturbation Equation \(\ref{7.4.2}\) in

\[ ( H^o + \lambda H^1) | n \rangle = E_n| n \rangle \label{7.4.9}\]

we have

\[ (H^o + \lambda H^1) \left( \sum _ {i=0}^m \lambda ^i| n^i \rangle \right) = \left( \sum_{i=0}^m \lambda^i E_n^i \right) \left( \sum _ {i=0}^m \lambda ^i| n^i \rangle \right) \label{7.4.10}\]

We’re now ready to match the two sides term by term in powers of \(\lambda\). Note that the zeroth-order term, of course, just gives back the unperturbed Schrödinger Equation (Equation \(\ref{7.4.1}\)).

Let's look at Equation \(\ref{7.4.10}\) with the first few terms of the expansion:

\[ (H^o + \lambda H^1) \left( | n ^o \rangle + \lambda | n^1 \rangle \right) = \left( E _n^0 + \lambda E_n^1 \right) \left( | n ^o \rangle + \lambda | n^1 \rangle \right) \label{7.4.11}\]

\[ H^o | n ^o \rangle + \lambda H^1 | n ^o \rangle + \lambda H^o | n^1 \rangle + \lambda^2 H^1| n^1 \rangle= E _n^0 | n ^o \rangle + \lambda E_n^1 | n ^o \rangle + E _n^0 | n ^1 \rangle + \lambda^2 E_n^1 | n^1 \rangle \label{7.4.11A}\]

Collecting terms in order of \(\lambda\)

\[ \underset{\text{zero order}}{H^o | n ^o \rangle} + \color{red} \underset{\text{1st order}}{\lambda ( H^1 | n ^o \rangle + H^o | n^1 \rangle )} + \color{blue} \underset{\text{2nd order}} {\lambda^2 H^1| n^1 \rangle} =\color{black}\underset{\text{zero order}}{E _n^0 | n ^o \rangle} + \color{red} \underset{\text{1st order}}{ \lambda (E_n^1 | n ^o \rangle + E _n^0 | n ^1 \rangle )} +\color{blue}\underset{\text{2nd order}}{\lambda^2 E_n^1 | n^1 \rangle} \label{7.4.12}\]

If we expanded Equation \(\ref{7.4.10}\) further we could express the energies and wavefunctions in higher order components.

#### Zero-Order Terms (\(\lambda=0\))

The zero order terms in the expansion of Equation \(\ref{7.4.10}\) results in just the Schrödinger Equation for the unperturbed system

\[ H^o | n^o \rangle = E_n^o | n^o \rangle \label{Zero}\]

#### First-Order Expression of Energy (\(\lambda=1\))

The summations in Equations \(\ref{7.4.5}\), \(\ref{7.4.6}\), and \(\ref{7.4.10}\) can be truncated at any order of \(\lambda\). For example, the first order perturbation theory has the truncation at \(\lambda=1\). Matching the terms that linear in \(\lambda\) (red terms in Equation \(\ref{7.4.12}\)) and setting \(\lambda=1\) on both sides of Equation \(\ref{7.4.12}\):

\[ H^o | n^1 \rangle + H^1 | n^o \rangle = E_n^o | n^1 \rangle + E_n^1 | n^o \rangle \label{7.4.13}\]

Equation \(\ref{7.4.13}\) is the key to finding the first-order change in energy \(E_n^1\). Taking the **inner product** of both sides with \(\langle n^o | \):

\[ \langle n^o | H^o | n^1 \rangle + \langle n^o | H^1 | n^o \rangle = \langle n^o | E_n^o| n^1 \rangle + \langle n^o | E_n^1 | n^o \rangle \label{7.4.14}\]

since operating the zero-order Hamiltonian on the bra wavefunction (this is just the Schrödinger equation (Equation \(\ref{Zero}\)) is

\[ \langle n^o | H^o = \langle n^o | E_n^o \label{7.4.15}\]

and via the orthonormality of the unperturbed \(| n^o \rangle\) wavefunctions both

\[ \langle n^o | n^o \rangle = 1 \label{7.4.16}\]

and Equation \(\ref{7.4.14}\) can be simplified

\[ \cancel{E_n^o \langle n^o | n^1 \rangle} + \langle n^o | H^1 | n^o \rangle = \cancel{ E_n^o \langle n^o | n^1 \rangle} + E_n^1 \cancelto{1}{\langle n^o | n^o} \rangle \label{7.4.14new}\]

since the unperturbed set of eigenstates are orthogonal and we can cancel the other term on each side of the equation, we find that

\[ \color{red} E_n^1 = \langle n^o | H^1 | n^o \rangle \label{7.4.17}\]

The first-order change in the energy of a state resulting from adding a perturbing term \(H^1\) to the Hamiltonian is just the

expectation valueof \(H^1\) in theunperturbedwavefunctions.

That is, within the first order limit, the energies are given by

\[ \color{red} \underbrace{E_n \approx E_n^o + E_n^1 = E_n^o + \langle n^o | H^1 | n^o \rangle}_{\text{First Order Perturbation Theory}} \label{7.4.17.2}\]

Example \(\PageIndex{1}\): A Perturbed Particle in a Box

Estimate the energy of the ground-state and first excited-state wavefunction within first-order perturbation theory of a system with the following potential energy

\[V(x)=\begin{cases}

V_o & 0\leq x\leq L \\

\infty & x< 0 \; and\; x> L \end{cases} \nonumber\]

**Solution**

The first step in any perturbation problem is to write the Hamiltonian in terms of a unperturbed component that the solutions (both eigenstates and energy) are known and a perturbation component (Equation \(\ref{7.4.2}\)). For this system, the unperturbed Hamilonian and solutions is the particle in an infiinitely high box and the perturbation is a shift of the potential within the box by \(V_o\).

\[ \hat{H}^1 = V_o \nonumber\]

Using Equation \(\ref{7.4.17}\) for the first-order term in the energy of the ground-state

\[ E_n^1 = \langle n^o | H^1 | n^o \rangle \nonumber\]

with the wavefunctions known from the particle in the box problem

\[ | n^o \rangle = \sqrt{\dfrac{2}{L}} \sin \left ( \dfrac {n \pi}{L} x \right) \nonumber\]

At this stage we can do two problems independently (i.e., the ground-state with \(| 1 \rangle\) and the first excited-state \(| 2 \rangle\)). However, in this case, the first-order perturbation to any particle-in-the-box sate can be easily derived.

\[ E_n^1 = \int_0^L \sqrt{\dfrac{2}{L}} \sin \left ( \dfrac {n \pi}{L} x \right) V_o \sqrt{\dfrac{2}{L}} \sin \left ( \dfrac {n \pi}{L} x \right) dx \nonumber\]

or better yet, instead of evaluating this integrals we can simplify the expression

\[ E_n^1 = \langle n^o | H^1 | n^o \rangle = \langle n^o | V_o | n^o \rangle = V_o \langle n^o | n^o \rangle = V_o \nonumber\]

so via Equation \(\ref{7.4.17.2}\), the energy of each perturbed eigenstate is

\[ E_n \approx E_n^o + V_o \nonumber\]

or

\[ E_n \approx \dfrac{h^2}{8mL^2}n^2 + V_o \nonumber\]

While this is the first order perturbation to the energy, it is also the exact value too.

Example \(\PageIndex{2}\): An Even More Perturbed Particle in a Box

Estimate the energy of the ground-state wavefunction within first-order perturbation theory of a system with the following potential energy

\[V(x)=\begin{cases}

V_o & 0\leq x\leq L/2 \\

\infty & x< 0 \; and\; x> L \end{cases} \nonumber\]

**Solution**

As with Example \(\PageIndex{1}\), we recognize that unperturbed component of the problem (Equation \(\ref{7.4.2}\)) is the particle in an infinitely high well. For this system, the unperturbed Hamiltonian and solutions is the particle in an infinitely high box and the perturbation is a shift of the potential within half a box by \(V_o\). This is essentially a step function.

Using Equation \(\ref{7.4.17}\) for the first-order term in the energy of the any state

\[ E_n^1 = \langle n^o | H^1 | n^o \rangle \nonumber\]

\[ E_n^1 = \int_0^{L/2} \sqrt{\dfrac{2}{L}} \sin \left ( \dfrac {n \pi}{L} x \right) V_o \sqrt{\dfrac{2}{L}} \sin \left ( \dfrac {n \pi}{L} x \right) + dx \nonumber\]

\[\int_{L/2}^L \sqrt{\dfrac{2}{L}} \sin \left ( \dfrac {n \pi}{L} x \right) 0 \sqrt{\dfrac{2}{L}} \sin \left ( \dfrac {n \pi}{L} x \right) dx \nonumber\]

The second integral is zero and the first integral is simplified to

\[ E_n^1 = \dfrac{2}{L} \int_0^{L/2} V_o \sin^2 \left( \dfrac {n \pi}{L} x \right) + dx \nonumber\]

this is evaluated to

\[ E_n^1 = \dfrac{2V_o}{L} \left[ \dfrac{-1}{2 \frac{\pi n}{a}} \cos \left( \dfrac {n \pi}{L} x \right) \sin \left( \dfrac {n \pi}{L} x \right) + \dfrac{x}{2} \right]_o^{L/2}\]

\[ = \dfrac{2V_o}{\cancel{L}} \dfrac{\cancel{L}}{4} = \dfrac{V_o}{2}\]

The energy of each perturbed eigenstate, via Equation \(\ref{7.4.17.2}\), is

\[ E_n \approx E_n^o + \dfrac{V_o}{2} \nonumber\]

or

\[ E_n \approx \dfrac{h^2}{8mL^2}n^2 + \dfrac{V_o}{2} \nonumber\]

#### First-Order Expression of Wavefunction (\(\lambda=1\))

The general expression for the first-order change in the *wavefunction* is found by taking the inner product of the first-order expansion (Equation \(\ref{7.4.13}\)) with the bra \( \langle m^o |\) with \(m \neq n\),

\[ \langle m^o | H^o | n^1 \rangle + \langle m^o |H^1 | n^o \rangle = \langle m^o | E_n^o | n^1 \rangle + \langle m^o |E_n^1 | n^o \rangle \label{7.4.18}\]

**Last term on right side of Equation \(\ref{7.4.18}\)**

The last integral on the right hand side of Equation \(\ref{7.4.18}\) is zero, since \(m \neq n\) so

\[ \langle m^o |E_n^1 | n^o \rangle = E_n^1 \langle m^o | n^o \rangle \label{7.4.19}\]

and

\[\langle m^o | n^0 \rangle = 0 \label{7.4.20}\]

**First term on right side of Equation \(\ref{7.4.18}\)**

The first integral is more complicated and can be expanded back into the \(H^o\)

\[ E_m^o \langle m^o | n^1 \rangle = \langle m^o|E_m^o | n^1 \rangle = \langle m^o | H^o | n^1 \rangle \label{7.4.21}\]

since

\[ \langle m^o | H^o = \langle m^o | E_m^o \label{7.4.22}\]

so

\[ \langle m^o | n^1 \rangle = \dfrac{\langle m^o | H^1 | n^o \rangle}{ E_n^o - E_m^o} \label{7.4.23}\]

and therefore the wavefunction corrected to first order is:

\[ \color{red} \underbrace{| n \rangle \approx | n^o \rangle + | n^1 \rangle = | n^o \rangle + \sum _{m \neq n} \dfrac{|m^o \rangle \langle m^o | H^1| n^o \rangle }{E_n^o - E_m^o}}_{\text{First Order Perturbation Theory}} \label{7.4.24}\]

Equation \(\ref{7.4.24}\) is essentially is an expansion of the unknown wavefunction correction as a **linear combination** of known unperturbed wavefunctions \(\ref{7.4.24.2}\):

\[ \color{red} | n \rangle \approx | n^o \rangle + | n^1 \rangle = | n^o \rangle + \sum _{m \neq n} c_{m,n} |m^o \rangle \label{7.4.24.2}\]

with the expansion coefficients determined by

\[ \color{red} c_{m,n} = \dfrac{\langle m^o | H^1| n^o \rangle }{E_n^o - E_m^o} \label{7.4.24.3}\]

Since is justified since the set of original zero-order wavefunctions forms a complete basis set that can describe any function.

**Figure \(\PageIndex{2}\): **The first order perturbation of the ground-state wavefunction for a perturbed (left) can be expressed as a linear combination of all excited-state wavefunctions of the unperturbed potential (Equation \(\ref{7.4.24.2}\), shown as a harmonic oscillator in this example (right). Note that the ground-state harmonic oscillator wavefuncion is not part of this expression and technically all wavefunctions need to be included in the expression, not just the first eight wavefunctions shown here.

Calculating the first order perturbation to the wavefunctions is in general an infinite sum of off diagonal matrix elements of \(H^1\) (Figure \(\PageIndex{2}\)). However, the denominator argues that terms in this sum will be weighted by states that are of** comparable energy**. That means in principle, these sum can be truncated easily based off of some criterion. Another point to consider is that many of these matrix elements will equal zero depending on the symmetry of the \(\{| n^o \rangle \}\) basis and \(H^1\) (e.g., some \(\langle m^o | H^1| n^o \rangle\) integrals in Equation \(\ref{7.4.24}\) could be zero due to the integrand having an odd symmetry; see Example \(\PageIndex{3}\)).

The denominators in Equation \(\ref{7.4.24}\) argues that terms in this sum will be preferentially dictated by states that are of

comparable energy. That is, eigenstates that have energies significantly greater or lower than the unperturbed eigenstate will weakly contribute to the perturbed wavefunction.

Example \(\PageIndex{3}\): Harmonic Oscillator with a Cubic Perturbation

Estimate the energy of the ground-state wavefunction associated with the Hamiltonian using perturbation theory

\[ \hat{H} = \dfrac{-\hbar}{2m} \dfrac{d^2}{dx^2} + \dfrac{1}{2} kx^2 + \epsilon x^3 \nonumber\]

**Solution**

**Energy**

The first steps in flowchart for applying perturbation theory (Figure \(\PageIndex{1}\)) is to separate the Hamiltonian of the difficult (or unsolvable) problem into a solvable one with a perturbation. For this case, we can rewrite the Hamiltonian as

\[ \hat{H}^{o} + \hat{H}^{1} \nonumber\]

where

- \(\hat{H}^{o}\) is the Hamitonian for the standard Harmonic Oscillator with
**known**eigenstates and eigenenergies \[ \hat{H}^{(0)}= \dfrac{-\hbar}{2m} \dfrac{d^2}{dx^2} + \dfrac{1}{2} kx^2 \nonumber\] - \(\hat{H}^{1}\) is the pertubtiation \[\hat{H}^{1} = \epsilon x^3 \nonumber\]

The first order perturbation is given by Equation \(\ref{7.4.17}\), which for this problem is

\[E_n^1 = \langle n^o | \epsilon x^3 | n^o \rangle \nonumber\]

Notice that the integrand has an odd symmetry (i.e., \(f(x)=-f(-x)\)) with the perturbation Hamiltonian being odd and the ground state harmonic oscillator wavefunctions being even. So

\[E_n^1=0 \nonumber\]

This means to first order purtbation theory, this cubic terms does not alter the ground state energy (via Equation \(\ref{7.4.17.2})\). However, this is not the case if second-order perturbation theory were used, which is more accurate (not shown).

**Wavefunction**

Calculating the first order perturbation to the wavefunctions (equation \(\ref{7.4.24}\) is more difficult than energy since multiple integrals must be evaluated (an infinite number if symmetry arguments are not applicable). The harmonic oscillator wavefunctions are often written in terms of \(Q\), the unscaled displacement coordinate:

\[ | \Psi _v (x) \rangle = N_v'' H_v (\sqrt{\alpha} Q) e^{-\alpha Q^2/ 2} \]

with \(\alpha\)

\[ \alpha =1/\sqrt{\beta} = \sqrt{\dfrac{k \mu}{\hbar ^2}} \]

and

\[ N_v'' = \sqrt {\dfrac {1}{2^v v!}} \left(\dfrac{\alpha}{\pi}\right)^{1/4} \]

Let's consider only the first six wavefunctions that use these Hermite polynomials \(H_v (x)\):

- \(H_0 = 1\)
- \(H_1 = 2x\)
- \(H_2 = -2 + 4x^2\)
- \(H_3 = -12x + 8x^3\)
- \(H_4 = 12 - 48x^2 +16x^4\)
- \(H_5 = 120x - 160x^3 + 32x^5\)

The first order perturbation to the wavefunction (Equation \(\ref{7.4.24}\))

\[ | 0^1 \rangle = \sum _{m \neq 0} \dfrac{|m^o \rangle \langle m^o | H^1| 0^o \rangle }{E_0^o - E_m^o}\]

given these truncated wavefunctions (we should technically use the infinite sum) and that we are considering only the ground state with \(n=0\):

\[| 0^1 \rangle = \dfrac{ \langle 1^o | H^1| 0^o \rangle }{E_0^o - E_1^o} |1^o \rangle + \dfrac{ \langle 2^o | H^1| 0^o \rangle }{E_0^o - E_2^o} |2^o \rangle + \dfrac{ \langle 3^o | H^1| 0^o \rangle }{E_0^o - E_3^o} |3^o \rangle + \dfrac{ \langle 4^o | H^1| 0^o \rangle }{E_0^o - E_4^o} |4^o \rangle + \dfrac{ \langle 5^o | H^1| 0^o \rangle }{E_0^o - E_5^o} |5^o \rangle \nonumber\]

We can use symmetry of the perturbation and unperturbed wavefunctions to solve the integrals above. We know that the unperturbed wavefunctions \(\{|n^{0}\} \rangle\) alternate between even (when \(v\) is even) and odd (when \(v\) is odd). Since the perturbation is an odd function, only when \(m= 2k+1\) with \(k=1,2,3\) would these integrals be non-zero (i.e., for \(m=1,3,5, ...\)).

So of the original five unperturbed wavefuncion, only \(|m=2\rangle\) and \(m=4 \rangle\) mix to make the first-order perturbed wavefunction so

\[| 0^1 \rangle = \dfrac{ \langle 1^o | H^1| 0^o \rangle }{E_0^o - E_1^o} |1^o \rangle + \dfrac{ \langle 3^o | H^1| 0^o \rangle }{E_0^o - E_3^o} |3^o \rangle + \dfrac{ \langle 5^o | H^1| 0^o \rangle }{E_0^o - E_5^o} |5^o \rangle \nonumber\]

At this stage, the integrals have to be manually calculated using the defined wavefuctions above, which is left as an exercise. Notice that each unperturbed wavefunction that can "mix" to generate the perturbed wavefunction will have a reciprocally decreasing contribution (w.r.t. energy) due to the growing denominator.

Exercise \(\PageIndex{3}\): Harmonic Oscillator with a Quartic Perturbation

Estimate the energy of the ground-state wavefunction associated with the Hamiltonian using perturbation theory

\[ \hat{H} = \dfrac{-\hbar}{2m} \dfrac{d^2}{dx^2} + \dfrac{1}{2} kx^2 + \gamma x^4 \nonumber\]

#### Second-Order Terms (\(\lambda=2\))

There are higher energy terms in the expansion of Equation \(\ref{7.4.5}\) (e.g., the blue terms in Equation \(\ref{7.4.12}\)), but are not discussed further here other than noting the whole perturbation process is an infitinite series of corrections that ideally converge to the correct answer. It is the truncating this series as a finite level that is the approximation.

The general approach to perturbation theory applications is giving in the flowchart below.

**Figure \(\PageIndex{1}\): **Simplified algorithmic flowchart of the Perturbation Theory approximation showing the first two perturbation orders. The process can be continued to third and higher orders.

Perturbation Theory Does not always Work

It should be noted that there *are* problems which cannot be solved using perturbation theory, even when the perturbation is very weak, although such problems are the exception rather than the rule. One such case is the one-dimensional problem of free particles perturbed by a localized potential of strength *l*. As we found earlier in the course, switching on an arbitrarily weak attractive potential causes the \(k=0\) free particle wavefunction to drop below the continuum of plane wave energies and become a localized bound state with binding energy of order \(\lambda^2\). However, changing the sign of \(\lambda\) to give a repulsive potential there is no bound state, the lowest energy plane wave state stays at energy zero. Therefore the energy shift on switching on the perturbation **cannot **be represented as a power series in \(\lambda\), the strength of the perturbation.

### Contributors

Michael Fowler (Beams Professor, Department of Physics, University of Virginia)