2.2.1: Collision theory, transition state theory, and the prediction of rate laws and rate constants

Last updated
Save as PDF

Page ID: 401770

Mark Tuckerman
University of Illinois Springfield

\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

\( \newcommand{\Span}{\mathrm{span}}\)

\( \newcommand{\id}{\mathrm{id}}\)

\( \newcommand{\Span}{\mathrm{span}}\)

\( \newcommand{\kernel}{\mathrm{null}\,}\)

\( \newcommand{\range}{\mathrm{range}\,}\)

\( \newcommand{\RealPart}{\mathrm{Re}}\)

\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

\( \newcommand{\Argument}{\mathrm{Arg}}\)

\( \newcommand{\norm}[1]{\| #1 \|}\)

\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)

\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)

\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vectorC}[1]{\textbf{#1}} \)

\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

Rate Law and Collision Theory

Consider the reaction

\[\text{A} + \text{B} \longrightarrow \text{C}\]

In the last class, we regarded the rate law

\[r = k \left[ \text{A} \right] \left[ \text{B} \right] \label{20.1}\]

as empirical. As it happens, we can actually derive this using the collision theory discussed in Lecture 6. Recall, from lecture 6, that the collision rate between two atoms or molecules in a system is

\[\gamma = \rho \sigma \left< \left| \textbf{v} \right| \right> \label{20.2}\]

where \(\rho\) is the number density, \(\sigma\) is the collision cross section, and \(\left< \left| \textbf{v} \right| \right>\) is the relative velocity between the two atoms or molecules. Now, if the two colliding atoms or molecules are different, and we are interested in the rate of collisions of atoms/molecules of type \(\text{A}\) with those of type \(\text{B}\), then the collision rate must be written as

\[\gamma_\text{AB} = \sigma_\text{AB} \left< \left| \textbf{v}_\text{AB} \right| \right> \rho_\text{B} \label{20.3}\]

Here \(\rho_\text{B}\) is the density of atoms/molecules of type \(\text{B}\), \(\left| \textbf{v}_\text{AB} \right| = \left| \textbf{v}_\text{B} - \textbf{v}_\text{A} \right|\) is the relative speed between \(\text{A}\) and \(\text{B}\), and \(\sigma_\text{AB}\) is the cross section between \(\text{A}\) and \(\text{B}\), which, is given the average of arithmetic and geometric averages:

\[\sigma_\text{AB} = \frac{1}{2} \left[ \left( \frac{\sigma_\text{A} + \sigma_\text{B}}{2} \right) + \sqrt{\sigma_\text{A} \sigma_\text{B}} \right] \label{20.4}\]

From the Maxwell-Boltzmann distribution,

\[\left< \left| \textbf{v}_\text{AB} \right| \right> = \sqrt{\frac{8 k_B T}{\pi \mu}} \label{20.5}\]

where the reduced mass

\[\mu = \frac{m_\text{A} m_\text{B}}{m_\text{A} + m_\text{B}} \label{20.6}\]

The collision rate \(\gamma_\text{AB}\) is the rate for the collision of one atom/molecule of \(\text{A}\). If there are \(N_\text{A}\) atoms/molecules of \(\text{A}\), then the total collision rate for \(\text{A}\) with \(\text{B}\) is

\[\gamma_\text{tot} = N_\text{A} \gamma_\text{AB} = N_\text{A} \sigma_\text{AB} \left< \left| \textbf{v}_\text{AB} \right| \right> \rho_\text{B} \label{20.7}\]

However, the number density of \(\text{A}\) is \(\rho_\text{A} = N_\text{A}/V\), so we can write the total collision rate as

\[\gamma_\text{tot} = \sigma_\text{AB} V \left< \left| \textbf{v}_\text{AB} \right| \right> \rho_\text{A} \rho_\text{B} \label{20.8}\]

In a time interval \(dt\), the number of collisions \(dN_\text{coll}\) is

\[dN_\text{coll} = \gamma_\text{tot} dt = \sigma_\text{AB} V \left< \left| \textbf{v}_\text{AB} \right| \right> \rho_\text{A} \rho_\text{B} dt\]

Let \(P_\text{rxn}\) denote the probability that a collision between \(\text{A}\) and \(\text{B}\) leads to product \(\text{C}\). The rate of decrease of \(N_\text{A}\) must then be

\[dN_\text{A} = -P_\text{rxn} dN_\text{coll} = -\sigma_\text{AB} P_\text{rxn} V \left< \left| \textbf{v}_\text{AB} \right| \right> \rho_\text{A} \rho_\text{B} dt \label{20.9}\]

so that

\[\frac{dN_\text{A}}{dt} = -\sigma_\text{AB} P_\text{rxn} V \left< \left| \textbf{v}_\text{AB} \right| \right> \rho_\text{A} \rho_\text{B} \label{20.10}\]

Note that the rate is

\[r = -\frac{d \left[ \text{A} \right]}{dt} \label{20.11}\]

However, \(\left[ \text{A} \right]\) is in units of moles/liter. The ratio \(N_\text{A}/\left(N_0V \right)\), where \(N_0\) is Avogadro’s number, has the proper units of moles/liter, if \(V\) is in liters. Thus,

\[\begin{align} \frac{d \left[ \text{A} \right]}{dt} &= \frac{d \left( N_\text{A}/\left( N_0 V \right) \right)}{dt} \\ &= -\frac{1}{N_0 V} \sigma_\text{AB} P_\text{rxn} V \left< \left| \text{v}_\text{AB} \right| \right> \rho_\text{A} \rho_\text{B} \\ &= -\sigma_\text{AB} P_\text{rxn} N_0^{-1} \left< \left| \textbf{v}_\text{AB} \right| \right> \rho_\text{A} \rho_\text{B} \end{align} \label{20.12}\]

Since \(\rho_\text{A}\) has units of (molecules of \(\text{A}\)/liters), we can write \(\rho_\text{A} = N_0 \left[ \text{A} \right]\), and similarly, \(\rho_\text{B} = N_0 \left[ \text{B} \right]\). This gives

\[\frac{d \left[ \text{A} \right]}{dt} = -\sigma_\text{AB} P_\text{rxn} N_0 \left< \left| \textbf{v}_\text{AB} \right| \right> \left[ \text{A} \right] \left[ \text{B} \right] = -k \left[ \text{A} \right] \left[ \text{B} \right] \label{20.13}\]

where the rate constant is

\[k = \sigma_\text{AB} P_\text{rxn} N_0 \left< \left| \textbf{v}_\text{AB} \right| \right> \label{20.14}\]

To determine the reaction probability \(P_\text{rxn}\), consider the energy profile for the reaction in Figure \(\PageIndex{1}\). In the gas phase, the activation “energy”, denote \(E_a\) in the figure is the potential energy at the top of the hill, which we denote as \(\mathcal{E}^\ddagger\). If the reaction takes place in a condensed phase, such as in solution, then the activation “energy” is the free energy \(\Delta G^\ddagger\).

Tuckerman Screenshot 20-1.png — Figure \(\PageIndex{1}\): Illustration of a reaction energy profile.

If \(\text{A}\) and \(\text{B}\) are atoms, then \(P_\text{rxn}\) is the probability that the energy \(E_\text{AB}\) between \(\text{A}\) and \(\text{B}\) must be larger than this energy \(E_a\) in order for the collision to yield product \(\text{C}\). If \(\text{A}\) and \(\text{B}\) are molecules, then \(\text{A}\) and \(\text{B}\) must also have the right orientation in addition to a sufficiently high energy. The probability that they have the right orientation is a fraction \(f < 1\), which we call the steric factor. When \(\text{A}\) and \(\text{B}\) are atoms, \(f = 1\). Generally, we can write

\[P_\text{rxn} = f P \left( E_\text{AB} > E_a \right) \label{20.15}\]

The general probability distribution \(p \left( E_\text{AB} \right)\) is just given by the Boltzmann distribution

\[p \left( E_\text{AB} \right) Ce^{-\beta E_\text{AB}} \label{20.16}\]

where \(C\) is a normalization constant. The normalization condition is

\[\int_0^\infty p \left( E_\text{AB} \right) d E_\text{AB} = C \int_0^\infty e^{-\beta E_\text{AB}} = \left. -\frac{C}{\beta} e^{-\beta E_\text{AB}} \right|_0^\infty = 1 \label{20.17}\]

which gives \(C = \beta = 1/\left( k_B T \right)\). Thus,

\[p \left( E_\text{AB} \right) = \beta e^{-\beta E_\text{AB}} \label{20.18}\]

Now, the probability \(P \left( E_\text{AB} > E_a \right)\) that \(E_\text{AB} > E_a\) is

\[P \left( E_\text{AB} > E_a \right) = \beta \int_{E_a}^\infty e^{-\beta E_\text{AB}} d E_\text{AB} = e^{-\beta E_a} \label{20.19}\]

which gives the rate constant as

\[k = \sigma_\text{AB} N_0 f e^{-\beta E_a} \left< \left| \textbf{v}_\text{AB} \right| \right> = \sigma_\text{AB} N_0 f e^{-\beta E_a} \left( \frac{8}{\beta \pi \mu} \right)^{1/2} \label{20.20}\]

We see, generally, that

\[k = \text{A} e^{-\beta E_a} \label{20.21}\]

where \(E_a\) is the activation potential \(\mathcal{E}^\ddagger\) in the gas phase and the activation free energy \(\Delta G^\ddagger\) in condensed phases. This is known as the Arrhenius law.

Note that if we plot \(\text{ln} \: k\) vs. \(1/T\), which is given by

\[\text{ln} \: k = \text{ln} \: \text{A} - \frac{E_a}{k_B T} \label{20.22}\]

the plot will be a line with slope \(-E_a/k_B\). Such a plot is called an Arrhenius plot. Note, moreover, that if \(\text{A}\) and \(\text{B}\) are the same atom or molecule type, then the rate law we derived, would take the form of a second-order rate law

\[\frac{d \left[ \text{A} \right]}{dt} = -k \left[ \text{A} \right]^2 \label{20.23}\]

Transition State Theory

In Figure \(\PageIndex{1}\), the point at which we evaluate or measure \(E_a\) serves as a dividing line (also called a dividing surface) between reactants and products. At this point, we do not have \(\text{A} + \text{B}\), and we do not have \(\text{C}\). Rather, what we have is an activated complex of some kind called a transition state between reactants and products. The value of the reaction coordinate at the transition state is denoted \(q^\ddagger\). Recall our notation \(\text{x}\) for the complete set of coordinates and momenta of all of the atoms in the system. Generally, the reaction coordinate \(q\) is a function \(q \left( \text{x} \right)\) of all of the coordinates and momenta, although typically, \(q \left( \text{x} \right)\) is a function of a subset of the coordinates and, possibly, the momenta.

As an example, let us consider two atoms \(\text{A}\) and \(\text{B}\) undergoing a collision. An appropriate reaction coordinate could simply be the distance \(r\) between \(\text{A}\) and \(\text{B}\). This distance is a function of the positions \(\textbf{r}_\text{A}\) and \(\textbf{r}_\text{B}\) of the two atoms, in that

\[q = r = \left| \textbf{r}_\text{A} - \textbf{r}_\text{B} \right| \label{20.24}\]

When \(\text{A}\) and \(\text{B}\) are molecules, such as proteins, \(q \left( \text{x} \right)\) is a much more complicated function of \(\text{x}\).

Now, recall that the mechanical energy \(\mathcal{E} \left( \text{x} \right)\) is given by

\[\mathcal{E} \left( \text{x} \right) = \sum_{i=1}^N \frac{\textbf{p}_i^2}{2m_1} + U \left( \textbf{r}_1, \ldots, \textbf{r}_N \right) \label{20.25}\]

and is a sum of kinetic and potential energies. Transition state theory assumes the following:

We start a trajectory obeying this equation of motion with an initial condition \(\text{x}\) that makes \(q \left( \text{x} \right) = q^\ddagger\) and such that \(\dot{q} \left( \text{x} \right) > 0\) so that the reaction coordinate proceeds initial to the right, i.e., toward products.
We follow the motion \(\text{x}_t\) of the coordinates and momenta in time starting from this initial condition \(\text{x}\), which gives us a unique function \(\text{x}_t \left( \text{x} \right)\).
If \(q \left( \text{x}_t \left( \text{x} \right) \right) > q^\ddagger\) at time \(t\), then the trajectory is designated as “reactive” and contributes to the reaction rate.

Define a function \(\theta \left( y \right)\), which is \(1\) if \(y \geq 0\) and \(0\) if \(y < 0\). The function \(\theta \left( y \right)\) is known as a step function.

We now define a flux of reactive trajectories \(k \left( t \right)\) using statistical mechanics

\[k \left( t \right) = \frac{1}{h Q_r} \int_{q \left( \text{x} \right) = q^\ddagger} d \text{x} \: e^{-\beta \mathcal{E} \left( \text{x} \right)} \left| \dot{q} \left( \text{x} \right) \right| \theta \left( q \left( \text{x}_t \left( \text{x} \right) \right) - q^\ddagger \right) \label{20.27}\]

where \(h\) is Planck’s constant. Here \(Q_r\) is the partition function of the reactants

\[Q_r = \int d \text{x} \: e^{-\beta \mathcal{E} \left( \text{x} \right)} \theta \left( q^\ddagger - q \left( \text{x} \right) \right) \label{20.28}\]

The meaning of Equation \(\ref{20.27}\) is an ensemble average over a canonical ensemble of the product \(\left| \dot{q} \left( \text{x} \right) \right|\) and \(\theta \left( q \left( \text{x}_t \left( \text{x} \right) \right) - q^\ddagger \right)\). The first factor in this product \(\left| \dot{q} \left( \text{x} \right) \right|\) forces the initial velocity of the reaction coordinate to be positive, i.e., toward products, and the step function \(\theta \left( q \left( \text{x}_t \left( \text{x} \right) \right) - q^\ddagger \right)\) requires that the trajectory of \(q \left( \text{x}_t \left( \text{x} \right) \right)\) be reactive, otherwise, the step function will give no contribution to the flux. The function \(k \left( t \right)\) in Equation \(\ref{20.27}\) is known as the reactive flux. In the definition of \(Q_r\) the step function \(\theta \left( q^\ddagger - q \left( \text{x} \right) \right)\) measures the total number of microscopic states on the reactive side of the energy profile.

A plot of some examples of reactive flux functions \(k \left( t \right)\) is shown in Figure \(\PageIndex{2}\). These functions are discussed in greater detail in J. Chem. Phys. 95, 5809 (1991). These examples all show that \(k \left( t \right)\) decays at first but then finally reaches a plateau value. This plateau value is taken to be the true rate of the reaction under the assumption that eventually, all trajectories that will become reactive will have done so after a sufficiently long time. Thus,

\[k = \underset{t \rightarrow \infty}{\text{lim}} k \left( t \right) \label{20.29}\]

gives the true rate constant. On the other hand, a common approximation is to take the value \(k \left( 0 \right)\) as an estimate of the rate constant, and this is known as the transition state theory approximation to \(k\), i.e.,

\[\begin{align} k^\text{(TST)} &= k \left( 0 \right) \\ &= \frac{1}{Q_r} \int_{q \left( \text{x} \right) = q^\ddagger} d \text{x} \: e^{-\beta \mathcal{E} \left( \text{x} \right)} \left| \dot{q} \left( \text{x} \right) \right| \theta \left( q \left( \text{x} \right) - q^\ddagger \right) \end{align} \label{20.30}\]

However, note that since we require \(\dot{q} \left( \text{x} \right)\) to initially be toward products, then by definition, at \(t = 0\), \(q \left( \text{x} \right) \geq q^\ddagger\), and the step function in the above expression is redundant. In addition, if \(\dot{q} \left( \text{x} \right)\) only depends on momenta (or velocities) and not actually on coordinates, which will be true if \(q \left( \text{x} \right)\) is not curvilinear (and is true for some curvilinear coordinates \(q \left( \text{x} \right)\)), and if \(q \left( \text{x} \right)\) only depends on coordinates, then Equation \(\ref{20.30}\) reduces to

\[k^\text{(TST)} = \frac{1}{h Q_r} \int d \text{x}_\textbf{p} e^{-\beta \sum_{i=1}^N \textbf{p}_i^2/2m_i} \left| \dot{q} \left( \textbf{p}_1, \ldots, \textbf{p}_N \right) \right| \int_{q \left( \textbf{r}_1, \ldots, \textbf{r}_N \right) = q^\ddagger} d \text{x}_\textbf{r} e^{-\beta U \left( \textbf{r}_1, \ldots, \textbf{r}_N \right)} \label{20.31}\]

Tuckerman Screenshot 20-2.png — Figure \(\PageIndex{2}\): Examples of the reactive flux \(k \left( t \right)\).

The integral

\[Z^\ddagger = \int_{q \left( \textbf{r}_1, \ldots, \textbf{r}_N \right) = q^\ddagger} d \text{x}_\textbf{r} e^{-\beta U \left( \textbf{r}_1, \ldots, \textbf{r}_N \right)}\]

counts the number of microscopic states consistent with the condition \(q \left( \textbf{r}_1, \ldots, \textbf{r}_N \right) = q^\ddagger\) and is, therefore, a kind of partition function, and is denoted \(Q^\ddagger\). On the other hand, because it is a partition function, we can derive a free energy \(\Delta F^\ddagger\) from it

\[F^\ddagger \propto -k_B T \: \text{ln} \: Z^\ddagger \label{20.32}\]

Similarly, if we divide \(Q_r\) into its ideal-gas and configurational contributions

\[Q_r = Q_r^\text{(ideal)} Z_r \label{20.33}\]

then we can take

\[Z_r = e^{-\beta F_r} \label{20.34}\]

where \(F_r\) is the free energy of the reactants. Finally, setting \(\dot{q} = p/\mu\), where \(\mu\) is the associated mass, and \(p\) is the corresponding momentum of the reaction coordinate, then, canceling most of the momentum integrals between the numerator and \(Q_r^\text{(ideal)}\), the momentum integral we need is

\[\int_0^\infty e^{-\beta p^2/2 \mu} \frac{p}{\mu} = k_B T \label{20.35}\]

which gives the final expression for the transition state theory rate constant

\[k^\text{(TST)} = \frac{k_B T}{h} e^{-\beta \left( F^\ddagger - F_r \right)} = \frac{k_B T}{h} e^{-\beta \Delta F^\ddagger} \label{20.36}\]

Figure \(\PageIndex{2}\) actually shows \(k \left( t \right)/ k^\text{(TST)}\), which must start at \(1\). As the figure shows, in addition, for \(t > 0\), \(k \left( t \right) < k^\text{(TST)}\). Hence, \(k^\text{(TST)}\) is always an upper bound to the true rate constant. Transition state theory assumes that any trajectory that initially moves toward products will be a reactive trajectory. For this reason, it overestimates the reaction rate. In reality, trajectories can cross the dividing surface several or many times before eventually proceeding either toward products or back toward reactants.

Tuckerman Screenshot 20-3.png — Figure \(\PageIndex{3}\): Examples of the trajectories in a typical system, some of which are reactive but some of which return to reactants.

Figure \(\PageIndex{3}\) shows that one can obtain trajectories of both types. Here, the dividing surface lies at \(q = 0\). Left, toward \(q = -1\) is the reactant side, and right, toward \(q = 1\) is the product side. Because some trajectories return to reactants and never become products, the true rate is always less than \(k^\text{(TST)}\), and we can write

\[k = \kappa k^\text{(TST)} \label{20.37}\]

where the factor \(\kappa < 1\) is known as the transmission factor. This factor accounts for multiple recrossings of the dividing surface and the fact that some trajectories do not become reactive ones.

Search

Text Color

Text Size

Margin Size

Font Type

Rate Law and Collision Theory

Transition State Theory