Skip to main content
Chemistry LibreTexts

21.3: DNA Hybridization

  • Page ID
    294361
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vectorC}[1]{\textbf{#1}} \)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

    To illustrate the use of statistical thermodynamics to describe binding, we discuss simple models for the hybridization or melting of DNA. These models are similar to our description of the helix–coil transition in their approach. These do not distinguish the different nucleobases, only considering nucleotides along a chain that are paired (bp) or free (f).

    Consider the case of the pairing between self-complementary oligonucleotides.

    \[S+S \rightleftharpoons D \nonumber \]

    S refers to any fully dissociated ssDNA and D to any dimer forms that involve two strands which have at least one base pair formed. We can then follow expressions for monomer–dimer equilibria above. The equilibrium constant for the association of single strands is

    \[K_a = \dfrac{c_D}{c_S^2}\]

    This equilibrium constant is determined by the concentration-dependent free-energy barrier for two strands to diffuse into contact and create the first base pair. If the total concentration of molecules present is either monomer or dimer, the form is

    \[ C_{TOT} = c_S + 2c_D \]

    then the fraction of the DNA strands in the dimer form is

    \[ \theta_D = \dfrac{2c_D}{C_{tot}} \]

    and eq. (10) leads to

    \[ \theta_D = 1+(4K_aC_{tot})^{-1}-\sqrt{(1+(4K_aC_{tot})^{-1})^2-1} \]

    We see that at the total concentration, which results in a dimer fraction \(\theta_D = 0.5\), the association constant is obtained from \(K_a=(9C_{tot})^{-1} \). This is a traditional description of the thermodynamics of a monomer–dimer equilibrium.

    We can calculate Ka from the molecular partition functions for the S and D states:

    \[K_a = \dfrac{q_D}{q_S^2} \nonumber \]

    Different models for hybridization will vary in the form of these partition functions. For either state, we can separate the partition function into contributions from the conformational degrees of freedom relevant to the base-pairing and hybridization, and other degrees of freedom, qi = qi,confqi,ext. Assuming that the external degrees of freedom will be largely of an entropic nature, we neglect an explicit calculation and factor out the external degrees of freedom by defining the variable γ:

    \[\gamma = \dfrac{q_{D,ext}C_{tot}}{q_{S,ext}^2} \]

    then

    \[ \theta_D = 1+\dfrac{q_{S,int}^2}{4\gamma q_{D,int}}-\sqrt{\left( 1+ \dfrac{q_{S,int}^2}{4\gamma q_{D,int}} \right)^2-1} \]

    Short Oligonucleotides: The Zipper Model clipboard_ec4f9c90bde85937545c0b663dae72d54.png

    For short oligonucleotide hybridization, a common (and reasonable) approximation is the single stretch model, which assumed that base-pairing will only occur as a single continuous stretch of base pairs. This is reasonable for short oligomers (n < 20) where two distinct helical stretches separated by a bubble (loop) are unlikely given the persistence length of dsDNA. The zipper model refers to the single-stretch case with “perfect matching”, in which only pairing between the bases in precisely sequence-aligned DNA strands is counted. As a result of these two approximations, the only dissociated base pairs observed in this model appear at the end of a chain (fraying).

    The number of bases in a single strand is n and the number of bases that are paired is nbp. For the dimer, we consider all configurations that have at least one base pair formed. The dimer partition function can be written as

    \[ \begin{aligned} q_{D,int}(n) &=\sigma \sum_{n_{bp}=1}^ng(n,n_{bp})s^{n_{bp}} \\ &=\sigma \sum_{n_{bp}=1}^n (n-n_{bp} +1)s^{n_{bp}} \end{aligned} \]

    Here g is the number of ways of arranging nbp continuous base pairs on a strand with length n; σis the statistical weight for nucleating the first base pair; and s is the statistical weight for forming a base pair next to an already-paired segment: \( s=e^{-\Delta \varepsilon_{bp}/k_BT}\). Therefore, in the zipper model, the equilibrium constant in eq. (23) between ssDNA and dimers involving at least one intact base pair is: Kzip = σs. In the case of homogeneous polynucleotide chains, in which sliding of registry between chains is allowed: \( q_{D,int}(n) =\sigma \sum_{n_{bp}=1}^n (n-n_{bp} +1)^2s^{n_{bp}} \). The sum in eq. (27) can be evaluated exactly, giving

    \[q_{D,int}(n) = \dfrac{\sigma_S}{(s-1)^2}\left[ s^{n+1} -(n+1)s+n \right] \]

    In the case that s > 1 ( \( \Delta \varepsilon_{bp} < 0 \) ) and n≫1, qD,int→σsn. Also, the probability distribution of helical segments is

    \[ P_{bp}(n,n_{bp}) = \dfrac{(n-n_{bp}+1)\sigma s^{n_{bp}}}{q_{D,int}}\qquad 1\leq n_{bp} \leq n \]

    The plot below shows illustrations of the probability density and associated energy landscape for a narrow range of s across the helix–coil transition. These figures illustrate a duplex state that always has a single free-energy minimum characterized by frayed configurations.

    clipboard_e1e9611f0c5525fe8355ab4c8ddbf29ae.png

    In addition to the fraction of molecules that associate to form a dimer, we must also consider the fraction of contacts that successfully form a base pair in the dimer state

    \[ \theta_{bp} = \dfrac{ \langle n_{bp} \rangle }{n} \nonumber \]

    We can evaluate this using the identity

    \[ \langle n_H \rangle = \dfrac{s}{q} \dfrac{\partial q}{\partial s} \nonumber \]

    Using eq. (28) we have

    \[ \theta_{bp} = \dfrac{ns^{n+2}-(n+2)s^{n+1}+(n+2)s - n}{n(s-1)(s^{n+1}-s(n+1)+n)} \nonumber \]

    Similar to the helix–coil transition in polypeptides, θbp shows cooperative behavior with a transition centered at s = 1, which gets steeper with increasing n and decreasing σ.

    clipboard_ed39fb424bac14d2f906458f3fd209a6a.png

    Finally, we can write the total fraction of nucleobases that participate in a base pair as the product of the fraction of the DNA strands that are associated in a dimer form, and the average fraction of bases of the dimer that are paired.

    \[\theta_{tot} = \theta_D \theta_{bp} \nonumber \]


    This page titled 21.3: DNA Hybridization is shared under a CC BY-NC-SA 4.0 license and was authored, remixed, and/or curated by Andrei Tokmakoff via source content that was edited to the style and standards of the LibreTexts platform.