20.9: The Statistical Definition of Entropy is Analogous to the Thermodynamic Definition
- Page ID
- 13723
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\dsum}{\displaystyle\sum\limits} \)
\( \newcommand{\dint}{\displaystyle\int\limits} \)
\( \newcommand{\dlim}{\displaystyle\lim\limits} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)The laws of thermodynamics can be understood in two ways: by looking at large-scale properties like temperature and pressure, or by examining how countless tiny atoms move and interact. Both perspectives lead to the same laws, even though the reasoning behind them is quite different.
We learned earlier that entropy, \(S\), is related to the number of microstates, \(W\), in an ensemble with \(A\) systems:
\[S_{ensemble}=k_B \ln{W} \label{eq1} \]
and
\[W=\frac{A!}{\prod_j{a_j}!} \label{eq2} \]
Combining Equations \ref{eq1} and \ref{eq2} to get:
\[\begin{split} S_{ensemble} &= k_B \ln{\frac{A!}{\prod_j{a_j}!}} \\[4pt] &= k_B \ln{A!}-k_B\sum_j{\ln{a_j}!} \end{split} \nonumber \]
Using Sterling's approximation:
\[\ln{A!} \approx A\ln{A}-A \nonumber \]
We obtain:
\[S_{ensemble} = k_B A \ln{A}-k_BA-k_B\sum_j{a_j\ln{a_j}}+k_B\sum{a_j} \nonumber \]
Since:
\[A=\sum{a_j} \nonumber \]
The expression simplifies to:
\[S_{ensemble} = k_B A \ln{A}-k_B\sum_j{a_j\ln{a_j}} \nonumber \]
We can make use of the fact that the number of microstates in state \(j\) is equal to the total number of microstates multiplied by the probability of finding the system in state \(j\), \(p_j\):
\[a_j=p_jA \nonumber \]
Plugging in, we obtain
\[\begin{split}S_{ensemble} &= k_B A \ln{A}-k_B\sum_j{p_jA\ln{p_jA}} \\[4pt] &= k_B A \ln{A}-k_B\sum_j{p_jA\ln{p_j}}-k_B\sum_j{p_jA\ln{A}} \end{split} \nonumber \]
Since \(A\) is a constant and the sum of the probabilities of finding the system in state \(j\) is always 1:
\[\sum{p_j}=1 \nonumber \]
The first and last term cancel out:
\[S_{ensemble} = -k_BA\sum_j{p_j\ln{p_j}} \nonumber \]
We can use that the entropy of the system is the entropy of the ensemble divided by the number of systems:
\[S_{system}=S_{ensemble}/A \nonumber \]
Dividing by \(A\), we obtain:
\[S_{system} = -k_B\sum_j{p_j\ln{p_j}} \nonumber \]
We can differentiate this equation and dropping the subscript:
\[dS = -k_B\sum_j{\left(dp_j+\ln{p_j}dp_j\right)} \nonumber \]
Since \(\sum_j{p_j}=1\), the derivative \(\sum_j{dp_j}=0\):
\[dS = -k_B\sum_j{\ln{p_j}dp_j} \nonumber \]
In short:
\[\sum_j{\ln{p_j}dp_j}=-\frac{\delta q_{rev}}{k_BT} \nonumber \]
Plugging in:
\[dS = \frac{\delta q_{rev}}{T} \label{10} \]
The thermodynamic definition of entropy comes from macroscopic observations. In classical thermodynamics, entropy is defined as the measure of energy dispersal in a system (Equation \ref{10}). It tells us how much heat is spread out or distributed in a system at temperature. This definition is practical and works well at the bulk level but does not say why entropy behaves this way.
The statistical definition of entropy in Equation \ref{eq1}, developed by Boltzmann, defines entropy in terms of the number of possible microstates that correspond to the same overall macroscopic state (macrostate). This shows that entropy measures the degree of microscopic disorder or multiplicity of arrangements consistent with what we see at the macroscopic level.
Both definitions describe the same underlying concept:
- The thermodynamic view says entropy measures the spread of energy in terms of heat and temperature.
- The statistical view says entropy measures the spread of possible microstates that energy can occupy.
They are equivalent because a system with more accessible microstates (higher disorder) will also tend to have greater energy dispersal, and thus higher entropy. In fact, the statistical definition explains why the thermodynamic definition works: macroscopic energy dispersal arises from the overwhelming number of microscopic ways energy can be arranged.

