Chapter 7.2: Empirical and Molecular Formulas
 Page ID
 18855
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{\!\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\ #1 \}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\ #1 \}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{\!\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{\!\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left#1\right}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)
Learning Objectives
 To determine the empirical formula of a compound from its composition by mass.
 To derive the molecular formula of a compound from its empirical formula.
When a new chemical compound, such as a potential new pharmaceutical, is synthesized in the laboratory or isolated from a natural source, chemists determine its elemental composition, its empirical formula, and its structure to understand its properties. In this section, we focus on how to determine the empirical formula of a compound and then use it to determine the molecular formula if the molar mass of the compound is known.
Calculating Mass Percentages
The law of definite proportions states that a chemical compound always contains the same proportion of elements by mass; that is, the percent composition (the percentage of each element present in a pure substance). With few exceptions, the percent composition of a chemical compound is constant (see law of definite proportions).—the percentage of each element present in a pure substance—is constant (although we now know there are exceptions to this law). For example, sucrose (cane sugar) is 42.11% carbon, 6.48% hydrogen, and 51.41% oxygen by mass. This means that 100.00 g of sucrose always contains 42.11 g of carbon, 6.48 g of hydrogen, and 51.41 g of oxygen. First we will use the molecular formula of sucrose (C_{12}H_{22}O_{11}) to calculate the mass percentage of the component elements; then we will show how mass percentages can be used to determine an empirical formula.
According to its molecular formula, each molecule of sucrose contains 12 carbon atoms, 22 hydrogen atoms, and 11 oxygen atoms. A mole of sucrose molecules therefore contains 12 mol of carbon atoms, 22 mol of hydrogen atoms, and 11 mol of oxygen atoms. We can use this information to calculate the mass of each element in 1 mol of sucrose, which will give us the molar mass of sucrose. We can then use these masses to calculate the percent composition of sucrose. To three decimal places, the calculations are the following:
\( \begin{matrix}
mass\; of\; C/mol\; of\; sucrose & =12\cancel{mol}\left ( \dfrac{12.011\; g\; C}{\cancel{1\; mol\; C}} \right ) & =144.132\; g \; C\\
& & \\
mass\; of\; H/mol\; of\; sucrose & =22\cancel{mol}\left ( \dfrac{1.008\; g\; H}{\cancel{1\; mol\; H}} \right ) & =22.176\; g \; H\\
& & \\
mass\; of\; O/mol\; of\; sucrose & =11\cancel{mol}\left ( \dfrac{15.999\; g\; O}{\cancel{1\; mol\; O}} \right ) & =175.989\; g \; O
\end{matrix}) \tag{7.2.1} \)
Thus 1 mol of sucrose has a mass of 342.297 g; note that more than half of the mass (175.989 g) is oxygen, and almost half of the mass (144.132 g) is carbon.
The mass percentage of each element in sucrose is the mass of the element present in 1 mol of sucrose divided by the molar mass of sucrose, multiplied by 100 to give a percentage. The result is shown to two decimal places:
\( \begin{matrix}
mass\; \%\; C\; in\; sucrose & =\dfrac{mass\; of\; C/mol\; sucrose}{molar\; mass\; of\; sucrose}\times 100 & =\dfrac{144.132\; g\; C}{342.297\; g/mol}\times 100 & =42.12 \%\\
& & \\
mass\; \%\; H\; in\; sucrose & =\dfrac{mass\; of\; H/mol\; sucrose}{molar\; mass\; of\; sucrose}\times 100 & =\dfrac{22.176\; g\; H}{342.297\; g/mol}\times 100 & =6.48 \%\\
& & \\
mass\; \%\; O\; in\; sucrose & =\frac{mass\; of\; O/mol\; sucrose}{molar\; mass\; of\; sucrose}\times 100 & =\dfrac{175.989\; g\; O}{342.297\; g/mol}\times 100 & =51.41 \%
\end{matrix} \)
You can check your work by verifying that the sum of the percentages of all the elements in the compound is 100%:
\( 42.12 \% + 6.48 \% + 51.41 \% = 100.01 \% \)
If the sum is not 100%, you have made an error in your calculations. (Rounding to the correct number of decimal places can, however, cause the total to be slightly different from 100%.) Thus 100.00 g of sucrose contains 42.12 g of carbon, 6.48 g of hydrogen, and 51.41 g of oxygen; to two decimal places, the percent composition of sucrose is indeed 42.12% carbon, 6.48% hydrogen, and 51.41% oxygen.
Figure 7.2.1 Percent Composition
We could also calculate the mass percentages using atomic masses and molecular masses, with atomic mass units. Because the answer we are seeking is a ratio, expressed as a percentage, the units of mass cancel whether they are grams (using molar masses) or atomic mass units (using atomic and molecular masses).
Example 5
Aspartame is the artificial sweetener sold as NutraSweet and Equal. Its molecular formula is C_{14}H_{18}N_{2}O_{5}.
 Calculate the mass percentage of each element in aspartame.
 Calculate the mass of carbon in a 1.00 g packet of Equal, assuming it is pure aspartame.
Given: molecular formula and mass of sample
Asked for: mass percentage of all elements and mass of one element in sample
Strategy:
A Use atomic masses from the periodic table to calculate the molar mass of aspartame.
B Divide the mass of each element by the molar mass of aspartame; then multiply by 100 to obtain percentages.
C To find the mass of an element contained in a given mass of aspartame, multiply the mass of aspartame by the mass percentage of that element, expressed as a decimal.
Solution:

A We calculate the mass of each element in 1 mol of aspartame and the molar mass of aspartame, here to three decimal places:
14 C (14 mol C)(12.011 g/mol C) = 168.154 g 18 H (18 mol H)(1.008 g/mol H) = 18.114 g 2N (2 mol N)(14.007 g/mol N) = 28.014 g +5O (5 mol O)(15.999 g/mol O) = 79.995 g C_{14}H_{18}N_{2}O_{5} molar mass of aspartame 294.277 g/mol Thus more than half the mass of 1 mol of aspartame (294.277 g) is carbon (168.154 g).
B To calculate the mass percentage of each element, we divide the mass of each element in the compound by the molar mass of aspartame and then multiply by 100 to obtain percentages, here reported to two decimal places:
\( \begin{matrix}
mass\; \%\; C & = \dfrac{168.154\; g\; C}{294.277\; g\; aspartame}\times 100 & =57.14 \%\; C\\
& & \\
mass\; \%\; N & = \dfrac{18.114\; g\; H}{294.277\; g\; aspartame}\times 100 & =6.16 \%\; C\\
& & \\
mass\; \%\; N & = \dfrac{28.014\; g\; N}{294.277\; g\; aspartame}\times 100 & =9.52 \%\; C\\
& & \\
mass\; \%\; O & = \dfrac{79.995\; g\; O}{294.277\; g\; aspartame}\times 100 & =27.18 \%\; C\\
\end{matrix} \)As a check, we can add the percentages together:
57.14% + 6.16% + 9.52% + 27.18% = 100.00%If you obtain a total that differs from 100% by more than about ±1%, there must be an error somewhere in the calculation.

C The mass of carbon in 1.00 g of aspartame is calculated as follows:
\( mass\; of \; C =1.00 \cancel{g\; aspartame}\times \dfrac{57.14\; g\; C}{100\; \cancel{g\; aspartame}} = 0.57`\; g \; C \)
Exercise
Calculate the mass percentage of each element in aluminum oxide (Al_{2}O_{3}). Then calculate the mass of aluminum in a 3.62 g sample of pure aluminum oxide.
Answer: 52.93% aluminum; 47.08% oxygen; 1.92 g Al
Determining the Empirical Formula of Penicillin
Just as we can use the empirical formula of a substance to determine its percent composition, we can use the percent composition of a sample to determine its empirical formula, which can then be used to determine its molecular formula. Such a procedure was actually used to determine the empirical and molecular formulas of the first antibiotic to be discovered: penicillin.
Antibiotics are chemical compounds that selectively kill microorganisms, many of which cause diseases. Although we may take antibiotics for granted today, penicillin was discovered only about 80 years ago. The subsequent development of a wide array of other antibiotics for treating many common diseases has contributed greatly to the substantial increase in life expectancy over the past 50 years. The discovery of penicillin is a historical detective story in which the use of mass percentages to determine empirical formulas played a key role.
In 1928, Alexander Fleming, a young microbiologist at the University of London, was working with a common bacterium that causes boils and other infections such as blood poisoning. For laboratory study, bacteria are commonly grown on the surface of a nutrientcontaining gel in small, flat culture dishes. One day Fleming noticed that one of his cultures was contaminated by a bluishgreen mold similar to the mold found on spoiled bread or fruit. Such accidents are rather common, and most laboratory workers would have simply thrown the cultures away. Fleming noticed, however, that the bacteria were growing everywhere on the gel except near the contaminating mold (part (a) in Figure 7.2.2, and he hypothesized that the mold must be producing a substance that either killed the bacteria or prevented their growth. To test this hypothesis, he grew the mold in a liquid and then filtered the liquid and added it to various bacteria cultures. The liquid killed not only the bacteria Fleming had originally been studying but also a wide range of other diseasecausing bacteria. Because the mold was a member of the Penicillium family (named for their pencilshaped branches under the microscope) (part (b) in Figure 7.2.2), Fleming called the active ingredient in the broth penicillin.
Figure 7.2.2 Penicillium
(a) Penicillium mold is growing in a culture dish; the photo shows its effect on bacterial growth. (b) In this photomicrograph of Penicillium, its rod and pencilshaped branches are visible. The name comes from the Latin penicillus, meaning “paintbrush.”
Although Fleming was unable to isolate penicillin in pure form, the medical importance of his discovery stimulated researchers in other laboratories. Finally, in 1940, two chemists at Oxford University, Howard Florey (1898–1968) and Ernst Chain (1906–1979), were able to isolate an active product, which they called penicillin G. Within three years, penicillin G was in widespread use for treating pneumonia, gangrene, gonorrhea, and other diseases, and its use greatly increased the survival rate of wounded soldiers in World War II. As a result of their work, Fleming, Florey, and Chain shared the Nobel Prize in Medicine in 1945.
As soon as they had succeeded in isolating pure penicillin G, Florey and Chain subjected the compound to a procedure called combustion analysis (described later in this section) to determine what elements were present and in what quantities. The results of such analyses are usually reported as mass percentages. They discovered that a typical sample of penicillin G contains 53.9% carbon, 4.8% hydrogen, 7.9% nitrogen, 9.0% sulfur, and 6.5% sodium by mass. The sum of these numbers is only 82.1%, rather than 100.0%, which implies that there must be one or more additional elements. A reasonable candidate is oxygen, which is a common component of compounds that contain carbon and hydrogen;Do not assume that the “missing” mass is always due to oxygen. It could be any other element. for technical reasons, however, it is difficult to analyze for oxygen directly. If we assume that all the missing mass is due to oxygen, then penicillin G contains (100.0% − 82.1%) = 17.9% oxygen. From these mass percentages, the empirical formula and eventually the molecular formula of the compound can be determined.
To determine the empirical formula from the mass percentages of the elements in a compound such as penicillin G, we need to convert the mass percentages to relative numbers of atoms. For convenience, we assume that we are dealing with a 100.0 g sample of the compound, even though the sizes of samples used for analyses are generally much smaller, usually in milligrams. This assumption simplifies the arithmetic because a 53.9% mass percentage of carbon corresponds to 53.9 g of carbon in a 100.0 g sample of penicillin G; likewise, 4.8% hydrogen corresponds to 4.8 g of hydrogen in 100.0 g of penicillin G; and so forth for the other elements. We can then divide each mass by the molar mass of the element to determine how many moles of each element are present in the 100.0 g sample:
\( \frac{mass\left ( g \right )}{molar\;mass\left ( g/mol \right )}=\left ( \cancel{g} \right )\left ( \frac{mol}{\cancel{g}} \right )=mol \tag{7.2.2} \)
\( \begin{matrix}
53.9\; \cancel{g\; C}\dfrac{1\; mol\; C}{12.011\; \cancel{g\; C}} &=4.49\; mol\; C \\
& \\
4.8\; \cancel{g\; H}\dfrac{1\; mol\; H}{1.008\; \cancel{g\; H}} &=4.49\; mol\; H \\
& \\
7.9\; \cancel{g\; N}\dfrac{1\; mol\; N}{14.007\; \cancel{g\; N}} &=0.56\; mol\; N \\
& \\
9.0\; \cancel{g\; S}\dfrac{1\; mol\; S}{32.065\; \cancel{g\; S}} &=0.28\; mol\; S \\
& \\
6.5\; \cancel{g\; Na}\dfrac{1\; mol\; Na}{22.990\; \cancel{g\; Na}} &=0.28\; mol\; Na \\
& \\
17.9\; \cancel{g\; O}\dfrac{1\; mol\; O}{15.999\; \cancel{g\; O}} &=1.12\; mol\; O
\end{matrix} \)
Thus 100.0 g of penicillin G contains 4.49 mol of carbon, 4.8 mol of hydrogen, 0.56 mol of nitrogen, 0.28 mol of sulfur, 0.28 mol of sodium, and 1.12 mol of oxygen (assuming that all the missing mass was oxygen). The number of significant figures in the numbers of moles of elements varies between two and three because some of the analytical data were reported to only two significant figures.
These results tell us the ratios of the moles of the various elements in the sample (4.49 mol of carbon to 4.8 mol of hydrogen to 0.56 mol of nitrogen, and so forth), but they are not the wholenumber ratios we need for the empirical formula—the empirical formula expresses the relative numbers of atoms in the smallest whole numbers possible. To obtain whole numbers, we divide the numbers of moles of all the elements in the sample by the number of moles of the element present in the lowest relative amount, which in this example is sulfur or sodium. The results will be the subscripts of the elements in the empirical formula. To two significant figures, the results are
\( \begin{matrix}
C:\dfrac{4.49}{0.28} = 16 & H:\dfrac{4.8}{0.28} = 17 & N:\dfrac{0.56}{0.28} = 2.0 \\
& & \\
S:\dfrac{0.28}{0.28} = 1.0 & Na:\dfrac{0.28}{0.28} = 1.0 & O:\dfrac{1.12}{0.28} = 1.0
\end{matrix} \tag{7.2.3} \)
The empirical formula of penicillin G is therefore C_{16}H_{17}N_{2}NaO_{4}S. Other experiments have shown that penicillin G is actually an ionic compound that contains Na^{+} cations and [C_{16}H_{17}N_{2}O_{4}S]^{−} anions in a 1:1 ratio. The complex structure of penicillin G (Figure 7.2.3) was not determined until 1948.
Figure 7.2.3 Structural Formula
and BallandStick Model of the
Anion of Penicillin G
In some cases, one or more of the subscripts in a formula calculated using this procedure may not be integers. Does this mean that the compound of interest contains a nonintegral number of atoms? No; rounding errors in the calculations as well as experimental errors in the data can result in nonintegral ratios. When this happens, you must exercise some judgment in interpreting the results, as illustrated in Example 6. In particular, ratios of 1.50, 1.33, or 1.25 suggest that you should multiply all subscripts in the formula by 2, 3, or 4, respectively. Only if the ratio is within 5% of an integral value should you consider rounding to the nearest integer.
Example 6
Calculate the empirical formula of the ionic compound calcium phosphate, a major component of fertilizer and a polishing agent in toothpastes. Elemental analysis indicates that it contains 38.77% calcium, 19.97% phosphorus, and 41.27% oxygen.
Given: percent composition
Asked for: empirical formula
Strategy:
A Assume a 100 g sample and calculate the number of moles of each element in that sample.
B Obtain the relative numbers of atoms of each element in the compound by dividing the number of moles of each element in the 100 g sample by the number of moles of the element present in the smallest amount.
C If the ratios are not integers, multiply all subscripts by the same number to give integral values.
D Because this is an ionic compound, identify the anion and cation and write the formula so that the charges balance.
Solution:
A A 100 g sample of calcium phosphate contains 38.77 g of calcium, 19.97 g of phosphorus, and 41.27 g of oxygen. Dividing the mass of each element in the 100 g sample by its molar mass gives the number of moles of each element in the sample:
\( \begin{matrix}moles\; Ca & =38.77\; \cancel{g\; Ca}\dfrac{1\; mol\; Ca}{40.078\; \cancel{g\; Ca}} & =0.9674\; mol\; Ca\\
& & \\
moles\; P & =19.97\; \cancel{g\; P}\dfrac{1\; mol\; P}{30.9738\; \cancel{g\; Ca}} & =0.6447\; mol\; P\\
& & \\
moles\; O & =41.27\; \cancel{g\; O}\dfrac{1\; mol\; O}{15.994\; \cancel{g\; O}} & =2.5800\; mol\; O
\end{matrix} \)
B To obtain the relative numbers of atoms of each element in the compound, divide the number of moles of each element in the 100g sample by the number of moles of the element in the smallest amount, in this case phosphorus:
\( \begin{matrix}P:\frac{0.6447\; mol P}{0.6447\; mol P} = 1.00 & Ca:\dfrac{0.9674}{0.6447} = 1.501 & O:\dfrac{2.5800}{0.6447} = 4.002
\end{matrix} \)
C We could write the empirical formula of calcium phosphate as Ca_{1.501}P_{1.000}O_{4.002}, but the empirical formula should show the ratios of the elements as small whole numbers. To convert the result to integral form, multiply all the subscripts by 2 to get Ca_{3.002}P_{2.000}O_{8.004}. The deviation from integral atomic ratios is small and can be attributed to minor experimental errors; therefore, the empirical formula is Ca_{3}P_{2}O_{8}.
D The calcium ion (Ca^{2}^{+}) is a cation, so to maintain electrical neutrality, phosphorus and oxygen must form a polyatomic anion. We know from Section 6.3 2 that phosphorus and oxygen form the phosphate ion (PO_{4}^{3−}; see Table 7.21). Because there are two phosphorus atoms in the empirical formula, two phosphate ions must be present. So we write the formula of calcium phosphate as Ca_{3}(PO_{4})_{2}.
Exercise
Calculate the empirical formula of ammonium nitrate, an ionic compound that contains 35.00% nitrogen, 5.04% hydrogen, and 59.96% oxygen by mass; refer to Table 7.2.1 if necessary. Although ammonium nitrate is widely used as a fertilizer, it can be dangerously explosive. For example, it was a major component of the explosive used in the 1995 Oklahoma City bombing.
Answer: N_{2}H_{4}O_{3} is NH_{4}^{+}NO_{3}^{−}, written as NH_{4}NO_{3}
Combustion Analysis
One of the most common ways to determine the elemental composition of an unknown hydrocarbon is an analytical procedure called combustion analysis. A small, carefully weighed sample of an unknown compound that may contain carbon, hydrogen, nitrogen, and/or sulfur is burned in an oxygen atmosphere,Other elements, such as metals, can be determined by other methods. and the quantities of the resulting gaseous products (CO_{2}, H_{2}O, N_{2}, and SO_{2}, respectively) are determined by one of several possible methods. One procedure used in combustion analysis is outlined schematically in Figure 7.2.4, and a typical combustion analysis is illustrated in Example 7.
Figure 7.2.4 Steps for Obtaining an Empirical Formula from Combustion Analysis
Example 7
Naphthalene, the active ingredient in one variety of mothballs, is an organic compound that contains carbon and hydrogen only. Complete combustion of a 20.10 mg sample of naphthalene in oxygen yielded 69.00 mg of CO_{2} and 11.30 mg of H_{2}O. Determine the empirical formula of naphthalene.
Given: mass of sample and mass of combustion products
Asked for: empirical formula
Strategy:
A Use the masses and molar masses of the combustion products, CO_{2} and H_{2}O, to calculate the masses of carbon and hydrogen present in the original sample of naphthalene.
B Use those masses and the molar masses of the elements to calculate the empirical formula of naphthalene.
Solution:
A Upon combustion, 1 mol of CO_{2} is produced for each mole of carbon atoms in the original sample. Similarly, 1 mol of H_{2}O is produced for every 2 mol of hydrogen atoms present in the sample. The masses of carbon and hydrogen in the original sample can be calculated from these ratios, the masses of CO_{2} and H_{2}O, and their molar masses. Because the units of molar mass are grams per mole, we must first convert the masses from milligrams to grams:
\( mass\; of\; C=69.00\cancel{mg\; CO_{2}}\times \dfrac{1\;\cancel{g}}{1000\;\cancel{mg}}\times \dfrac{1\; \cancel{mol\;CO_{2}}}{44.010\; \cancel{g\; CO_{2}}}\times \dfrac{1\;\cancel{mol\; C}}{1\;\cancel{mol\; CO_{2}}}\times \dfrac{12.011\; g}{1 \cancel{mol\; C}} \)
\( = 1.883 \times 10^{2}\; g\; C) \)
\( mass\; of\; H=11.30\; \cancel{mg\; H_{2}O}\times \dfrac{1\;\cancel{g}}{1000\;\cancel{mg}}\times \dfrac{1\; \cancel{mol\;H_{2}O}}{18.015\; \cancel{g\; H_{2}O}}\times \dfrac{2\;\cancel{mol\; H}}{1\;\cancel{mol\; H_{2}O}}\times \dfrac{1.0079\; g}{1 \cancel{mol\; C}} \)
\( = 1.264 \times 10^{3}\; g\; H) \)
B To obtain the relative numbers of atoms of both elements present, we need to calculate the number of moles of each and divide by the number of moles of the element present in the smallest amount:
\( moles\; C= 1.883 \times 10^{2}\; \cancel{g\; C}\times \dfrac{1\; mol\; C}{12.011\; \cancel{g\; C}}= 1.568\times 10^{3}\; mol\; C \)
\( moles\; H= 1.264 \times 10^{3}\; \cancel{g\; H}\times \dfrac{1\; mol\; H}{1.0079\; \cancel{g\; H}}= 1.254\times 10^{3}\; mol\; H \)
Dividing each number by the number of moles of the element present in the smaller amount gives
\( \begin{matrix}
H:\dfrac{1.254\times 10^{3}}{1.254\times 10^{3}}=1.000 & C:\dfrac{1.568\times 10^{3}}{1.254\times 10^{3}}=1.250
\end{matrix} \)
Thus naphthalene contains a 1.25:1 ratio of moles of carbon to moles of hydrogen: C_{1.25}H_{1.0}. Because the ratios of the elements in the empirical formula must be expressed as small whole numbers, multiply both subscripts by 4, which gives C_{5}H_{4} as the empirical formula of naphthalene. In fact, the molecular formula of naphthalene is C_{10}H_{8}, which is consistent with our results.
Exercise
 Xylene, an organic compound that is a major component of many gasoline blends, contains carbon and hydrogen only. Complete combustion of a 17.12 mg sample of xylene in oxygen yielded 56.77 mg of CO_{2} and 14.53 mg of H_{2}O. Determine the empirical formula of xylene.
 The empirical formula of benzene is CH (its molecular formula is C_{6}H_{6}). If 10.00 mg of benzene is subjected to combustion analysis, what mass of CO_{2} and H_{2}O will be produced?
Answer:
 The empirical formula is C_{4}H_{5}. (The molecular formula of xylene is actually C_{8}H_{10}.)
 33.81 mg of CO_{2}; 6.92 mg of H_{2}O
From Empirical Formula to Molecular Formula
The empirical formula gives only the relative numbers of atoms in a substance in the smallest possible ratio. For a covalent substance, we are usually more interested in the molecular formula, which gives the actual number of atoms of each kind present per molecule. Without additional information, however, it is impossible to know whether the formula of penicillin G, for example, is C_{16}H_{17}N_{2}NaO_{4}S or an integral multiple, such as C_{32}H_{34}N_{4}Na_{2}O_{8}S_{2}, C_{48}H_{51}N_{6}Na_{3}O_{12}S_{3}, or (C_{16}H_{17}N_{2}NaO_{4}S)_{n}, where n is an integer. (The actual structure of penicillin G is shown in Figure 7.2.3).
Consider glucose, the sugar that circulates in our blood to provide fuel for our bodies and especially for our brains. Results from combustion analysis of glucose report that glucose contains 39.68% carbon and 6.58% hydrogen. Because combustion occurs in the presence of oxygen, it is impossible to directly determine the percentage of oxygen in a compound by using combustion analysis; other more complex methods are necessary. If we assume that the remaining percentage is due to oxygen, then glucose would contain 53.79% oxygen. A 100.0 g sample of glucose would therefore contain 39.68 g of carbon, 6.58 g of hydrogen, and 53.79 g of oxygen. To calculate the number of moles of each element in the 100.0 g sample, we divide the mass of each element by its molar mass:
\( \begin{matrix}
moles\; C &=39.68\; \cancel{g\; C}\dfrac{1\; mol\; C}{12.011\; \cancel{gC}} & = 3.304\; mol\; C \\
& & \\
moles\; H & =6.58\; \cancel{g\; H}\dfrac{1\; mol\; H}{1.0079\; \cancel{gH}} & = 6.53\; mol\; H \\
& & \\
moles\; O & =53.79\; \cancel{g\; O}\dfrac{1\; mol\; C}{15.9994\; \cancel{gO}} & = 3.362\; mol\; O
\end{matrix} \tag{7.2.4} \)
Once again, we find the subscripts of the elements in the empirical formula by dividing the number of moles of each element by the number of moles of the element present in the smallest amount:
\( \begin{matrix}
C:\dfrac{3.304}{3.304}=1.000 & H:\dfrac{6.53}{3.304}=1.98 & O:\dfrac{3.362}{3.304}=1.018
\end{matrix} \)
The oxygen:carbon ratio is 1.018, or approximately 1, and the hydrogen:carbon ratio is approximately 2. The empirical formula of glucose is therefore CH_{2}O, but what is its molecular formula?
Many known compounds have the empirical formula CH_{2}O, including formaldehyde, which is used to preserve biological specimens and has properties that are very different from the sugar circulating in our blood. At this point, we cannot know whether glucose is CH_{2}O, C_{2}H_{4}O_{2}, or any other (CH_{2}O)_{n}. We can, however, use the experimentally determined molar mass of glucose (180 g/mol) to resolve this dilemma.
First, we calculate the formula mass, the molar mass of the formula unit, which is the sum of the atomic masses of the elements in the empirical formula multiplied by their respective subscripts. For glucose,
\( formula\; mass\; of\; CH_{2}O=\left [ 1\; \cancel{mol\; C}\left ( \dfrac{12.011\; g}{1\; \cancel{mol\; C}} \right ) \right ]+\left [ 2\; \cancel{mol\; H}\left ( \dfrac{1.0079\; g}{1\; \cancel{mol\; H}} \right ) \right ]+\left [ 1\; \cancel{mol\; O}\left ( \dfrac{15.9994\; g}{1\; \cancel{mol\; O}} \right ) \right ] \tag{7.2.4} \)
This is much smaller than the observed molar mass of 180 g/mol.
Second, we determine the number of formula units per mole. For glucose, we can calculate the number of (CH_{2}O) units—that is, the n in (CH_{2}O)_{n}—by dividing the molar mass of glucose by the formula mass of CH_{2}O:
\( n=\dfrac{180\; g}{30.026\; g\; CH_{2}O}=5.99\approx 6 CH^{_{2}O\; formula\; units} \tag{7.2.5} \)
Each glucose contains six CH_{2}O formula units, which gives a molecular formula for glucose of (CH_{2}O)_{6}, which is more commonly written as C_{6}H_{12}O_{6}. The molecular structures of formaldehyde and glucose, both of which have the empirical formula CH_{2}O, are shown in Figure 7.2.5.
Figure 7.2.5 Structural Formulas and BallandStick Models of (a) Formaldehyde and (b) Glucose
Example 8
Calculate the molecular formula of caffeine, a compound found in coffee, tea, and cola drinks that has a marked stimulatory effect on mammals. The chemical analysis of caffeine shows that it contains 49.18% carbon, 5.39% hydrogen, 28.65% nitrogen, and 16.68% oxygen by mass, and its experimentally determined molar mass is 196 g/mol.
Given: percent composition and molar mass
Asked for: molecular formula
Strategy:
A Assume 100 g of caffeine. From the percentages given, use the procedure given in Example 6 to calculate the empirical formula of caffeine.
B Calculate the formula mass and then divide the experimentally determined molar mass by the formula mass. This gives the number of formula units present.
C Multiply each subscript in the empirical formula by the number of formula units to give the molecular formula.
Solution:
A We begin by dividing the mass of each element in 100.0 g of caffeine (49.18 g of carbon, 5.39 g of hydrogen, 28.65 g of nitrogen, 16.68 g of oxygen) by its molar mass. This gives the number of moles of each element in 100 g of caffeine.
\( \begin{matrix}
moles\; C &=49.18\; \cancel{g\; C}\dfrac{1\; mol\; C}{12.011\; \cancel{gC}} & = 4.095\; mol\; C \\
& & \\
moles\; H & =5.93\; \cancel{g\; H}\dfrac{1\; mol\; H}{1.0079\; \cancel{gH}} & = 5.35\; mol\; H \\
& & \\
moles\; N & =28.65\; \cancel{g\; N}\dfrac{1\; mol\; H}{14.0067\; \cancel{gH}} & = 2.045\; mol\; N \\
& & \\
moles\; O & =16.68\; \cancel{g\; O}\dfrac{1\; mol\; C}{15.9994\; \cancel{gO}} & = 1.043\; mol\; O
\end{matrix} \)
To obtain the relative numbers of atoms of each element present, divide the number of moles of each element by the number of moles of the element present in the least amount:
\( \begin{matrix}
O:\dfrac{1.043}{1.043}=1.000 & C:\dfrac{4.095}{1.043}=3.926 & H:\dfrac{5.35}{1.043}=5.13 & N:\dfrac{2.045}{1.043}=1.960
\end{matrix} \)
These results are fairly typical of actual experimental data. None of the atomic ratios is exactly integral but all are within 5% of integral values. Just as in Example 6, it is reasonable to assume that such small deviations from integral values are due to minor experimental errors, so round to the nearest integer. The empirical formula of caffeine is thus C_{4}H_{5}N_{2}O.
B The molecular formula of caffeine could be C_{4}H_{5}N_{2}O, but it could also be any integral multiple of this. To determine the actual molecular formula, we must divide the experimentally determined molar mass by the formula mass. The formula mass is calculated as follows:
Dividing the measured molar mass of caffeine (196 g/mol) by the calculated formula mass gives
\( \dfrac{196\; g/mol}{97.096\; g/C_{4}H_{5}N_{2}O}=2.02\approx 2C_{4}H_{5}N_{2}O \)
C There are two C_{4}H_{5}N_{2}O formula units in caffeine, so the molecular formula must be (C_{4}H_{5}N_{2}O)_{2} = C_{8}H_{10}N_{4}O_{2}. The structure of caffeine is as follows:
Exercise
Calculate the molecular formula of Freon114, which has 13.85% carbon, 41.89% chlorine, and 44.06% fluorine. The experimentally measured molar mass of this compound is 171 g/mol. Like Freon11, Freon114 is a commonly used refrigerant that has been implicated in the destruction of the ozone layer.
Answer: C_{2}Cl_{2}F_{4}
Summary
The empirical formula of a substance can be calculated from the experimentally determined percent composition, the percentage of each element present in a pure substance by mass. In many cases, these percentages can be determined by combustion analysis. If the molar mass of the compound is known, the molecular formula can be determined from the empirical formula.
Key Takeaway
 The empirical formula of a substance can be calculated from its percent composition, and the molecular formula can be determined from the empirical formula and the compound’s molar mass.
Conceptual Problems

What is the relationship between an empirical formula and a molecular formula?

Construct a flowchart showing how you would determine the empirical formula of a compound from its percent composition.
Numerical Problems
Please be sure you are familiar with the topics discussed in Essential Skills 2 (Section 7.7 ) before proceeding to the Numerical Problems.

What is the mass percentage of water in each hydrate?
 H_{3}AsO_{4}·0·5H_{2}O
 NH_{4}NiCl_{3}·6H_{2}O
 Al(NO_{3})_{3}·9H_{2}O

What is the mass percentage of water in each hydrate?
 CaSO_{4}·2H_{2}O
 Fe(NO_{3})_{3}·9H_{2}O
 (NH_{4})_{3}ZrOH(CO_{3})_{3}·2H_{2}O

Which of the following has the greatest mass percentage of oxygen—KMnO_{4}, K_{2}Cr_{2}O_{7}, or Fe_{2}O_{3}?

Which of the following has the greatest mass percentage of oxygen—ThOCl_{2}, MgCO_{3}, or NO_{2}Cl?

Calculate the percent composition of the element shown in bold in each compound.
 SbBr_{3}
 As_{2}I_{4}
 AlPO_{4}
 C_{6}H_{10}O

Calculate the percent composition of the element shown in bold in each compound.
 HBrO_{3}
 CsReO_{4}
 C_{3}H_{8}O
 FeSO_{4}

A sample of a chromium compound has a molar mass of 151.99 g/mol. Elemental analysis of the compound shows that it contains 68.43% chromium and 31.57% oxygen. What is the identity of the compound?

The percentages of iron and oxygen in the three most common binary compounds of iron and oxygen are given in the following table. Write the empirical formulas of these three compounds.
Compound % Iron % Oxygen Empirical Formula 1 69.9 30.1 2 77.7 22.3 3 72.4 27.6 
What is the mass percentage of water in each hydrate?
 LiCl·H_{2}O
 MgSO_{4}·7H_{2}O
 Sr(NO_{3})_{2}·4H_{2}O

What is the mass percentage of water in each hydrate?
 CaHPO_{4}·2H_{2}O
 FeCl_{2}·4H_{2}O
 Mg(NO_{3})_{2}·4H_{2}O

Two hydrates were weighed, heated to drive off the waters of hydration, and then cooled. The residues were then reweighed. Based on the following results, what are the formulas of the hydrates?
Compound Initial Mass (g) Mass after Cooling (g) NiSO_{4}·xH_{2}O 2.08 1.22 CoCl_{2}·xH_{2}O 1.62 0.88 
Which contains the greatest mass percentage of sulfur—FeS_{2}, Na_{2}S_{2}O_{4}, or Na_{2}S?

Given equal masses of each, which contains the greatest mass percentage of sulfur—NaHSO_{4} or K_{2}SO_{4}?

Calculate the mass percentage of oxygen in each polyatomic ion.
 bicarbonate
 chromate
 acetate
 sulfite

Calculate the mass percentage of oxygen in each polyatomic ion.
 oxalate
 nitrite
 dihydrogen phosphate
 thiocyanate

The empirical formula of garnet, a gemstone, is Fe_{3}Al_{2}Si_{3}O_{12}. An analysis of a sample of garnet gave a value of 13.8% for the mass percentage of silicon. Is this consistent with the empirical formula?

A compound has the empirical formula C_{2}H_{4}O, and its formula mass is 88 g. What is its molecular formula?

Mirex is an insecticide that contains 22.01% carbon and 77.99% chlorine. It has a molecular mass of 545.59 g. What is its empirical formula? What is its molecular formula?

How many moles of CO_{2} and H_{2}O will be produced by combustion analysis of 0.010 mol of styrene?

How many moles of CO_{2}, H_{2}O, and N_{2} will be produced by combustion analysis of 0.0080 mol of aniline?

How many moles of CO_{2}, H_{2}O, and N_{2} will be produced by combustion analysis of 0.0074 mol of aspartame?

How many moles of CO_{2}, H_{2}O, N_{2}, and SO_{2} will be produced by combustion analysis of 0.0060 mol of penicillin G?

Combustion of a 34.8 mg sample of benzaldehyde, which contains only carbon, hydrogen, and oxygen, produced 101 mg of CO_{2} and 17.7 mg of H_{2}O.
 What was the mass of carbon and hydrogen in the sample?
 Assuming that the original sample contained only carbon, hydrogen, and oxygen, what was the mass of oxygen in the sample?
 What was the mass percentage of oxygen in the sample?
 What is the empirical formula of benzaldehyde?
 The molar mass of benzaldehyde is 106.12 g/mol. What is its molecular formula?

Salicylic acid is used to make aspirin. It contains only carbon, oxygen, and hydrogen. Combustion of a 43.5 mg sample of this compound produced 97.1 mg of CO_{2} and 17.0 mg of H_{2}O.
 What is the mass of oxygen in the sample?
 What is the mass percentage of oxygen in the sample?
 What is the empirical formula of salicylic acid?
 The molar mass of salicylic acid is 138.12 g/mol. What is its molecular formula?

Given equal masses of the following acids, which contains the greatest amount of hydrogen that can dissociate to form H^{+}—nitric acid, hydroiodic acid, hydrocyanic acid, or chloric acid?

Calculate the formula mass or the molecular mass of each compound.
 heptanoic acid (a sevencarbon carboxylic acid)
 2propanol (a threecarbon alcohol)
 KMnO_{4}
 tetraethyllead
 sulfurous acid
 ethylbenzene (an eightcarbon aromatic hydrocarbon)

Calculate the formula mass or the molecular mass of each compound.
 MoCl_{5}
 B_{2}O_{3}
 bromobenzene
 cyclohexene
 phosphoric acid
 ethylamine

Given equal masses of butane, cyclobutane, and propene, which contains the greatest mass of carbon?

Given equal masses of urea [(NH_{2})_{2}CO] and ammonium sulfate, which contains the most nitrogen for use as a fertilizer?
Answers

To two decimal places, the percentages are:
 5.97%
 37.12%
 43.22%

% oxygen: KMnO_{4}, 40.50%; K_{2}Cr_{2}O_{7}, 38.07%; Fe_{2}O_{3}, 30.06%

To two decimal places, the percentages are:
 66.32% Br
 22.79% As
 25.40% P
 73.43% C

Cr_{2}O_{3}.

To two decimal places, the percentages are:
 29.82%
 51.16%
 25.40%

NiSO_{4} · 6H_{2}O and CoCl_{2} · 6H_{2}O

NaHSO_{4}

 72.71%
 69.55%
 65.99%
 0%

C_{4}H_{8}O_{2}

 27.6 mg C and 1.98 mg H
 5.2 mg O
 15%
 C_{7}H_{6}O
 C_{7}H_{6}O

hydrocyanic acid, HCN

To two decimal places, the values are:
 273.23 amu
 69.62 amu
 157.01 amu
 82.14 amu
 98.00 amu
 45.08 amu

Urea
Contributors
 Anonymous
Modified by Joshua Halpern