# 3.2: Determining Empirical and Molecular Formulas

- Page ID
- 38148

- Compute the percent composition of a compound
- Determine the empirical formula of a compound
- Determine the molecular formula of a compound

In the previous section, we discussed the relationship between the bulk mass of a substance and the number of atoms or molecules it contains (moles). Given the chemical formula of the substance, we were able to determine the amount of the substance (moles) from its mass, and vice versa. But what if the chemical formula of a substance is unknown? In this section, we will explore how to apply these very same principles in order to derive the chemical formulas of unknown substances from experimental mass measurements.

## Percent Composition

The elemental makeup of a compound defines its chemical identity, and chemical formulas are the most succinct way of representing this elemental makeup. When a compound’s formula is unknown, measuring the mass of each of its constituent elements is often the first step in the process of determining the formula experimentally. The results of these measurements permit the calculation of the compound’s percent composition, defined as the percentage by mass of each element in the compound. For example, consider a gaseous compound composed solely of carbon and hydrogen. The percent composition of this compound could be represented as follows:

\[\mathrm{\%H=\dfrac{mass\: H}{mass\: compound}\times100\%} \nonumber \]

\[\mathrm{\%C=\dfrac{mass\: C}{mass\: compound}\times100\%} \nonumber \]

If analysis of a 10.0-g sample of this gas showed it to contain 2.5 g H and 7.5 g C, the percent composition would be calculated to be 25% H and 75% C:

\[\mathrm{\%H=\dfrac{2.5\:g\: H}{10.0\:g\: compound}\times100\%=25\%} \nonumber \]

\[\mathrm{\%C=\dfrac{7.5\:g\: C}{10.0\:g\: compound}\times100\%=75\%} \nonumber \]

Analysis of a 12.04-g sample of a liquid compound composed of carbon, hydrogen, and nitrogen showed it to contain 7.34 g C, 1.85 g H, and 2.85 g N. What is the percent composition of this compound?

###### Solution

To calculate percent composition, we divide the experimentally derived mass of each element by the overall mass of the compound, and then convert to a percentage:

\[\mathrm{\%C=\dfrac{7.34\:g\: C}{12.04\:g\: compound}\times100\%=61.0\%} \nonumber \]

\[\mathrm{\%H=\dfrac{1.85\:g\: H}{12.04\:g\: compound}\times100\%=15.4\%} \nonumber \]

\[\mathrm{\%N=\dfrac{2.85\:g\: N}{12.04\:g\: compound}\times100\%=23.7\%} \nonumber \]

The analysis results indicate that the compound is 61.0% C, 15.4% H, and 23.7% N by mass.

A 24.81-g sample of a gaseous compound containing only carbon, oxygen, and chlorine is determined to contain 3.01 g C, 4.00 g O, and 17.81 g Cl. What is this compound’s percent composition?

**Answer**-
12.1% C, 16.1% O, 71.8% Cl

## Determining Percent Composition from Formula Mass

Percent composition is also useful for evaluating the relative abundance of a given element in different compounds of known formulas. As one example, consider the common nitrogen-containing fertilizers ammonia (NH_{3}), ammonium nitrate (NH_{4}NO_{3}), and urea (CH_{4}N_{2}O). The element nitrogen is the active ingredient for agricultural purposes, so the mass percentage of nitrogen in the compound is a practical and economic concern for consumers choosing among these fertilizers. For these sorts of applications, the percent composition of a compound is easily derived from its formula mass and the atomic masses of its constituent elements. A molecule of NH_{3} contains one N atom weighing 14.01 amu and three H atoms weighing a total of (3 × 1.008 amu) = 3.024 amu. The formula mass of ammonia is therefore (14.01 amu + 3.024 amu) = 17.03 amu, and its percent composition is:

\[\mathrm{\%N=\dfrac{14.01\:amu\: N}{17.03\:amu\:NH_3}\times100\%=82.27\%} \nonumber \]

\[\mathrm{\%H=\dfrac{3.024\:amu\: N}{17.03\:amu\:NH_3}\times100\%=17.76\%} \nonumber \]

This same approach may be taken considering a pair of molecules, a dozen molecules, or a mole of molecules, etc. The latter amount is most convenient and would simply involve the use of molar masses instead of atomic and formula masses, as demonstrated Example \(\PageIndex{2}\). As long as we know the chemical formula of the substance in question, we can easily derive percent composition from the formula mass or molar mass.

Aspirin is a compound with the molecular formula C_{9}H_{8}O_{4}. What is its percent composition?

###### Solution

To calculate the percent composition, we need to know the masses of C, H, and O in a known mass of C_{9}H_{8}O_{4}. It is convenient to consider 1 mol of C_{9}H_{8}O_{4} and use its molar mass (180.159 g/mole, determined from the chemical formula) to calculate the percentages of each of its elements:

\[\begin{align*}

\%\ce C&=\mathrm{\dfrac{9\:mol\: C\times molar\: mass\: C}{molar\: mass\:\ce{C9H18O4}}\times100=\dfrac{9\times12.01\:g/mol} \nonumber{180.159\:g/mol}\times100=\dfrac{108.09\:g/mol}{180.159\:g/mol}\times100} \nonumber\\

\%\ce C&=\mathrm{60.00\,\%\,C} \nonumber

\end{align*} \nonumber \]

\[\begin{align*}

\%\ce H&=\mathrm{\dfrac{8\:mol\: H\times molar\: mass\: H}{molar\: mass\:\ce{C9H18O4}}\times 100=\dfrac{8\times 1.008\:g/mol} \nonumber {180.159\:g/mol}\times 100=\dfrac{8.064\:g/mol}{180.159\:g/mol}\times 100} \nonumber\\

\%\ce H&=4.476\,\%\,\ce H \nonumber

\end{align*} \nonumber \]

\[\begin{align*}

\%\ce O&=\mathrm{\dfrac{4\:mol\: O\times molar\: mass\: O}{molar\: mass\: \ce{C9H18O4}}\times 100=\dfrac{4\times 16.00\:g/mol} \nonumber{180.159\:g/mol}\times 100=\dfrac{64.00\:g/mol}{180.159\:g/mol}\times 100} \nonumber \\

\%\ce O&=35.52\% \nonumber

\end{align*} \nonumber \]

Note that these percentages sum to equal 100.00% when appropriately rounded.

To three significant digits, what is the mass percentage of iron in the compound \(Fe_2O_3\)?

**Answer**-
69.9% Fe

## Determination of Empirical Formulas

As previously mentioned, the most common approach to determining a compound’s chemical formula is to first measure the masses of its constituent elements. However, we must keep in mind that chemical formulas represent the relative *numbers*, not masses, of atoms in the substance. Therefore, any experimentally derived data involving mass must be used to derive the corresponding numbers of atoms in the compound. To accomplish this, we can use molar masses to convert the mass of each element to a number of moles. We then consider the moles of each element relative to each other, converting these numbers into a whole-number ratio that can be used to derive the empirical formula of the substance. Consider a sample of compound determined to contain 1.71 g C and 0.287 g H. The corresponding numbers of atoms (in moles) are:

\[\mathrm{1.71\:g\: C\times \dfrac{1\:mol\: C}{12.01\:g\: C}=0.142\:mol\: C} \nonumber \]

\[\mathrm{0.287\:g\: H\times \dfrac{1\:mol\: H}{1.008\:g\: H}=0.284\:mol\: H} \nonumber \]

Thus, we can accurately represent this compound with the formula C_{0.142}H_{0.284}. Of course, per accepted convention, formulas contain whole-number subscripts, which can be achieved by dividing each subscript by the smaller subscript:

\[\ce C_{\Large{\frac{0.142}{0.142}}}\:\ce H_{\Large{\frac{0.284}{0.142}}}\ce{\:or\:CH2} \nonumber \]

(Recall that subscripts of “1” are not written, but rather assumed if no other number is present.)

The empirical formula for this compound is thus CH_{2}. This may or not be the compound’s *molecular formula* as well; however, we would need additional information to make that determination (as discussed later in this section).

Consider as another example a sample of compound determined to contain 5.31 g Cl and 8.40 g O. Following the same approach yields a tentative empirical formula of:

\[\mathrm{Cl_{0.150}O_{0.525}=Cl_{\Large{\frac{0.150}{0.150}}}\: O_{\Large{\frac{0.525}{0.150}}}=ClO_{3.5}} \nonumber \]

In this case, dividing by the smallest subscript still leaves us with a decimal subscript in the empirical formula. To convert this into a whole number, we must multiply each of the subscripts by two, retaining the same atom ratio and yielding Cl_{2}O_{7} as the final empirical formula.

In summary, empirical formulas are derived from experimentally measured element masses by:

- Deriving the number of moles of each element from its mass
- Dividing each element’s molar amount by the smallest molar amount to yield subscripts for a tentative empirical formula
- Multiplying all coefficients by an integer, if necessary, to ensure that the smallest whole-number ratio of subscripts is obtained

Figure \(\PageIndex{1}\) outlines this procedure in flow chart fashion for a substance containing elements A and X.

A sample of the black mineral hematite (Figure \(\PageIndex{2}\)), an oxide of iron found in many iron ores, contains 34.97 g of iron and 15.03 g of oxygen. What is the empirical formula of hematite?

###### Solution

For this problem, we are given the mass in grams of each element. Begin by finding the moles of each:

\[\begin{align*}

\mathrm{34.97\:g\: Fe\left(\dfrac{mol\: Fe}{55.85\:g}\right)}&=\mathrm{0.6261\:mol\: Fe} \nonumber\\ \nonumber\\

\mathrm{15.03\:g\: O\left(\dfrac{mol\: O}{16.00\:g}\right)}&=\mathrm{0.9394\:mol\: O} \nonumber

\end{align*} \nonumber \]

Next, derive the iron-to-oxygen molar ratio by dividing by the lesser number of moles:

\[\mathrm{\dfrac{0.6261}{0.6261}=1.000\:mol\: Fe} \nonumber \]

\[\mathrm{\dfrac{0.9394}{0.6261}=1.500\:mol\: O} \nonumber \]

The ratio is 1.000 mol of iron to 1.500 mol of oxygen (Fe_{1}O_{1.5}). Finally, multiply the ratio by two to get the smallest possible whole number subscripts while still maintaining the correct iron-to-oxygen ratio:

\[\mathrm{2(Fe_1O_{1.5})=Fe_2O_3} \nonumber \]

The empirical formula is \(Fe_2O_3\).

What is the empirical formula of a compound if a sample contains 0.130 g of nitrogen and 0.370 g of oxygen?

**Answer**-
\(N_2O_5\)

**Video \(\PageIndex{1}\)**: Additional worked examples illustrating the derivation of empirical formulas are presented in the brief video clip.

## Deriving Empirical Formulas from Percent Composition

Finally, with regard to deriving empirical formulas, consider instances in which a compound’s percent composition is available rather than the absolute masses of the compound’s constituent elements. In such cases, the percent composition can be used to calculate the masses of elements present in any convenient mass of compound; these masses can then be used to derive the empirical formula in the usual fashion.

The bacterial fermentation of grain to produce ethanol forms a gas with a percent composition of 27.29% C and 72.71% O (Figure \(\PageIndex{3}\)). What is the empirical formula for this gas?

###### Solution

Since the scale for percentages is 100, it is most convenient to calculate the mass of elements present in a sample weighing 100 g. The calculation is “most convenient” because, per the definition for percent composition, the mass of a given element in grams is numerically equivalent to the element’s mass percentage. This numerical equivalence results from the definition of the “percentage” unit, whose name is derived from the Latin phrase *per centum* meaning “by the hundred.” Considering this definition, the mass percentages provided may be more conveniently expressed as fractions:

\[\begin{align*}

27.29\,\%\,\ce C&=\mathrm{\dfrac{27.29\:g\: C}{100\:g\: compound}} \nonumber \\ \nonumber \\

72.71\,\%\,\ce O&=\mathrm{\dfrac{72.71\:g\: O}{100\:g\: compound}} \nonumber

\end{align*} \nonumber \]

The molar amounts of carbon and hydrogen in a 100-g sample are calculated by dividing each element’s mass by its molar mass:

\[\begin{align*}

\mathrm{27.29\:g\: C\left(\dfrac{mol\: C}{12.01\:g}\right)}&=\mathrm{2.272\:mol\: C} \nonumber \\ \nonumber \\

\mathrm{72.71\:g\: O\left(\dfrac{mol\: O}{16.00\:g}\right)}&=\mathrm{4.544\:mol\: O} \nonumber

\end{align*} \nonumber \]

Coefficients for the tentative empirical formula are derived by dividing each molar amount by the lesser of the two:

\[\mathrm{\dfrac{2.272\:mol\: C}{2.272}=1} \nonumber \]

\[\mathrm{\dfrac{4.544\:mol\: O}{2.272}=2} \nonumber \]

Since the resulting ratio is one carbon to two oxygen atoms, the empirical formula is CO_{2}.

What is the empirical formula of a compound containing 40.0% C, 6.71% H, and 53.28% O?

**Answer**-
\(CH_2O\)

## Derivation of Molecular Formulas

Recall that empirical formulas are symbols representing the *relative* numbers of a compound’s elements. Determining the *absolute* numbers of atoms that compose a single molecule of a covalent compound requires knowledge of both its empirical formula and its molecular mass or molar mass. These quantities may be determined experimentally by various measurement techniques. Molecular mass, for example, is often derived from the mass spectrum of the compound (see discussion of this technique in the previous chapter on atoms and molecules). Molar mass can be measured by a number of experimental methods, many of which will be introduced in later chapters of this text.

Molecular formulas are derived by comparing the compound’s molecular or molar mass to its empirical formula mass. As the name suggests, an empirical formula mass is the sum of the average atomic masses of all the atoms represented in an empirical formula. If we know the molecular (or molar) mass of the substance, we can divide this by the empirical formula mass in order to identify the number of empirical formula units per molecule, which we designate as *n*:

\[\mathrm{\dfrac{molecular\: or\: molar\: mass\left(amu\: or\:\dfrac{g}{mol}\right)}{empirical\: formula\: mass\left(amu\: or\:\dfrac{g}{mol}\right)}= \mathit n\: formula\: units/molecule} \nonumber \]

The molecular formula is then obtained by multiplying each subscript in the empirical formula by *n*, as shown by the generic empirical formula A_{x}B_{y}:

\[\mathrm{(A_xB_y)_n=A_{nx}B_{nx}} \nonumber \]

For example, consider a covalent compound whose empirical formula is determined to be CH_{2}O. The empirical formula mass for this compound is approximately 30 amu (the sum of 12 amu for one C atom, 2 amu for two H atoms, and 16 amu for one O atom). If the compound’s molecular mass is determined to be 180 amu, this indicates that molecules of this compound contain six times the number of atoms represented in the empirical formula:

\[\mathrm{\dfrac{180\:amu/molecule}{30\:\dfrac{amu}{formula\: unit}}=6\:formula\: units/molecule} \nonumber \]

Molecules of this compound are then represented by molecular formulas whose subscripts are six times greater than those in the empirical formula:

\[\ce{(CH2O)6}=\ce{C6H12O6} \nonumber \]

Note that this same approach may be used when the molar mass (g/mol) instead of the molecular mass (amu) is used. In this case, we are merely considering one mole of empirical formula units and molecules, as opposed to single units and molecules.

Nicotine, an alkaloid in the nightshade family of plants that is mainly responsible for the addictive nature of cigarettes, contains 74.02% C, 8.710% H, and 17.27% N. If 40.57 g of nicotine contains 0.2500 mol nicotine, what is the molecular formula?

###### Solution

Determining the molecular formula from the provided data will require comparison of the compound’s empirical formula mass to its molar mass. As the first step, use the percent composition to derive the compound’s empirical formula. Assuming a convenient, a 100-g sample of nicotine yields the following molar amounts of its elements:

\[\begin{alignat}{2}

&\mathrm{(74.02\:g\: C)\left(\dfrac{1\:mol\: C}{12.01\:g\: C}\right)}&&= \:\mathrm{6.163\:mol\: C}\\

&\mathrm{(8.710\:g\: H)\left(\dfrac{1\:mol\: H}{1.01\:g\: H}\right)}&&= \:\mathrm{8.624\:mol\: H}\\

&\mathrm{(17.27\:g\: N)\left(\dfrac{1\:mol\: N}{14.01\:g\: N}\right)}&&= \:\mathrm{1.233\:mol\: N}

\end{alignat} \nonumber \]

Next, we calculate the molar ratios of these elements relative to the least abundant element, \(\ce{N}\).

\(6.163 \: \text{mol} \: \ce{C}\) / \(1.233 \: \text{mol} \: \ce{N}\) = 5

\(8.264 \: \text{mol} \: \ce{H}\) / \(1.233 \: \text{mol} \: \ce{N}\) = 7

\(1.233 \: \text{mol} \: \ce{N}\) / \(1.233\: \text{mol} \: \ce{N}\) = 1

1.233/1.233 = \(1.000 \: \text{mol} \: \ce{N}\)

6.163/1.233 = \(4.998 \: \text{mol} \: \ce{C}\)

8.624/1.233 = \(6.994 \: \text{mol} \: \ce{H}\)

The C-to-N and H-to-N molar ratios are adequately close to whole numbers, and so the empirical formula is C_{5}H_{7}N. The empirical formula mass for this compound is therefore 81.13 amu/formula unit, or 81.13 g/mol formula unit.

We calculate the molar mass for nicotine from the given mass and molar amount of compound:

\[\mathrm{\dfrac{40.57\:g\: nicotine}{0.2500\:mol\: nicotine}=\dfrac{162.3\:g}{mol}} \nonumber \]

Comparing the molar mass and empirical formula mass indicates that each nicotine molecule contains two formula units:

\[\mathrm{\dfrac{162.3\:g/mol}{81.13\:\dfrac{g}{formula\: unit}}=2\:formula\: units/molecule} \nonumber \]

Thus, we can derive the molecular formula for nicotine from the empirical formula by multiplying each subscript by two:

\[\ce{(C5H7N)2}=\ce{C10H14N2} \nonumber \]

What is the molecular formula of a compound with a percent composition of 49.47% C, 5.201% H, 28.84% N, and 16.48% O, and a molecular mass of 194.2 amu?

**Answer**-
C

_{8}H_{10}N_{4}O_{2}

## Summary

The chemical identity of a substance is defined by the types and relative numbers of atoms composing its fundamental entities (molecules in the case of covalent compounds, ions in the case of ionic compounds). A compound’s percent composition provides the mass percentage of each element in the compound, and it is often experimentally determined and used to derive the compound’s empirical formula. The empirical formula mass of a covalent compound may be compared to the compound’s molecular or molar mass to derive a molecular formula.

## Key Equations

- \(\mathrm{\%X=\dfrac{mass\: X}{mass\: compound}\times100\%}\)
- \(\mathrm{\dfrac{molecular\: or\: molar\: mass\left(amu\: or\:\dfrac{g}{mol}\right)}{empirical\: formula\: mass\left(amu\: or\:\dfrac{g}{mol}\right)}=\mathit n\: formula\: units/molecule}\)
- (A
_{x}B_{y})_{n}= A_{nx}B_{ny}

## Glossary

- percent composition
- percentage by mass of the various elements in a compound

- empirical formula mass
- sum of average atomic masses for all atoms represented in an empirical formula