Skip to main content
Chemistry LibreTexts

1.3.4: Proteins- Polymers of Amino Acids

  • Page ID
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vectorC}[1]{\textbf{#1}} \)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)
    Learning Objectives
    • Identify different amino acids.
    • Describe how short chain proteins (polypeptides) are formed from amino acids.

    Proteins may be defined as compounds of high molar mass consisting largely or entirely of chains of amino acids. Their masses range from several thousand to several million daltons (Da). In addition to carbon, hydrogen, and oxygen atoms, all proteins contain nitrogen and sulfur atoms, and many also contain phosphorus atoms and traces of other elements. Proteins serve a variety of roles in living organisms and are often classified by these biological roles. Muscle tissue is largely protein, as are skin and hair. Proteins are present in the blood, in the brain, and even in tooth enamel. Each type of cell in our bodies makes its own specialized proteins, as well as proteins common to all or most cells. We begin our study of proteins by looking at the properties and reactions of amino acids, which is followed by a discussion of how amino acids link covalently to form peptides and proteins. We end the chapter with a discussion of enzymes—the proteins that act as catalysts in the body.

    The proteins in all living species, from bacteria to humans, are constructed from the same set of 20 amino acids, so called because each contains an amino group attached to a carboxylic acid. The amino acids in proteins are α-amino acids, which means the amino group is attached to the α-carbon of the carboxylic acid. Humans can synthesize only about half of the needed amino acids; the remainder must be obtained from the diet and are known as essential amino acids. However, two additional amino acids have been found in limited quantities in proteins: Selenocysteine was discovered in 1986, while pyrrolysine was discovered in 2002.

    The amino acids are colorless, nonvolatile, crystalline solids, melting and decomposing at temperatures above 200°C. These melting temperatures are more like those of inorganic salts than those of amines or organic acids and indicate that the structures of the amino acids in the solid state and in neutral solution are best represented as having both a negatively charged group and a positively charged group. Such a species is known as a zwitterion.



    In addition to the amino and carboxyl groups, amino acids have a side chain or R group attached to the α-carbon. Each amino acid has unique characteristics arising from the size, shape, solubility, and ionization properties of its R group. As a result, the side chains of amino acids exert a profound effect on the structure and biological activity of proteins. Although amino acids can be classified in various ways, one common approach is to classify them according to whether the functional group on the side chain at neutral pH is nonpolar, polar but uncharged, negatively charged, or positively charged. The structures and names of the 20 amino acids, their one- and three-letter abbreviations, and some of their distinctive features are given in Table \(\PageIndex{1}\).

    Table \(\PageIndex{1}\): Common Amino Acids Found in Proteins
    Common Name Abbreviation Structural Formula (at pH 6) Molar Mass Distinctive Feature
    Amino acids with a nonpolar R group
    glycine gly (G) gly.jpg 75 the only amino acid lacking a chiral carbon
    alanine ala (A) ala.jpg 89
    valine val (V) val.jpg 117 a branched-chain amino acid
    leucine leu (L) leu.jpg 131 a branched-chain amino acid
    isoleucine ile (I) ile.jpg 131 an essential amino acid because most animals cannot synthesize branched-chain amino acids
    phenylalanine phe (F) phe.jpg 165 also classified as an aromatic amino acid
    tryptophan trp (W) trp.jpg 204 also classified as an aromatic amino acid
    methionine met (M) met.jpg 149 side chain functions as a methyl group donor
    proline pro (P) pro.jpg 115 contains a secondary amine group; referred to as an α-imino acid
    Amino acids with a polar but neutral R group
    serine ser (S) ser.jpg 105 found at the active site of many enzymes
    threonine thr (T) thr.jpg 119 named for its similarity to the sugar threose
    cysteine cys (C) cys.jpg 121 oxidation of two cysteine molecules yields cystine
    tyrosine tyr (Y) tyr.jpg 181 also classified as an aromatic amino acid
    asparagine asn (N) asn.jpg 132 the amide of aspartic acid
    glutamine gln (Q) gln.jpg 146 the amide of glutamic acid
    Amino acids with a negatively charged R group
    aspartic acid asp (D) asp.jpg 132 carboxyl groups are ionized at physiological pH; also known as aspartate
    glutamic acid glu (E) glu.jpg 146 carboxyl groups are ionized at physiological pH; also known as glutamate
    Amino acids with a positively charged R group
    histidine his (H) his.jpg 155 the only amino acid whose R group has a pKa (6.0) near physiological pH
    lysine lys (K) lys.jpg 147
    arginine arg (R) arg.jpg 175 almost as strong a base as sodium hydroxide

    The first amino acid to be isolated was asparagine in 1806. It was obtained from protein found in asparagus juice (hence the name). Glycine, the major amino acid found in gelatin, was named for its sweet taste (Greek glykys, meaning “sweet”). In some cases an amino acid found in a protein is actually a derivative of one of the common 20 amino acids (one such derivative is hydroxyproline). The modification occurs after the amino acid has been assembled into a protein.

    proline, hydroxyproline.jpg


    The structure of an amino acid allows it to act as both an acid and a base. An amino acid has this ability because at a certain pH value (different for each amino acid) nearly all the amino acid molecules exist as zwitterions. If acid is added to a solution containing the zwitterion, the carboxylate group captures a hydrogen (H+) ion, and the amino acid becomes positively charged. If base is added, ion removal of the H+ ion from the amino group of the zwitterion produces a negatively charged amino acid. In both circumstances, the amino acid acts to maintain the pH of the system—that is, to remove the added acid (H+) or base (OH) from solution.

    acid base addition.jpg

    The Peptide Bond: Peptides and Proteins

    Two or more amino acids can join together into chains called peptides. Previously, we discussed the reaction between ammonia and a carboxylic acid to form an amide. In a similar reaction, the amino group on one amino acid molecule reacts with the carboxyl group on another, releasing a molecule of water and forming an amide linkage as shown in Figure \(\PageIndex{1}\)

    Condensation reaction of two amino acids to form a peptide
    Figure \(\PageIndex{1}\) Amino acids undergo condensation to form a molecule called a dipeptide. The \(\ce{C-N}\) bond is called a peptide bond.

    An amide bond joining two amino acid units is called a peptide bond. Note that the product molecule still has a reactive amino group on the left and a reactive carboxyl group on the right. These can react with additional amino acids to lengthen the peptide. The process can continue until thousands of units have joined, resulting in large proteins.

    A chain consisting of only two amino acid units is called a dipeptide; a chain consisting of three is a tripeptide. By convention, peptide and protein structures are depicted with the amino acid whose amino group is free (the N-terminal end) on the left and the amino acid with a free carboxyl group (the C-terminal end) to the right.

    terminal end.jpg

    The general term peptide refers to an amino acid chain of unspecified length. However, chains of about 50 amino acids or more are usually called proteins or polypeptides. In its physiologically active form, a protein may be composed of one or more polypeptide chains.

    Peptide cells in our bodies have an intricate mechanism for the manufacture of proteins. Humans have to use other techniques in order to synthesize the same proteins in a lab. The chemistry of peptide synthesis is complicated. Both active groups on an amino acid can react and the amino acid sequence must be a specific one in order for the protein to function. Robert Merrifield developed the first synthetic approach for making proteins in the lab, a manual approach which was lengthy and tedious (and, he won the Nobel Prize in Chemistry in 1984 for his work). Today, however, automated systems can crank out a peptide in a very short period of time.

    The Sequence of Amino Acids

    The particular sequence of amino acids in a longer chain is called an amino acid sequence. By convention, the amino acid sequence is listed in the order such that the free amine group is on the left end of the molecule and the free carboxyl group is on the right end of the molecule.


    For example, suppose that a sequence of the amino acids glycine, tryptophan, and alanine is formed with the free amine group as part of the glycine and the free carboxyl group as part of the alanine. The amino acid sequence can be easily written using the abbreviations as Gly-Trp-Ala. This is a different sequence from Ala-Trp-Gly because the free amine and carboxyl groups would be on different amino acids in that case.


    For peptides and proteins to be physiologically active, it is not enough that they incorporate certain amounts of specific amino acids. The order, or sequence, in which the amino acids are connected is also of critical importance. Bradykinin is a nine-amino acid peptide (Figure \(\PageIndex{2}\)) produced in the blood that has the following amino acid sequence:


    This peptide lowers blood pressure, stimulates smooth muscle tissue, increases capillary permeability, and causes pain. When the order of amino acids in bradykinin is reversed,


    the peptide resulting from this synthesis shows none of the activity of bradykinin.

    Figure \(\PageIndex{2}\) Space-filling model of bradykinin. (Public Domain; Fvasconcellos)

    Just as millions of different words are spelled with our 26-letter English alphabet, millions of different proteins are made with the 20 common amino acids. However, just as the English alphabet can be used to write gibberish, amino acids can be put together in the wrong sequence to produce nonfunctional proteins. Although the correct sequence is ordinarily of utmost importance, it is not always absolutely required. Just as you can sometimes make sense of incorrectly spelled English words, a protein with a small percentage of “incorrect” amino acids may continue to function. However, it rarely functions as well as a protein having the correct sequence. There are also instances in which seemingly minor errors of sequence have disastrous effects. For example, in some people, every molecule of hemoglobin (a protein in the blood that transports oxygen) has a single incorrect amino acid unit out of about 300 (a single valine replaces a glutamic acid). That “minor” error is responsible for sickle cell anemia, an inherited condition that usually is fatal.

    Example \(\PageIndex{1}\)

    Draw the polypeptide Asp-Val-Ser.


    1. Identify the structures of each of the three given amino acids and draw them in the same order as given in the name.

    2. Leaving the order the same, connect the amino acids to one another by forming peptide bonds. Note that the order given in the name is the same way the amino acids are connected in the molecule. The first one listed is always the \(\ce{N}\)-terminus of the polypeptide.

    Example \(\PageIndex{2}\)

    List all of the possible polypeptides that can be formed from cysteine (Cys), leucine (Leu), and arginine (Arg).


    Although there are only three amino acids, the order in which they are bonded changes the identity, properties, and function of the resulting polypeptide. There are six possible polypeptides formed from these three amino acids.







    Exercise \(\PageIndex{1}\)

    Draw the structure for each peptide.

    1. gly-val
    2. val-gly


    a. EA1a.jpg

    b. EA1b.jpg


    • The amino group of one amino acid can react with the carboxyl group on another amino acid to form a peptide bond that links the two amino acids together.
    • Additional amino acids can be added on through the formation of addition peptide (amide) bonds.
    • A chain consisting of only two amino acid units is called a dipeptide; a chain consisting of three is a tripeptide.
    • Chains of about 50 amino acids or more are usually called proteins or polypeptides.
    • A sequence of amino acids in a peptide or protein is written with the N-terminal amino acid first and the C-terminal amino acid at the end (writing left to right).
    • The order, or sequence, in which the amino acids are connected is also of critical importance in order for a peptide or protein to be physiologically active.

    Contributors and Attributions

    1.3.4: Proteins- Polymers of Amino Acids is shared under a not declared license and was authored, remixed, and/or curated by LibreTexts.

    • Was this article helpful?