Proteinogenic amino acid
Encyclopedia
Proteinogenic amino acids are those amino acids that can be found in protein
Protein
Proteins are biochemical compounds consisting of one or more polypeptides typically folded into a globular or fibrous form, facilitating a biological function. A polypeptide is a single linear polymer chain of amino acids bonded together by peptide bonds between the carboxyl and amino groups of...

s and require cellular machinery coded for in the genetic code
Genetic code
The genetic code is the set of rules by which information encoded in genetic material is translated into proteins by living cells....

  of any organism for their isolated production. There are 22 standard amino acids, but only 21 are found in eukaryotes. Of the 22, 20 are directly encoded by the universal genetic code. Humans can synthesize 11 of these 20 from each other or from other molecules of intermediary metabolism. The other 9 must be consumed in the diet, and so are called essential amino acids; those are histidine
Histidine
Histidine Histidine, an essential amino acid, has a positively charged imidazole functional group. It is one of the 22 proteinogenic amino acids. Its codons are CAU and CAC. Histidine was first isolated by German physician Albrecht Kossel in 1896. Histidine is an essential amino acid in humans...

, isoleucine
Isoleucine
Isoleucine is an α-amino acid with the chemical formula HO2CCHCHCH2CH3. It is an essential amino acid, which means that humans cannot synthesize it, so it must be ingested. Its codons are AUU, AUC and AUA....

, leucine
Leucine
Leucine is a branched-chain α-amino acid with the chemical formula HO2CCHCH2CH2. Leucine is classified as a hydrophobic amino acid due to its aliphatic isobutyl side chain. It is encoded by six codons and is a major component of the subunits in ferritin, astacin and other 'buffer' proteins...

, lysine
Lysine
Lysine is an α-amino acid with the chemical formula HO2CCH4NH2. It is an essential amino acid, which means that the human body cannot synthesize it. Its codons are AAA and AAG....

, methionine
Methionine
Methionine is an α-amino acid with the chemical formula HO2CCHCH2CH2SCH3. This essential amino acid is classified as nonpolar. This amino-acid is coded by the codon AUG, also known as the initiation codon, since it indicates mRNA's coding region where translation into protein...

, phenylalanine
Phenylalanine
Phenylalanine is an α-amino acid with the formula C6H5CH2CHCOOH. This essential amino acid is classified as nonpolar because of the hydrophobic nature of the benzyl side chain. L-Phenylalanine is an electrically neutral amino acid, one of the twenty common amino acids used to biochemically form...

, threonine
Threonine
Threonine is an α-amino acid with the chemical formula HO2CCHCHCH3. Its codons are ACU, ACA, ACC, and ACG. This essential amino acid is classified as polar...

, tryptophan
Tryptophan
Tryptophan is one of the 20 standard amino acids, as well as an essential amino acid in the human diet. It is encoded in the standard genetic code as the codon UGG...

, and valine
Valine
Valine is an α-amino acid with the chemical formula HO2CCHCH2. L-Valine is one of 20 proteinogenic amino acids. Its codons are GUU, GUC, GUA, and GUG. This essential amino acid is classified as nonpolar...

. The remaining two, selenocysteine
Selenocysteine
Selenocysteine is an amino acid that is present in several enzymes .-Nomenclature:...

 and pyrrolysine
Pyrrolysine
Pyrrolysine is a naturally occurring, genetically coded amino acid used by some methanogenic archaea and one known bacterium in enzymes that are part of their methane-producing metabolism. It is similar to lysine, but with an added pyrroline ring linked to the end of the lysine side chain...

, are incorporated into proteins by unique synthetic mechanisms.

The word proteinogenic means "protein building". Proteinogenic amino acids can be assembled into a polypeptide (the subunit of a protein) through a process called translation (the second stage of protein biosynthesis
Protein biosynthesis
Protein biosynthesis is the process in which cells build or manufacture proteins. The term is sometimes used to refer only to protein translation but more often it refers to a multi-step process, beginning with amino acid synthesis and transcription of nuclear DNA into messenger RNA, which is then...

, part of the overall process of gene expression
Biosemiotics
Biosemiotics is a growing field that studies the production, action and interpretation of signs in the biological realm...

).

In contrast, non-proteinogenic amino acids are either not found in proteins (like carnitine
Carnitine
Carnitine is a quaternary ammonium compound biosynthesized from the amino acids lysine and methionine. In living cells, it is required for the transport of fatty acids from the cytosol into the mitochondria during the breakdown of lipids for the generation of metabolic energy. It is widely...

, GABA
Gamma-aminobutyric acid
γ-Aminobutyric acid is the chief inhibitory neurotransmitter in the mammalian central nervous system. It plays a role in regulating neuronal excitability throughout the nervous system...

, or L-DOPA), or are not produced directly and in isolation by standard cellular machinery (like hydroxyproline
Hydroxyproline
-4-Hydroxyproline, or L-hydroxyproline , is a common non-proteinogenic amino acid, abbreviated as HYP, e.g., in Protein Data Bank.-Structure and discovery:...

 and selenomethionine
Selenomethionine
Selenomethionine is an amino acid containing selenium. The L-enantiomer of selenomethionine, known as Se-met and Sem, is a common natural food source of selenium. In vivo, selenomethionine is randomly incorporated instead of methionine and is readily oxidized. Its antioxidant activity arises from...

). The latter often results from posttranslational modification
Posttranslational modification
Posttranslational modification is the chemical modification of a protein after its translation. It is one of the later steps in protein biosynthesis, and thus gene expression, for many proteins....

 of proteins.

There are clear reasons why organisms have not evolved
Evolution
Evolution is any change across successive generations in the heritable characteristics of biological populations. Evolutionary processes give rise to diversity at every level of biological organisation, including species, individual organisms and molecules such as DNA and proteins.Life on Earth...

 to incorporate certain non-proteinogenic amino acids into proteins: for example, ornithine
Ornithine
Ornithine is an amino acid that plays a role in the urea cycle.-Role in urea cycle:L-Ornithine is one of the products of the action of the enzyme arginase on L-arginine, creating urea. Therefore, ornithine is a central part of the urea cycle, which allows for the disposal of excess nitrogen....

 and homoserine
Homoserine
Homoserine is an α-amino acid with the chemical formula HO2CCHCH2CH2OH. L-Homoserine is not one of the common amino acids encoded by DNA. It differs from the proteinogenic amino acid serine by insertion of an additional methylene group...

 cyclize against the peptide backbone and fragment the protein with relatively short half-lives
Biological half-life
The biological half-life or elimination half-life of a substance is the time it takes for a substance to lose half of its pharmacologic, physiologic, or radiologic activity, as per the MeSH definition...

, while others are toxic because they can be mistakenly incorporated into proteins, such as the arginine
Arginine
Arginine is an α-amino acid. The L-form is one of the 20 most common natural amino acids. At the level of molecular genetics, in the structure of the messenger ribonucleic acid mRNA, CGU, CGC, CGA, CGG, AGA, and AGG, are the triplets of nucleotide bases or codons that codify for arginine during...

 analog canavanine
Canavanine
L---Canavanine is a non-proteinogenic α-amino acid found in certain leguminous plants. It is structurally related to the proteinogenic α-amino acid L-arginine, the sole difference being the replacement of a methylene group in arginine with an oxa group in canavanine...

.

Non-proteinogenic amino acids are found in nonribosomal peptide
Nonribosomal peptide
Nonribosomal peptides are a class of peptide secondary metabolites, usually produced by microorganisms like bacteria and fungi. Nonribosomal peptides are also found in higher organisms, such as nudibranchs, but are thought to be made by bacteria inside these organisms...

s, which are not produced by the ribosome during translation.

Structures

The following illustrates the structures and abbreviations of the 21 amino acids that are directly encoded for protein synthesis by the genetic code of eukaryotes. The structures given below are standard chemical structures, not the typical zwitterion
Zwitterion
In chemistry, a zwitterion is a neutral molecule with a positive and a negative electrical charge at different locations within that molecule. Zwitterions are sometimes also called inner salts.-Examples:...

 forms that exist in aqueous solutions.
IUPAC/IUBMB now also recommends standard abbreviations for the following two amino acids:

Non-specific abbreviations

Sometimes the specific identity of an amino acid cannot be determined unambiguously. Certain protein sequencing
Protein sequencing
Protein sequencing is a technique to determine the amino acid sequence of a protein, as well as which conformation the protein adopts and the extent to which it is complexed with any non-peptide molecules...

 techniques do not distinguish among certain pairs. Thus, the following codes are used:
  • Asx (B) is "asparagine
    Asparagine
    Asparagine is one of the 20 most common natural amino acids on Earth. It has carboxamide as the side-chain's functional group. It is not an essential amino acid...

     or aspartic acid
    Aspartic acid
    Aspartic acid is an α-amino acid with the chemical formula HOOCCHCH2COOH. The carboxylate anion, salt, or ester of aspartic acid is known as aspartate. The L-isomer of aspartate is one of the 20 proteinogenic amino acids, i.e., the building blocks of proteins...

    "
  • Glx (Z) is "glutamic acid
    Glutamic acid
    Glutamic acid is one of the 20 proteinogenic amino acids, and its codons are GAA and GAG. It is a non-essential amino acid. The carboxylate anions and salts of glutamic acid are known as glutamates...

     or glutamine
    Glutamine
    Glutamine is one of the 20 amino acids encoded by the standard genetic code. It is not recognized as an essential amino acid but may become conditionally essential in certain situations, including intensive athletic training or certain gastrointestinal disorders...

    "
  • Xle (J) is "leucine
    Leucine
    Leucine is a branched-chain α-amino acid with the chemical formula HO2CCHCH2CH2. Leucine is classified as a hydrophobic amino acid due to its aliphatic isobutyl side chain. It is encoded by six codons and is a major component of the subunits in ferritin, astacin and other 'buffer' proteins...

     or isoleucine
    Isoleucine
    Isoleucine is an α-amino acid with the chemical formula HO2CCHCHCH2CH3. It is an essential amino acid, which means that humans cannot synthesize it, so it must be ingested. Its codons are AUU, AUC and AUA....

    "

In addition, the symbol X is used to indicate an amino acid that is completely unidentified.

Chemical properties

Following is a table listing the one-letter symbols, the three-letter symbols, and the chemical properties of the side-chains of the standard amino acids. The masses listed are based on weighted averages of the elemental isotope
Isotope
Isotopes are variants of atoms of a particular chemical element, which have differing numbers of neutrons. Atoms of a particular element by definition must contain the same number of protons but may have a distinct number of neutrons which differs from atom to atom, without changing the designation...

s at their natural abundance
Natural abundance
In chemistry, natural abundance refers to the abundance of isotopes of a chemical element as naturally found on a planet. The relative atomic mass of these isotopes is the atomic weight listed for the element in the periodic table...

s. Note that forming a peptide bond
Peptide bond
This article is about the peptide link found within biological molecules, such as proteins. A similar article for synthetic molecules is being created...

 results in elimination of a molecule of water, so the mass of an amino acid unit within a protein chain is reduced by 18.01524 Da.

General chemical properties
Amino Acid Short Abbrev. Avg. Mass (Da
Atomic mass unit
The unified atomic mass unit or dalton is a unit that is used for indicating mass on an atomic or molecular scale. It is defined as one twelfth of the rest mass of an unbound neutral atom of carbon-12 in its nuclear and electronic ground state, and has a value of...

)
pI
Isoelectric point
The isoelectric point , sometimes abbreviated to IEP, is the pH at which a particular molecule or surface carries no net electrical charge....

pK
Dissociation constant
In chemistry, biochemistry, and pharmacology, a dissociation constant is a specific type of equilibrium constant that measures the propensity of a larger object to separate reversibly into smaller components, as when a complex falls apart into its component molecules, or when a salt splits up into...

1
(α-COOH)
pK2
(α-+NH3)
Alanine
Alanine
Alanine is an α-amino acid with the chemical formula CH3CHCOOH. The L-isomer is one of the 20 amino acids encoded by the genetic code. Its codons are GCU, GCC, GCA, and GCG. It is classified as a nonpolar amino acid...

A Ala 89.09404 6.01 2.35 9.87
Cysteine
Cysteine
Cysteine is an α-amino acid with the chemical formula HO2CCHCH2SH. It is a non-essential amino acid, which means that it is biosynthesized in humans. Its codons are UGU and UGC. The side chain on cysteine is thiol, which is polar and thus cysteine is usually classified as a hydrophilic amino acid...

C Cys 121.15404 5.05 1.92 10.70
Aspartic acid
Aspartic acid
Aspartic acid is an α-amino acid with the chemical formula HOOCCHCH2COOH. The carboxylate anion, salt, or ester of aspartic acid is known as aspartate. The L-isomer of aspartate is one of the 20 proteinogenic amino acids, i.e., the building blocks of proteins...

D Asp 133.10384 2.85 1.99 9.90
Glutamic acid
Glutamic acid
Glutamic acid is one of the 20 proteinogenic amino acids, and its codons are GAA and GAG. It is a non-essential amino acid. The carboxylate anions and salts of glutamic acid are known as glutamates...

E Glu 147.13074 3.15 2.10 9.47
Phenylalanine
Phenylalanine
Phenylalanine is an α-amino acid with the formula C6H5CH2CHCOOH. This essential amino acid is classified as nonpolar because of the hydrophobic nature of the benzyl side chain. L-Phenylalanine is an electrically neutral amino acid, one of the twenty common amino acids used to biochemically form...

F Phe 165.19184 5.49 2.20 9.31
Glycine
Glycine
Glycine is an organic compound with the formula NH2CH2COOH. Having a hydrogen substituent as its 'side chain', glycine is the smallest of the 20 amino acids commonly found in proteins. Its codons are GGU, GGC, GGA, GGG cf. the genetic code.Glycine is a colourless, sweet-tasting crystalline solid...

G Gly 75.06714 6.06 2.35 9.78
Histidine
Histidine
Histidine Histidine, an essential amino acid, has a positively charged imidazole functional group. It is one of the 22 proteinogenic amino acids. Its codons are CAU and CAC. Histidine was first isolated by German physician Albrecht Kossel in 1896. Histidine is an essential amino acid in humans...

H His 155.15634 7.60 1.80 9.33
Isoleucine
Isoleucine
Isoleucine is an α-amino acid with the chemical formula HO2CCHCHCH2CH3. It is an essential amino acid, which means that humans cannot synthesize it, so it must be ingested. Its codons are AUU, AUC and AUA....

I Ile 131.17464 6.05 2.32 9.76
Lysine
Lysine
Lysine is an α-amino acid with the chemical formula HO2CCH4NH2. It is an essential amino acid, which means that the human body cannot synthesize it. Its codons are AAA and AAG....

K Lys 146.18934 9.60 2.16 9.06
Leucine
Leucine
Leucine is a branched-chain α-amino acid with the chemical formula HO2CCHCH2CH2. Leucine is classified as a hydrophobic amino acid due to its aliphatic isobutyl side chain. It is encoded by six codons and is a major component of the subunits in ferritin, astacin and other 'buffer' proteins...

L Leu 131.17464 6.01 2.33 9.74
Methionine
Methionine
Methionine is an α-amino acid with the chemical formula HO2CCHCH2CH2SCH3. This essential amino acid is classified as nonpolar. This amino-acid is coded by the codon AUG, also known as the initiation codon, since it indicates mRNA's coding region where translation into protein...

M Met 149.20784 5.74 2.13 9.28
Asparagine
Asparagine
Asparagine is one of the 20 most common natural amino acids on Earth. It has carboxamide as the side-chain's functional group. It is not an essential amino acid...

N Asn 132.11904 5.41 2.14 8.72
Pyrrolysine
Pyrrolysine
Pyrrolysine is a naturally occurring, genetically coded amino acid used by some methanogenic archaea and one known bacterium in enzymes that are part of their methane-producing metabolism. It is similar to lysine, but with an added pyrroline ring linked to the end of the lysine side chain...

O Pyl
Proline
Proline
Proline is an α-amino acid, one of the twenty DNA-encoded amino acids. Its codons are CCU, CCC, CCA, and CCG. It is not an essential amino acid, which means that the human body can synthesize it. It is unique among the 20 protein-forming amino acids in that the α-amino group is secondary...

P Pro 115.13194 6.30 1.95 10.64
Glutamine
Glutamine
Glutamine is one of the 20 amino acids encoded by the standard genetic code. It is not recognized as an essential amino acid but may become conditionally essential in certain situations, including intensive athletic training or certain gastrointestinal disorders...

Q Gln 146.14594 5.65 2.17 9.13
Arginine
Arginine
Arginine is an α-amino acid. The L-form is one of the 20 most common natural amino acids. At the level of molecular genetics, in the structure of the messenger ribonucleic acid mRNA, CGU, CGC, CGA, CGG, AGA, and AGG, are the triplets of nucleotide bases or codons that codify for arginine during...

R Arg 174.20274 10.76 1.82 8.99
Serine
Serine
Serine is an amino acid with the formula HO2CCHCH2OH. It is one of the proteinogenic amino acids. By virtue of the hydroxyl group, serine is classified as a polar amino acid.-Occurrence and biosynthesis:...

S Ser 105.09344 5.68 2.19 9.21
Threonine
Threonine
Threonine is an α-amino acid with the chemical formula HO2CCHCHCH3. Its codons are ACU, ACA, ACC, and ACG. This essential amino acid is classified as polar...

T Thr 119.12034 5.60 2.09 9.10
Selenocysteine
Selenocysteine
Selenocysteine is an amino acid that is present in several enzymes .-Nomenclature:...

U Sec 168.053 5.47
Valine
Valine
Valine is an α-amino acid with the chemical formula HO2CCHCH2. L-Valine is one of 20 proteinogenic amino acids. Its codons are GUU, GUC, GUA, and GUG. This essential amino acid is classified as nonpolar...

V Val 117.14784 6.00 2.39 9.74
Tryptophan
Tryptophan
Tryptophan is one of the 20 standard amino acids, as well as an essential amino acid in the human diet. It is encoded in the standard genetic code as the codon UGG...

W Trp 204.22844 5.89 2.46 9.41
Tyrosine
Tyrosine
Tyrosine or 4-hydroxyphenylalanine, is one of the 22 amino acids that are used by cells to synthesize proteins. Its codons are UAC and UAU. It is a non-essential amino acid with a polar side group...

Y Tyr 181.19124 5.64 2.20 9.21

Side chain properties

Amino Acid Short Abbrev. Side chain Hydro-
phobic
pKa
Acid dissociation constant
An acid dissociation constant, Ka, is a quantitative measure of the strength of an acid in solution. It is the equilibrium constant for a chemical reaction known as dissociation in the context of acid-base reactions...

Polar
Chemical polarity
In chemistry, polarity refers to a separation of electric charge leading to a molecule or its chemical groups having an electric dipole or multipole moment. Polar molecules interact through dipole–dipole intermolecular forces and hydrogen bonds. Molecular polarity is dependent on the difference in...

pH Small Tiny Aromatic
Aromaticity
In organic chemistry, Aromaticity is a chemical property in which a conjugated ring of unsaturated bonds, lone pairs, or empty orbitals exhibit a stabilization stronger than would be expected by the stabilization of conjugation alone. The earliest use of the term was in an article by August...


or Aliphatic
van der Waals
volume
Van der Waals radius
The van der Waals radius, r, of an atom is the radius of an imaginary hard sphere which can be used to model the atom for many purposes. It is named after Johannes Diderik van der Waals, winner of the 1910 Nobel Prize in Physics, as he was the first to recognise that atoms had a finite size and to...

Alanine
Alanine
Alanine is an α-amino acid with the chemical formula CH3CHCOOH. The L-isomer is one of the 20 amino acids encoded by the genetic code. Its codons are GCU, GCC, GCA, and GCG. It is classified as a nonpolar amino acid...

A Ala -CH3 X - - - X X - 67
Cysteine
Cysteine
Cysteine is an α-amino acid with the chemical formula HO2CCHCH2SH. It is a non-essential amino acid, which means that it is biosynthesized in humans. Its codons are UGU and UGC. The side chain on cysteine is thiol, which is polar and thus cysteine is usually classified as a hydrophilic amino acid...

C Cys -CH2SH
Thiol
In organic chemistry, a thiol is an organosulfur compound that contains a carbon-bonded sulfhydryl group...

X 8.18 - acidic X - - 86
Aspartic acid
Aspartic acid
Aspartic acid is an α-amino acid with the chemical formula HOOCCHCH2COOH. The carboxylate anion, salt, or ester of aspartic acid is known as aspartate. The L-isomer of aspartate is one of the 20 proteinogenic amino acids, i.e., the building blocks of proteins...

D Asp -CH2COOH - 3.90 X acidic X - - 91
Glutamic acid
Glutamic acid
Glutamic acid is one of the 20 proteinogenic amino acids, and its codons are GAA and GAG. It is a non-essential amino acid. The carboxylate anions and salts of glutamic acid are known as glutamates...

E Glu -CH2CH2COOH - 4.07 X acidic - - - 109
Phenylalanine
Phenylalanine
Phenylalanine is an α-amino acid with the formula C6H5CH2CHCOOH. This essential amino acid is classified as nonpolar because of the hydrophobic nature of the benzyl side chain. L-Phenylalanine is an electrically neutral amino acid, one of the twenty common amino acids used to biochemically form...

F Phe -CH2C6H5 X - - - - - Aromatic 135
Glycine
Glycine
Glycine is an organic compound with the formula NH2CH2COOH. Having a hydrogen substituent as its 'side chain', glycine is the smallest of the 20 amino acids commonly found in proteins. Its codons are GGU, GGC, GGA, GGG cf. the genetic code.Glycine is a colourless, sweet-tasting crystalline solid...

G Gly -H X - - - X X - 48
Histidine
Histidine
Histidine Histidine, an essential amino acid, has a positively charged imidazole functional group. It is one of the 22 proteinogenic amino acids. Its codons are CAU and CAC. Histidine was first isolated by German physician Albrecht Kossel in 1896. Histidine is an essential amino acid in humans...

H His -CH2-C3H3N2
Imidazole
Imidazole is an organic compound with the formula C3H4N2. This aromatic heterocyclic is a diazole and is classified as an alkaloid. Imidazole refers to the parent compound, whereas imidazoles are a class of heterocycles with similar ring structure, but varying substituents...

- 6.04 X weak basic - - Aromatic 118
Isoleucine
Isoleucine
Isoleucine is an α-amino acid with the chemical formula HO2CCHCHCH2CH3. It is an essential amino acid, which means that humans cannot synthesize it, so it must be ingested. Its codons are AUU, AUC and AUA....

I Ile -CH(CH3)CH2CH3 X - - - - - Aliphatic 124
Lysine
Lysine
Lysine is an α-amino acid with the chemical formula HO2CCH4NH2. It is an essential amino acid, which means that the human body cannot synthesize it. Its codons are AAA and AAG....

K Lys -(CH2)4NH2 - 10.54 X basic - - - 135
Leucine
Leucine
Leucine is a branched-chain α-amino acid with the chemical formula HO2CCHCH2CH2. Leucine is classified as a hydrophobic amino acid due to its aliphatic isobutyl side chain. It is encoded by six codons and is a major component of the subunits in ferritin, astacin and other 'buffer' proteins...

L Leu -CH2CH(CH3)2 X - - - - - Aliphatic 124
Methionine
Methionine
Methionine is an α-amino acid with the chemical formula HO2CCHCH2CH2SCH3. This essential amino acid is classified as nonpolar. This amino-acid is coded by the codon AUG, also known as the initiation codon, since it indicates mRNA's coding region where translation into protein...

M Met -CH2CH2S
Sulfur
Sulfur or sulphur is the chemical element with atomic number 16. In the periodic table it is represented by the symbol S. It is an abundant, multivalent non-metal. Under normal conditions, sulfur atoms form cyclic octatomic molecules with chemical formula S8. Elemental sulfur is a bright yellow...

CH3
X - - - - - - 124
Asparagine
Asparagine
Asparagine is one of the 20 most common natural amino acids on Earth. It has carboxamide as the side-chain's functional group. It is not an essential amino acid...

N Asn -CH2CONH2 - - X - X - - 96
Pyrrolysine
Pyrrolysine
Pyrrolysine is a naturally occurring, genetically coded amino acid used by some methanogenic archaea and one known bacterium in enzymes that are part of their methane-producing metabolism. It is similar to lysine, but with an added pyrroline ring linked to the end of the lysine side chain...

O Pyl
Proline
Proline
Proline is an α-amino acid, one of the twenty DNA-encoded amino acids. Its codons are CCU, CCC, CCA, and CCG. It is not an essential amino acid, which means that the human body can synthesize it. It is unique among the 20 protein-forming amino acids in that the α-amino group is secondary...

P Pro -CH2CH2CH2- X - - - X - - 90
Glutamine
Glutamine
Glutamine is one of the 20 amino acids encoded by the standard genetic code. It is not recognized as an essential amino acid but may become conditionally essential in certain situations, including intensive athletic training or certain gastrointestinal disorders...

Q Gln -CH2CH2CONH2 - - X - - - - 114
Arginine
Arginine
Arginine is an α-amino acid. The L-form is one of the 20 most common natural amino acids. At the level of molecular genetics, in the structure of the messenger ribonucleic acid mRNA, CGU, CGC, CGA, CGG, AGA, and AGG, are the triplets of nucleotide bases or codons that codify for arginine during...

R Arg -(CH2)3NH-C(NH)NH2 - 12.48 X strongly basic - - - 148
Serine
Serine
Serine is an amino acid with the formula HO2CCHCH2OH. It is one of the proteinogenic amino acids. By virtue of the hydroxyl group, serine is classified as a polar amino acid.-Occurrence and biosynthesis:...

S Ser -CH2OH - - X - X X - 73
Threonine
Threonine
Threonine is an α-amino acid with the chemical formula HO2CCHCHCH3. Its codons are ACU, ACA, ACC, and ACG. This essential amino acid is classified as polar...

T Thr -CH(OH)CH3 - - X weak acidic X - - 93
Selenocysteine
Selenocysteine
Selenocysteine is an amino acid that is present in several enzymes .-Nomenclature:...

U Sec -CH2SeH
Selenol
Selenols are organic compounds that contain the functional group with the connectivity C-Se-H. Selenols are sometimes also called selenamercaptans, selenathiols, and selenothiols. Selenols are one of the principal classes of organoselenium compounds...

X 5.73 - - X - -
Valine
Valine
Valine is an α-amino acid with the chemical formula HO2CCHCH2. L-Valine is one of 20 proteinogenic amino acids. Its codons are GUU, GUC, GUA, and GUG. This essential amino acid is classified as nonpolar...

V Val -CH(CH3)2 X - - - X - Aliphatic 105
Tryptophan
Tryptophan
Tryptophan is one of the 20 standard amino acids, as well as an essential amino acid in the human diet. It is encoded in the standard genetic code as the codon UGG...

W Trp -CH2C8H6N
Indole
Indole is an aromatic heterocyclic organic compound. It has a bicyclic structure, consisting of a six-membered benzene ring fused to a five-membered nitrogen-containing pyrrole ring. Indole is a popular component of fragrances and the precursor to many pharmaceuticals. Compounds that contain an...

X - - - - - Aromatic 163
Tyrosine
Tyrosine
Tyrosine or 4-hydroxyphenylalanine, is one of the 22 amino acids that are used by cells to synthesize proteins. Its codons are UAC and UAU. It is a non-essential amino acid with a polar side group...

Y Tyr -CH2-C6H4OH - 10.46 X - - - Aromatic 141


Note: The pKa values of amino acids are typically slightly different when the amino acid is inside a protein. Protein pKa calculations
Protein pKa calculations
In computational biology, protein pKa calculations are used to estimate the pKa values of amino acids as they exist within proteins. These calculations complement the pKa values reported for amino acids in their free state, and are used frequently within the fields of molecular modeling, structural...

 are sometimes used to calculate the change in the pKa value of an amino acid in this situation.

Gene expression and biochemistry

Amino Acid Short Abbrev. Codon(s) Occurrence
in human proteins
(%)
Essential‡ in humans
Alanine
Alanine
Alanine is an α-amino acid with the chemical formula CH3CHCOOH. The L-isomer is one of the 20 amino acids encoded by the genetic code. Its codons are GCU, GCC, GCA, and GCG. It is classified as a nonpolar amino acid...

A Ala GCU, GCC, GCA, GCG 7.8 -
Cysteine
Cysteine
Cysteine is an α-amino acid with the chemical formula HO2CCHCH2SH. It is a non-essential amino acid, which means that it is biosynthesized in humans. Its codons are UGU and UGC. The side chain on cysteine is thiol, which is polar and thus cysteine is usually classified as a hydrophilic amino acid...

C Cys UGU, UGC 1.9 Conditionally
Aspartic acid
Aspartic acid
Aspartic acid is an α-amino acid with the chemical formula HOOCCHCH2COOH. The carboxylate anion, salt, or ester of aspartic acid is known as aspartate. The L-isomer of aspartate is one of the 20 proteinogenic amino acids, i.e., the building blocks of proteins...

D Asp GAU, GAC 5.3 -
Glutamic acid
Glutamic acid
Glutamic acid is one of the 20 proteinogenic amino acids, and its codons are GAA and GAG. It is a non-essential amino acid. The carboxylate anions and salts of glutamic acid are known as glutamates...

E Glu GAA, GAG 6.3 Conditionally
Phenylalanine
Phenylalanine
Phenylalanine is an α-amino acid with the formula C6H5CH2CHCOOH. This essential amino acid is classified as nonpolar because of the hydrophobic nature of the benzyl side chain. L-Phenylalanine is an electrically neutral amino acid, one of the twenty common amino acids used to biochemically form...

F Phe UUU, UUC 3.9 Yes
Glycine
Glycine
Glycine is an organic compound with the formula NH2CH2COOH. Having a hydrogen substituent as its 'side chain', glycine is the smallest of the 20 amino acids commonly found in proteins. Its codons are GGU, GGC, GGA, GGG cf. the genetic code.Glycine is a colourless, sweet-tasting crystalline solid...

G Gly GGU, GGC, GGA, GGG 7.2 Conditionally
Histidine
Histidine
Histidine Histidine, an essential amino acid, has a positively charged imidazole functional group. It is one of the 22 proteinogenic amino acids. Its codons are CAU and CAC. Histidine was first isolated by German physician Albrecht Kossel in 1896. Histidine is an essential amino acid in humans...

H His CAU, CAC 2.3 Yes
Isoleucine
Isoleucine
Isoleucine is an α-amino acid with the chemical formula HO2CCHCHCH2CH3. It is an essential amino acid, which means that humans cannot synthesize it, so it must be ingested. Its codons are AUU, AUC and AUA....

I Ile AUU, AUC, AUA 5.3 Yes
Lysine
Lysine
Lysine is an α-amino acid with the chemical formula HO2CCH4NH2. It is an essential amino acid, which means that the human body cannot synthesize it. Its codons are AAA and AAG....

K Lys AAA, AAG 5.9 Yes
Leucine
Leucine
Leucine is a branched-chain α-amino acid with the chemical formula HO2CCHCH2CH2. Leucine is classified as a hydrophobic amino acid due to its aliphatic isobutyl side chain. It is encoded by six codons and is a major component of the subunits in ferritin, astacin and other 'buffer' proteins...

L Leu UUA, UUG, CUU, CUC, CUA, CUG 9.1 Yes
Methionine
Methionine
Methionine is an α-amino acid with the chemical formula HO2CCHCH2CH2SCH3. This essential amino acid is classified as nonpolar. This amino-acid is coded by the codon AUG, also known as the initiation codon, since it indicates mRNA's coding region where translation into protein...

M Met AUG 2.3 Yes
Asparagine
Asparagine
Asparagine is one of the 20 most common natural amino acids on Earth. It has carboxamide as the side-chain's functional group. It is not an essential amino acid...

N Asn AAU, AAC 4.3 -
Pyrrolysine
Pyrrolysine
Pyrrolysine is a naturally occurring, genetically coded amino acid used by some methanogenic archaea and one known bacterium in enzymes that are part of their methane-producing metabolism. It is similar to lysine, but with an added pyrroline ring linked to the end of the lysine side chain...

O Pyl UAG* -
Proline
Proline
Proline is an α-amino acid, one of the twenty DNA-encoded amino acids. Its codons are CCU, CCC, CCA, and CCG. It is not an essential amino acid, which means that the human body can synthesize it. It is unique among the 20 protein-forming amino acids in that the α-amino group is secondary...

P Pro CCU, CCC, CCA, CCG 5.2 -
Glutamine
Glutamine
Glutamine is one of the 20 amino acids encoded by the standard genetic code. It is not recognized as an essential amino acid but may become conditionally essential in certain situations, including intensive athletic training or certain gastrointestinal disorders...

Q Gln CAA, CAG 4.2 -
Arginine
Arginine
Arginine is an α-amino acid. The L-form is one of the 20 most common natural amino acids. At the level of molecular genetics, in the structure of the messenger ribonucleic acid mRNA, CGU, CGC, CGA, CGG, AGA, and AGG, are the triplets of nucleotide bases or codons that codify for arginine during...

R Arg CGU, CGC, CGA, CGG, AGA, AGG 5.1 Conditionally
Serine
Serine
Serine is an amino acid with the formula HO2CCHCH2OH. It is one of the proteinogenic amino acids. By virtue of the hydroxyl group, serine is classified as a polar amino acid.-Occurrence and biosynthesis:...

S Ser UCU, UCC, UCA, UCG, AGU, AGC 6.8 -
Threonine
Threonine
Threonine is an α-amino acid with the chemical formula HO2CCHCHCH3. Its codons are ACU, ACA, ACC, and ACG. This essential amino acid is classified as polar...

T Thr ACU, ACC, ACA, ACG 5.9 Yes
Selenocysteine
Selenocysteine
Selenocysteine is an amino acid that is present in several enzymes .-Nomenclature:...

U Sec UGA** -
Valine
Valine
Valine is an α-amino acid with the chemical formula HO2CCHCH2. L-Valine is one of 20 proteinogenic amino acids. Its codons are GUU, GUC, GUA, and GUG. This essential amino acid is classified as nonpolar...

V Val GUU, GUC, GUA, GUG 6.6 Yes
Tryptophan
Tryptophan
Tryptophan is one of the 20 standard amino acids, as well as an essential amino acid in the human diet. It is encoded in the standard genetic code as the codon UGG...

W Trp UGG 1.4 Yes
Tyrosine
Tyrosine
Tyrosine or 4-hydroxyphenylalanine, is one of the 22 amino acids that are used by cells to synthesize proteins. Its codons are UAC and UAU. It is a non-essential amino acid with a polar side group...

Y Tyr UAU, UAC 3.2 Conditionally
Stop codon† - Term UAA, UAG, UGA - -

* UAG is normally the amber stop codon
Stop codon
In the genetic code, a stop codon is a nucleotide triplet within messenger RNA that signals a termination of translation. Proteins are based on polypeptides, which are unique sequences of amino acids. Most codons in messenger RNA correspond to the addition of an amino acid to a growing polypeptide...

, but encodes pyrrolysine if a PYLIS element
PYLIS downstream sequence
In biology, the PYLIS downstream sequence is a stem-loop structure which appears on some mRNA sequences. This structural motif causes the UAG stop codon to be translated to the amino acid pyrrolysine instead of ending the protein translation...

 is present.

** UGA is normally the opal (or umber) stop codon, but encodes selenocysteine if a SECIS element
SECIS element
In biology, the SECIS element is an RNA element around 60 nucleotides in length that adopts a stem-loop structure. This structural motif directs the cell to translate UGA codons as selenocysteines...

 is present.

The stop codon
Stop codon
In the genetic code, a stop codon is a nucleotide triplet within messenger RNA that signals a termination of translation. Proteins are based on polypeptides, which are unique sequences of amino acids. Most codons in messenger RNA correspond to the addition of an amino acid to a growing polypeptide...

 is not an amino acid, but is included for completeness.

An essential amino acid
Essential amino acid
An essential amino acid or indispensable amino acid is an amino acid that cannot be synthesized de novo by the organism , and therefore must be supplied in the diet.-Essentiality vs. conditional essentiality in humans:...

 cannot be synthesized in humans and must, therefore, be supplied in the diet. Conditionally essential amino acids are not normally required in the diet, but must be supplied exogenously to specific populations that do not synthesize it in adequate amounts.

Mass spectrometry

In mass spectrometry
Mass spectrometry
Mass spectrometry is an analytical technique that measures the mass-to-charge ratio of charged particles.It is used for determining masses of particles, for determining the elemental composition of a sample or molecule, and for elucidating the chemical structures of molecules, such as peptides and...

 of peptides and proteins, it is useful to know the masses of the residues. The mass of the peptide or protein is the sum of the residue masses plus the mass of water.
Amino Acid Short Abbrev. Formula Mon. Mass§ (Da
Atomic mass unit
The unified atomic mass unit or dalton is a unit that is used for indicating mass on an atomic or molecular scale. It is defined as one twelfth of the rest mass of an unbound neutral atom of carbon-12 in its nuclear and electronic ground state, and has a value of...

)
Avg. Mass (Da
Atomic mass unit
The unified atomic mass unit or dalton is a unit that is used for indicating mass on an atomic or molecular scale. It is defined as one twelfth of the rest mass of an unbound neutral atom of carbon-12 in its nuclear and electronic ground state, and has a value of...

)
Alanine
Alanine
Alanine is an α-amino acid with the chemical formula CH3CHCOOH. The L-isomer is one of the 20 amino acids encoded by the genetic code. Its codons are GCU, GCC, GCA, and GCG. It is classified as a nonpolar amino acid...

A Ala C3H5NO 71.03711 71.0788
Cysteine
Cysteine
Cysteine is an α-amino acid with the chemical formula HO2CCHCH2SH. It is a non-essential amino acid, which means that it is biosynthesized in humans. Its codons are UGU and UGC. The side chain on cysteine is thiol, which is polar and thus cysteine is usually classified as a hydrophilic amino acid...

C Cys C3H5NOS 103.00919 103.1388
Aspartic acid
Aspartic acid
Aspartic acid is an α-amino acid with the chemical formula HOOCCHCH2COOH. The carboxylate anion, salt, or ester of aspartic acid is known as aspartate. The L-isomer of aspartate is one of the 20 proteinogenic amino acids, i.e., the building blocks of proteins...

D Asp C4H5NO3 115.02694 115.0886
Glutamic acid
Glutamic acid
Glutamic acid is one of the 20 proteinogenic amino acids, and its codons are GAA and GAG. It is a non-essential amino acid. The carboxylate anions and salts of glutamic acid are known as glutamates...

E Glu C5H7NO3 129.04259 129.1155
Phenylalanine
Phenylalanine
Phenylalanine is an α-amino acid with the formula C6H5CH2CHCOOH. This essential amino acid is classified as nonpolar because of the hydrophobic nature of the benzyl side chain. L-Phenylalanine is an electrically neutral amino acid, one of the twenty common amino acids used to biochemically form...

F Phe C9H9NO 147.06841 147.1766
Glycine
Glycine
Glycine is an organic compound with the formula NH2CH2COOH. Having a hydrogen substituent as its 'side chain', glycine is the smallest of the 20 amino acids commonly found in proteins. Its codons are GGU, GGC, GGA, GGG cf. the genetic code.Glycine is a colourless, sweet-tasting crystalline solid...

G Gly C2H3NO 57.02146 57.0519
Histidine
Histidine
Histidine Histidine, an essential amino acid, has a positively charged imidazole functional group. It is one of the 22 proteinogenic amino acids. Its codons are CAU and CAC. Histidine was first isolated by German physician Albrecht Kossel in 1896. Histidine is an essential amino acid in humans...

H His C6H7N3O 137.05891 137.1411
Isoleucine
Isoleucine
Isoleucine is an α-amino acid with the chemical formula HO2CCHCHCH2CH3. It is an essential amino acid, which means that humans cannot synthesize it, so it must be ingested. Its codons are AUU, AUC and AUA....

I Ile C6H11NO 113.08406 113.1594
Lysine
Lysine
Lysine is an α-amino acid with the chemical formula HO2CCH4NH2. It is an essential amino acid, which means that the human body cannot synthesize it. Its codons are AAA and AAG....

K Lys C6H12N2O 128.09496 128.1741
Leucine
Leucine
Leucine is a branched-chain α-amino acid with the chemical formula HO2CCHCH2CH2. Leucine is classified as a hydrophobic amino acid due to its aliphatic isobutyl side chain. It is encoded by six codons and is a major component of the subunits in ferritin, astacin and other 'buffer' proteins...

L Leu C6H11NO 113.08406 113.1594
Methionine
Methionine
Methionine is an α-amino acid with the chemical formula HO2CCHCH2CH2SCH3. This essential amino acid is classified as nonpolar. This amino-acid is coded by the codon AUG, also known as the initiation codon, since it indicates mRNA's coding region where translation into protein...

M Met C5H9NOS 131.04049 131.1986
Asparagine
Asparagine
Asparagine is one of the 20 most common natural amino acids on Earth. It has carboxamide as the side-chain's functional group. It is not an essential amino acid...

N Asn C4H6N2O2 114.04293 114.1039
Pyrrolysine
Pyrrolysine
Pyrrolysine is a naturally occurring, genetically coded amino acid used by some methanogenic archaea and one known bacterium in enzymes that are part of their methane-producing metabolism. It is similar to lysine, but with an added pyrroline ring linked to the end of the lysine side chain...

O Pyl C12H21N3O3 255.15829 255.3172
Proline
Proline
Proline is an α-amino acid, one of the twenty DNA-encoded amino acids. Its codons are CCU, CCC, CCA, and CCG. It is not an essential amino acid, which means that the human body can synthesize it. It is unique among the 20 protein-forming amino acids in that the α-amino group is secondary...

P Pro C5H7NO 97.05276 97.1167
Glutamine
Glutamine
Glutamine is one of the 20 amino acids encoded by the standard genetic code. It is not recognized as an essential amino acid but may become conditionally essential in certain situations, including intensive athletic training or certain gastrointestinal disorders...

Q Gln C5H8N2O2 128.05858 128.1307
Arginine
Arginine
Arginine is an α-amino acid. The L-form is one of the 20 most common natural amino acids. At the level of molecular genetics, in the structure of the messenger ribonucleic acid mRNA, CGU, CGC, CGA, CGG, AGA, and AGG, are the triplets of nucleotide bases or codons that codify for arginine during...

R Arg C6H12N4O 156.10111 156.1875
Serine
Serine
Serine is an amino acid with the formula HO2CCHCH2OH. It is one of the proteinogenic amino acids. By virtue of the hydroxyl group, serine is classified as a polar amino acid.-Occurrence and biosynthesis:...

S Ser C3H5NO2 87.03203 87.0782
Threonine
Threonine
Threonine is an α-amino acid with the chemical formula HO2CCHCHCH3. Its codons are ACU, ACA, ACC, and ACG. This essential amino acid is classified as polar...

T Thr C4H7NO2 101.04768 101.1051
Selenocysteine
Selenocysteine
Selenocysteine is an amino acid that is present in several enzymes .-Nomenclature:...

U Sec C3H5NOSe 150.95364 150.0388
Valine
Valine
Valine is an α-amino acid with the chemical formula HO2CCHCH2. L-Valine is one of 20 proteinogenic amino acids. Its codons are GUU, GUC, GUA, and GUG. This essential amino acid is classified as nonpolar...

V Val C5H9NO 99.06841 99.1326
Tryptophan
Tryptophan
Tryptophan is one of the 20 standard amino acids, as well as an essential amino acid in the human diet. It is encoded in the standard genetic code as the codon UGG...

W Trp C11H10N2O 186.07931 186.2132
Tyrosine
Tyrosine
Tyrosine or 4-hydroxyphenylalanine, is one of the 22 amino acids that are used by cells to synthesize proteins. Its codons are UAC and UAU. It is a non-essential amino acid with a polar side group...

Y Tyr C9H9NO2 163.06333 163.1760

§ Monoisotopic mass
Monoisotopic mass
The monoisotopic mass is the sum of the masses of the atoms in a molecule using the unbound, ground-state, rest mass of the principal isotope for each element instead of the isotopic average mass. For typical organic compounds, where the monoisotopic mass is most commonly used, this also results...


Stoichiometry and metabolic cost in cell

Following table lists the abundance of amino acids in E.coli cell and the metabolic cost (ATP) for synthesis the amino acids. Negative numbers indicate the metabolic processes are energy favorable and do not cost net ATP of the cell. Note that the abundance of amino acids include amino acids in free-form and in polymerization form (proteins).
Amino acid Abundance
(# of molecules (×108)
per E. coli
Escherichia coli
Escherichia coli is a Gram-negative, rod-shaped bacterium that is commonly found in the lower intestine of warm-blooded organisms . Most E. coli strains are harmless, but some serotypes can cause serious food poisoning in humans, and are occasionally responsible for product recalls...

cell)
ATP cost in synthesis
under aerobic
condition
ATP cost in synthesis
under anaerobic
condition
Alanine
Alanine
Alanine is an α-amino acid with the chemical formula CH3CHCOOH. The L-isomer is one of the 20 amino acids encoded by the genetic code. Its codons are GCU, GCC, GCA, and GCG. It is classified as a nonpolar amino acid...

2.9 -1 1
Cysteine
Cysteine
Cysteine is an α-amino acid with the chemical formula HO2CCHCH2SH. It is a non-essential amino acid, which means that it is biosynthesized in humans. Its codons are UGU and UGC. The side chain on cysteine is thiol, which is polar and thus cysteine is usually classified as a hydrophilic amino acid...

0.52 11 15
Aspartic acid
Aspartic acid
Aspartic acid is an α-amino acid with the chemical formula HOOCCHCH2COOH. The carboxylate anion, salt, or ester of aspartic acid is known as aspartate. The L-isomer of aspartate is one of the 20 proteinogenic amino acids, i.e., the building blocks of proteins...

1.4 0 2
Glutamic acid
Glutamic acid
Glutamic acid is one of the 20 proteinogenic amino acids, and its codons are GAA and GAG. It is a non-essential amino acid. The carboxylate anions and salts of glutamic acid are known as glutamates...

1.5 -7 -1
Phenylalanine
Phenylalanine
Phenylalanine is an α-amino acid with the formula C6H5CH2CHCOOH. This essential amino acid is classified as nonpolar because of the hydrophobic nature of the benzyl side chain. L-Phenylalanine is an electrically neutral amino acid, one of the twenty common amino acids used to biochemically form...

1.1 -6 2
Glycine
Glycine
Glycine is an organic compound with the formula NH2CH2COOH. Having a hydrogen substituent as its 'side chain', glycine is the smallest of the 20 amino acids commonly found in proteins. Its codons are GGU, GGC, GGA, GGG cf. the genetic code.Glycine is a colourless, sweet-tasting crystalline solid...

3.5 -2 2
Histidine
Histidine
Histidine Histidine, an essential amino acid, has a positively charged imidazole functional group. It is one of the 22 proteinogenic amino acids. Its codons are CAU and CAC. Histidine was first isolated by German physician Albrecht Kossel in 1896. Histidine is an essential amino acid in humans...

0.54 1 7
Isoleucine
Isoleucine
Isoleucine is an α-amino acid with the chemical formula HO2CCHCHCH2CH3. It is an essential amino acid, which means that humans cannot synthesize it, so it must be ingested. Its codons are AUU, AUC and AUA....

1.7 7 11
Lysine
Lysine
Lysine is an α-amino acid with the chemical formula HO2CCH4NH2. It is an essential amino acid, which means that the human body cannot synthesize it. Its codons are AAA and AAG....

2.0 5 9
Leucine
Leucine
Leucine is a branched-chain α-amino acid with the chemical formula HO2CCHCH2CH2. Leucine is classified as a hydrophobic amino acid due to its aliphatic isobutyl side chain. It is encoded by six codons and is a major component of the subunits in ferritin, astacin and other 'buffer' proteins...

2.6 -9 1
Methionine
Methionine
Methionine is an α-amino acid with the chemical formula HO2CCHCH2CH2SCH3. This essential amino acid is classified as nonpolar. This amino-acid is coded by the codon AUG, also known as the initiation codon, since it indicates mRNA's coding region where translation into protein...

0.88 21 23
Asparagine
Asparagine
Asparagine is one of the 20 most common natural amino acids on Earth. It has carboxamide as the side-chain's functional group. It is not an essential amino acid...

1.4 3 5
Proline
Proline
Proline is an α-amino acid, one of the twenty DNA-encoded amino acids. Its codons are CCU, CCC, CCA, and CCG. It is not an essential amino acid, which means that the human body can synthesize it. It is unique among the 20 protein-forming amino acids in that the α-amino group is secondary...

1.3 -2 4
Glutamine
Glutamine
Glutamine is one of the 20 amino acids encoded by the standard genetic code. It is not recognized as an essential amino acid but may become conditionally essential in certain situations, including intensive athletic training or certain gastrointestinal disorders...

1.5 -6 0
Arginine
Arginine
Arginine is an α-amino acid. The L-form is one of the 20 most common natural amino acids. At the level of molecular genetics, in the structure of the messenger ribonucleic acid mRNA, CGU, CGC, CGA, CGG, AGA, and AGG, are the triplets of nucleotide bases or codons that codify for arginine during...

1.7 5 13
Serine
Serine
Serine is an amino acid with the formula HO2CCHCH2OH. It is one of the proteinogenic amino acids. By virtue of the hydroxyl group, serine is classified as a polar amino acid.-Occurrence and biosynthesis:...

1.2 -2 2
Threonine
Threonine
Threonine is an α-amino acid with the chemical formula HO2CCHCHCH3. Its codons are ACU, ACA, ACC, and ACG. This essential amino acid is classified as polar...

1.5 6 8
Tryptophan
Tryptophan
Tryptophan is one of the 20 standard amino acids, as well as an essential amino acid in the human diet. It is encoded in the standard genetic code as the codon UGG...

0.33 -7 7
Tyrosine
Tyrosine
Tyrosine or 4-hydroxyphenylalanine, is one of the 22 amino acids that are used by cells to synthesize proteins. Its codons are UAC and UAU. It is a non-essential amino acid with a polar side group...

0.79 -8 2
Valine
Valine
Valine is an α-amino acid with the chemical formula HO2CCHCH2. L-Valine is one of 20 proteinogenic amino acids. Its codons are GUU, GUC, GUA, and GUG. This essential amino acid is classified as nonpolar...

2.4 -2 2

Remarks

Amino Acid Abbrev. Remarks
Alanine
Alanine
Alanine is an α-amino acid with the chemical formula CH3CHCOOH. The L-isomer is one of the 20 amino acids encoded by the genetic code. Its codons are GCU, GCC, GCA, and GCG. It is classified as a nonpolar amino acid...

A Ala Very abundant, very versatile. More stiff than glycine, but small enough to pose only small steric limits for the protein conformation. It behaves fairly neutrally, and can be located in both hydrophilic regions on the protein outside and the hydrophobic areas inside.
Asparagine
Asparagine
Asparagine is one of the 20 most common natural amino acids on Earth. It has carboxamide as the side-chain's functional group. It is not an essential amino acid...

 or aspartic acid
Aspartic acid
Aspartic acid is an α-amino acid with the chemical formula HOOCCHCH2COOH. The carboxylate anion, salt, or ester of aspartic acid is known as aspartate. The L-isomer of aspartate is one of the 20 proteinogenic amino acids, i.e., the building blocks of proteins...

B Asx A placeholder when either amino acid may occupy a position.
Cysteine
Cysteine
Cysteine is an α-amino acid with the chemical formula HO2CCHCH2SH. It is a non-essential amino acid, which means that it is biosynthesized in humans. Its codons are UGU and UGC. The side chain on cysteine is thiol, which is polar and thus cysteine is usually classified as a hydrophilic amino acid...

C Cys The sulfur atom bonds readily to heavy metal
Heavy metals
A heavy metal is a member of a loosely-defined subset of elements that exhibit metallic properties. It mainly includes the transition metals, some metalloids, lanthanides, and actinides. Many different definitions have been proposed—some based on density, some on atomic number or atomic weight,...

 ions. Under oxidizing conditions, two cysteines can join together in a disulfide bond
Disulfide bond
In chemistry, a disulfide bond is a covalent bond, usually derived by the coupling of two thiol groups. The linkage is also called an SS-bond or disulfide bridge. The overall connectivity is therefore R-S-S-R. The terminology is widely used in biochemistry...

 to form the amino acid cystine
Cystine
Cystine is a dimeric amino acid formed by the oxidation of two cysteine residues that covalently link to make a disulfide bond. This organosulfur compound has the formula 2. It is a white solid, and melts at 247-249 °C...

. When cystines are part of a protein, insulin
Insulin
Insulin is a hormone central to regulating carbohydrate and fat metabolism in the body. Insulin causes cells in the liver, muscle, and fat tissue to take up glucose from the blood, storing it as glycogen in the liver and muscle....

 for example, the tertiary structure
Tertiary structure
In biochemistry and molecular biology, the tertiary structure of a protein or any other macromolecule is its three-dimensional structure, as defined by the atomic coordinates.-Relationship to primary structure:...

 is stabilized, which makes the protein more resistant to denaturation
Denaturation (biochemistry)
Denaturation is a process in which proteins or nucleic acids lose their tertiary structure and secondary structure by application of some external stress or compound, such as a strong acid or base, a concentrated inorganic salt, an organic solvent , or heat...

; therefore, disulfide bonds are common in proteins that have to function in harsh environments including digestive enzymes (e.g., pepsin
Pepsin
Pepsin is an enzyme whose precursor form is released by the chief cells in the stomach and that degrades food proteins into peptides. It was discovered in 1836 by Theodor Schwann who also coined its name from the Greek word pepsis, meaning digestion...

 and chymotrypsin
Chymotrypsin
Chymotrypsin is a digestive enzyme that can perform proteolysis. Chymotrypsin preferentially cleaves peptide amide bonds where the carboxyl side of the amide bond is a tyrosine, tryptophan, or phenylalanine. These amino acids contain an aromatic ring in their sidechain that fits into a...

) and structural proteins (e.g., keratin
Keratin
Keratin refers to a family of fibrous structural proteins. Keratin is the key of structural material making up the outer layer of human skin. It is also the key structural component of hair and nails...

). Disulfides are also found in peptides too small to hold a stable shape on their own (eg. insulin
Insulin
Insulin is a hormone central to regulating carbohydrate and fat metabolism in the body. Insulin causes cells in the liver, muscle, and fat tissue to take up glucose from the blood, storing it as glycogen in the liver and muscle....

).
Aspartic acid
Aspartic acid
Aspartic acid is an α-amino acid with the chemical formula HOOCCHCH2COOH. The carboxylate anion, salt, or ester of aspartic acid is known as aspartate. The L-isomer of aspartate is one of the 20 proteinogenic amino acids, i.e., the building blocks of proteins...

D Asp Behaves similarly to glutamic acid. Carries a hydrophilic acidic group with strong negative charge. Usually is located on the outer surface of the protein, making it water-soluble. Binds to positively-charged molecules and ions, often used in enzymes to fix the metal ion. When located inside of the protein, aspartate and glutamate are usually paired with arginine and lysine.
Glutamic acid
Glutamic acid
Glutamic acid is one of the 20 proteinogenic amino acids, and its codons are GAA and GAG. It is a non-essential amino acid. The carboxylate anions and salts of glutamic acid are known as glutamates...

E Glu Behaves similar to aspartic acid. Has longer, slightly more flexible side chain.
Phenylalanine
Phenylalanine
Phenylalanine is an α-amino acid with the formula C6H5CH2CHCOOH. This essential amino acid is classified as nonpolar because of the hydrophobic nature of the benzyl side chain. L-Phenylalanine is an electrically neutral amino acid, one of the twenty common amino acids used to biochemically form...

F Phe Essential
Essential amino acid
An essential amino acid or indispensable amino acid is an amino acid that cannot be synthesized de novo by the organism , and therefore must be supplied in the diet.-Essentiality vs. conditional essentiality in humans:...

 for humans. Phenylalanine, tyrosine, and tryptophan contain large rigid aromatic
Aromaticity
In organic chemistry, Aromaticity is a chemical property in which a conjugated ring of unsaturated bonds, lone pairs, or empty orbitals exhibit a stabilization stronger than would be expected by the stabilization of conjugation alone. The earliest use of the term was in an article by August...

 group on the side-chain. These are the biggest amino acids. Like isoleucine, leucine and valine, these are hydrophobic and tend to orient towards the interior of the folded protein molecule. Phenylalanine can be converted into Tyrosine.
Glycine
Glycine
Glycine is an organic compound with the formula NH2CH2COOH. Having a hydrogen substituent as its 'side chain', glycine is the smallest of the 20 amino acids commonly found in proteins. Its codons are GGU, GGC, GGA, GGG cf. the genetic code.Glycine is a colourless, sweet-tasting crystalline solid...

G Gly Because of the two hydrogen atoms at the α carbon, glycine is not optically active. It is the smallest amino acid, rotates easily, adds flexibility to the protein chain. It is able to fit into the tightest spaces, e.g., the triple helix of collagen
Collagen
Collagen is a group of naturally occurring proteins found in animals, especially in the flesh and connective tissues of mammals. It is the main component of connective tissue, and is the most abundant protein in mammals, making up about 25% to 35% of the whole-body protein content...

. As too much flexibility is usually not desired, as a structural component it is less common than alanine.
Histidine
Histidine
Histidine Histidine, an essential amino acid, has a positively charged imidazole functional group. It is one of the 22 proteinogenic amino acids. Its codons are CAU and CAC. Histidine was first isolated by German physician Albrecht Kossel in 1896. Histidine is an essential amino acid in humans...

H His In even slightly acidic conditions protonation
Protonation
In chemistry, protonation is the addition of a proton to an atom, molecule, or ion. Some classic examples include*the protonation of water by sulfuric acid:*the protonation of isobutene in the formation of a carbocation:2C=CH2 + HBF4 → 3C+ + BF4−*the protonation of ammonia in the...

 of the nitrogen occurs, changing the properties of histidine and the polypeptide as a whole. It is used by many proteins as a regulatory mechanism, changing the conformation and behavior of the polypeptide in acidic regions such as the late endosome
Endosome
In biology, an endosome is a membrane-bound compartment inside eukaryotic cells. It is a compartment of the endocytic membrane transport pathway from the plasma membrane to the lysosome. Molecules internalized from the plasma membrane can follow this pathway all the way to lysosomes for...

 or lysosome
Lysosome
thumb|350px|Schematic of typical animal cell, showing subcellular components. [[Organelle]]s: [[nucleoli]] [[cell nucleus|nucleus]] [[ribosomes]] [[vesicle |vesicle]] rough [[endoplasmic reticulum]]...

, enforcing conformation change in enzymes. However only a few histidines are needed for this, so it is comparatively scarce.
Isoleucine
Isoleucine
Isoleucine is an α-amino acid with the chemical formula HO2CCHCHCH2CH3. It is an essential amino acid, which means that humans cannot synthesize it, so it must be ingested. Its codons are AUU, AUC and AUA....

I Ile Essential
Essential amino acid
An essential amino acid or indispensable amino acid is an amino acid that cannot be synthesized de novo by the organism , and therefore must be supplied in the diet.-Essentiality vs. conditional essentiality in humans:...

 for humans. Isoleucine, leucine and valine have large aliphatic hydrophobic side chains. Their molecules are rigid, and their mutual hydrophobic interactions are important for the correct folding of proteins, as these chains tend to be located inside of the protein molecule.
Leucine
Leucine
Leucine is a branched-chain α-amino acid with the chemical formula HO2CCHCH2CH2. Leucine is classified as a hydrophobic amino acid due to its aliphatic isobutyl side chain. It is encoded by six codons and is a major component of the subunits in ferritin, astacin and other 'buffer' proteins...

 or isoleucine
Isoleucine
Isoleucine is an α-amino acid with the chemical formula HO2CCHCHCH2CH3. It is an essential amino acid, which means that humans cannot synthesize it, so it must be ingested. Its codons are AUU, AUC and AUA....

J Xle A placeholder when either amino acid may occupy a position
Lysine
Lysine
Lysine is an α-amino acid with the chemical formula HO2CCH4NH2. It is an essential amino acid, which means that the human body cannot synthesize it. Its codons are AAA and AAG....

K Lys Essential
Essential amino acid
An essential amino acid or indispensable amino acid is an amino acid that cannot be synthesized de novo by the organism , and therefore must be supplied in the diet.-Essentiality vs. conditional essentiality in humans:...

 for humans. Behaves similarly to arginine. Contains a long flexible side-chain with a positively-charged end. The flexibility of the chain makes lysine and arginine suitable for binding to molecules with many negative charges on their surfaces. E.g., DNA-binding proteins have their active regions rich with arginine and lysine. The strong charge makes these two amino acids prone to be located on the outer hydrophilic surfaces of the proteins; when they are found inside, they are usually paired with a corresponding negatively-charged amino acid, e.g., aspartate or glutamate.
Leucine
Leucine
Leucine is a branched-chain α-amino acid with the chemical formula HO2CCHCH2CH2. Leucine is classified as a hydrophobic amino acid due to its aliphatic isobutyl side chain. It is encoded by six codons and is a major component of the subunits in ferritin, astacin and other 'buffer' proteins...

L Leu Essential
Essential amino acid
An essential amino acid or indispensable amino acid is an amino acid that cannot be synthesized de novo by the organism , and therefore must be supplied in the diet.-Essentiality vs. conditional essentiality in humans:...

 for humans. Behaves similar to isoleucine and valine. See isoleucine.
Methionine
Methionine
Methionine is an α-amino acid with the chemical formula HO2CCHCH2CH2SCH3. This essential amino acid is classified as nonpolar. This amino-acid is coded by the codon AUG, also known as the initiation codon, since it indicates mRNA's coding region where translation into protein...

M Met Essential
Essential amino acid
An essential amino acid or indispensable amino acid is an amino acid that cannot be synthesized de novo by the organism , and therefore must be supplied in the diet.-Essentiality vs. conditional essentiality in humans:...

 for humans. Always the first amino acid to be incorporated into a protein; sometimes removed after translation. Like cysteine, contains sulfur, but with a methyl group instead of hydrogen. This methyl group can be activated, and is used in many reactions where a new carbon atom is being added to another molecule.
Asparagine
Asparagine
Asparagine is one of the 20 most common natural amino acids on Earth. It has carboxamide as the side-chain's functional group. It is not an essential amino acid...

N Asn Similar to aspartic acid. Asn contains an amide
Amide
In chemistry, an amide is an organic compound that contains the functional group consisting of a carbonyl group linked to a nitrogen atom . The term refers both to a class of compounds and a functional group within those compounds. The term amide also refers to deprotonated form of ammonia or an...

 group where Asp has a carboxyl.
Pyrrolysine
Pyrrolysine
Pyrrolysine is a naturally occurring, genetically coded amino acid used by some methanogenic archaea and one known bacterium in enzymes that are part of their methane-producing metabolism. It is similar to lysine, but with an added pyrroline ring linked to the end of the lysine side chain...

O Pyl Similar to lysine
Lysine
Lysine is an α-amino acid with the chemical formula HO2CCH4NH2. It is an essential amino acid, which means that the human body cannot synthesize it. Its codons are AAA and AAG....

, with a pyrroline
Pyrroline
Pyrrolines, also known under the name dihydropyrroles, are three different heterocyclic organic chemical compounds that differ in the position of the double bond. Pyrrolines are formally derived from the aromate pyrrole by hydrogenation...

 ring attached.
Proline
Proline
Proline is an α-amino acid, one of the twenty DNA-encoded amino acids. Its codons are CCU, CCC, CCA, and CCG. It is not an essential amino acid, which means that the human body can synthesize it. It is unique among the 20 protein-forming amino acids in that the α-amino group is secondary...

P Pro Contains an unusual ring to the N-end amine group, which forces the CO-NH amide sequence into a fixed conformation. Can disrupt protein folding structures like α helix
Alpha helix
A common motif in the secondary structure of proteins, the alpha helix is a right-handed coiled or spiral conformation, in which every backbone N-H group donates a hydrogen bond to the backbone C=O group of the amino acid four residues earlier...

 or β sheet
Beta sheet
The β sheet is the second form of regular secondary structure in proteins, only somewhat less common than the alpha helix. Beta sheets consist of beta strands connected laterally by at least two or three backbone hydrogen bonds, forming a generally twisted, pleated sheet...

, forcing the desired kink in the protein chain. Common in collagen
Collagen
Collagen is a group of naturally occurring proteins found in animals, especially in the flesh and connective tissues of mammals. It is the main component of connective tissue, and is the most abundant protein in mammals, making up about 25% to 35% of the whole-body protein content...

, where it often undergoes a posttranslational modification
Posttranslational modification
Posttranslational modification is the chemical modification of a protein after its translation. It is one of the later steps in protein biosynthesis, and thus gene expression, for many proteins....

 to hydroxyproline
Hydroxyproline
-4-Hydroxyproline, or L-hydroxyproline , is a common non-proteinogenic amino acid, abbreviated as HYP, e.g., in Protein Data Bank.-Structure and discovery:...

.
Glutamine
Glutamine
Glutamine is one of the 20 amino acids encoded by the standard genetic code. It is not recognized as an essential amino acid but may become conditionally essential in certain situations, including intensive athletic training or certain gastrointestinal disorders...

Q Gln Similar to glutamic acid. Gln contains an amide
Amide
In chemistry, an amide is an organic compound that contains the functional group consisting of a carbonyl group linked to a nitrogen atom . The term refers both to a class of compounds and a functional group within those compounds. The term amide also refers to deprotonated form of ammonia or an...

 group where Glu has a carboxyl. Used in proteins and as a storage for ammonia
Ammonia
Ammonia is a compound of nitrogen and hydrogen with the formula . It is a colourless gas with a characteristic pungent odour. Ammonia contributes significantly to the nutritional needs of terrestrial organisms by serving as a precursor to food and fertilizers. Ammonia, either directly or...

. The most abundant Amino Acid in the body.
Arginine
Arginine
Arginine is an α-amino acid. The L-form is one of the 20 most common natural amino acids. At the level of molecular genetics, in the structure of the messenger ribonucleic acid mRNA, CGU, CGC, CGA, CGG, AGA, and AGG, are the triplets of nucleotide bases or codons that codify for arginine during...

R Arg Functionally similar to lysine.
Serine
Serine
Serine is an amino acid with the formula HO2CCHCH2OH. It is one of the proteinogenic amino acids. By virtue of the hydroxyl group, serine is classified as a polar amino acid.-Occurrence and biosynthesis:...

S Ser Serine and threonine have a short group ended with a hydroxyl
Hydroxyl
A hydroxyl is a chemical group containing an oxygen atom covalently bonded with a hydrogen atom. In inorganic chemistry, the hydroxyl group is known as the hydroxide ion, and scientists and reference works generally use these different terms though they refer to the same chemical structure in...

 group. Its hydrogen is easy to remove, so serine and threonine often act as hydrogen donors in enzymes. Both are very hydrophilic, therefore the outer regions of soluble proteins tend to be rich with them.
Threonine
Threonine
Threonine is an α-amino acid with the chemical formula HO2CCHCHCH3. Its codons are ACU, ACA, ACC, and ACG. This essential amino acid is classified as polar...

T Thr Essential
Essential amino acid
An essential amino acid or indispensable amino acid is an amino acid that cannot be synthesized de novo by the organism , and therefore must be supplied in the diet.-Essentiality vs. conditional essentiality in humans:...

 for humans. Behaves similarly to serine.
Selenocysteine
Selenocysteine
Selenocysteine is an amino acid that is present in several enzymes .-Nomenclature:...

U Sec Selenated
Selenium
Selenium is a chemical element with atomic number 34, chemical symbol Se, and an atomic mass of 78.96. It is a nonmetal, whose properties are intermediate between those of adjacent chalcogen elements sulfur and tellurium...

 form of cysteine, which replaces sulfur
Sulfur
Sulfur or sulphur is the chemical element with atomic number 16. In the periodic table it is represented by the symbol S. It is an abundant, multivalent non-metal. Under normal conditions, sulfur atoms form cyclic octatomic molecules with chemical formula S8. Elemental sulfur is a bright yellow...

.
Valine
Valine
Valine is an α-amino acid with the chemical formula HO2CCHCH2. L-Valine is one of 20 proteinogenic amino acids. Its codons are GUU, GUC, GUA, and GUG. This essential amino acid is classified as nonpolar...

V Val Essential
Essential amino acid
An essential amino acid or indispensable amino acid is an amino acid that cannot be synthesized de novo by the organism , and therefore must be supplied in the diet.-Essentiality vs. conditional essentiality in humans:...

 for humans. Behaves similarly to isoleucine and leucine. See isoleucine.
Tryptophan
Tryptophan
Tryptophan is one of the 20 standard amino acids, as well as an essential amino acid in the human diet. It is encoded in the standard genetic code as the codon UGG...

W Trp Essential
Essential amino acid
An essential amino acid or indispensable amino acid is an amino acid that cannot be synthesized de novo by the organism , and therefore must be supplied in the diet.-Essentiality vs. conditional essentiality in humans:...

 for humans. Behaves similarly to phenylalanine and tyrosine (see phenylalanine). Precursor of serotonin
Serotonin
Serotonin or 5-hydroxytryptamine is a monoamine neurotransmitter. Biochemically derived from tryptophan, serotonin is primarily found in the gastrointestinal tract, platelets, and in the central nervous system of animals including humans...

. Naturally fluorescent.
Unknown X Xaa Placeholder when the amino acid is unknown or unimportant.
Tyrosine
Tyrosine
Tyrosine or 4-hydroxyphenylalanine, is one of the 22 amino acids that are used by cells to synthesize proteins. Its codons are UAC and UAU. It is a non-essential amino acid with a polar side group...

Y Tyr Behaves similarly to phenylalanine (precursor to Tyrosine) and tryptophan (see phenylalanine). Precursor of melanin
Melanin
Melanin is a pigment that is ubiquitous in nature, being found in most organisms . In animals melanin pigments are derivatives of the amino acid tyrosine. The most common form of biological melanin is eumelanin, a brown-black polymer of dihydroxyindole carboxylic acids, and their reduced forms...

, epinephrine
Epinephrine
Epinephrine is a hormone and a neurotransmitter. It increases heart rate, constricts blood vessels, dilates air passages and participates in the fight-or-flight response of the sympathetic nervous system. In chemical terms, adrenaline is one of a group of monoamines called the catecholamines...

, and thyroid hormone
Thyroid hormone
The thyroid hormones, thyroxine and triiodothyronine , are tyrosine-based hormones produced by the thyroid gland primarily responsible for regulation of metabolism. An important component in the synthesis of thyroid hormones is iodine. The major form of thyroid hormone in the blood is thyroxine ,...

s. Naturally fluorescent, although fluorescence is usually quenched by energy transfer to tryptophans.
Glutamic acid
Glutamic acid
Glutamic acid is one of the 20 proteinogenic amino acids, and its codons are GAA and GAG. It is a non-essential amino acid. The carboxylate anions and salts of glutamic acid are known as glutamates...

 or glutamine
Glutamine
Glutamine is one of the 20 amino acids encoded by the standard genetic code. It is not recognized as an essential amino acid but may become conditionally essential in certain situations, including intensive athletic training or certain gastrointestinal disorders...

Z Glx A placeholder when either amino acid may occupy a position.

Catabolism

The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK