The 20 canonical amino acids can be classified according to their properties. Important factors are charge,
hydrophilicity or
hydrophobicity, size, and functional groups. There are various
hydrophobicity scales of amino acid residues. Some amino acids have special properties. Cysteine can form covalent
disulfide bonds to other cysteine residues.
Proline forms
a cycle to the polypeptide backbone, and glycine is more flexible than other amino acids. Glycine and proline are strongly present within low complexity regions of both eukaryotic and prokaryotic proteins, whereas the opposite is the case with cysteine, phenylalanine, tryptophan, methionine, valine, leucine, isoleucine, which are highly reactive, or complex, or hydrophobic. Many proteins undergo a range of
posttranslational modifications, whereby additional chemical groups are attached to the amino acid residue side chains sometimes producing
lipoproteins (that are hydrophobic), or
glycoproteins (that are hydrophilic) allowing the protein to attach temporarily to a membrane. For example, a signaling protein can attach and then detach from a cell membrane, because it contains cysteine residues that can have the fatty acid
palmitic acid added to them and subsequently removed.
Acid-base behavior Zwitterions The common natural forms of amino acids have a
zwitterionic structure, with ( in the case of proline) and functional groups attached to the same C atom, and are thus α-amino acids, and are the only ones found in proteins during translation in the ribosome. In aqueous solution at pH close to neutrality, amino acids are energetically favored in their
zwitterionic form, with a deprotonated group and a protonated group, because the high
dielectric constant of water and its hydrogen-bonding network effectively stabilize separated charges. Thus, the overall structure is , and the so-called "neutral forms" are not present to any measurable degree at
physiological pH. A zwitterion has a net charge of zero, but because it contains both positively and negatively charged sites, it is misleading to describe it as "uncharged." In contrast, in low-dielectric hydrophobic environments such as
organic solvents or
cell membrane interiors, charge separation is poorly stabilized and proton transfer tends to yield a neutral form, while in the
gas phase, where there is essentially no dielectric screening or solvation, spectroscopic and computational studies show that the lowest-energy structures of most amino acids are also neutral unless specific
intramolecular interactions or stepwise
hydration provide sufficient stabilization of the zwitterion. In strongly acidic conditions (pH below 3), the carboxylate group becomes protonated and the structure becomes an ammonio carboxylic acid, . In highly basic conditions (pH greater than 10, not normally seen in physiological conditions), the ammonio group is deprotonated to give . Although various definitions of acids and bases are used in chemistry, the only one that is useful for chemistry in aqueous solution is
that of Brønsted:
Isoelectric point s of twenty proteinogenic amino acids grouped by side chain category For amino acids with uncharged side-chains the zwitterion predominates at pH values between the two p
Ka values, but coexists in
equilibrium with small amounts of net negative and net positive ions. At the midpoint between the two p
Ka values, the trace amount of net negative and trace of net positive ions balance, so that average net charge of all forms present is zero. This pH is known as the
isoelectric point p
I, so p
I = (p
Ka1 + p
Ka2). For amino acids with charged side chains, the p
Ka of the side chain is involved. Thus for aspartate or glutamate with negative side chains, the terminal amino group is essentially entirely in the charged form , but this positive charge needs to be balanced by the state with just one C-terminal carboxylate group is negatively charged. This occurs halfway between the two carboxylate p
Ka values: p
I = (p
Ka1 + p
Ka(R)), where p
Ka(R) is the side chain p
Ka. Similar considerations apply to other amino acids with ionizable side-chains, including not only glutamate (similar to aspartate), but also cysteine, histidine, lysine, tyrosine and arginine with positive side chains. Zwitterionic amino acids exhibit minimal solubility at their isoelectric point, and in some cases can be isolated from aqueous solution by adjusting the pH to the appropriate isoelectric value.
Abbreviation and property tables Although one-letter symbols are included in the table, IUPAC–IUBMB recommend that "Use of the one-letter symbols should be restricted to the comparison of long sequences". • Initial letters are used where there is no ambiguity: C cysteine, H histidine, I isoleucine, M methionine, S serine, V valine, • D aspartate was assigned arbitrarily, with the proposed mnemonic aspar
Dic acid; E glutamate was assigned in alphabetical sequence being larger by merely one
methylene –CH2– group, In addition to the specific amino acid codes, placeholders are used in cases where
chemical or
crystallographic analysis of a peptide or protein cannot conclusively determine the identity of a residue. They are also used to summarize
conserved protein sequence motifs. The use of single letters to indicate sets of similar residues is similar to the use of
abbreviation codes for degenerate bases.
Unk is sometimes used instead of
Xaa,
Ter or
* (from termination) is used in notation for mutations in proteins when a stop codon occurs. It corresponds to no amino acid at all. In addition, many
nonstandard amino acids have a specific code. For example, several peptide drugs, such as
Bortezomib and
MG132, are
artificially synthesized and retain their
protecting groups, which have specific codes. Bortezomib is
Pyz–Phe–boroLeu, and MG132 is
Z–Leu–Leu–Leu–al. To aid in the analysis of protein structure,
photo-reactive amino acid analogs are available. These include
photoleucine (
pLeu) and
photomethionine (
pMet). ==Biological roles and occurrence==