Ubiquitylation (also known as ubiquitination or ubiquitinylation) is an enzymatic
post-translational modification in which a ubiquitin moiety is attached to a
substrate protein. This process most commonly binds the last
amino acid of ubiquitin (
glycine 76) to a
lysine residue on the substrate. An
isopeptide bond is formed between the
carboxyl group (COO−) of the ubiquitin's glycine and the epsilon-
amino group (ε-) of the substrate's lysine. Trypsin cleavage of a ubiquitin-conjugated substrate leaves a di-glycine "remnant" that is used to identify the site of ubiquitylation. Ubiquitin can also be bound to other sites in a protein which are electron-rich
nucleophiles, termed "non-canonical ubiquitylation". This was first observed with the
amine group of a protein's
N-terminus being used for ubiquitylation, rather than a lysine residue, in the protein
MyoD and has been observed since in 22 other proteins in multiple species, including ubiquitin itself. and the
hydroxyl group on threonine and serine. The end result of this process is the addition of one ubiquitin molecule (monoubiquitylation) or a chain of ubiquitin molecules (polyubiquitination) to the substrate protein. Ubiquitination requires three types of enzyme:
ubiquitin-activating enzymes,
ubiquitin-conjugating enzymes, and
ubiquitin ligases, known as E1s, E2s, and E3s, respectively. The process consists of three main steps: •
Activation: Ubiquitin is activated in a two-step reaction by an E1
ubiquitin-activating enzyme, which is dependent on
ATP. The initial step involves production of a ubiquitin-adenylate intermediate. The E1 binds both ATP and ubiquitin and catalyses the acyl-adenylation of the
C-terminus of the ubiquitin molecule. The second step transfers ubiquitin to an
active site cysteine residue, with release of
AMP. This step results in a
thioester linkage between the C-terminal carboxyl group of ubiquitin and the E1 cysteine
sulfhydryl group. The human genome contains two genes that produce enzymes capable of activating ubiquitin:
UBA1 and
UBA6. •
Conjugation: E2
ubiquitin-conjugating enzymes catalyse the transfer of ubiquitin from E1 to the
active site cysteine of the E2 via a trans(thio)esterification reaction. In order to perform this reaction, the E2 binds to both activated ubiquitin and the E1 enzyme. Humans possess 35 different E2 enzymes, whereas other
eukaryotic organisms have between 16 and 35. They are characterised by their highly conserved structure, known as the ubiquitin-conjugating catalytic (UBC) fold. •
Ligation: E3
ubiquitin ligases catalyse the final step of the ubiquitylation cascade. Most commonly, they create an isopeptide bond between a lysine of the target protein and the C-terminal glycine of ubiquitin. In general, this step requires the activity of one of the hundreds of E3s. E3 enzymes function as the
substrate recognition modules of the system and are capable of interaction with both E2 and substrate. Some E3 enzymes also activate the E2 enzymes. E3 enzymes possess one of two
domains: the homologous to the E6-AP carboxyl terminus (
HECT) domain and the really interesting new gene (
RING) domain (or the closely related U-box domain). HECT domain E3s transiently bind ubiquitin in this process (an obligate thioester intermediate is formed with the active-site cysteine of the E3), whereas RING domain E3s catalyse the direct transfer from the E2 enzyme to the substrate. The
anaphase-promoting complex (APC) and the
SCF complex (for Skp1-Cullin-F-box protein complex) are two examples of multi-
subunit E3s involved in recognition and ubiquitylation of specific target proteins for degradation by the
proteasome. In the ubiquitylation cascade, E1 can bind with many E2s, which can bind with hundreds of E3s in a hierarchical way. Having levels within the cascade allows tight regulation of the ubiquitylation machinery. Other ubiquitin-like proteins (UBLs) are also modified via the E1–E2–E3 cascade, although variations in these systems do exist. E4 enzymes, or ubiquitin-chain elongation factors, are capable of adding pre-formed polyubiquitin chains to substrate proteins. For example, multiple monoubiquitylation of the tumor suppressor
p53 by
Mdm2 can be followed by addition of a polyubiquitin chain using
p300 and CBP.
Ubiquitination of non-protein substrates Although ubiquitination was historically considered a protein-specific post-translational modification, recent studies have shown that ubiquitin can also be conjugated to certain non-protein molecules. These include lipopolysaccharides (LPS), phospholipids and other metabolites. A notable example is the E3 ligase RNF213 which can attach ubiquitin to the lipid A moiety of bacterial LPS, promoting xenophagic clearance of invading bacteria. These findings expand the known scope of ubiquitin signaling beyond classical post-translational protein modification.
Types Ubiquitylation affects cellular process by regulating the degradation of proteins (via the
proteasome and
lysosome), coordinating the
cellular localization of proteins, activating and inactivating proteins, and modulating
protein–protein interactions. These effects are mediated by different types of substrate ubiquitylation, for example the addition of a single ubiquitin molecule (monoubiquitylation) or different types of ubiquitin chains (polyubiquitylation). Monoubiquitylation affects cellular processes such as
membrane trafficking,
endocytosis and
viral budding.
Polyubiquitin chains Polyubiquitylation is the formation of a ubiquitin chain on a single lysine residue on the substrate protein. Following addition of a single ubiquitin moiety to a protein substrate, further ubiquitin molecules can be added to the first, yielding a polyubiquitin chain. Lysine 48-linked polyubiquitin chains target proteins for destruction, by a process known as
proteolysis. Multi-ubiquitin chains at least four ubiquitin molecules long must be attached to a lysine residue on the condemned protein in order for it to be recognised by the
26S proteasome. This is a barrel-shape structure comprising a central proteolytic core made of four ring structures, flanked by two cylinders that selectively allow entry of ubiquitylated proteins. Once inside, the proteins are rapidly degraded into small
peptides (usually 3–25 amino acid residues in length). Ubiquitin molecules are cleaved off the protein immediately prior to destruction and are recycled for further use. Although the majority of protein substrates are ubiquitylated, there are examples of non-ubiquitylated proteins targeted to the proteasome. In cells, lysine 63-linked chains are bound by the
ESCRT-0 complex, which prevents their binding to the proteasome. This complex contains two proteins, Hrs and STAM1, that contain a UIM, which allows it to bind to lysine 63-linked chains. Methionine 1-linked (or linear) polyubiquitin chains are another type of non-degradative ubiquitin chains. In this case, ubiquitin is linked in a head-to-tail manner, meaning that the C-terminus of the last ubiquitin molecule binds directly to the N-terminus of the next one. Although initially believed to target proteins for proteasomal degradation, linear ubiquitin later proved to be indispensable for NF-kB signaling. Currently, there is only one known E3 ubiquitin ligase generating M1-linked polyubiquitin chains -
linear ubiquitin chain assembly complex (LUBAC). Less is understood about atypical (non-lysine 48-linked) ubiquitin chains but research is starting to suggest roles for these chains. The function of these chains is unknown. K63- and M1-linked chains have a fairly linear conformation; they are known as open-conformation chains. K6-, K11-, and K48-linked chains form closed conformations. The ubiquitin molecules in open-conformation chains do not interact with each other, except for the covalent
isopeptide bonds linking them together. In contrast, the closed conformation chains have interfaces with interacting residues. Altering the chain conformations exposes and conceals different parts of the ubiquitin protein, and the different linkages are recognized by proteins that are specific for the unique
topologies that are intrinsic to the linkage. Proteins can specifically bind to ubiquitin via
ubiquitin-binding domains (UBDs). The distances between individual ubiquitin units in chains differ between lysine 63- and 48-linked chains. The UBDs exploit this by having small spacers between
ubiquitin-interacting motifs that bind lysine 48-linked chains (compact ubiquitin chains) and larger spacers for lysine 63-linked chains. The machinery involved in recognising polyubiquitin chains can also differentiate between K63-linked chains and M1-linked chains, demonstrated by the fact that the latter can induce proteasomal degradation of the substrate. == Function ==