Several methods of RNA splicing occur in nature; the type of splicing depends on the structure of the spliced intron and the
catalysts required for splicing to occur.
Spliceosomal complex Introns The word
intron is derived from the terms
intragenic region, and
intracistron, that is, a segment of DNA that is located between two exons of a
gene. The term intron refers to both the DNA sequence within a gene and the corresponding sequence in the unprocessed RNA transcript. As part of the RNA processing pathway, introns are removed by RNA splicing either shortly after or concurrent with
transcription. Introns are found in the genes of most organisms and many viruses. They can be located in a wide range of genes, including those that generate
proteins,
ribosomal RNA (rRNA), and
transfer RNA (tRNA). Within introns, a donor site (5' end of the intron), a branch site (near the 3' end of the intron) and an acceptor site (3' end of the intron) are required for splicing. The splice donor site includes an almost invariant sequence GU at the 5' end of the intron, within a larger, less highly conserved region. The splice acceptor site at the 3' end of the intron terminates the intron with an almost invariant AG sequence. Upstream (5'-ward) from the AG there is a region high in
pyrimidines (C and U), or
polypyrimidine tract. Further upstream from the polypyrimidine tract is the branchpoint, which includes an adenine nucleotide involved in lariat formation. The
consensus sequence for an intron (in IUPAC
nucleic acid notation) is: G-G-[cut]-G-U-R-A-G-U (donor site) ... intron sequence ... Y-U-R-A-C (branch sequence 20-50 nucleotides upstream of acceptor site) ... Y-rich-N-C-A-G-[cut]-G (acceptor site). However, it is noted that the specific sequence of intronic splicing elements and the number of nucleotides between the branchpoint and the nearest 3' acceptor site affect splice site selection. Also, point mutations in the underlying DNA or errors during transcription can activate a
cryptic splice site in part of the transcript that usually is not spliced. This results in a
mature messenger RNA with a missing section of an exon. In this way, a
point mutation, which might otherwise affect only a single amino acid, can manifest as a
deletion or truncation in the final protein.
1''' - 3' Splice site
2 - Poly pyrimidine Tract
3 - Branch site
4 - 5' splice site
Formation and activity Splicing is catalyzed by the
spliceosome, a large RNA-protein complex composed of five small nuclear ribonucleoproteins (
snRNPs). Assembly and activity of the spliceosome occurs during transcription of the pre-mRNA. The RNA components of snRNPs interact with the intron and are involved in catalysis. Two types of spliceosomes have been identified (major and minor) which contain different
snRNPs. • The
major spliceosome splices introns containing GU at the 5' splice site and AG at the 3' splice site. It is composed of the
U1,
U2,
U4,
U5, and
U6 snRNPs and is active in the nucleus. In addition, a number of proteins including
U2 small nuclear RNA auxiliary factor 1 (U2AF35),
U2AF2 (U2AF65) and
SF1 are required for the assembly of the spliceosome. The spliceosome forms different complexes during the splicing process: :*Complex E :**The U1 snRNP binds to the GU sequence at the 5' splice site of an intron; :**
Splicing factor 1 binds to the intron branch point sequence; :**
U2AF1 binds at the 3' splice site of the intron; :**
U2AF2 binds to the polypyrimidine tract; :*Complex A (pre-spliceosome) :**The U2 snRNP displaces SF1 and binds to the branch point sequence and ATP is hydrolyzed; :*Complex B (pre-catalytic spliceosome) :**The U5/U4/U6 snRNP trimer binds, and the U5 snRNP binds exons at the 5' site, with U6 binding to U2; :*Complex B* :**The U1 snRNP is released, U5 shifts from exon to intron, and the U6 binds at the 5' splice site; :*Complex C (catalytic spliceosome) :**U4 is released, U6/U2 catalyzes transesterification, making the 5'-end of the intron ligate to the A on intron and form a lariat, U5 binds exon at 3' splice site, and the 5' site is cleaved, resulting in the formation of the lariat; :*Complex C* (post-spliceosomal complex) :**U2/U5/U6 remain bound to the lariat, and the 3' site is cleaved and exons are ligated using ATP hydrolysis. The spliced RNA is released, the lariat is released and degraded, and the snRNPs are recycled. :This type of splicing is termed
canonical splicing or termed the
lariat pathway, which accounts for more than 99% of splicing. By contrast, when the intronic flanking sequences do not follow the GU-AG rule,
noncanonical splicing is said to occur (see "minor spliceosome" below). • The
minor spliceosome is very similar to the major spliceosome, but instead it splices out rare introns with different splice site sequences. While the minor and major spliceosomes contain the same U5
snRNP, the minor spliceosome has different but functionally analogous snRNPs for U1, U2, U4, and U6, which are respectively called
U11,
U12,
U4atac, and
U6atac.
Recursive splicing In most cases, splicing removes introns as single units from precursor
mRNA transcripts. However, in some cases, especially in mRNAs with very long introns, splicing happens in steps, with part of an intron removed and then the remaining intron is spliced out in a following step. This has been found first in the
Ultrabithorax (
Ubx) gene of the fruit fly,
Drosophila melanogaster, and a few other
Drosophila genes, but cases in humans have been reported as well.
Trans-splicing Trans-splicing is a form of splicing that removes introns or
outrons, and joins two exons that are not within the same RNA transcript.
Trans-splicing can occur between two different endogenous
pre-mRNAs or between an endogenous and an exogenous (such as from viruses) or artificial RNAs.
Self-splicing Self-splicing occurs for rare introns that form a
ribozyme, performing the functions of the spliceosome by RNA alone. There are three kinds of self-splicing introns,
Group I,
Group II and
Group III. Group I and II introns perform splicing similar to the spliceosome without requiring any protein. This similarity suggests that Group I and II introns may be evolutionarily related to the spliceosome. Self-splicing may also be very ancient, and may have existed in an
RNA world present before protein. Two
transesterifications characterize the mechanism in which group I introns are spliced: • 3'OH of a free guanine
nucleoside or a nucleotide cofactor (GMP, GDP, GTP) attacks phosphate at the 5' splice site. • 3'OH of the 5' exon becomes a nucleophile and the second transesterification results in the joining of the two exons. The mechanism in which group II introns are spliced (two
transesterification reactions) is as follows: • The 2'OH of a specific adenosine in the intron (also known as the "branchpoint") attacks the 5' splice site, thereby forming the
lariat. • The 3'OH of the 5' exon triggers the second transesterification at the 3' splice site, thereby joining the exons together. Both group I and II introns utilize two magnesium ions in the catalytic core to catalyze the splicing reaction, the same catalytic mechanism used by the spliceosome. Detailed structural characterization reveals that group II intron bears significant similarities to the spliceosome in terms of branchpoint adenosine recognition and structural dynamics during the two steps of transesterification.
tRNA splicing tRNA (also tRNA-like) splicing is another rare form of splicing that usually occurs in tRNA. The splicing reaction involves a different biochemistry than the spliceosomal and self-splicing pathways. In the
yeast Saccharomyces cerevisiae, a yeast tRNA splicing
endonuclease heterotetramer, composed of
TSEN54,
TSEN2,
TSEN34, and
TSEN15, cleaves pre-tRNA at two sites in the acceptor loop to form a 5'-half tRNA, terminating at a 2',3'-cyclic phosphodiester group, and a 3'-half tRNA, terminating at a 5'-hydroxyl group, along with a discarded intron. Yeast tRNA kinase then phosphorylates the 5'-hydroxyl group using
adenosine triphosphate. Yeast tRNA cyclic phosphodiesterase cleaves the cyclic phosphodiester group to form a 2'-phosphorylated 3' end. Yeast tRNA ligase adds an
adenosine monophosphate group to the 5' end of the 3'-half and joins the two halves together. NAD-dependent 2'-phosphotransferase then removes the 2'-phosphate group.
SOS splicing SOS splicing was discovered in
Caenorhabditis elegans where it protects
genes from DNA-
transposon-mediated disruption by removing
transposable elements (TEs) from
mRNAs. The process also occurs in humans and operates independently of the
spliceosome. Three
proteins are required for SOS splicing: AKAP17A (an mRNA-binding protein); RTCB (an
RNA ligase), and CAAP1 (which connects RTCB and AKAP17A). ==Evolution==