RNA-binding proteins exhibit highly specific recognition of their RNA targets by recognizing their sequences, structures, motifs and RNA modifications. Specific binding of the RNA-binding proteins allow them to distinguish their targets and regulate a variety of cellular functions via control of the generation, maturation, and lifespan of the RNA transcript. This interaction begins during transcription as some RBPs remain bound to RNA until degradation whereas others only transiently bind to RNA to regulate
RNA splicing, processing, transport, and localization.
Cross-linking immunoprecipitation (CLIP) methods are used to stringently identify direct RNA binding sites of RNA-binding proteins in a variety of tissues and organisms. In this section, three classes of the most widely studied RNA-binding domains (RNA-recognition motif, double-stranded RNA-binding motif, zinc-finger motif) will be discussed.
RNA-recognition motif (RRM) The
RNA recognition motif, which is the most common RNA-binding motif, is a small protein domain of 75–85
amino acids that forms a four-stranded
β-sheet against the two α-helices. This recognition motif exerts its role in numerous cellular functions, especially in mRNA/rRNA processing, splicing, translation regulation, RNA export, and RNA stability. Ten structures of an RRM have been identified through
NMR spectroscopy and
X-ray crystallography. These structures illustrate the intricacy of protein–RNA recognition of RRM as it entails RNA–RNA and protein–protein interactions in addition to protein–RNA interactions. Despite their complexity, all ten structures have some common features. All RRMs' main protein surfaces' four-stranded β-sheet was found to interact with the RNA, which usually contacts two or three nucleotides in a specific manner. In addition, strong RNA binding affinity and specificity towards variation are achieved through an interaction between the inter-domain linker and the RNA and between RRMs themselves. This plasticity of the RRM explains why RRM is the most abundant domain and why it plays an important role in various biological functions.
Double-stranded RNA-binding motif The double-stranded RNA-binding motif (dsRM, dsRBD), a 70–75 amino-acid domain, plays a critical role in
RNA processing,
RNA localization,
RNA interference,
RNA editing, and translational repression. All three structures of the domain solved as of 2005 possess uniting features that explain how dsRMs only bind to dsRNA instead of dsDNA. The dsRMs were found to interact along the RNA duplex via both α-helices and β1-β2 loop. Moreover, all three dsRBM structures make contact with the sugar-phosphate backbone of the major groove and of one minor groove, which is mediated by the β1-β2 loop along with the
N-terminus region of the
alpha helix 2. This interaction is a unique adaptation for the shape of an RNA double helix as it involves 2'-hydroxyls and phosphate oxygen. Despite the common structural features among dsRBMs, they exhibit distinct chemical frameworks, which permits specificity for a variety for RNA structures including stem-loops, internal loops, bulges or helices containing mismatches.
Zinc fingers " : Cartoon representation of the zinc-finger motif of proteins. The zinc ion (green) is coordinated by two histidine and two cysteine amino acid residues. CCHH-type
zinc-finger domains are the most common
DNA-binding domain within the eukaryotic
genome. In order to attain high sequence-specific recognition of DNA, several zinc fingers are utilized in a modular fashion. Zinc fingers exhibit ββα protein fold in which a
β-hairpin and a α-helix are joined via a ion. Furthermore, the interaction between protein side-chains of the α-helix with the DNA bases in the major groove allows for the DNA-sequence-specific recognition. Despite its wide recognition of DNA, there has been recent discoveries that zinc fingers also have the ability to recognize RNA. In addition to CCHH zinc fingers, CCCH zinc fingers were recently discovered to employ sequence-specific recognition of single-stranded RNA through an interaction between intermolecular
hydrogen bonds and Watson-Crick edges of the RNA bases. CCHH-type zinc fingers employ two methods of RNA binding. First, the zinc fingers exert non-specific interaction with the backbone of a
double helix whereas the second mode allows zinc fingers to specifically recognize the individual bases that bulge out. Differing from the CCHH-type, the CCCH-type zinc finger displays another mode of RNA binding, in which single-stranded RNA is identified in a sequence-specific manner. Overall, zinc fingers can directly recognize DNA via binding to dsDNA sequence and RNA via binding to ssRNA sequence. ==Role in embryonic development==