are the result of different levels of expression of
pigmentation genes in different areas of the
skin. Regulation of gene expression is the control of the amount and timing of appearance of the functional product of a gene. Control of expression is vital to allow a cell to produce the gene products it needs when it needs them; in turn, this gives cells the flexibility to adapt to a variable environment, external signals, damage to the cell, and other stimuli. More generally, gene regulation gives the cell control over all structure and function, and is the basis for
cellular differentiation,
morphogenesis and the versatility and adaptability of any organism. Numerous terms are used to describe types of genes depending on how they are regulated; these include: • A
constitutive gene is a gene that is transcribed continually as opposed to a facultative gene, which is only transcribed when needed. • A
housekeeping gene is a gene that is required to maintain basic cellular function and so is typically expressed in all cell types of an organism. Examples include
actin,
GAPDH and
ubiquitin. Some housekeeping genes are transcribed at a relatively constant rate and these genes can be used as a reference point in experiments to measure the expression rates of other genes. • A
facultative gene is a gene only transcribed when needed as opposed to a constitutive gene. • An
inducible gene is a gene whose expression is either responsive to environmental change or dependent on the position in the cell cycle. Any step of gene expression may be modulated, from the DNA-RNA transcription step to
post-translational modification of a protein. The stability of the final gene product, whether it is RNA or protein, also contributes to the expression level of the gene—an unstable product results in a low expression level. In general gene expression is regulated through changes in the number and type of interactions between molecules that collectively influence transcription of DNA and translation of RNA. Some simple examples of where gene expression is important are: • Control of
insulin expression so it gives a signal for
blood glucose regulation. •
X chromosome inactivation in female
mammals to prevent an "overdose" of the genes it contains. •
Cyclin expression levels control progression through the eukaryotic
cell cycle.
Transcriptional Regulation of transcription can be broken down into three main routes of influence; genetic (direct interaction of a control factor with the gene), modulation interaction of a control factor with the transcription machinery and epigenetic (non-sequence changes in DNA structure that influence transcription). transcription factor (green) binds as a dimer to
major groove of DNA target (red and blue) and disables initiation of transcription. From . Direct interaction with DNA is the simplest and the most direct method by which a protein changes transcription levels. Genes often have several protein binding sites around the coding region with the specific function of regulating transcription. There are many classes of regulatory DNA binding sites known as
enhancers,
insulators and
silencers. The mechanisms for regulating transcription are varied, from blocking key binding sites on the DNA for
RNA polymerase to acting as an
activator and promoting transcription by assisting RNA polymerase binding. The activity of transcription factors is further modulated by intracellular signals causing protein post-translational modification including
phosphorylation,
acetylation, or
glycosylation. These changes influence a transcription factor's ability to bind, directly or indirectly, to promoter DNA, to recruit RNA polymerase, or to favor elongation of a newly synthesized RNA molecule. The nuclear membrane in eukaryotes allows further regulation of transcription factors by the duration of their presence in the nucleus, which is regulated by reversible changes in their structure and by binding of other proteins. Environmental stimuli or endocrine signals may cause modification of regulatory proteins eliciting cascades of intracellular signals, which result in regulation of gene expression. It has become apparent that there is a significant influence of non-DNA-sequence specific effects on transcription. These effects are referred to as
epigenetic and involve the higher order structure of DNA, non-sequence specific DNA binding proteins and chemical modification of DNA. In general epigenetic effects alter the accessibility of DNA to proteins and so modulate transcription. . Note how the DNA (blue and green) is tightly wrapped around the protein core made of
histone octamer (ribbon coils), restricting access to the DNA. From . In eukaryotes the structure of
chromatin, controlled by the
histone code, regulates access to DNA with significant impacts on the expression of genes in
euchromatin and
heterochromatin areas.
Enhancers, transcription factors, mediator complex and DNA loops regulatory region is enabled to interact with the
promoter region of its target
gene by formation of a chromosome loop. This can initiate
messenger RNA (mRNA) synthesis by
RNA polymerase II (RNAP II) bound to the promoter at the
transcription start site of the gene. The loop is stabilized by one architectural protein anchored to the enhancer and one anchored to the promoter and these proteins are joined to form a dimer (red zigzags). Specific regulatory
transcription factors bind to DNA sequence motifs on the enhancer. General transcription factors bind to the promoter. When a transcription factor is activated by a signal (here indicated as
phosphorylation shown by a small red star on a transcription factor on the enhancer) the enhancer is activated and can now activate its target promoter. The active enhancer is transcribed on each strand of DNA in opposite directions by bound RNAP IIs. Mediator proteins (a complex consisting of about 26 proteins in an interacting structure) communicate regulatory signals from the enhancer DNA-bound transcription factors to the promoter. Gene expression in mammals is regulated by many
cis-regulatory elements, including
core promoters and promoter-proximal elements that are located near the
transcription start sites of genes,
upstream on the DNA (towards the 5' region of the
sense strand). Other important cis-regulatory modules are localized in DNA regions that are distant from the transcription start sites. These include
enhancers,
silencers,
insulators and tethering elements. Enhancers and their associated
transcription factors have a leading role in the regulation of gene expression.
Enhancers are genome regions that regulate genes. Enhancers control cell-type-specific gene expression programs, most often by looping through long distances to come in physical proximity with the promoters of their target genes. Multiple enhancers, each often tens or hundred of thousands of nucleotides distant from their target genes, loop to their target gene promoters and coordinate with each other to control gene expression. Several cell function-specific transcription factors (among the about 1,600 transcription factors in a human cell) generally bind to specific motifs on an enhancer. A small combination of these enhancer-bound transcription factors, when brought close to a promoter by a DNA loop, govern transcription level of the target gene. Mediator (a complex usually consisting of about 26 proteins in an interacting structure) communicates regulatory signals from enhancer DNA-bound transcription factors directly to the RNA polymerase II (pol II) enzyme bound to the promoter. Enhancers, when active, are generally transcribed from both strands of DNA with RNA polymerases acting in two different directions, producing two eRNAs as illustrated in the figure. An inactive enhancer may be bound by an inactive transcription factor. Phosphorylation of the transcription factor may activate it and that activated transcription factor may then activate the enhancer to which it is bound (see small red star representing phosphorylation of transcription factor bound to enhancer in the illustration). An activated enhancer begins transcription of its RNA before activating transcription of messenger RNA from its target gene.
DNA methylation and demethylation group to the DNA that happens at
cytosine. The image shows a cytosine single ring base and a methyl group added on to the 5 carbon. In mammals, DNA methylation occurs almost exclusively at a cytosine that is followed by a
guanine.
DNA methylation is a widespread mechanism for epigenetic influence on gene expression and is seen in
bacteria and
eukaryotes and has roles in heritable transcription silencing and transcription regulation. Methylation most often occurs on a cytosine (see Figure). Methylation of cytosine primarily occurs in dinucleotide sequences where a cytosine is followed by a guanine, a
CpG site. The number of
CpG sites in the human genome is about 28 million. Depending on the type of cell, about 70% of the CpG sites have a methylated cytosine. Methylation of cytosine in DNA has a major role in regulating gene expression. Methylation of CpGs in a promoter region of a gene usually represses gene transcription while methylation of CpGs in the body of a gene increases expression.
TET enzymes play a central role in demethylation of methylated cytosines. Demethylation of CpGs in a gene promoter by
TET enzyme activity increases transcription of the gene.
Post-transcriptional regulation In eukaryotes, where export of RNA is required before translation is possible, nuclear export is thought to provide additional control over gene expression. All transport in and out of the nucleus is via the
nuclear pore and transport is controlled by a wide range of
importin and
exportin proteins. Expression of a gene coding for a protein is only possible if the messenger RNA carrying the code survives long enough to be translated. In a typical cell, an RNA molecule is only stable if specifically protected from degradation. RNA degradation has particular importance in regulation of expression in eukaryotic cells where mRNA has to travel significant distances before being translated. In eukaryotes, RNA is stabilised by certain post-transcriptional modifications, particularly the
5′ cap and
poly-adenylated tail. If an mRNA molecule has a complementary sequence to a
small interfering RNA then it is targeted for destruction via the
RNA interference pathway.
Three prime untranslated regions and microRNAs Three prime untranslated regions (3′UTRs) of
messenger RNAs (mRNAs) often contain regulatory sequences that post-transcriptionally influence gene expression. Such 3′-UTRs often contain both binding sites for
microRNAs (miRNAs) as well as for regulatory proteins. By binding to specific sites within the 3′-UTR, miRNAs can decrease gene expression of various mRNAs by either inhibiting translation or directly causing degradation of the transcript. The 3′-UTR also may have silencer regions that bind repressor proteins that inhibit the expression of a mRNA. The 3′-UTR often contains
microRNA response elements (MREs). MREs are sequences to which miRNAs bind. These are prevalent motifs within 3′-UTRs. Among all regulatory motifs within the 3′-UTRs (e.g. including silencer regions), MREs make up about half of the motifs. As of 2014, the
miRBase web site, an archive of
miRNA sequences and annotations, listed 28,645 entries in 233 biologic species. Of these, 1,881 miRNAs were in annotated human miRNA loci. miRNAs were predicted to have an average of about four hundred target
mRNAs (affecting expression of several hundred genes). Friedman et al. Other experiments show that a single miRNA may repress the production of hundreds of proteins, but that this repression often is relatively mild (less than 2-fold). The effects of miRNA dysregulation of gene expression seem to be important in cancer. For instance, in gastrointestinal cancers, nine miRNAs have been identified as
epigenetically altered and effective in down regulating DNA repair enzymes. The effects of miRNA dysregulation of gene expression also seem to be important in neuropsychiatric disorders, such as schizophrenia, bipolar disorder, major depression, Parkinson's disease, Alzheimer's disease and autism spectrum disorders.
Translational is an example of a small molecule that reduces expression of all protein genes inevitably leading to cell death; it thus acts as an
antibiotic. Direct regulation of translation is less prevalent than control of transcription or mRNA stability but is occasionally used. Inhibition of protein translation is a major target for
toxins and
antibiotics, so they can kill a cell by overriding its normal gene expression control.
Protein synthesis inhibitors include the antibiotic
neomycin and the toxin
ricin.
Post-translational modifications Post-translational modifications (PTMs) are
covalent modifications to proteins. Like RNA splicing, they help to significantly diversify the proteome. These modifications are usually catalyzed by enzymes. Additionally, processes like covalent additions to amino acid side chain residues can often be reversed by other enzymes. However, some, like the
proteolytic cleavage of the protein backbone, are irreversible. PTMs play many important roles in the cell. For example, phosphorylation is primarily involved in activating and deactivating proteins and in signaling pathways. PTMs are involved in transcriptional regulation: an important function of acetylation and methylation is histone tail modification, which alters how accessible DNA is for transcription. One type of PTM can initiate another type of PTM, as can be seen in how
ubiquitination tags proteins for degradation through proteolysis. ==Measurement==