MarketSuper-enhancer
Company Profile

Super-enhancer

Cell differentiation in multicellular organisms with different cell types is determined, in each cell type, by the expression of genes under the regulatory control of typical enhancers and super-enhancers.

Frequency of super-enhancers
In many cell types, only a minority of activated enhancers are located in Super-Enhancers (SEs). For specialized tissue, such as skeletal muscle, a reduced number of genes are expressed and a low number of specialized and activated super-enhancers are found. In human skeletal muscle, there are nine identified types of cells. On average, the number of expressed genes in these nine cell types is 1,331. There are also about 22 super-enhancers specific to skeletal muscle cells among the nine types of skeletal muscle cells, indicating that specialized super-enhancers in these cells are about 1.7% of the number of typical enhancers (TEs). In immune-system B cells, a study identified 140 SEs and 4,290 TEs in non-stimulated B cells (SEs were 3.2% of activated transcription areas). In stimulated B cells SEs were 3.6% of activated transcription areas. Similarly, in mouse embryonic stem cells, 231 SEs were found, compared to 8,794 TEs, with SEs comprising 2.6% of activated chromatin regions. A study of neural stem cells found 445 SEs and 9436 TEs, so that SEs were 4.7% of active enhancer regions. ==Formation of super-enhancers==
Formation of super-enhancers
Hundreds of thousands of sites in the human genome can potentially act as enhancers. In one large 2020 study, 78 different types of human cells were examined for links between activated enhancers and genes coding for messenger RNA to produce gene products. Distributed among the 78 types of cells there were a total of 449,627 activated enhancers linked to 17,643 protein-coding genes. With this large number of potentially active enhancers, there are some genome regions with a cluster of enhancers that, when all are activated they can all loop to the same promoter and produce a super-enhancer, driving a gene to have very high messenger RNA output. One well-studied gene, MYC, has amplified expression in as many as 70% of all cancers. While about 28% of its over-expressions are due to genetic focal amplifications or translocations, the majority of cases of over-expression of MYC are due to activated super-enhancers. There are more than 10 different super-enhancers that can cause MYC over-expression. For each of 4 tumor types of cells grown in culture (HCT-116, MCF7, K562 and Jurkat) there were three to five super-enhancers specific to each tumor cell type. In one 2013 study, the length of typical enhancers was found to be about 700 base pairs while in the case of super-enhancers the length was about 9,000 base pairs (encompassing multiple single enhancers). A later study, in 2020, indicated that typical enhancers were about 200 nucleotides long and that there may be as many as 3.6 million potentially active enhancers occupying 21.55% of the human genome. In the nucleus of mammalian cells, almost all the DNA is wrapped around regularly spaced protein complexes, called nucleosomes (see top panel in Figure "Chromatin"). The protein complexes are composed of 4 pairs of histones, H2A, H2B, H3 and H4. The DNA plus these protein complexes is called chromatin (see Figure illustrating chromatin). Enhancer regions, as described above, are several hundred nucleotides long. To be activated, the enhancer region must have the nucleosomes evicted from the DNA so that the multiple transcription factors that bind to that enhancer DNA would have access to their binding sites (see bottom panel in Figure "Chromatin"). (To be an active enhancer, more than 10 different binding sites must be occupied by different transcription factors in the enhancer.) In eviction of nucleosomes from enhancer DNA, a pioneer transcription factor first loosens up the attachment of DNA to the nucleosome of an enhancer region. For instance, one transcription factor that does this is the pioneer transcription factor NF-κB. Five steps follow this: • NF-κB is acetylated by p300/CBP. • Acetylated NF-κB recruits a specific histone acetyltransferase enzyme, BRD4. • BRD4 acetylates histone 3 at histone 3 lysine 122 (see Figure "Nucleosome at enhancer with H3K122 acetylated"). • When histone 3 lysine 122 is acetylated the nucleosome is evicted from the enhancer sequence. • Opening up the enhancer DNA allows binding of the other transcription factors needed to form an activated enhancer. Presumably, when the activating signal for NF-κB is very strong, much more NF-κB is activated, and then greatly increased NF-κB can start the process of activating multiple nearby enhancers at the same time, forming a super-enhancer. ==Super-enhancers promote high levels of transcription==
Super-enhancers promote high levels of transcription
As described above, in forming a super-enhancer, BRD4 is complexed with NF-kB. This complex also recruits and forms a further complex with cyclin T1 and Cdk9. Cyclin T1/Cdk9 is also known as P-TEFb. P-TEFb acts as a kinase that phosphorylates RNA polymerase II (RNAP II), which then activates (in conjunction with the Mediator complex described below) the polymerase on the promoter of a gene to initiate transcription and to continue transcription (instead of pausing). The transcription factors, bound to their sites on each enhancer within the super-enhancer, recruit the Mediator complex between each enhancer and the RNA polymerase II that will initiate transcription of the gene to be actively transcribed (see Figure at top of article that illustrates a super-enhancer). The Mediator complex in humans is 1.4 MDa in size and includes 26 sub-units. The tail modules of the Mediator complex protein sub-units interact with the activation domains of transcription factors bound at enhancers and the head and middle modules interact with the pre-initiation complex (PIC) at gene promoters. The Mediator complex, when certain sub-units are phosphorylated and up-activated by particular cyclin-dependent kinases (Cdk8, Cdk9, Cdk19, etc.), will then promote higher levels of transcription. ==History==
History
The regulation of transcription by enhancers has been studied since the 1980s. Large or multi-component transcription regulators with a range of mechanistic properties, including locus control regions, clustered open regulatory elements, and transcription initiation platforms, were observed shortly thereafter. More recent research has suggested that these different categories of regulatory elements may represent subtypes of super-enhancer. In 2013, two labs identified large enhancers near several genes especially important for establishing cell identities. While Richard A. Young and colleagues identified super-enhancers, Francis Collins and colleagues identified stretch enhancers. As currently defined, the term "super-enhancer" was introduced by Young's lab to describe regions identified in mouse embryonic stem cells (ESCs). == Function ==
Function
The enhancers comprising super-enhancers share the functions of enhancers, including binding transcription factor proteins, looping to target genes, and activating transcription. High levels of many transcription factors and co-factors are seen at super-enhancers (e.g., CDK7, BRD4, and Mediator). The constituent enhancers of super-enhancers physically interact with each other and their target genes over a long range sequence-wise. Super-enhancers that control the expression of major cell surface receptors with a crucial role in the function of a given cell lineage have also been defined. This is notably the case for B-lymphocytes, the survival, the activation and the differentiation of which rely on the expression of membrane-form immunoglobulins (Ig). The Ig heavy chain locus super-enhancer is a very large (25kb) cis-regulatory region, including multiple enhancers and controlling several major modifications of the locus (notably somatic hypermutation, class-switch recombination and locus suicide recombination). ==Relevance to Disease==
Relevance to Disease
Mutations in super-enhancers have been noted in various diseases, including cancers, type 1 diabetes, Alzheimer's disease, lupus, rheumatoid arthritis, multiple sclerosis, systemic scleroderma, primary biliary cirrhosis, Crohn's disease, Graves disease, vitiligo, and atrial fibrillation. A similar enrichment in disease-associated sequence variation has also been observed for stretch enhancers. Altered super-enhancer function is also induced by mutations of chromatin regulators. Acquired super-enhancers may thus be biomarkers that could be useful for diagnosis and therapeutic intervention. For instance, super-enhancers rely on exceptional amounts of CDK7, and, in cancer, multiple papers report the loss of expression of their target genes when cells are treated with the CDK7 inhibitor THZ1. Similarly, super-enhancers are enriched in the target of the JQ1 small molecule, BRD4, so treatment with JQ1 causes exceptional losses in expression for super-enhancer—associated genes. ==Identification==
Identification
Super-enhancers have been most commonly identified by locating genomic regions that are highly enriched in ChIP-Seq signal. ChIP-Seq experiments targeting master transcription factors and co-factors like Mediator or BRD4 have been used, but the most frequently used is H3K27ac-marked nucleosomes. The program "ROSE" (Rank Ordering of Super-Enhancers) is commonly used to identify super-enhancers from ChIP-Seq data. This program stitches together previously identified enhancer regions and ranks these stitched enhancers by their ChIP-Seq signal. == References ==
tickerdossier.comtickerdossier.substack.com