of DUX4 protein full-length (FL), with short (S) version indicated The genetics of FSHD are complex. The
DUX4 gene is the focal point of FSHD genetics. Normally, full-length DUX4 protein (DUX4-fl) is expressed during early
embryogenesis, in testicular tissue of adults, and the
thymus; in all other tissues, it is
repressed. Each D4Z4 repeat is 3.3
kilobase pairs (kb) long and is the site of epigenetic regulation, containing both
heterochromatin and
euchromatin structures. In FSHD, the heterochromatin structure is lost, becoming euchromatin, Because 10q usually lacks a polyadenylation sequence, it is usually not implicated in disease. However,
chromosomal rearrangements can occur between 4q and 10q repeat arrays, and involvement in disease is possible if a 4q D4Z4 repeat and polyadenylation signal are transferred onto 10q, or if rearrangement causes FSHD1. D4Z4 repeat array types are subclassified into 4qA and 4qB alleles, with only 4qA alleles causing disease. 4qA alleles are defined by a specific sequence of DNA immediately downstream to the D4Z4 repeat array: a 260 base pair region named pLAM, followed by a 6,200 base pair beta satellite region. 4qA and 4qB alleles, together, can be subdivided into at least 17 types, based on the DNA upstream from the D4Z4 repeat array, the presence/absence of restriction enzyme sites within D4Z4, the size of the last D4Z4 repeat element, and the DNA present downstream to the D4Z4 repeat array. A polyadenylation signal is within exon 3. Because exon 3 and its containing polyadenylation signal are not contained within each D4Z4 repeat, only the last D4Z4 repeat of a D4Z4 repeat array can encode a stable mRNA transcript to produce the DUX4 protein. These transcripts can be spliced in several ways to form mature RNA. One of these transcripts encodes only a portion of DUX4 protein, termed DUX4-s (DUX4-short). One proposed mechanism is that DBE-T leads to the recruitment of the
trithorax-group protein Ash1L, an increase in
H3K36me2-methylation, and ultimately de-repression of 4q35 genes.
FSHD1 FSHD involving deletion of D4Z4 repeats (termed 'D4Z4 contraction') on 4q is classified as FSHD1, which accounts for 95% of FSHD cases. As of 2019, more detailed studies are needed to show definitively whether or not anticipation occurs. If anticipation does occur in FSHD, the mechanism is different than that of trinucleotide repeat disorders, since D4Z4 repeats are much larger than trinucleotide repeats, an underabundance of repeats (rather than overabundance) causes disease, and the repeat array size in FSHD is stable across generations.
FSHD2 FSHD with a D4Z4 array repeat size of 11 or greater is classified as FSHD2, responsible for 5% of FSHD cases. Another cause of FSHD2 is mutation in
DNMT3B (DNA methyltransferase 3B). Mutations in
DNMT3B can also cause
ICF syndrome. LRIF1 is known to interact with the SMCHD1 protein.
Two ends of a disease spectrum FSHD1 and FSHD2 have traditionally been viewed as separate entities with distinct genetic causes (although the downstream genetic mechanisms merge). Alternatively, the genetic causes of FSHD1 and FSHD2 can be viewed as
risk factors, each contributing to an FSHD disease spectrum. For example, in those with FSHD2, although they have do not have a 4qA allele with D4Z4 repeat number less than 11, they still have one less than 30 (shorter than the upper limit seen in the general population), suggesting that a large number of D4Z4 repeats can prevent the effects of an
SMCHD1 mutation. A combined FSHD1/FSHD2 presentation is most common in those with 9–10 repeats. A possible explanation is that a sizable portion of the general population has 9–10 repeats with difficult-to-detect or no disease. The additive effect of an
SMCHD1 mutation may be severe enough to make a diagnosis. The 9–10 repeat size can be considered as an overlap zone between FSHD1 and FSHD2. ==Pathophysiology==