TDP-43 is 414
amino acid residues long. It consists of four
domains: an N-terminal domain spanning residues 1–76 (NTD) with a well-defined
fold that has been shown to form a
dimer or
oligomer; two highly conserved folded
RNA recognition motifs spanning residues 106–176 (RRM1) and 191–259 (RRM2), respectively, required to bind target
RNA and
DNA; an unstructured C-terminal domain encompassing residues 274–414 (CTD), which contains a
glycine-rich region, is involved in protein-protein interactions, and harbors most of the
mutations associated with familial
amyotrophic lateral sclerosis. The entire protein devoid of large solubilising tags has been purified. The full-length protein is a dimer. but shortly after it was argued that in the context of
FTLD-TDP the protein involved could be
TMEM106B (which has been also resolved with cryo-EM), rather than of TDP-43.
N-Terminal domain (NTD) The
NTD located between residues 1 and 76 is involved in TDP-43
polymerization. Indeed, dimers are formed by head-to-head interactions between NTDs, and the polymer thus obtained allows for
pre-mRNA splicing. However, further
oligomerization brings to more toxic accumulates. This process of polymerization into dimers, larger forms or just stabilizing monomers is dependent on TDP-43 conformational equilibrium between monomers, homodimers and oligomers. Hence, in TDP-43 diseased
cells, TDP-43's over-expression leads to the NTD showing high propensity to aggregate. Contrary to this, in normal cells, normal levels of TDP-43 allow for folded NTD, preventing aggregates and polymers formation. More recently, this domain was found to have a
ubiquitin-like structure. It bears 27,6% of homology with
Ubiquitin-1 and a
β1-β2-
α1-β3-β4-β5-β6 + 2*
SO42- form. Ubiquitin-like domain are usually associated with a greater affinity for
RNA/
DNA. However, in the unique case of TDP-43, the Ubiquitin-like NTD binds directly to
ssDNA. This interaction permits the conformational equilibrium cited higher to shift towards non-aggregated forms. The domain spanning from [1,80] has a
solenoid-like structure which sterically impedes interactions between aggregation prone
C-term regions.
Mitochondrial localization signal There are six
mitochondrial localization signals to be accounted on TDP-43's
amino acid sequence, although only M1, M3, and M5 were shown to be essential for mitochondrial localization. Indeed, their ablation leads to a lessened mitochondrial localization. These localizing sequences are found on the following amino acids: M1: [35, 41], M2: [105, 112], M3: [146-150], M4: [228, 235], M5: [294, 300], M6: [228, 236].
Nuclear localization signal (NLS) The
nuclear localization signal (NLS) domain is located between residues 82 and 98 is of critical importance in
ALS, and such is witnessed by the depletion or the mutations (notably A90V) of this domain, which cause loss-of-function from nucleus and promote aggregating, two processes very likely to conduct to TDP-43's toxic gain of function.
Nuclear export signal (NES) The nuclear export signal is located between residues 239 and 251 sequence probably bears a role in TDP-43's shuttling function, and was recently found using a prediction algorithm.
Disordered glycin rich C-terminal domain (CTD) The Disordered
Glycin Rich C-terminal domain is located between residues 277 and 414. Much like 70 other
RNA binding proteins, TDP-43 bears a
Q/
N rich domain [344, 366] which resembles
yeast prion sequence. This sequence is called a Prion-Like Domain (PLD). PLDs are low complexity sequences that have been reported to mediate gene regulation via Liquid-Liquid Phase Transition (LLP) thus driving RNP granule assembly. It is here noted that LLP are reversible phenomenons of de-mixing a solution into two distinct liquid phases, hereby forming granules. Mutations within the TDP-43 proteins Glycine Rich Region (GRR) have recently been identified as associates that can contribute to various neurodegenerative diseases, with the most notable and common NDD being ALS, about 10% of the mutations causing familial ALS are accredited with the TDP-43 protein This CTD is often reported to play important role in pathogenic behavior of TDP-43:
RNPs granules could have a role in stress response, and thus, aging, or persistence stress could lead the LLPs to turn into irreversible Liquid Solid Phase separation, pathological aggregates notably found in
ALS neurons. The CTD's disorganized structure can turn into a full fledged
amyloid-like
beta-sheet rich structure, causing it to adopt
prion-like properties. Moreover, CTFs are a common marker in diseased
neurons and are argued to be of high toxicity. However, notice is to be taken that some points are not always consensual. Indeed, due to its
hydrophobic structure, TDP-43 can be hard to analyze, and parts of it remain somewhat vague. Precise sites of
phosphorylation,
methylation, or even binding are still a bit elusive. == Function ==