Eukaryotic SMCs Eukaryotes have at least six SMC proteins in individual organisms, and they form three distinct heterodimers with specialized functions: •
SMC1-SMC3: A pair of SMC1 and SMC3 constitutes the core subunits of the
cohesin complexes involved in sister chromatid cohesion. •
SMC2-SMC4: A pair of SMC2 and SMC4 acts as the core of the
condensin complexes implicated in
chromosome condensation. •
SMC5-SMC6: A pair of SMC5 and SMC6 functions as part of a yet-to-be-named complex implicated in
DNA repair and checkpoint responses. The pairings of SMC proteins in eukaryotes, SMC1-SMC3, SMC2–SMC4, and SMC5–SMC6, are highly specific and invariant; no exceptions to these combinations have been reported to date. Sequence comparisons reveal that SMC1 and SMC4, as well as SMC2 and SMC3, share a high degree of similarity, while SMC5 and SMC6 form a more distinct clade (Figure 1). It is hypothesized that the
last eukaryotic common ancestor (LECA) possessed all six SMC proteins. While SMC1–4 are conserved in all known eukaryotic species, some lineages (such as the
ciliate Tetrahymena thermophila) have lost SMC5 and SMC6 during evolution, suggesting that the SMC5/6 complex may not be strictly essential for eukaryotic cell viability. In addition to the six subtypes, some organisms have variants of SMC proteins. For instance, mammals have a
meiosis-specific variant of SMC1, known as SMC1β. The nematode
Caenorhabditis elegans has an SMC4-variant that has a specialized role in
dosage compensation. The following table shows the SMC proteins names for several
model organisms and vertebrates:
Prokaryotic SMCs The evolutionary origin of SMC proteins is ancient, and homologs are widely conserved in both
bacteria and
archaea. •
SMC (canonical type): Many bacteria (e.g.,
Bacillus subtilis) and archaea possess canonical SMC proteins that closely resemble their eukaryotic counterparts. These bacterial and archaeal SMCs form homodimers and associate with regulatory subunits to form
condensin-like complexes,
SMC-ScpAB. It is hypothesized that the eukaryotic ancestor (most likely the
Asgard archaeon) possessed two types of SMC proteins: a canonical SMC (SMCc) and a non-canonical SMC (SMCnc). Gene duplications of these two ancestral types are thought to have given rise to the six SMC subfamilies present in the
last eukaryotic common ancestor (LECA): SMC1–4 evolved from the canonical lineage, while SMC5 and SMC6 evolved from the non-canonical lineage (Figure 1). •
MukB: In some
γ-proteobacteria, including
Escherichia coli, SMC function is carried out by a distantly related protein called MukB. MukB also forms homodimers and, together with regulatory subunits, assembles into a
MukBEF complex, which performs
condensin-like functions in organizing bacterial chromosomes. •
MksB/JetC/EptC: A third type of prokaryotic SMC protein, known as MksB, has been identified in certain bacterial species. Like MukB, MksB forms a distantly-related
condensin-like complex, MksBEF. More recently, a variant complex called
MksBEFG, which includes a nuclease subunit MksG, has been shown to function in plasmid defense. In other bacterial lineages, orthologous systems have been identified, including JetABCD and EptABCD. These systems are collectively referred to as the
Wadjet family of SMC-like complexes.
SMC-related proteins In a broader sense, several proteins with structural similarities to SMC are considered members of the SMC superfamily. • In eukaryotes,
Rad50 is a well-known SMC-related protein involved in the repair of DNA double-strand breaks. • In bacteria, several proteins related to DNA repair also belong to the extended SMC family, including
SbcC,
RecF, and
RecN. • In archaea, a subfamily known as Archaea-specific SMC-related proteins (
ASRPs) has been identified. Previously described archaeal proteins such as Sph1/2 and ClsN (also known as
coalescin) are now considered members of this ASRP subfamily. ==Subunit composition of SMC protein complexes==