The canonical Cr2/CD21 gene of subprimate mammals produces two types of complement receptor (CR1, ca. 200 kDa; CR2, ca. 145 kDa) via alternative mRNA splicing. The murine Cr2 gene contains 25 exons; a common first exon is spliced to exon 2 and to exon 9 in transcripts encoding CR1 and CR2, respectively. A transcript with an
open reading frame of 4,224 nucleotides encodes the long isoform, CR1; this is predicted to be a protein of 1,408 amino acids that includes 21 short consensus repeats (SCR) of ca. 60 amino acids each, plus transmembrane and cytoplasmic regions. Isoform CR2 (1,032 amino acids) is encoded by a shorter transcript (3,096 coding nucleotides) that lacks exons 2–8 encoding SCR1-6. CR1 and CR2 on murine B cells form complexes with a co-accessory activation complex containing CD19, CD81, and the fragilis/Ifitm (murine equivalents of LEU13) proteins. The
complement receptor 2 (CR2) gene of primates produces only the smaller isoform, CR2; primate CR1, which recapitulates many of the structural domains and presumed functions of Cr2-derived CR1 in subprimates, is encoded by a distinct CR1 gene (apparently derived from the gene Crry of subprimates). Isoforms CR1 and CR2 derived from the Cr2 gene possess the same C-terminal sequence, such that association with and activation through CD19 should be equivalent. CR1 can bind to C4b and C3b complexes, whereas CR2 (murine and human) binds to C3dg-bound complexes. CR1, a surface protein produced primarily by
follicular dendritic cells, appears to be critical for generation of appropriately activated B cells of the germinal centre and for mature antibody responses to bacterial infection. The most common allelic variant of the human CR1 gene (CR1*1) is composed of 38
exons spanning 133kb encoding a
protein of 2,039
amino acids with a predicted molecular weight of 220 kDa. Large
insertions and
deletions have given rise to four structurally variant
genes and some alleles may extend up to 160 kb and 9 additional exons. The
transcription start site has been mapped to 111 bp upstream of the
translation initiation codon ATG and there is another possible start site 29 bp further upstream. The
promoter region lacks a distinct
TATA box sequence. The gene is expressed principally on
erythrocytes,
monocytes,
neutrophils and
B cells but is also present on some
T lymphocytes,
mast cells and
glomerular podocytes. == Structure ==