Many repeat sequences are likely to be non-functional, decaying remnants of
Transposable elements, these have been labelled "
junk" or "
selfish" DNA. Nevertheless, occasionally some repeats may be
exapted for other functions.
Tandem repeats Tandem repeats are repeated sequences which are directly adjacent to each other in the genome. Tandem repeats may vary in the number of nucleotides comprising the repeated sequence, as well as the number of times the sequence repeats. When the repeating sequence is only 2–10 nucleotides long, the repeat is referred to as a short tandem repeat (STR) or
microsatellite. When the repeating sequence is 10–60 nucleotides long, the repeat is referred to as a
minisatellite. For minisatellites and microsatellites, the number of times the sequence repeats at a single locus can range from twice to hundreds of times. Tandem repeats have a wide variety of biological functions in the genome. For example, minisatellites are often hotspots of meiotic
homologous recombination in eukaryotic organisms. Recombination is when two homologous chromosomes align, break, and rejoin to swap pieces. Recombination is important as a source of genetic diversity, as a mechanism for repairing damaged DNA, and a necessary step in the appropriate segregation of chromosomes in meiosis. These repeats fold into highly organized
G quadruplex structures which protect the ends of chromosomal DNA from degradation. Repetitive elements are enriched in the middle of chromosomes as well.
Centromeres are the highly compact regions of chromosomes which join sister chromatids together and also allow the mitotic spindle to attach and separate sister chromatids during cell division. Centromeres are composed of a 177 base pair tandem repeat named the α-satellite repeat. Some repetitive sequences, such as those with structural roles discussed above, play roles necessary for proper biological functioning. Other tandem repeats have deleterious roles which drive diseases. Many other tandem repeats, however, have unknown or poorly understood functions.
Interspersed repeats Interspersed repeats are identical or similar DNA sequences which are found in different locations throughout the genome. Interspersed repeats are distinguished from tandem repeats in that the repeated sequences are not directly adjacent to each other but instead may be scattered among different chromosomes or far apart on the same chromosome. Most interspersed repeats are
transposable elements (TEs), mobile sequences which can be "cut and pasted" or "copied and pasted" into different places in the genome. TEs were originally called "jumping genes" for their ability to move, yet this term is somewhat misleading as not all TEs are discrete genes. Transposable elements that are transcribed into RNA, reverse-transcribed into DNA, then reintegrated into the genome are called
retrotransposons. Short interspersed nuclear elements (
SINEs) are typically 100-300 base pairs and no longer than 600 base pairs. Since uncontrolled propagation of TEs could wreak havoc on the genome, many regulatory mechanisms have evolved to silence their spread, including DNA methylation, histone modifications, non-coding RNAs (ncRNAs) including small interfering RNA (siRNA), chromatin remodelers, histone variants, and other epigenetic factors. Furthermore, TEs contribute to regulating the expression of other genes by serving as distal
enhancers and transcription factor binding sites. The prevalence of interspersed elements in the genome has garnered attention for more research on their origins and functions. Some specific interspersed elements have been characterized, such as the Alu repeat and LINE1.
Intrachromosomal recombination Homologous recombination between chromosomal repeated sequences in somatic cells of
Nicotiana tabacum was found to be increased by exposure to
mitomycin C, a bifunctional alkylating agent that
crosslinks DNA strands. This increase in recombination was attributed to increased intrachromosomal recombinational repair. ==Evolutionary emergence of meiosis==