DNA replication, like all biological polymerization processes, proceeds in three enzymatically catalyzed and coordinated steps: initiation, elongation and termination.
Initiation For a
cell to divide, it must first replicate its DNA. DNA replication is an all-or-none process; once replication begins, it proceeds to completion. Once replication is complete, it does not occur again in the same cell cycle. This is made possible by the division of initiation of the
pre-replication complex.
Pre-replication complex In late
mitosis and early
G1 phase, a large complex of initiator proteins assembles into the pre-replication complex at particular points in the DNA, known as "
origins". Sequences used by initiator proteins tend to be "AT-rich" (rich in adenine and thymine bases), because A-T base pairs have two hydrogen bonds (rather than the three formed in a C-G pair) and thus are easier to strand-separate. In eukaryotes, the origin recognition complex (ORC) catalyzes the assembly of initiator proteins into the pre-replication complex. In addition, a recent report suggests that budding yeast ORC dimerizes in a cell cycle dependent manner to control licensing. In turn, the process of ORC dimerization is mediated by a cell cycle-dependent Noc3p dimerization cycle in vivo, and this role of Noc3p is separable from its role in ribosome biogenesis. An essential Noc3p dimerization cycle mediates ORC double-hexamer formation in replication licensing and Noc3p are continuously bound to the chromatin throughout the cell cycle.
Cdc6 and
Cdt1 then associate with the bound origin recognition complex at the origin in order to form a larger complex necessary to load the
Mcm complex onto the DNA. In eukaryotes, the Mcm complex is the helicase that will split the DNA helix at the replication forks and origins. The Mcm complex is recruited at late G1 phase and loaded by the ORC-Cdc6-Cdt1 complex onto the DNA via ATP-dependent protein remodeling. The loading of the MCM complex onto the origin DNA marks the completion of pre-replication complex formation. If environmental conditions are right in late G1 phase, the G1 and G1/S
cyclin-
Cdk complexes are activated, which stimulate expression of genes that encode components of the DNA synthetic machinery. G1/S-Cdk activation also promotes the expression and activation of S-Cdk complexes, which may play a role in activating replication origins depending on species and cell type. Control of these Cdks vary depending on cell type and stage of development. This regulation is best understood in
budding yeast, where the S cyclins
Clb5 and
Clb6 are primarily responsible for DNA replication. Clb5,6-Cdk1 complexes directly trigger the activation of replication origins and are therefore required throughout S phase to directly activate each origin. The TOPRIM fold contains an α/β core with four conserved strands in a
Rossmann-like topology. This structure is also found in the catalytic domains of
topoisomerase Ia, topoisomerase II, the OLD-family nucleases and DNA repair proteins related to the RecR protein. The primase used by archaea and eukaryotes, in contrast, contains a highly derived version of the
RNA recognition motif (RRM). This primase is structurally similar to many viral RNA-dependent RNA polymerases, reverse transcriptases, cyclic nucleotide generating cyclases and DNA polymerases of the A/B/Y families that are involved in DNA replication and repair. In eukaryotic replication, the primase forms a complex with Pol α. Multiple DNA polymerases take on different roles in the DNA replication process. In
E. coli,
DNA Pol III is the polymerase enzyme primarily responsible for DNA replication. It assembles into a replication complex at the replication fork that exhibits extremely high processivity, remaining intact for the entire replication cycle. In contrast,
DNA Pol I is the enzyme responsible for replacing RNA primers with DNA. DNA Pol I has a 5′ to 3′
exonuclease activity in addition to its polymerase activity, and uses its exonuclease activity to degrade the RNA primers ahead of it as it extends the DNA strand behind it, in a process called
nick translation. Pol I is much less processive than Pol III because its primary function in DNA replication is to create many short DNA regions rather than a few very long regions. In
eukaryotes, the low-processivity enzyme, Pol α, helps to initiate replication because it forms a complex with primase. In eukaryotes, leading strand synthesis is thought to be conducted by Pol ε; however, this view has recently been challenged, suggesting a role for Pol δ. Primer removal is completed Pol δ while repair of DNA during replication is completed by Pol ε. As DNA synthesis continues, the original DNA strands continue to unwind on each side of the bubble, forming a
replication fork with two prongs. In bacteria, which have a single origin of replication on their circular chromosome, this process creates a "
theta structure" (resembling the Greek letter theta: θ). In contrast, eukaryotes have longer linear chromosomes and initiate replication at multiple origins within these.
Replication fork The replication fork is a structure that forms within the long helical DNA during DNA replication. It is produced by enzymes called helicases that break the hydrogen bonds that hold the DNA strands together in a helix. The resulting structure has two branching "prongs", each one made up of a single strand of DNA. These two strands serve as the template for the leading and lagging strands, which will be created as DNA polymerase matches complementary nucleotides to the templates; the templates may be properly referred to as the leading strand template and the lagging strand template. '''DNA is read by DNA polymerase in the 3′ to 5′ direction, meaning the new strand is synthesized in the 5' to 3' direction.''' Since the leading and lagging strand templates are oriented in opposite directions at the replication fork, a major issue is how to achieve synthesis of new lagging strand DNA, whose direction of synthesis is opposite to the direction of the growing replication fork.
Leading strand The leading strand is the strand of new DNA which is synthesized in the same direction as the growing replication fork. This sort of DNA replication is continuous.
Lagging strand The lagging strand is the strand of new DNA whose direction of synthesis is opposite to the direction of the growing replication fork, although the chemical reaction is exactly the same. Because of its orientation, replication of the lagging strand is more complicated as compared to that of the leading strand. As a consequence, the DNA polymerase on this strand is seen to "lag behind" the other strand. The lagging strand is synthesized in short segments. On the lagging strand
template, a
primase "reads" the template DNA and initiates synthesis of a short complementary
RNA primer. A DNA polymerase extends the primed segments, forming
Okazaki fragments. The RNA primers are then removed by another DNA polymerase and replaced with DNA, and the fragments of DNA are joined by
DNA ligase. As helicase unwinds DNA at the replication fork, the DNA ahead is forced to rotate. This process results in a build-up of twists in the DNA ahead. This build-up creates a torsional load that would eventually stop the replication fork. Topoisomerases are enzymes that temporarily break the strands of DNA, relieving the tension caused by unwinding the two strands of the DNA helix; topoisomerases (including
DNA gyrase) achieve this by adding negative
supercoils to the DNA helix. Bare single-stranded DNA tends to fold back on itself forming
secondary structures; these structures can interfere with the movement of DNA polymerase. To prevent this,
single-strand binding proteins bind to the DNA until a second strand is synthesized, preventing secondary structure formation. Double-stranded DNA is coiled around
histones that play an important role in regulating gene expression so the replicated DNA must be coiled around histones at the same places as the original DNA. To ensure this, histone
chaperones disassemble the
chromatin before it is replicated and replace the histones in the correct place. Some steps in this reassembly are somewhat speculative. Clamp proteins act as a sliding clamp on DNA, allowing the DNA polymerase to bind to its template and aid in processivity. The inner face of the clamp enables DNA to be threaded through it. Once the polymerase reaches the end of the template or detects double-stranded DNA, the sliding clamp undergoes a conformational change that releases the DNA polymerase. Clamp-loading proteins are used to initially load the clamp, recognizing the junction between template and RNA primers.
In vitro single-molecule experiments (using
optical tweezers and
magnetic tweezers) have found synergetic interactions between the replisome enzymes (
helicase,
polymerase, and
Single-strand DNA-binding protein) and with the DNA replication fork enhancing
DNA-unwinding and DNA-replication. In 1980 Bruce Alberts proposed the "trombone model" of DNA replication to visualize the coordination between the leading and lagging strand of DNA synthesis. In this model the replication machinery is clustered while the leading strand synthesis flows through and the lagging strand expands in and out like the slide on a trombone. Peter Meister et al. observed directly replication sites in
budding yeast by monitoring
green fluorescent protein (GFP)-tagged DNA polymerases α. They detected DNA replication of pairs of the tagged loci spaced apart symmetrically from a replication origin and found that the distance between the pairs decreased markedly by time. This finding suggests that the mechanism of DNA replication aligns with the trombone model with a cluster of replisomes loaded on the origin of replication, proceeding in both directions. Meister's finding is the first direct evidence of trombone model. Subsequent research has shown that DNA helicases form dimers in many eukaryotic cells and bacterial replisomes stay in a single intranuclear location during DNA synthesis. Replication Factories Disentangle Sister Chromatids. The disentanglement is essential for distributing the chromatids into daughter cells after DNA replication. Because sister chromatids after DNA replication hold each other by
cohesin rings, there is the only chance for the disentanglement in DNA replication. Fixing of replication machineries as replication factories can improve the success rate of DNA replication. If replication forks move freely in chromosomes, catenation of nuclei is aggravated and impedes mitotic segregation.
Termination Eukaryotes initiate DNA replication at multiple points in the chromosome, so replication forks meet and terminate at many points in the chromosome. Because eukaryotes have linear chromosomes, DNA replication is unable to reach the very end of the chromosomes. Due to this problem, DNA is lost in each replication cycle from the end of the chromosome.
Telomeres are regions of repetitive DNA close to the ends and help prevent loss of genes due to this shortening. Shortening of the telomeres is a normal process in
somatic cells. This shortens the telomeres of the daughter DNA chromosome. As a result, cells can only divide a certain number of times before the DNA loss prevents further division. (This is known as the
Hayflick limit.) Within the
germ cell line, which passes DNA to the next generation,
telomerase extends the repetitive sequences of the telomere region to prevent degradation. Telomerase can become mistakenly active in somatic cells, sometimes leading to
cancer formation. Increased telomerase activity is one of the hallmarks of cancer. Termination requires that the progress of the DNA replication fork must stop or be blocked. Termination at a specific locus, when it occurs, involves the interaction between two components: (1) a termination site sequence in the DNA, and (2) a protein which binds to this sequence to physically stop DNA replication. In various bacterial species, this is named the DNA replication terminus site-binding protein, or
Ter protein. Because bacteria have circular chromosomes, termination of replication occurs when the two replication forks meet each other on the opposite end of the parental chromosome.
E. coli regulates this process through the use of termination sequences that, when bound by the
Tus protein, enable only one direction of replication fork to pass through. As a result, the replication forks are constrained to always meet within the termination region of the chromosome. == Regulation ==