The CRISPR/Cas system evolved in nature as a means for bacteria to protect themselves from invading viruses and bacteriophages by inserting pieces of their DNA into the host genome. This allowed the adaptive immune system to respond accordingly on a subsequent infection. It was discovered in
Streptococcus pyogenes and later found across many other species.
Repeated sequences The discovery of clustered DNA repeats took place independently in three parts of the world. The first description of what would later be called CRISPR is
Ishino, et al. in 1987. They accidentally cloned part of a CRISPR sequence together with the iap
gene (isozyme conversion of alkaline phosphatase
) from
Escherichia coli. The organization of the repeats surprised them, as clustered repeated sequences are more typically arranged consecutively, without interspersing sequences. and used this property to design a typing method called
spoligotyping, which remains in use.
Mojica studied the function of repeats in the archaeal
genera Haloferax and
Haloarcula. Mojica's supervisor surmised that the clustered repeats had a role in segregating replicated DNA into daughter cells during cell division, because plasmids and chromosomes with identical repeat arrays could not coexist in
Haloferax volcanii. They noted the transcription of the interrupted repeats for the first time; CRISPR's first full characterization. By 2000, Mojica and his students, after an automated search of published genomes, identified interrupted repeats in 20 species of microbes as belonging to the same family. Because those sequences were interspaced, Mojica initially called these sequences "short regularly spaced repeats" (SRSR). In 2001, Mojica and Jansen, who were searching for additional interrupted repeats, proposed the acronym CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) to encompass the numerous acronyms then in use. In 2002, Tang, et al. showed evidence that CRISPR repeat regions in
Archaeoglobus fulgidus were transcribed into long RNA molecules subsequently processed into unit-length small RNAs, plus some longer forms of 2, 3, or more spacer-repeat units. In 2005,
Barrangou discovered that
S thermophilus, after iterative phage infection challenges, develops increased phage resistance due to the incorporation of additional CRISPR spacer sequences.
CRISPR-associated systems A major advance came with Jansen's observation that the prokaryote repeat cluster was accompanied by four homologous genes, CRISPR-associated systems,
Cas 1–4. The Cas proteins showed
helicase and
nuclease motifs, suggesting a role in the dynamic structure of the CRISPR loci. However, CRISPR's function remained enigmatic. In 2005, three independent research groups showed that some CRISPR spacers are derived from
phage DNA and
extrachromosomal DNA such as
plasmids. In effect, the spacers are fragments of DNA gathered from viruses that previously attacked the cell. The source of the spacers was a sign that the CRISPR-
cas system could have a role in adaptive immunity in
bacteria. All three studies proposing this idea were initially rejected by high-profile journals, but eventually appeared elsewhere. The first publication Experimental work by several groups revealed the basic mechanisms of CRISPR-Cas immunity. In 2007, the first experimental evidence that CRISPR was part of the adaptive immune system was published. Cascade, crRNA and a helicase/nuclease (
Cas3) were required to provide a bacterial host with immunity against infection by a
DNA virus. By designing an anti-virus CRISPR, they demonstrated that two orientations of the crRNA (sense/antisense) provided immunity, indicating that the crRNA guides were targeting
dsDNA. That year Marraffini and Sontheimer confirmed that a CRISPR sequence of
S. epidermidis targeted DNA and not RNA to prevent
conjugation. This finding was at odds with the proposed RNA-interference-like mechanism of CRISPR-Cas immunity, although a CRISPR-Cas system that targets foreign RNA was later found in
Pyrococcus furiosus. In 2012, Jinek et al. fused crRNA and
tracrRNA into a single-guide RNA, simplifying Cas9 targeting. Šikšnys, et al., reported that Cas9 from
S. thermophilus can target specific DNA by altering crRNA. In 2013, Cong, et al., as well as Mali, et al., applied CRISPR-Cas9 to edit human cell cultures. In 2015, Liang, et al., edited human tripronuclear
zygotes, achieving successful cleavage in 28 of 54
embryos.
Cas9 and
Cas9 with the position of DNA cleavage shown relative to their
PAM sequences in a zoom-in A simpler CRISPR system from
S. pyogenes uses
Cas9, an endonuclease functioning with two small RNAs—crRNA and tracrRNA—to form a four-component complex. In 2012,
Doudna and
Charpentier simplified this into a two-component system by fusing the RNAs into a
single-guide RNA, enabling Cas9 to target and cut specific DNA sequences—a breakthrough that earned them the 2020
Nobel Prize in Chemistry. Parallel work showed the
S. thermophilus Cas9 could be similarly reprogrammed by altering the crRNA sequence.
Cas12a Cas12a, a Class II Type V CRISPR-associated nuclease, was characterized in 2015 and was formerly known as Cpf1. This collateral cleavage property is exploited in the development of various diagnostic technologies. == Locus structure ==