Gene duplication Genome duplication occurs through various mechanisms mentioned below
Whole genome duplication Similar to gene duplication, whole genome duplication is the process by which an organism's entire genetic information is copied, once or multiple times which is known as
polyploidy. This may provide an evolutionary benefit to the organism by supplying it with multiple copies of a gene thus creating a greater possibility of functional and selectively favored genes. However, tests for enhanced rate and innovation in teleost fishes with duplicated genomes compared with their close relative holostean fishes (without duplicated genomes) found that there was little difference between them for the first 150 million years of their evolution. In 1997, Wolfe & Shields gave evidence for an ancient duplication of the
Saccharomyces cerevisiae (
Yeast) genome. It was initially noted that this yeast genome contained many individual gene duplications. Wolfe & Shields hypothesized that this was actually the result of an entire genome duplication in the yeast's distant evolutionary history. They found 32 pairs of homologous chromosomal regions, accounting for over half of the yeast's genome. They also noted that although
homologs were present, they were often located on different
chromosomes. Based on these observations, they determined that
Saccharomyces cerevisiae underwent a whole genome duplication soon after its evolutionary split from
Kluyveromyces, a genus of ascomycetous yeasts. Over time, many of the duplicate genes were deleted and rendered non-functional. A number of chromosomal rearrangements broke the original duplicate chromosomes into the current manifestation of homologous chromosomal regions. This idea was further solidified in looking at the genome of yeast's close relative
Ashbya gossypii. Whole genome duplication is common in fungi as well as plant species. An example of extreme genome duplication is represented by the Common Cordgrass (
Spartina anglica) which is a dodecaploid, meaning that it contains 12 sets of chromosomes, in stark contrast to the human diploid structure in which each individual has only two sets of 23 chromosomes.
Transposable elements Transposable elements are regions of DNA that can be inserted into the genetic code through one of two mechanisms. These mechanisms work similarly to "cut-and-paste" and "copy-and-paste" functionalities in word processing programs. The "cut-and-paste" mechanism works by excising DNA from one place in the genome and inserting itself into another location in the code. The "copy-and-paste" mechanism works by making a genetic copy or copies of a specific region of DNA and inserting these copies elsewhere in the code. The most common transposable element in the
human genome is the
Alu sequence, which is present in the genome over one million times.
Mutation Spontaneous
mutations often occur which can cause various changes in the genome. A mutation in a
promoter region, enhancer region or
transcription factor binding region can also result in either a loss of function, or an up or downregulation in the
transcription of the gene targeted by these regulatory elements. Mutations are constantly occurring in an organism's genome and can cause either a negative effect, positive effect or neutral effect (no effect at all).
Pseudogenes Often a result of spontaneous
mutation,
pseudogenes are dysfunctional genes derived from previously functional gene relatives. There are many mechanisms by which a functional gene can become a pseudogene including the deletion or insertion of one or multiple
nucleotides. This can result in a shift of
reading frame, causing the gene to no longer code for the expected protein, introduce a premature
stop codon or a mutation in the
promoter region. Often cited examples of pseudogenes within the human genome include the once functional
olfactory gene families. Over time, many olfactory genes in the human genome became pseudogenes and were no longer able to produce functional proteins, explaining the poor sense of smell humans possess in comparison to their mammalian relatives. Similarly, bacterial pseudogenes commonly arise from
adaptation of free-living bacteria to
parasitic lifestyles, so that many metabolic genes become superfluous as these species become adapted to their host. Once a parasite obtains nutrients (such as
amino acids or
vitamins) from its host it has no need to produce these nutrients itself and often loses the genes to make them.
Exon shuffling Exon shuffling is a mechanism by which new genes are created. This can occur when two or more
exons from different genes are combined or when exons are duplicated. Exon shuffling results in new genes by altering the current intron-exon structure. This can occur by any of the following processes:
transposon mediated shuffling, sexual recombination or non-homologous recombination (also called
illegitimate recombination). Exon shuffling may introduce new genes into the genome that can be either selected against and deleted or selectively favored and conserved.
Genome reduction and gene loss Many species exhibit genome reduction when subsets of their genes are not needed anymore. This typically happens when organisms adapt to a parasitic life style, e.g. when their nutrients are supplied by a host. As a consequence, they lose the genes needed to produce these nutrients. In many cases, there are both free living and parasitic species that can be compared and their lost genes identified. Good examples are the genomes of
Mycobacterium tuberculosis and
Mycobacterium leprae, the latter of which has a dramatically reduced genome (see figure under
pseudogenes above). Another beautiful example are
endosymbiont species. For instance,
Polynucleobacter necessarius was first described as a cytoplasmic endosymbiont of the ciliate
Euplotes aediculatus. The latter species dies soon after being cured of the endosymbiont. In the few cases in which
P. necessarius is not present, a different and rarer bacterium apparently supplies the same function. No attempt to grow symbiotic
P. necessarius outside their hosts has yet been successful, strongly suggesting that the relationship is obligate for both partners. Yet, closely related free-living relatives of P. necessarius have been identified. The endosymbionts have a significantly reduced genome when compared to their free-living relatives (1.56 Mbp vs. 2.16 Mbp). == Speciation ==