Rate of gene duplication Comparisons of genomes demonstrate that gene duplications are common in most species investigated. This is indicated by variable copy numbers (
copy number variation) in the genome of humans or fruit flies. However, it has been difficult to measure the rate at which such duplications occur. Recent studies yielded a first direct estimate of the genome-wide rate of gene duplication in
Caenorhabditis elegans, the first multicellular eukaryote for which such as estimate became available. The gene duplication rate in
C. elegans is on the order of 10−7 duplications/gene/generation, that is, in a population of 10 million worms, one will have a gene duplication per generation. This rate is two orders of magnitude greater than the spontaneous rate of point mutation per nucleotide site in this species. Older (indirect) studies reported locus-specific duplication rates in bacteria,
Drosophila, and humans ranging from 10−3 to 10−7/gene/generation.
Genome duplication in cancer Genome duplication does not occur as a single event but as a continuous process during tumor progression, generating cells with different degrees of ploidy. More than 60% of the tumors analyzed showed multiple whole-genome duplication (WGD) events, suggesting an active evolutionary model within the tumor.
Neofunctionalization Gene duplications are an essential source of genetic novelty that can lead to evolutionary innovation. Duplication creates genetic redundancy, where the second copy of the gene is often free from
selective pressure—that is,
mutations of it have no deleterious effects to its host organism. If one copy of a gene experiences a mutation that affects its original function, the second copy can serve as a 'spare part' and continue to function correctly. Thus, duplicate genes accumulate mutations faster than a functional single-copy gene, over generations of organisms, and it is possible for one of the two copies to develop a new and different function. Some examples of such neofunctionalization is the apparent mutation of a duplicated digestive gene in a family of
ice fish into an antifreeze gene and duplication leading to a novel snake venom gene and the synthesis of 1 beta-hydroxytestosterone in pigs. Gene duplication is believed to play a major role in
evolution; this stance has been held by members of the scientific community for over 100 years.
Susumu Ohno was one of the most famous developers of this theory in his classic book
Evolution by gene duplication (1970). Ohno argued that gene duplication is the most important evolutionary force since the emergence of the
universal common ancestor. Major
genome duplication events can be quite common. It is believed that the entire
yeast genome underwent duplication about 100 million years ago.
Plants are the most prolific genome duplicators. For example,
wheat is hexaploid (a kind of
polyploid), meaning that it has six copies of its genome.
Subfunctionalization Another possible fate for duplicate genes is that both copies are equally free to accumulate degenerative mutations, so long as any defects are complemented by the other copy. This leads to a neutral "
subfunctionalization" (a process of
constructive neutral evolution) or DDC (duplication-degeneration-complementation) model, in which the functionality of the original gene is distributed among the two copies. Neither gene can be lost, as both now perform important non-redundant functions, but ultimately neither is able to achieve novel functionality. Subfunctionalization can occur through neutral processes in which mutations accumulate with no detrimental or beneficial effects. However, in some cases subfunctionalization can occur with clear adaptive benefits. If an ancestral gene is
pleiotropic and performs two functions, often neither one of these two functions can be changed without affecting the other function. In this way, partitioning the ancestral functions into two separate genes can allow for adaptive specialization of subfunctions, thereby providing an adaptive benefit.
Loss Often the resulting genomic variation leads to gene dosage dependent neurological disorders such as
Rett-like syndrome and
Pelizaeus–Merzbacher disease. Such detrimental mutations are likely to be lost from the population and will not be preserved or develop novel functions. However, many duplications are, in fact, not detrimental or beneficial, and these neutral sequences may be lost or may spread through the population through random fluctuations via
genetic drift. ==Identifying duplications in sequenced genomes==