Since the 1950s, with the emergence of various molecular techniques, the genome sizes of thousands of
eukaryotes have been analyzed, and these data are available in online databases for animals, plants, and fungi (see external links). Nuclear genome size is typically measured in eukaryotes using either
densitometric measurements of
Feulgen-stained nuclei (previously using specialized densitometers, now more commonly using computerized
image analysis) or
flow cytometry. In
prokaryotes,
pulsed field gel electrophoresis and complete
genome sequencing are the predominant methods of genome size determination. Nuclear genome sizes are well known to vary enormously among eukaryotic species. In animals they range more than 3,300-fold, and in land plants they differ by a factor of about 1,000.
Protist genomes have been reported to vary more than 300,000-fold in size, but the high end of this range (
Amoeba) has been called into question. Genome size in eukaryotes was thought to be proportional to the complexity of an organism but by the mid-20th century it became apparent that closely related species could differ substantially in the size of their genomes. This counter-intuitive observation gave rise to what became known as the "
C-value paradox." By the end of 1960s this "paradox" was resolved by the discovery of
repetitive DNA and the realization that much of the differences in genomes sizes was due to the presence or absence of large amounts of repetitive DNA. Further advances at that time indicated that most of the DNA in large genomes was non-functional
junk DNA and only a small fraction corresponded to functional DNA, including the functional parts of genes. Genome size correlates with a range of measurable characteristics at the
cell and organism levels, including cell size,
cell division rate, and, depending on the
taxon, body size,
metabolic rate, developmental rate,
organ complexity, geographical distribution, or
extinction risk. Although the latter contrasts with the previous view that no correlation exists for the eukaryotes, the observed nonlinear correlation for eukaryotes may reflect disproportionately fast-increasing
junk DNA in increasingly large eukaryotic genomes. Although sequenced genome data are practically biased toward small genomes, which may compromise the accuracy of the empirically derived correlation, and ultimate proof of the correlation remains to be obtained by sequencing some of the largest eukaryotic genomes, current data do not seem to rule out a possible correlation.
Human genome size of a human. It shows 22
homologous chromosomes, both the female (XX) and male (XY) versions of the
sex chromosome (bottom right), as well as the
mitochondrial genome (to scale at bottom left). The blue scale to the left of each chromosome pair (and the mitochondrial genome) shows its length in terms of millions of DNA
base pairs. In humans, the total female
diploid nuclear genome per cell extends for 6.37 Gigabase pairs (Gbp), is 208.23 cm long and weighs 6.51 picograms (pg). Male values are 6.27 Gbp, 205.00 cm, 6.41 pg. In
eukaryotes, in addition to
nuclear DNA, there is also
mitochondrial DNA (mtDNA) which encodes certain proteins used by the mitochondria. The mtDNA is usually relatively small in comparison to the nuclear DNA. For example, the
human mitochondrial DNA forms closed circular molecules, each of which contains 16,569 DNA base pairs, with each such molecule normally containing a full set of the mitochondrial genes. Each human mitochondrion contains, on average, approximately 5 such mtDNA molecules. ==Genome reduction==