Biologists and taxonomists have made many attempts to define species, beginning from
morphology and moving towards
genetics. Early taxonomists such as Linnaeus had no option but to describe what they saw: this was later formalised as the typological or morphological species concept.
Ernst Mayr emphasised reproductive isolation, but this, like other species concepts, can be hard or even impossible to test for groups of organisms separated in space or time. Later biologists have tried to refine Mayr's definition with the recognition and cohesion concepts, among others. Many of the concepts are quite similar or overlap, so they are not easy to count: the biologist R. L. Mayden recorded about 24 concepts, and the philosopher of science John Wilkins counted 26. Wilkins further grouped the species concepts into seven basic kinds of concepts: (1)
agamospecies for asexual organisms (2) biospecies for reproductively isolated sexual organisms (3)
ecospecies based on ecological niches (4) evolutionary species based on lineage (5) genetic species based on gene pool (6) morphospecies based on form or phenotype and (7) taxonomic species, a species as determined by a taxonomist.
Typological or morphological species s share the same coloration, unmistakably identifying the
morphospecies. A typological species is a group of organisms in which individuals conform to certain fixed properties (a type, which may be defined by a chosen 'nominal species'), so that even pre-literate people often recognise the same taxon as do modern taxonomists. Modern-day field guides and identification websites such as
iNaturalist use this concept. The clusters of variations or phenotypes within specimens (such as longer or shorter tails) would differentiate the species. This method was used as a "classical" method of determining species, such as with Linnaeus, early in evolutionary theory. However, different phenotypes are not necessarily different species (e.g. a four-winged
Drosophila born to a two-winged mother is not a different species). Species named in this manner are called
morphospecies. In the 1970s,
Robert R. Sokal, Theodore J. Crovello and
Peter Sneath proposed a variation on the morphological species concept, a
phenetic species, defined as a set of organisms with a similar
phenotype to each other, but a different phenotype from other sets of organisms. It differs from the morphological species concept in including a numerical measure of distance or similarity to cluster entities based on multivariate comparisons of a reasonably large number of phenotypic traits.
Recognition and cohesion species A mate-recognition species is a group of sexually reproducing organisms that recognise one another as potential mates. Expanding on this to allow for post-mating isolation, a cohesion species is the most inclusive population of individuals having the potential for phenotypic cohesion through intrinsic cohesion mechanisms; no matter whether populations can hybridise successfully, they are still distinct cohesion species if the amount of hybridisation is insufficient to completely mix their respective
gene pools. A further development of the recognition concept is provided by the biosemiotic concept of species.
Genetic similarity and barcode species for the
cytochrome c oxidase enzyme is used to distinguish species in the
Barcode of Life Data Systems database.
Single locus (barcoding) In
microbiology, genes can move freely even between distantly related bacteria, possibly extending to the whole bacterial domain. As a rule of thumb, microbiologists had assumed that members of
Bacteria or
Archaea with
16S ribosomal RNA gene sequences more similar than 97% to each other need to be checked by
DNA–DNA hybridisation to decide if they belong to the same species. This concept was narrowed in 2006 to a similarity of 98.7%. The 16S sequence is an example of a single locus which is simple enough for non-specialists to apply and, in most cases, sufficient to distinguish species. Using a single easy-to-use locus to distinguish taxa is called
DNA barcoding. One of the barcodes for eukaryotes is a region of mitochondrial DNA within the gene for
cytochrome c oxidase. A database,
Barcode of Life Data System, contains DNA barcode sequences from over 190,000 species. However, scientists such as Rob DeSalle have expressed concern that classical taxonomy and DNA barcoding, which they consider a misnomer, need to be reconciled, as they delimit species differently.
Genetic introgression mediated by
endosymbionts and other vectors can further make barcodes ineffective in the identification of species. A singular locus can be a good proxy of time of divergence assuming the chosen locus evolved like most of the rest of the genome. This assumption can be broken by horizontal gene transfer affecting the locus itself.
Rapid modes of evolution separating biological species (speciation) over a short timespan would also decouple the species concept from time itself. Among bacteria, there are several cases where very different genomes share 99.9% 16S identity.
Multilocus comparison Using multiple (usually fewer than 10) loci for comparison provides more phylogenetic signal compared to comparing versions of the same loci as more mutations can be captured. As a result, it provides improved taxonomic resolution compared to single-locus comparison, giving results more similar to the expensive "gold standard" of whole genome comparison at a small increase in cost. Even when whole genomes are available, there are good reasons to only compare select genes: many genes are not universally found in all genomes, so they provide limited taxonomic signal while still adding to the computational cost of comparison. In situations like this, tens to hundreds of loci may be extracted from each genome and used together. As with comparisons with fewer loci, marker genes used for this purpose should be genes with low rates of horizontal transfer and gene duplication, few known instances of horizontal transfer, and high occurrence in the sampled genomes. With prokaryotes, marker genes can be used to delimit taxa down to the genus level, with whole-genome comparison reserved to separate species from each other.
Whole genome comparison The surefire way to capture all gene flow among populations is to compare their entire genomes. The
average nucleotide identity (ANI) method quantifies
genetic distance between entire
genomes, using regions of about 10,000
base pairs. With enough data from genomes of one genus, algorithms can be used to categorize species, as for
Pseudomonas avellanae in 2013, and for all sequenced bacteria and archaea since 2020. Observed ANI values among prokaryotic sequences appear to have an "ANI gap" at 85–95%, suggesting that a genetic boundary suitable for defining a species concept is present.
Phylogenetic or cladistic species A phylogenetic or
cladistic species is "the smallest aggregation of populations (sexual) or lineages (asexual) diagnosable by a unique combination of character states in comparable individuals (semaphoronts)". The empirical basis – observed character states – provides the evidence to support hypotheses about evolutionarily divergent lineages that have maintained their hereditary integrity through time and space. Molecular markers may be used to determine diagnostic genetic differences in the nuclear or
mitochondrial DNA of various species. For example, in a study done on
fungi, studying the nucleotide characters using cladistic species produced the most accurate results in recognising the numerous fungi species of all the concepts studied. Versions of the phylogenetic species concept that emphasise monophyly or diagnosability may lead to splitting of existing species, for example in
Bovidae, by recognising old
subspecies as species, despite the fact that there are no reproductive barriers, and populations may intergrade morphologically. Others have called this approach
taxonomic inflation, diluting the species concept and making taxonomy unstable. Yet others defend this approach, considering "taxonomic inflation" pejorative and labelling the opposing view as "taxonomic conservatism"; claiming it is politically expedient to split species and recognise smaller populations at the species level, because this means they can more easily be included as
endangered in the
IUCN red list and can attract conservation legislation and funding. Unlike the biological species concept, a cladistic species does not rely on reproductive isolation – its criteria are independent of processes that are integral in other concepts. This differs from the biological species concept in embodying persistence over time. Wiley and Mayden stated that they see the evolutionary species concept as "identical" to
Willi Hennig's species-as-lineages concept, and asserted that the biological species concept, "the several versions" of the phylogenetic species concept, and the idea that species are of the same kind as higher taxa are not suitable for biodiversity studies (with the intention of estimating the number of species accurately). They further suggested that the concept works for both asexual and sexually-reproducing species. A version of the concept is
Kevin de Queiroz's "General Lineage Concept of Species".
Ecological species An ecological species is a set of organisms adapted to a particular set of resources, called a niche, in the environment. According to this concept, populations form the discrete phenetic clusters that we recognise as species because the ecological and evolutionary processes controlling how resources are divided up tend to produce those clusters.
Genetic species A genetic species as defined by Robert Baker and Robert Bradley is a set of genetically isolated interbreeding populations. This is similar to Mayr's Biological Species Concept, but stresses genetic rather than reproductive isolation. In the 21st century, a genetic species could be established by comparing DNA sequences. Earlier, other methods were available, such as comparing
karyotypes (sets of
chromosomes) and
allozymes (
enzyme variants).
Evolutionarily significant unit An
evolutionarily significant unit (ESU) or "wildlife species" is a population of organisms considered distinct for purposes of conservation.
Chronospecies is defined in a single lineage (solid line) whose
morphology changes with time. At some point, palaeontologists judge that enough change has occurred that two species (A and B), separated in time and anatomy, once existed. In
palaeontology, with only
comparative anatomy (morphology) and
histology from
fossils as evidence, the concept of a
chronospecies can be applied. During
anagenesis (evolution, not necessarily involving branching), some palaeontologists seek to identify a sequence of species, each one derived from the
phyletically extinct one before through continuous, slow and more or less uniform change. In such a time sequence, some palaeontologists assess how much change is required for a morphologically distinct form to be considered a different species from its ancestors.
Viral quasispecies Viruses have enormous populations, are doubtfully living since they consist of little more than a string of DNA or RNA in a protein coat, and mutate rapidly. All of these factors make conventional species concepts largely inapplicable. A viral
quasispecies is a group of genotypes related by similar mutations, competing within a highly
mutagenic environment, and hence governed by a
mutation–selection balance. It is predicted that a viral quasispecies at a low but
evolutionarily neutral and highly connected (that is, flat) region in the
fitness landscape will outcompete a quasispecies located at a higher but narrower fitness peak in which the surrounding mutants are unfit, "the quasispecies effect" or the "survival of the flattest". There is no suggestion that a viral quasispecies resembles a traditional biological species. The
International Committee on Taxonomy of Viruses has since 1962 developed a universal taxonomic scheme for viruses; this has stabilised viral taxonomy.
Mayr's biological species concept proposed the widely used Biological Species Concept of reproductive isolation in 1942. Most modern textbooks make use of
Ernst Mayr's 1942 definition, known as the
biological species concept, as a basis for further discussion on the definition of species. It is also called a reproductive or isolation concept. This defines a species as It has been argued that this definition is a natural consequence of the effect of sexual reproduction on the dynamics of natural selection. Mayr's use of the adjective "potentially" has been a point of debate; some interpretations exclude unusual or artificial matings that occur only in captivity, or that involve animals capable of mating but that do not normally do so in the wild. extends Mayr's biological species definition to also include agamous (asexual) organisms. It extends the concept of
interbreeding, the "glue" binding separate individuals into a species, to the concept of
renewal cohesion, the "glue" that works both for sexual and agamous organisms.
Renewal cohesion is "the restriction of genetic diversity of a group of related individuals as a result of their recent origin from a common ancestor". This limitation on genetic diversity results from the effects of
genetic drift explained by
coalescent theory, which is sufficient to restrict genetic diversity even in the abscence of selection. Reproductive isolation as "anti-interbreeding" and a prerequisite for speciation, is extended to
renewal isolation, inclusive of both sexual and agamous organisms. A
renewing species "is the most inclusive group of individuals, which are united by renewal cohesion, i.e. 1) are reproduced in a specific ecological niche and 2) are genetically similar due to recent origin of their genomes (as whole uninterrupted DNA stretches or in parts) from genomes of ancestral individuals. [...] Applied to sexuals, the concept of renewal species is reduced to Mayr’s biological concept".Apart from the connection with Mayr's biological species concept, renewing species borrows its mechanism of cohesion from Templeton's "cohesion via genetic drift" (one of
cohesion species mechanisms described in ). ==The species problem==