MarketBaltimore classification
Company Profile

Baltimore classification

Baltimore classification is a system used to classify viruses by their routes of transferring genetic information from the genome to messenger RNA (mRNA). Seven Baltimore groups, or classes, exist and are numbered in Roman numerals from I to VII. Groups are defined by whether the viral genome is made of deoxyribonucleic acid (DNA) or ribonucleic acid (RNA), whether the genome is single- or double-stranded, whether a single-stranded RNA genome is positive-sense (+) or negative-sense (–), and whether the virus makes DNA from RNA. Viruses within Baltimore groups typically have the same replication method, but other characteristics such as virion structure are not directly related to Baltimore classification.

Overview
Baltimore classification groups viruses together by their routes of transferring genetic information from the genome to messenger RNA (mRNA). Characteristics that determine the Baltimore group of a virus include whether the genome is made of deoxyribonucleic acid (DNA) or ribonucleic acid (RNA), the strandedness of the genome, which can be either single- or double-stranded, the sense of a single-stranded RNA genome, which can be either positive (+) or negative (–), and whether the virus synthesizes DNA from RNA (reverse transcription (RT)). There are seven Baltimore groups or classes, numbered with Roman numerals, listed hereafter. • Group I: double-stranded DNA viruses (dsDNA) • Group II: single-stranded DNA viruses (ssDNA) • Group III: double-stranded RNA viruses (dsRNA) • Group IV: positive-sense single-stranded RNA viruses (+ssRNA) • Group V: negative-sense single-stranded RNA viruses (–ssRNA) • Group VI: single-stranded RNA viruses with a DNA intermediate in their life cycle (ssRNA-RT) • Group VII: double-stranded DNA viruses with an RNA intermediate in their life cycle (dsDNA-RT) Baltimore classification is chiefly based on the path toward transcription of the viral genome, and viruses within each group usually share the manner by which the mRNA synthesis occurs. While not the direct focus of Baltimore classification, groups are organized in such a manner that viruses in each group also typically have the same mechanisms of replicating the viral genome. Structural characteristics of the extracellular virus particle, called a virion, such as the shape of the viral capsid, which stores the genome, and the presence of a viral envelope, a lipid membrane that surrounds the capsid, have no direct relation to Baltimore groups, nor do the groups necessarily show genetic relation based on evolutionary history. ==Baltimore groups==
Baltimore groups
Group I: double-stranded DNA viruses es are dsDNA viruses. The first Baltimore group contains viruses that have a double-stranded DNA (dsDNA) genome. All dsDNA viruses have their mRNA synthesized in a three-step process. First, a transcription preinitiation complex binds to the DNA upstream of the transcription site, recruiting a host RNA polymerase enzyme. Once the RNA polymerase is recruited, it uses the negative-sense strand as a template for synthesizing mRNA strands, which are positive sense. The RNA polymerase then terminates transcription upon reaching a specific signal, such as a polyadenylation site. dsDNA viruses make use of several mechanisms to replicate their genome. A widely used method is bidirectional replication, in which two replication forks are established at a replication origin site and move in opposite directions on a DNA molecule. A rolling circle mechanism that produces linear strands while progressing in a loop around a circular genome is also common. Many dsDNA viruses use a strand displacement method whereby one strand is synthesized from a template strand, and a complementary strand is then synthesized from the previously synthesized strand to form a dsDNA genome. Lastly, some dsDNA viruses are replicated as part of a process called replicative transposition, whereby a viral genome that is integrated into a host cell's genome is replicated to another part of the host cell's genome. dsDNA viruses can be divided informally into those that replicate in the nucleus, and as such are relatively dependent on host cell machinery for transcription and replication, and those that replicate in cytoplasm, in which case they have obtained their own means of transcription and replication. dsDNA viruses are also sometimes divided between tailed dsDNA viruses, which refers to members of the realm Duplodnaviria, specifically the head-tail of the class Caudoviricetes, and tailless or non-tailed (icosahedral) dsDNA viruses, which refers to viruses in the realms Singelaviria and Varidnaviria. dsDNA viruses are classified into six realms and many taxa that are unassigned to a realm: • Viruses in the realms Adnaviria, Duplodnaviria, are dsDNA viruses. • In the realm Floreoviria, members of the class Papovaviricetes are dsDNA viruses. • The realm Pleomoviria contains both dsDNA viruses and ssDNA viruses. • The following taxa that are unassigned to a realm exclusively contain dsDNA viruses: • Classes: Naldaviricetes • Families: Ampullaviridae, Basaltiviridae, Bicaudaviridae, Clavaviridae, Eurekaviridae, Fuselloviridae, Globuloviridae, Guttaviridae, Halspiviridae, Huangdiviridae, Itzamnaviridae, Lomiviridae, Nipumfusiviridae, Ovaliviridae, Plasmaviridae, Portogloboviridae, Thaspiviridae, Xigoviridae, • Genera: Dinodnavirus Group II: single-stranded DNA viruses es are ssDNA viruses. The second Baltimore group contains viruses that have a single-stranded DNA (ssDNA) genome. ssDNA viruses have the same manner of transcription as dsDNA viruses. Because the genome is single-stranded, however, it is first made into a double-stranded form by a DNA polymerase enzyme upon entering a host cell. mRNA is then synthesized from the double-stranded form. The double-stranded form of ssDNA viruses may be produced either directly after entry into a cell or as a consequence of replicating the viral genome. Most ssDNA viruses contain circular genomes that are replicated by rolling circle replication (RCR). ssDNA RCR is initiated by an endonuclease enzyme that bonds to and cleaves the positive-sense strand, which allows a DNA polymerase to use the negative-sense strand as a template for replication. Replication progresses in a loop around the genome by extending the 3′-end ("three prime end") of the positive-sense strand, which displaces the prior positive-sense strand. The endonuclease then cleaves the positive-sense strand again to create a standalone genome that is joined (ligated) into a circular loop. The new ssDNA genome may be packaged into virions or replicated by a DNA polymerase to create a double-stranded form for transcription or additional rounds of replication. Nearly all ssDNA viruses have positive-sense genomes, but a few exceptions and peculiarities exist. Anelloviruses are the only ssDNA viruses that have negative-sense genomes. Parvoviruses may package either the positive- or negative-sense strand into capsids. Group III: double-stranded RNA viruses es are dsRNA viruses. The third Baltimore group contains viruses that have a double-stranded RNA (dsRNA) genome. After entering a host cell, the viral RNA-dependent RNA polymerase (RdRp) synthesizes a positive-sense strand from the negative-sense strand of the dsRNA genome. This positive-sense strand may be used either as mRNA for translation or as a template for replication to form the dsRNA genome. dsRNA is not a molecule made by cells, so eukaryotes have evolved antiviral systems to detect and inactivate viral dsRNA. To counter this, many dsRNA viruses replicate their genomes inside of capsids, thereby avoiding detection inside of the host cell's cytoplasm. Positive-sense strands are then forced out from the capsid to be translated or translocated from the mature capsid to a progeny capsid. • Viruses in the phylum Duplornaviricota are dsRNA viruses. • In the phylum Pisuviricota, members of the class Duplopiviricetes are dsRNA viruses. Group IV: positive-sense single-stranded RNA viruses es are +ssRNA viruses. The fourth Baltimore group contains viruses that have a positive-sense single-stranded RNA (+ssRNA) genome. For +ssRNA viruses, the genome functions as mRNA, so no transcription is required for translation. +ssRNA viruses will, however, produce positive-sense copies of the genome from negative-sense strands of an intermediate dsRNA genome. This acts as both a transcription and replication process since the replicated +ssRNA is also mRNA. Many +ssRNA viruses are able to have only a portion of their genome transcribed. Typically, subgenomic RNA (sgRNA) strands are used for the translation of structural and movement proteins needed during intermediate and late stages of infection. sgRNA transcription may occur by commencing RNA synthesis within the genome rather than from the 5′-end ("five prime end"), by stopping RNA synthesis at specific sequences in the genome, or, as a part of both aforementioned methods, by synthesizing leader sequences from viral RNA that are then attached to sgRNA strands. During infection, the viral RdRp is always translated directly from the genome first because replication, performed by the RdRp, is required for sgRNA synthesis. Because the process of replicating the viral genome produces intermediate dsRNA molecules, +ssRNA viruses can be targeted by the host cell's immune system. To avoid detection, +ssRNA viruses replicate in membrane-associated vesicles that are used as replication factories. From there, only +ssRNA strands enter the main cytoplasmic area of the cell. These strands may be used as mRNA or as progeny genomes. The fifth Baltimore group contains viruses that have a negative-sense, single-stranded RNA (–ssRNA) genome. At least two lineages of –ssRNA viruses exist, which transcribe and replicate their genomes differently. The first are viruses of the phylum Negarnaviricota in the kingdom Orthornavirae, realm Riboviria. Negarnaviricots transcribe mRNA, which is positive sense, directly from the negative-sense genome. The first process for –ssRNA transcription involves the viral RdRp binding to a leader sequence on the 3′-end of the genome, transcribing a 5′ triphosphate-leader RNA sequence, then stopping and restarting on a transcription signal that is capped, continuing until a stop signal is reached. There, the RdRp synthesizes a polyadenylated tail and releases the mRNA or, for polycistronic genomes, continues transcription. The second manner is similar, but instead of synthesizing a cap, the RdRp may use its endonuclease activity to snatch a short sequence of nucleotides from host cell mRNA and use it as the 5′ cap of viral mRNA. Genomic –ssRNA is replicated from the positive-sense antigenome in a manner similar to transcription, except in reverse using the antigenome as a template for the genome. The RdRp complex moves from the 3′-end to the 5′-end of the antigenome and ignores all transcription signals when synthesizing genomic –ssRNA. Various –ssRNA viruses use special mechanisms for transcription. The way of polyadenylating the end of an mRNA sequence may be through polymerase stuttering, during which the RdRp transcribes an adenine from uracil and then moves back in the RNA sequence to transcribe it again, continuing this process until hundreds of adenines have been added to the 3′-end of the mRNA. Some –ssRNA viruses are ambisense, as both the positive- and negative-sense strands separately encode viral proteins. These viruses produce one mRNA strand from the genome and one from a complementary strand. –ssRNA viruses in Negarnaviricota can be divided informally into those that have non-segmented and segmented genomes. Non-segmented –ssRNA viruses replicate in the cytoplasm, and segmented –ssRNA viruses replicate in the nucleus. For segmented viruses, the RdRp transcribes one monocistronic mRNA strand from each segment of the genome. This distinction is closely followed within Negarnaviricota, as viruses in the subphylum Haploviricotina usually have non-segmented genomes, and viruses in the subphylum Polyploviricotina have segmented genomes. Moreover, –ssRNA viruses that synthesize a cap structure on viral mRNA are assigned to Haploviricotina, whereas –ssRNA viruses that snatch caps from host mRNA belong to Polyploviricotina. , a ribozyvirian. HBsAg proteins are spikes on the viral envelope, and HDAg proteins are nucleoproteins associated with the genome. The second lineage of –ssRNA viruses is the realm Ribozyviria, which includes Hepatitis D virus (HDV) and its relatives. Ribozyvirians have covalently-closed circular –ssRNA genomes that are covered in nucleocapsid proteins to form a ribonucleoprotein (RNP) complex. After entering a cell, the RNP complex migrates from the cytosol to the nucleus, where the genome is replicated by RCR by a host RNA polymerase II enzyme. This process creates a long molecule with many copies of the genome, called a concatemer, that has a series of positive-sense genomic strands. Ribozymes encoded in this antigenome catalyze cleavage of the concatemer to form individual strands that are either translated or ligated for replication through RCR to produce concatemers of –ssRNA antigenomic strands. Ribozymes encoded in the negative-sense strands then catalyze cleavage of the negative-sense concatemer to produce individual genomic –ssRNA strands. Lastly, there is a group of –ssRNA viruses assigned to the tentative phylum Arctiviricota in the kingdom Orthornavirae. Arctiviricots inhabit the Arctic Ocean and are believed to represent a separate –ssRNA lineage in Orthornavirae from Negarnaviricota. Their mechanisms of replication and transcription have not been described. In summary, –ssRNA viruses belong to the following taxa: • In the realm Riboviria, viruses in the phyla Arctiviricota (tentative) and Negarnaviricota are –ssRNA viruses. • Viruses in the realm Ribozyviria are –ssRNA viruses. Group VI: single-stranded RNA viruses with a DNA intermediate is a ssRNA-RT virus. The small spherical particles are HIV virions budding from an infected cell. The sixth Baltimore group contains viruses that have a (positive-sense) single-stranded RNA genome with a DNA intermediate ((+)ssRNA-RT) in their replication cycle. ssRNA-RT viruses are transcribed in the same manner as DNA viruses, but their genomes are first converted to a dsDNA form through a process called reverse transcription (RT). The viral reverse transcriptase enzyme then synthesizes a DNA strand from the +ssRNA strand, and the RNA strand is degraded and replaced with a DNA strand to create a dsDNA copy of the genome. The viral enzyme integrase then integrates the dsDNA molecule into the DNA of the host cell, where it is now called a provirus. The host cell's RNA polymerase II then transcribes RNA in the nucleus from the proviral DNA. Some of this RNA becomes mRNA whereas other strands become copies of the viral genome for replication. ssRNA-RT viruses are all included in the class Revtraviricetes, the sole class in the kingdom Pararnavirae, realm Riboviria. Excluding the family Caulimoviridae, which belongs to group VII, all members of the Revtraviricetes order Ortervirales are ssRNA-RT viruses. ssRNA-RT viruses are sometimes called retroviruses, a term shared with members of the ssRNA-RT family Retroviridae. The seventh Baltimore group contains viruses that have a double-stranded DNA genome with an RNA intermediate (dsDNA-RT) in their replication cycle. dsDNA-RT viruses have gaps in their circular genomes so that parts of the genome are ssDNA. These gaps are repaired to create a complete, covalently-closed circular dsDNA genome before transcription. dsDNA-RT viruses are, like ssRNA-RT viruses, all included in the class Revtraviricetes. Two families of dsDNA-RT viruses are recognized: Caulimoviridae, which belongs to the order Ortervirales, and Hepadnaviridae, which is the sole family in the order Blubervirales. The provisional family Nudnaviridae is considered to be a sister family to hepadnavirids. dsDNA-RT viruses are often called pararetroviruses. ==Multi-group viruses==
Multi-group viruses
Some viruses can be classified into two Baltimore groups. Pleolipoviruses, for example, encapsidate either ssDNA or dsDNA genomes. For betapleolipoviruses, an encapsidated genome contains ssDNA regions and dsDNA regions. Similarly, bacilladnaviruses are ssDNA viruses with short dsDNA regions in their genomes. Therefore, both pleolipoviruses and bacilladnaviruses can be considered both dsDNA and ssDNA viruses. These RNA viruses may constitute a new Baltimore group, or they can be considered both +ssRNA and –ssRNA viruses. ==Correlates to Baltimore groups==
Correlates to Baltimore groups
, which are –ssRNA viruses, make use of RNA editing. Many characteristics of viruses do not define which Baltimore group a virus belongs to but still correspond to specific Baltimore groups. This includes the use of RNA editing, alternative splicing during transcription, whether the virus's genome is segmented, the size and structure of the virus's genome, the host range of viruses, whether the virus packages replication and transcription machinery into virions, and unorthodox methods of translating mRNA. RNA editing RNA editing is used by various ssRNA viruses to produce different proteins from a single gene. This can be done by polymerase slippage during transcription or by post-transcriptional editing. During polymerase slippage, the RNA polymerase slips one nucleotide back during transcription, which adds a nucleotide not included in the template strand to the mRNA strand. Editing of a genomic template would impair gene expression, so RNA editing is only done during and after transcription. For ebola viruses, RNA editing is used to express three different proteins from a single gene, which increases their ability to adapt to their hosts. Alternative splicing differs from RNA editing in that alternative splicing does not change the mRNA sequence like RNA editing but instead changes the coding capacity of an mRNA sequence as a result of alternative splicing sites. The two processes otherwise have the same result: multiple proteins are expressed from a single gene. Genome segmentation virion, the eight-segment genome shown in the center as part of ribonucleoprotein complexes Viral genomes can exist as a single (monopartite) segment, a segmented genome, or a multipartite genome. For monopartite viruses, all genes are on a single genome segment. For segmented viruses, the genome is separated into at least two molecules that are packaged together into one virion. Multipartite viruses are segmented viruses that package their genome segments into separate virions. Monopartite and segmented viruses are found in all cellular life, whereas multipartite viruses mainly infect plants and fungi. By Baltimore group, dsDNA and RT viruses are non-segmented, ssDNA and +ssRNA viruses are mostly non-segmented, dsRNA viruses are mostly segmented, and around half of –ssRNA viruses are segmented. Genome structure Viral genomes may be either linear with ends or circular in a loop. Whether a virus has a linear or circular genome varies from group to group. Most dsDNA and –ssRNA viruses have linear genomes, ssDNA viruses mainly have circular genomes, dsRNA, +ssRNA, and ssRNA-RT viruses have linear genomes, and dsDNA-RT viruses have circular genomes. As all +ssRNA viruses have genomes that can act as mRNA, and circular mRNA does not exist in cellular life, all +ssRNA viruses have linear genomes. Among –ssRNA viruses, those of the phylum Negarnaviricota have linear genomes, and those of the realm Ribozyviria have circular genomes. The size, or length, of a genome varies by Baltimore group. dsDNA viruses have genomes ranging from about 5 to 2,500 kilobases (kb) in length, ssDNA viruses 1–25 kb, dsRNA viruses 4–30 kb, +ssRNA viruses 3.5–40 kb, –ssRNA viruses 1.7–20 kb, ssRNA-RT viruses 5–13 kb, and dsDNA-RT viruses 3–10 kb in length. The relatively small genomes of viruses that are not group I are likely due to physical limitations. For example, ssDNA has the potential to form extensive secondary structures, and ssRNA is relatively chemically unstable. dsDNA viruses have much more varied genome sizes, likely because they have the same genomic organization as cells. This enables them to either exploit cellular machinery or encode their own machinery. As such, dsDNA seems to be the only genomic organization that can support genomes that exceed about 50 kb. Biological factors influence host range. For example, dsDNA viruses do not infect plants because large dsDNA molecules are unable to pass through plasmodesmata, intercellular channels that connect plant cells. The dominance of DNA viruses in prokaryotes may be because they outcompete RNA viruses. In eukaryotic cells, however, the nucleus is a barrier that requires adaptation by DNA viruses. They either have to evolve means to enter the nucleus to replicate or obtain their own replication and transcription machinery to replicate in virus factories in the cytosol. In contrast, the endomembranes of eukaryotic cells appear to be a beneficial environment for RNA virus replication. Translation , a dsDNA-RT virus, Translation is the process by which proteins are synthesized from mRNA by ribosomes. Baltimore groups do not directly pertain to the translation of viral mRNA to proteins, but atypical types of translation used by viruses are usually found within specific Baltimore groups. For example: • Non-canonical translation initiation: • Viral initiation of translation: some viruses have evolved mechanisms to initiate translation. Methods include having internal ribosomal entry sites to allow cap-independent initiation, having downstream hairpin loops that allow for cap-dependent translation without an eIF2 initiation factor, and initiating translation at a CUG codon or other start codon with a leucine amino acid. These methods are used by various +ssRNA and ssRNA-RT viruses. Ribosomal shunting is used by various dsDNA, +ssRNA, –ssRNA, and RT viruses. This is used by various dsRNA and +ssRNA viruses. uses ribosomal skipping. • Suppression of termination, also called stop codon read-through: certain viruses contain codons in their mRNA that would normally signal for termination of translation upon being recognized by a release factor but are instead partially recognized by tRNA during translation, which allows for continued translation up to the next stop codon to produce extended polypeptides at the end of the amino acid sequence. This is used by various dsRNA, +ssRNA, and ssRNA-RT viruses, ==Evolutionary origins and relations==
Evolutionary origins and relations
, not to scale. Most ssDNA viruses are likely descended from plasmids. ssRNA-RT viruses all belong to Ortervirales and thus share common ancestry. dsDNA-RT viruses, on the other hand, are found in both orders and therefore represent two separate lineages of dsDNA-RT viruses. Ribozyvirians constitute a lineage of –ssRNA viruses unrelated to other RNA viruses. Most ssDNA viruses likely originate from plasmids that, on multiple occasions, recombined with other genomes to obtain the structural proteins needed to form virions. The evolutionary history of dsDNA viruses is the most complex as they appear to have emerged independently on numerous occasions. Two major lineages of dsDNA viruses exist: the realm Duplodnaviria and the realm Varidnaviria, the latter of which also contains ssDNA viruses that are descended from dsDNA viruses. The opposite is true in the realm Floreoviria, which contains dsDNA viruses descended from ssDNA viruses. There are also two minor realms, Adnaviria and Singelaviria, that exclusively contain dsDNA viruses. Lastly, there are dsDNA virus families unassigned to higher taxa that are unique from existing realms and which likely constitute small realms. Of the replication-expression systems used by viruses, only Baltimore group I (dsDNA) is used by cells. The other groups may be remnants of the primordial stage of life before the emergence of modern-like cells, during which the dsDNA system used by extant cells had not yet become uniform. The ancestors of RNA viruses in particular may have emerged during the time of the RNA world. And although virus realms are evolutionarily independent from each other, the replicative proteins encoded by viruses in the four major realms (Duplodnaviria, Floreoviria, Riboviria, and Varidnaviria) are built on the core RNA recognition motif, one of the most common nucleic acid-binding domains in nature. Therefore, the replication-expression cycles most likely diversified before the separation of large dsDNA replicators, which became the ancestors of cellular life, from other types of replicators, which became selfish genetic elements and gave rise to viruses. ==History==
History
in 2021 Before Baltimore classification was created, a variety of classification systems for viruses had been proposed, classifying viruses by host, structure, biochemical properties, and other characteristics. Baltimore classification was proposed in 1971 by virologist David Baltimore in a paper titled "Expression of Animal Virus Genomes" that was published in the academic journal Bacteriology Reviews (now named Microbiology and Molecular Biology Reviews). Baltimore focused on classifying animal viruses but effectively classified all viruses by their routes of information transmission from genomic nucleic acid to mRNA. The system initially contained the first six groups but was later expanded to include group VII after the discovery of dsDNA-RT viruses. Because of the utility of Baltimore classification, it came to be used alongside standard virus taxonomy, which is based on evolutionary relationships and governed by the International Committee on Taxonomy of Viruses (ICTV). A year later, Riboviria was expanded to also include RT viruses. Within the realm, RT viruses are included in the kingdom Pararnavirae and the three other Baltimore groups in the kingdom Orthornavirae as defining traits of the kingdom's phyla. A year later, Ribozyviria was established for HDV and its relatives. In 2026, however, Monodnaviria was split into four realms (Efunaviria, Floreoviria, Pleomoviria, and Volvereviria) based on evidence that its four kingdoms did not share common ancestry. In 1974, virologist Vadim Agol proposed an extension of Baltimore classification to encompass all possible means of genetic information transmission and describe the hierarchical routes of information transmission, including both expression and replication, rather than solely mRNA synthesis. In the expanded system, there are 35 classes, 17 superclasses, and six types of genetic information transfer. The system was revisited in 2021 by Koonin et al. in light of discoveries made since the 1970s. Known viruses occupy 13 classes, one of which is shared with cells, seven superclasses, and three types. A fourteenth class is occupied by F-like plasmids. Ambisense viruses occupy two classes simultaneously, though separate classes could be made for them. Most unoccupied classes are of DNA-RNA hybrids, which appear to be disfavored by evolution since it may be advantageous to convert such molecules to dsDNA, the molecule most suitable for genome replication. According to Koonin et al., viruses that belong to the unoccupied classes are unlikely to be discovered unless they are rare in nature. ==Notes==
tickerdossier.comtickerdossier.substack.com