Viral taxonomy and phylogeny I

Overview

Teaching: 60 min
Exercises: 120 min
Objectives
  • Understand the differences between taxonomic approaches for viral and cellular organisms

  • Explain the challenges with viral taxonomy, and how they may be overcome

Viromics workflow

How are viruses taxonomically classified?

A widely accepted approach for taxonomic classification is by selecting a marker gene that is shared by all organisms, creating a multiple sequence alignment and phylogenetic tree, and identifying taxa as characteristic branches or lineages in the tree. This approach can be applied to all cellular organisms including bacteria, archaea, and eukaryotes, particularly with marker genes functioning in the ribosome. But only to subgroups of viruses, as no marker gene is universally conserved in all viruses. This lack of a universal genomic feature is thought to reflect their multiple evolutionary origins.

There is no single method to classify viruses. Many experts from the global virology community have done their part by classifying viruses according to their specific knowledge. This has generated a patchwork of methods that, ideally, capture the features of different viral lineages and generate meaningful taxa that are in agreement with biology. With the advance of viromics and the discovery of viruses by their genome sequences only, bioinformaticians have developed many methods to cluster them. Thus far, these methods have not been widely adopted for official ICTV virus taxonomy. They are rarely universally applicable, and if they are, there is bound to be some level of disagreement with some taxa that were previously defined by experts.

One popular method in the bacteriophage field is the gene-sharing network. This involves creating a network where nodes are genomes and edges represent shared gene families. Tight clusters in this gene-sharing network represent groups of viral genomes that share many genes, and those are interpreted as taxonomic groups. Clusters can be identified with different gene-sharing cutoffs, corresponding to different taxonomic ranks, where closely related viruses (species, genera) share more genes than distant ones (families). Tools like VICTOR and vConTACT3 work this way.

As more and more viruses are sequenced and we get a better view of the virosphere, marker genes are making a comeback. Although a universal marker gene shared by all viruses does not exist, marker genes are certainly shared by viruses of lower-ranking taxa (species, genera, families, orders, and in some cases above). This was already clear from the gene-sharing network. Detecting the marker gene is evidence that a virus belongs to a given taxon, and phylogenetic trees can help resolve the lower-level taxonomy, just like ribosomal marker gene trees for cellular organisms. Tools like vClassifier and geNomad use marker gene approaches.

Exercise - An overview of viral taxonomy

  1. Check out the ICTV website: https://ictv.global/.

Discuss with your fellow students and write down your answers:

  1. What is ICTV? (not just a TV channel featuring polar bears)
  2. How many taxonomic ranks are available to classify the virosphere?
  3. What is a realm?
  4. How many viral genera are there?

Read the following sections from the review “Global Organization and Proposed Megataxonomy of the Virus World” by Koonin et al.:

Discuss with your fellow students and write down your answers:

  1. What are the Baltimore Classes, and what are they based on?
  2. What triggered the change in viral taxonomy?
  3. What are the pros and cons of a “genomic taxonomy”?

Key Points

  • Viruses have multiple origins, so there is no universal marker gene

  • Viral taxonomy is based on many different methods

  • Gene-sharing networks and marker genes are popular methods for bacteriophage taxonomy