Visualization of seasonal influenza antigenic evolution

John Huddleston, PhD

Bedford Lab

Fred Hutchinson Cancer Research Center

March 18, 2022

https://huddlej.github.io/talks/2022-03-18-vizbi

Seasonal influenza kills 100,000s
of people every year

influenza patient

Photo: WHO / Tom Pietrasik

Surface proteins determine
influenza virus subtypes like H3N2

Hemagglutinin enables infection

Hemagglutinin is the primary target of our immune system

Mutations in hemagglutinin allow viruses to escape existing immunity

Mutations in hemagglutinin allow viruses to escape existing immunity

The Global Influenza Surveillance and Response System tracks influenza year-round

Map of World Health Organization Global Influenza Surveillance and Response System

Surveillance groups sequence
the genetic code of viruses

influenza sequences

Visualizing genomes as an inferred genealogy reveals related groups of viruses

phylogenies reveal closely related groups of viruses

Which single virus should be in the next vaccine?

phylogenies reveal closely related groups of viruses

Experimental assays identify viruses
that could escape our immunity

HI assays

Assays provide antigenic distances

schematic of antigenic distances by experimental assays

Historically, distances were viewed as tables

HI assays

Visualization as tables does not scale well

Binder of VCM materials

Antigenic cartography reduces
high dimensionality

antigenic cartography of H3N2 HI titers

Bayesian antigenic cartography integrates
assays and genetic data

Bayesian antigenic cartography of H3N2 HI titers and genomes

Antigenic trees map assays to phylogenies
and infer missing data

antigenic tree colored by inferred antigenic distance from H3N2 HI titers

Antigenic trees synthesize all assay data
into a single visualization

antigenic tree colored by inferred antigenic distance from H3N2 HI titers

Scatterplot trees encode antigenic distance
on a positional axis

antigenic scatterplot colored by inferred antigenic distance from H3N2 HI titers

Matrix view emphasizes coverage by
each serum of test viruses by clade

matrix of mean titer values between reference sera (y-axis) and test viruses in recent clades titers

Distribution view communicates uncertainty about coverage of test viruses by clade

distributions of mean +/- 89% confidence intervals by reference serum and test clades

Distributions with raw data reveal details about coverage of test viruses by clade

raw distances and distributions of mean +/- 89% confidence intervals by reference serum and test clades

Interactive visualizations support
multiple views on the data

interactive visualization of antigenic distances from H3N2 HI titers

Composite tree and measurements views provide all information required for decision making

interactive visualization of tree and antigenic distances from H3N2 HI titers in Auspice

Implementation complexity hinders
domain-specific visualizations

domain-specific visualizations often require complex implementations

Thank you!

Fred Hutch Cancer
Research Center

  • Trevor Bedford
  • Allison Black
  • Jover Lee
  • James Hadfield
  • Thomas Sibley

Collaborators

  • Richard Neher
  • David Wentworth
  • Rebecca Kondor

Data Sources

  • GISAID
  • WHO GISRS Network

Funding

  • NIH/NIAID R01 AI127893-01
  • NIAID Centers of Excellence for Influenza Research and Response (CEIRR)
  • Bill and Melinda Gates Foundation

Continue the conversation

By email at jhuddles@fredhutch.org

In public at discussion.nextstrain.org

In private at hello@nextstrain.org