Lineage frequency time series reveal elevated levels of genetic drift in SARS-CoV-2 transmission in England

The COVID-19 Genomics UK (COG-UK) Consortium, QinQin Yu, Joao A. Ascensao, Takashi Okada, Olivia Boyd, Erik Volz, Oskar Hallatschek, Matthew Bashton, Darren Smith, Andrew Nelson, Clare McCann

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)
1 Downloads (Pure)

Abstract

Genetic drift in infectious disease transmission results from randomness of transmission and host recovery or death. The strength of genetic drift for SARS-CoV-2 transmission is expected to be high due to high levels of superspreading, and this is expected to substantially impact disease epidemiology and evolution. However, we don't yet have an understanding of how genetic drift changes over time or across locations. Furthermore, noise that results from data collection can potentially confound estimates of genetic drift. To address this challenge, we develop and validate a method to jointly infer genetic drift and measurement noise from time-series lineage frequency data. Our method is highly scalable to increasingly large genomic datasets, which overcomes a limitation in commonly used phylogenetic methods. We apply this method to over 490,000 SARS-CoV-2 genomic sequences from England collected between March 2020 and December 2021 by the COVID-19 Genomics UK (COG-UK) consortium and separately infer the strength of genetic drift for pre-B.1.177, B.1.177, Alpha, and Delta. We find that even after correcting for measurement noise, the strength of genetic drift is consistently, throughout time, higher than that expected from the observed number of COVID-19 positive individuals in England by 1 to 3 orders of magnitude, which cannot be explained by literature values of superspreading. Our estimates of genetic drift suggest low and time-varying establishment probabilities for new mutations, inform the parametrization of SARS-CoV-2 evolutionary models, and motivate future studies of the potential mechanisms for increased stochasticity in this system.

Original languageEnglish
Article numbere1012090
Number of pages37
JournalPLoS Pathogens
Volume20
Issue number4
DOIs
Publication statusPublished - 15 Apr 2024

Keywords

  • COVID-19/transmission
  • Humans
  • SARS-CoV-2/genetics
  • Genetic Drift
  • England/epidemiology
  • Phylogeny
  • Genome, Viral

Cite this