Nature geneticsSeptember 1, 2019

Reconciling disparate estimates of viral genetic diversity during human influenza infections

Katherine S Xue, Jesse D Bloom
doi:10.1038/s41588-019-0349-3

Abstract

To the Editor — A key question in the study of influenza virus evolution is how rapidly viral genetic variation arises within infected humans1,2. Recently, several studies have measured influenza’s within-host genetic diversity in large cohorts of infected humans through high-throughput deep sequencing3,4,5,6 (Supplementary Table 1). These studies disagree in their estimates of influenza’s within-host genetic diversity. In a Nature Genetics letter titled ‘Quantifying influenza virus diversity and transmission in humans’, analyzing a household cohort in Hong Kong, Poon et al.4 have estimated that within-host genetic diversity is high, and 200–250 viral genomes are transmitted between individuals. However, several recent studies conducted in Wisconsin3, Michigan6, and Washington7 that used similar methodologies have estimated lower levels of viral genetic diversity. In particular, the Michigan study has estimated a narrow transmission bottleneck of just one or two viral genomes6. We sought to examine whether technical differences in the underlying deep-sequencing datasets or the methods used to analyze them might explain the disparate estimates of within-host viral genetic diversity. We identified an anomaly in the Hong Kong data that provides a technical explanation for these discrepancies: read pairs from this study are often split between different biological samples, thus indicating that some reads are incorrectly assigned.