Hostname: page-component-586b7cd67f-tf8b9 Total loading time: 0 Render date: 2024-11-22T17:43:45.787Z Has data issue: false hasContentIssue false

Genomic epidemiology of SARS-CoV-2 in large university hospital cohort: the UnCoVER-Brazil project

Published online by Cambridge University Press:  20 July 2023

Felipe Santos de Carvalho
Affiliation:
Social Medicine Department, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil Infectious Diseases Division, Internal Medicine Department, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil Medicine Department, Federal University of São Carlos, São Carlos, Brazil
Sarah Danielle Slack
Affiliation:
Infectious Diseases Division, Internal Medicine Department, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil Oncology and Imaging Department, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil
Francisco Barbosa-Júnior
Affiliation:
Social Medicine Department, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil
Mateus Rennó de Campos
Affiliation:
Social Medicine Department, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil Epidemiological Surveillance Service, University Hospital of Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil
Guilherme Silveira Castro
Affiliation:
Social Medicine Department, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil
Sabrina Baroni
Affiliation:
Oncology and Imaging Department, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil
Livia Mara Torres Bueno
Affiliation:
Oncology and Imaging Department, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil
Fernanda Borchers Coeli
Affiliation:
Oncology and Imaging Department, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil
Aparecida Yulie Yamamoto
Affiliation:
Virology Laboratory, University Hospital of Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil
Jorgete Maria Silva
Affiliation:
Social Medicine Department, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil Epidemiological Surveillance Service, University Hospital of Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil
Rodrigo do Tocantins Calado
Affiliation:
Oncology and Imaging Department, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil
Benedito Antônio Lopes Fonseca
Affiliation:
Infectious Diseases Division, Internal Medicine Department, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil
Leandro Machado Colli
Affiliation:
Oncology and Imaging Department, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil
Fernando Bellissimo-Rodrigues*
Affiliation:
Social Medicine Department, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil
*
Corresponding author: Fernando Bellissimo-Rodrigues; Email: [email protected]
Rights & Permissions [Opens in a new window]

Abstract

This work aimed to study the role of different SARS-CoV-2 lineages in the epidemiology of multiple waves of the COVID-19 pandemic in Ribeirão Preto (São Paulo state), with comparison within Brazil and globally. Viral genomic sequencing was combined with clinical and sociodemographic information of 2,379 subjects at a large Brazilian hospital. On the whole 2,395 complete SARS-CoV-2 genomes were obtained from April 2020 to January 2022. We report variants of concern (VOC) and interest (VOI) dynamics and the role of Brazilian lineages. We identified three World Health Organization VOCs (Gamma, Delta, Omicron) and one VOI (Zeta), which caused distinct waves in this cohort. We also identified 47 distinct Pango lineages. Consistent with the high prevalence of Gamma in Brazil, Pango lineage P.1 dominated infections in this cohort for half of 2021. Each wave of infection largely consisted of a single variant group, with each new group quickly and completely rising to dominance. Despite increasing vaccination in Brazil starting in 2021, this pattern was observed throughout the study and is consistent with the hypothesis that herd immunity tends to be SARS-CoV-2 variant-specific and does not broadly protect against COVID-19.

Type
Short Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the virus that causes coronavirus disease (COVID-19), remains a global threat despite massive diagnostic testing, isolation, therapies, and vaccines. Emerging SARS-CoV-2 variants of concern (VOCs) continue to challenge these measures with increased transmissibility or virulence, escape of host antibody neutralization, and decreased efficacy of detection, therapeutics, and vaccination [Reference Aleem, Akbar Samad and Slenker1]. Viral genomic sequencing can improve the understanding of VOCs while providing insight into the dynamics of all circulating SARS-CoV-2 lineages.

Of the over 650 million global cases of COVID-19 reported by the end of December 2022, 36 million cases were reported in Brazil, with deaths approaching 700,000. Almost 180,000 of these cases were reported in Ribeirão Preto, located in São Paulo state [2]. Ribeirão Preto is home to a prominent national medical centre, making it a unique setting for the study of COVID-19 pandemic dynamics with multiple ongoing studies [Reference Bonifácio, Csizmar, Barbosa-Júnior, Pereira, Koenigkam-Santos, Wada, Gaspar, Carvalho, Bollela, Santana, Souza and Bellissimo-Rodrigues3]. This ongoing study offers insight into the role of different SARS-CoV-2 lineages in the epidemiology of multiple waves of the COVID-19 pandemic in Ribeirão Preto, with comparison within Brazil and globally.

All patients cared for at the Hospital das Clínicas, Faculdade da Medicina de Ribeirão Preto, Universidade de São Paulo (HCFMRP-USP), were eligible to be included in the Uncovering new CoronaVirus Encoded Ramifications in Brazil (UnCoVER-Brazil) study. Nasopharyngeal and/or oropharyngeal samples were collected for over 6,000 patients in outpatient and inpatient settings at HCFMRP-USP between 4 April 2020 and 31 January 2022. The detection of SARS-CoV-2 was performed by multiplex real-time reverse transcription PCR (RT-PCR) targeting N, E, and RDRP genes (Gene Finder™ COVID-19 Plus Real Amp Kit; OSANG Healthcare). Subjects were included in the UnCoVER study with a cycle threshold for SARS-CoV-2 less than or equal to 30. From 2,637 viral samples meeting this criterion, we generated complete SARS-CoV-2 genomes from 2,395 distinct infections of 2,379 patients. Multiple distinct infections from a single individual were required to have a minimum sample collection interval of 60 days. Sequencing was performed at the Laboratory of Translational Oncology at the Hemocentro of Ribeirão Preto, with the Illumina COVIDSeq Test (RUO Version) kit and the included ARTIC Network nCoV-2019 Amplicon Panel (V3). Most samples were sequenced on HiSeq 4000 (Illumina), with some sequenced on MiSeq (Illumina). FASTQ generation was completed in BaseSpace (Illumina) using the FASTQ Generation app (1.0.0). DRAGEN COVID Lineage (version 3.5.10) was used for alignment to SARS-CoV-2 reference genome NC_045512 and for Pango lineage classification (Pangolin software 4.1.2, Pangolin data 1.14). Only genomes that passed Pangolin quality control and had at least 90% non-ambiguous content were included in analysis [4]. All included SARS-CoV-2 genomes are available on GISAID under the identifier EPI_SET_230119wh (doi: 10.55876/gis8.230119wh). Sequencing results were merged with sociodemographic and clinical data in RStudio (2022.07.01), which was also used for analysis and figure generation. REDCap (7.6.3) was used for data storage.

Sociodemographic information and COVID-19 case details were extracted from the HCFMRP-USP COVID-19 surveillance reporting system, with some entries supplemented with information from electronic medical records. Of the 2,395 distinct SARS-CoV-2 infections captured by sequencing, the median age of subjects was 41.0 (IQR 31–56) years, and 1,469 (61.3%) subjects were female. A total of 2,015 cases were symptomatic (84.1%, missing data 15.2%). Less than a third of patients were hospitalized (666 patients, 27.8%), and 176 patients died (7.3%, missing data 17.5%).

We identified four World Health Organization (WHO) VOCs (Gamma, Alpha, Delta, Omicron) and one WHO variant of interest (VOI) (Zeta), four of which (Zeta, Gamma, Delta, Omicron) caused distinct waves in this cohort. Within these waves, we identified 47 distinct Pango lineages, including multiple Brazilian lineages (Figure 1, Table 1).

Figure 1. Distribution of SARS-CoV-2 sequenced genomes per epidemiological week, plotted by weekend date (Saturday) of RT-PCR test collection [Reference Batra5]. Bars coloured by SARS-CoV-2 Pango lineage and grouped by VOC, where possible [Reference Andrés, Piñana, Borràs-Bermejo, González-Sánchez, García-Cehic, Esperalba, Rando, Zules-ña, Campos, Codina, Blanco-Grau, Colomer-Castell, Martín, Castillo, García-Comuñas, Vásquez-Mercado, Martins-Martins, Saubi, Campins-Martí, Pumarola, Quer and Antón6]: B & B descendant lineages in shades of purple, Zeta (P.2) in light orange, N lineages in shades of red, non-VOC P lineage in black, Alpha (B.1.1.7) in dark orange, Gamma (P.1) and descendant lineages in shades of blue, Delta (AY lineages) in shades of yellow to brown, and Omicron (BA lineages) in shades of green.

Throughout 2020, the SARS-CoV-2 B lineage and descendants made up nearly all cases sequenced, with Zeta rising at the end of 2020. Gamma spiked early in 2021 and, within 2 months, dominated most sequenced genomes. Gamma continued as the dominant variant until Delta emerged in the last third of 2021. Delta was completely replaced by the Omicron spike in early 2022. Each new dominant variant group quickly and nearly completely replaced the dominant group preceding it (Figure 1). This pattern, which continued despite vaccination increasing in Brazil starting in 2021, is consistent with the hypothesis that herd immunity is SARS-CoV-2 variant-specific and does not broadly cover COVID-19 [Reference Barber7].

We focused our analysis on comparative outcomes between variant groups. Relative to the overall median age of 41.0 years and the largely similar variant group median ages (Table 1), a decrease to 35.0 and 33.0 was observed for N lineages and the non-VOC P lineage, respectively. However, interpretation is limited by the low number of cases sequenced in these groups. Hospitalization was similarly consistent among variant groups, with an overall average rate of 27.8% and range from 20% to 45% for all groups except Omicron. Less than 10% of individuals with Omicron were hospitalized (Table 1); however, we only captured patients early in its wave, and increased vaccination is a likely confounder for decreased severity.

Table 1. Identified SARS-CoV-2 Pango lineages, grouped by VOC where possible, with median age and hospitalization counts by variant groups

The Pango lineage composition of variant groups offers additional insight into transmission dynamics unique to HCFMRP-USP. Combined, B.1.1.28 and B.1.1.33 made up 80% of the sequences observed in the B & B descendant group that dominated throughout 2020. Of global sequences reported to date, over 75% of all B.1.1.28 lineages and 85% of all B.1.1.33 were reported in Brazil [Reference O’Toole8]. Both descended from European lineage B.1.1 and were considered primary drivers of the first wave of the pandemic in Brazil. VOI Zeta (P.2) and VOC Gamma (P.1) descended from B.1.1.28, while N.9 – which was present at the same time at HCFMRP-USP but dominated by Zeta – descended from B.1.1.33 [Reference Resende9]. More than half of global lineages attributed to Gamma were observed in Brazil [Reference O’Toole8]. The long dominance of Gamma at HCFMRP-USP is clear in Figure 1, with over 80% observed being P.1 (Table 1).

All Delta lineages observed in this study were AY lineages, and no B.1.617.2 – the lineage first attributed to the global surge of Delta – was observed. Over 50% of Delta lineages were AY.99.2, a Brazilian lineage with over 95% of all global sequences reported in Brazil [Reference O’Toole8]. AY.99.2 descended from AY.99, which in turn descended from B.1.617.2. In Brazil, Delta was responsible for a more modest surge of cases than was observed in other countries in the second half of 2021. A recent study from Minas Gerais suggested that despite Delta’s increased transmissibility compared to Gamma – which dominated infections in Brazil when Delta arrived – the over 80% vaccination rate in Brazil at the time may have protected against a surge [Reference Fonseca10]. Unfortunately, no conclusions can be drawn from this study about case surges, as sequencing did not have any fixed correlation with COVID-19 case count at HCFMRP-USP. Although our data suggest that herd immunity is variant-specific, we speculate that Gamma – which was responsible for a larger outbreak in Brazil than in many countries – may have produced an immune response that was more cross-reactive with Delta than other variants.

In this dispatch, we describe the preliminary insights gained from the large COVID-19 study based at HCFMRP-USP, one of Brazil’s largest public academic hospitals located in Ribeirão Preto, São Paulo state. Here we observed trends in VOC prevalence consistent with those observed in general in Brazil. We also identified the contribution of key Brazilian lineages to pandemic waves. Our results are consistent with the hypothesis that herd immunity to SARS-CoV-2 tends to be variant-specific.

Data availability statement

Data that support the findings of this study are available upon a reasonable request addressed to the corresponding author.

Acknowledgements

We thank all individuals who participated in this study by donating specimen. We also thank all members of the UnCoVER-Brazil project. We acknowledge the Virology Laboratory at HCFMRP-USP for completing SARS-CoV-2 RT-PCR testing; Vivian Neves Dias Arantes for selecting and preparing samples for SARS-CoV-2 genomic sequencing; and the sequencing team in the Laboratory of Translational Oncology at the Hemocentro of Ribeirão Preto for completing sequencing. We also acknowledge Núcleo de Vigilância Epidemiológica Hospitalar (NVEH) at HCFMRP-USP for clinical data collection.

Author contribution

Conceptualization: F.B., B.A.L.F., J.M.S., F.B.J., L.M.C., F.S.D.C.; Data curation: F.B., S.D.S., F.B.C., G.S.C., F.B.J., L.M.C., L.M.T.B., M.R.C., S.B., A.Y.Y., F.S.D.C.; Funding acquisition: F.B., L.M.C., R.d.T.C.; Investigation: F.B., S.D.S., F.B.C., G.S.C., L.M.T.B., F.S.D.C.; Methodology: F.B., B.A.L.F., F.B.C., G.S.C., J.M.S., F.B.J., L.M.C., L.M.T.B., M.R.C., S.B., A.Y.Y., F.S.D.C.; Project administration: F.B., B.A.L.F., F.B.J., L.M.C., M.R.C., F.S.D.C.; Supervision: F.B., B.A.L.F., L.M.C., R.d.T.C.; Validation: F.B., F.B.J., M.R.C.; Visualization: F.B.; Writing – review & editing: F.B.; Formal analysis: S.D.S., F.B.J., F.S.D.C.; Writing – original draft: S.D.S., F.S.D.C.; Resources: M.R.C.

Financial support

This work was funded by the Department of Social Medicine at HCFMRP-USP, the Hemocentro of Ribeirão Preto, and the Provost of Graduate Studies of the University of São Paulo. We acknowledge financial support for S.D.S. by the Fulbright US Student Program sponsored by the US Department of State and the Fulbright Commission in Brazil. The content of this publication is solely the responsibility of the authors and does not necessarily represent the official views of those funding the work.

Competing interest

The authors declare none.

Ethical standard

This work was approved by the Research Ethics Committee at HCFMRP-USP and complies with the Brazilian General Data Protection Law.

Footnotes

First authors of F.S.d.C. and S.D.S. and senior authors of B.A.L.F., L.M.C., and F.B-R. are equally contributed to this article.

References

Aleem, A, Akbar Samad, AB and Slenker, AK (2022) Emerging Variants of SARS-CoV-2 And Novel Therapeutics Against Coronavirus (COVID-19). In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; [cited 2022 Nov 9]. Available at http://www.ncbi.nlm.nih.gov/books/NBK570580/.Google Scholar
Epidemiological Surveillance Ribeirão Preto [in Portuguese] (2022) [Internet]. Prefeitura da Cidade Ribeirão Preto [cited 2023 Jan 26]. Available at https://www.ribeiraopreto.sp.gov.br/portal/saude/boletim-novo-coronavirus-covid-19.Google Scholar
Bonifácio, LP, Csizmar, VNF, Barbosa-Júnior, F, Pereira, APS, Koenigkam-Santos, M, Wada, DT, Gaspar, GG, Carvalho, FS, Bollela, VR, Santana, RC, Souza, JP and Bellissimo-Rodrigues, F (2022) Long-term symptoms among COVID-19 survivors in prospective cohort study, Brazil. Emerging Infectious Diseases 28(3),730733.Google Scholar
Recommendations for SARS-CoV-2 Sequence Data Quality & Reporting Version 1 [Internet] (2021) Association of Public Health Laboratories; [cited 2022 Nov 9]. Available at https://www.aphl.org/programs/preparedness/Crisis-Management/Documents/APHL-SARS-CoV-2-Sequencing.pdf.Google Scholar
Batra, N, et al. (2021) The Epidemiologist R Handbook [Internet] [cited 2022 Nov 9]. Available at https://epirhandbook.com/en/ (accessed 5 May 2023).Google Scholar
Andrés, C, Piñana, M, Borràs-Bermejo, B, González-Sánchez, A, García-Cehic, D, Esperalba, J, Rando, A, Zules-ña, RG, Campos, C, Codina, MG, Blanco-Grau, A, Colomer-Castell, S, Martín, MC, Castillo, C, García-Comuñas, K, Vásquez-Mercado, R, Martins-Martins, R, Saubi, N, Campins-Martí, M, Pumarola, T, Quer, J, Antón, A (2022) A year living with SARS-CoV-2: An epidemiological overview of viral lineage circulation by whole-genome sequencing in Barcelona city (Catalonia, Spain). Emerging Microbes & Infections 11(1), 172181.Google ScholarPubMed
Barber, RM, et al. (2022) Estimating global, regional, and national daily and cumulative infections with SARS-CoV-2 through Nov 14, 2021: A statistical analysis. The Lancet 399(10344), 23512380.Google Scholar
O’Toole, Á, et al. (2021) Tracking the international spread of SARS-CoV-2 lineages B.1.1.7 and B.1.351/501Y-V2 with grinch. Wellcome Open Research 6, 121.Google Scholar
Resende, PC, et al. (2021) A potential SARS-CoV-2 variant of interest (VOI) harboring mutation E484K in the spike protein was identified within lineage B.1.1.33 circulating in Brazil. Viruses 13(5), 724.Google ScholarPubMed
Fonseca, PLC, et al. (2022) Tracking the turnover of SARS-CoV-2 VOCs gamma to Delta in a Brazilian state (Minas Gerais) with a high-vaccination status. Virus Evolution 8(2), veac064.Google Scholar
Figure 0

Figure 1. Distribution of SARS-CoV-2 sequenced genomes per epidemiological week, plotted by weekend date (Saturday) of RT-PCR test collection [5]. Bars coloured by SARS-CoV-2 Pango lineage and grouped by VOC, where possible [6]: B & B descendant lineages in shades of purple, Zeta (P.2) in light orange, N lineages in shades of red, non-VOC P lineage in black, Alpha (B.1.1.7) in dark orange, Gamma (P.1) and descendant lineages in shades of blue, Delta (AY lineages) in shades of yellow to brown, and Omicron (BA lineages) in shades of green.

Figure 1

Table 1. Identified SARS-CoV-2 Pango lineages, grouped by VOC where possible, with median age and hospitalization counts by variant groups