Hostname: page-component-745bb68f8f-cphqk Total loading time: 0 Render date: 2025-01-11T11:29:58.107Z Has data issue: false hasContentIssue false

Hindsight2020: Characterizing Uncertainty in the COVID-19 Scientific Literature

Published online by Cambridge University Press:  25 July 2023

Kinga Dobolyi*
Affiliation:
George Washington University, Department of Computer Science, Washington, DC, USA
George P. Sieniawski
Affiliation:
Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
David Dobolyi
Affiliation:
University of Notre Dame, Indiana, USA
Joseph Goldfrank
Affiliation:
George Washington University, Department of Computer Science, Washington, DC, USA
Zigfried Hampel-Arias
Affiliation:
Los Alamos National Laboratory, Los Alamos, New Mexico, USA
*
Corresponding author: Kinga Dobolyi; Email: [email protected]

Abstract

Following emerging, re-emerging, and endemic pathogen outbreaks, the rush to publish and the risk of data misrepresentation, misinterpretation, and even misinformation puts an even greater onus on methodological rigor, which includes revisiting initial assumptions as new evidence becomes available. This study sought to understand how and when early evidence emerges and evolves when addressing different types of recurring pathogen-related questions. By applying claim-matching by means of deep learning Natural Language Processing (NLP) of coronavirus disease 2019 (COVID-19) scientific literature against a set of expert-curated evidence, patterns in timing across different COVID-19 questions-and-answers were identified, to build a framework for characterizing uncertainty in emerging infectious disease (EID) research over time. COVID-19 was chosen as a use case for this framework given the large and accessible datasets curated for scientists during the beginning of the pandemic. Timing patterns in reliably answering broad COVID-19 questions often do not align with general publication patterns, but early expert-curated evidence was generally stable. Because instability in answers often occurred within the first 2 to 6 mo for specific COVID-19 topics, public health officials could apply more conservative policies at the start of future pandemics, to be revised as evidence stabilizes.

Type
Concepts in Disaster Medicine
Copyright
© In-Q-Tel, Inc. and the Author(s), 2023. Published by Cambridge University Press on behalf of the Society for Disaster Medicine and Public Health

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

SeyedAlinaghi, S, Oliaei, S, Kianzad, S, et al. Reinfection risk of novel coronavirus (COVID-19): a systematic review of current evidence. World J Virol. 2020;9(5):79-90. doi: 10.5501/wjv.v9.i5.79.CrossRefGoogle Scholar
Savvides, C, Siegel, R. Asymptomatic and presymptomatic transmission of SARS-CoV-2: a systematic review. 2020. doi: 10.1101/2020.06.11.20129072 CrossRefGoogle Scholar
Udow-Phillips, M, Lantz, PM. Trust in public health is essential amid the COVID-19 pandemic. J Hosp Med. 2020;15(7):431-433. doi: 10.12788/jhm.3474 CrossRefGoogle ScholarPubMed
Berger, L, Berger, N, Bosetti, V, et al. Rational policymaking during a pandemic. Proc Natl Acad Sci USA. 2021;118(4):e2012704118. doi: 10.1073/pnas.2012704118.CrossRefGoogle ScholarPubMed
Soares-Weiser, K, Lasserson, T, Juhl Jorgensen, K, et al. Policy makers must act on incomplete evidence in responding to COVID-19. Cochrane Database Syst Rev. 2020;11: ED000149. doi: 10.1002/14651858.ED000149 CrossRefGoogle Scholar
US Department of Homeland Security. Master question list for COVID-19 (caused by SARS-CoV-2). Accessed December 21, 2020. https://www.dhs.gov/publication/st-master-question-list-COVID-19, 2022 Google Scholar
Schünemann, HJ, Santesso, N, Vist, GE, et al. Using GRADE in situations of emergencies and urgencies: certainty in evidence and recommendations matters during the COVID-19 pandemic, now more than ever and no matter what. J Clin Epidemiol. 2020;127:202-207. doi: 10.1016/j.jclinepi.2020.05.030 CrossRefGoogle Scholar
Jalali, R, Hosseinian-Far, A, Mohammadi, M. Contradictions in the promotion of publishing academic and scientific journal articles, and the inability to cope with the new coronavirus (COVID-19). Antimicrob Resist Infect Control. 2021;10(1):10. doi: 10.1186/s13756-021-00884-0 CrossRefGoogle ScholarPubMed
Odone, A, Galea, S, Stuckler, D, et al. The first 10 000 COVID-19 papers in perspective: are we publishing what we should be publishing? Eur J Public Health. 2020;30(5):849-850. doi: 10.1093/eurpub/ckaa170 CrossRefGoogle ScholarPubMed
Wang, LL, Lo, K, Chandrasekhar, Y, et al. CORD-19: The COVID-19 open research dataset. In: Proceedings of the 1st Workshop on NLP for COVID-19 at ACL 2020. 2020; arXiv:2004.10706v4. Association for Computational Linguistics.Google Scholar
Älgå, A, Eriksson, O, Nordberg, M. The development of preprints during the COVID-19 pandemic. J Intern Med. 2021;290(2):480-483. doi: 10.1111/joim.13240 CrossRefGoogle ScholarPubMed
Elgendy, IY, Nimri, N, Barakat, AF, et al. A systematic bias assessment of top-cited full-length original clinical investigations related to COVID-19. Eur J Intern Med. 2021;86:104-106. doi: 10.1016/j.ejim.2021.01.018 CrossRefGoogle ScholarPubMed
Raynaud, M, Zhang, H, Louis, K, et al. COVID-19-related medical research: a meta-research and critical appraisal. BMC Med Res Methodol. 2021;21(1):1. doi: 10.1186/s12874-020-01190-w CrossRefGoogle ScholarPubMed
Whitmore, KA, Laupland, KB, Vincent, CM, et al. Changes in medical scientific publication associated with the COVID-19 pandemic. Med J Australia. 2020;213(11):496-499. doi: 10.5694/mja2.50855 CrossRefGoogle ScholarPubMed
Palayew, A, Norgaard, O, Safreed-Harmon, K, et al. Pandemic publishing poses a new COVID-19 challenge. Nat Hum Behav. 2020;4(7):666-669. doi: 10.1038/s41562-020-0911-0 CrossRefGoogle ScholarPubMed
Kang, M, Gurbani, SS, Kempker, JA. The published scientific literature on COVID-19: an analysis of Pubmed abstracts. J Med Sys. 2020;45(1):3. doi: 10.1007/s10916-020-01678-4 CrossRefGoogle ScholarPubMed
Fiske, ST, Dupree, C. Gaining trust as well as respect in communicating to motivated audiences about science topics. Proc Natl Acad Sci USA. 2014;111(Suppl 4):13593-13597. doi: 10.1073/pnas.1317505111 CrossRefGoogle ScholarPubMed
Pearce, W. Trouble in the trough: how uncertainties were downplayed in the UK’s science advice on COVID-19. Humanit Soc Sci Commun. 2020. doi: 10.1057/s41599-020-00612-w CrossRefGoogle Scholar
Mohammed, M, Sha’aban, A, Jatau, AI, et al. Assessment of COVID-19 information overload among the general public. J Racial Ethn Health Disparities. 2021;9(1):184-192. doi: 10.1007/s40615-020-00942-0 CrossRefGoogle ScholarPubMed
Montani, I, Honnibal, M, Van Landeghem, S, et al. spaCy: industrial-strength natural language processing in Python. 2020. Accessed May 29, 2023. https://zenodo.org/record/4021943 Google Scholar
Reimers, N, Gurevych, I. Sentence-BERT: Sentence embeddings using Siamese BERT-Networks. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics. 2019. doi: 10.48550/arXiv.1908.10084 CrossRefGoogle Scholar
BioASQ.org. BioASQ releases continuous space word vectors obtained by Applying Word2Vec to PubMed Abstracts. Accessed December 3, 2021. http://bioasq.org/news/bioasq-releases-continuous-space-word-vectors-obtained-applying-word2vec-pubmed-abstracts Google Scholar
Meyers, B. meyersbs/uncertainty. Installation & usage. Accessed May 29, 2023. https://github.com/meyersbs/uncertainty/wiki/installation-&-usage.Google Scholar
Vincze, V. Uncertainty Detection in Natural Language Texts. University of Szeged. 2014. doi: 10.14232/phd.2291 Google Scholar
Bero, L, Lawrence, R, Leslie, L, et al. Cross-sectional study of preprints and final journal publications from COVID-19 studies: discrepancies in results reporting and spin in interpretation. BMJ Open. 2021;11(7):e051821.CrossRefGoogle ScholarPubMed