Introduction
This study set out to identify trends and seasonal variations; made a prediction based on the globally reported cases of the Middle East respiratory syndrome coronavirus (MERS-CoV), extrapolated into the future by forecasting the trend and assessed contributions of various risk factors for the MERS-CoV cases. Specifically, we consider the questions: (1) what is the underlying growth or trend of the globally reported MERS-CoV cases? (2) Are there seasonal variations present in globally reported MERS-CoV cases over time and how do they affect the series? (3) What are the contributions of various risk factors for MERS-CoV cases?
The motivation for this study is that to date, the World Health Organisation (WHO) places the MERS-CoV among the list of blueprint priority diseases. Although a survey of the literature shows a rapid increase in research activities related to MERS-CoV [Reference Zyoud1], yet there is still a general paucity of empirical literature examining trends and seasonality. To the best of our knowledge, there is a single study, which anecdotally examined seasonality and time series patterns of MERS-CoV to date [Reference He2]. Given MERS-CoV remains a potential pandemic disease globally, it is important to understand the dynamics of the underlying growth or trend of the globally reported cases.
The motivation for this study also comes from the aetiology of MERS-CoV, especially its causes, spread, the complexity of its diagnosis and mortality. MERS-CoV is a virus that causes severe viral pneumonia in humans, known to have a high mortality rate [Reference Ahmed3–Reference Sherbini7] and has clinical symptoms similar to severe acute respiratory syndrome coronavirus [Reference Hui, Memish and Zumla8, Reference Corman9]. It was first reported in Saudi Arabia [10] and after that, the virus exhibited outbreaks in several regions of the world, particularly Saudi Arabia and the Republic of Korea [10, Reference Khan11]. Additionally, the complexity of MERS-CoV and its diagnosis of infection have been acknowledged in the literature [Reference Al Johani and Hajeer12, Reference Sung13]. According to WHO, 2160 laboratory-confirmed cases of MERS-CoV were reported at the end of January 2018, including 773 associated deaths (case–fatality rate: 35.8%) that were reported globally [14]. The majority of these cases were reported in Saudi Arabia.
Methods
This is a time series analysis using publicly reported MERS-CoV monthly global cases. The WHO receives confirmed MERS-CoV cases from countries across the world. To date, new cases continue to be identified and reported to the WHO, specifically from the Middle East region. These data are available at http://www.who.int/csr/don/archive/disease/coronavirus_infections/en/. The latest report included one case from Malaysia on 8 January 2018. We used time series of MERS-CoV cases reported between January 2015 and January 2018, where WHO began using standard case report. A research assistant retrieved data from WHO webpage and reviewed for quality by the second study author. The data retrieved include patient and clinical data such as age, gender, healthcare worker, comorbidity, the source of infection and geographical regions. The main outcome was the number of cases reported on a monthly basis from January 2015 to January 2018.
Statistical analysis
The analysis was performed using STATA 12 (STATA Corp., Texas, USA) and Microsoft Excel 10. Using the classical multiplicative time series model, we decomposed the original series into seasonal, irregular and trend components and examined their effects. Additionally, we identified patterns, smoothened series, generated predictions and employed forecasting techniques based on linear regression. Finally, we assessed contributions of various risk factors in MERS-CoV cases over time. P-values <0.05 were considered statistically significant.
Results
Figure 1 shows that although cases of MERS-CoV are decreasing in the range period selected, the series exhibits peaks and spikes.
Figure 1 also reveals a negative trend of MERS-CoV series from January 2015 to January 2018.
We investigated direction and significance of this trend, as well as stationarity of the series. While a unit root test for non-stationarity confirmed the MERS-CoV cases had a negative and statistically significant trend, the series was found to be stationary. The negative and statistically significant trend was also confirmed by subsequent regressions.
We collapsed the monthly series into quarters and then smoothened out the series using a centred moving average in order to understand underlying growth component. We assumed the classical multiplicative time series model by decomposing original series into seasonal, irregular, and trend components (Table 1)
Q1, Q2, Q3, Q4 are the Quarters of the year and * = implies forecast.
Our decomposition of the MERS-CoV cases shows that in 2016Q2, the seasonality and irregularity components of the series were 8% below the baseline (the centred moving average). Decomposing further, seasonality component was 11% above the baseline in 2016Q2, while it was 41% below the baseline in 2016Q4. Our analysis also shows the de-seasonalised series of the original MERS-CoV by removing seasonality and irregularity components. Using a linear regression, we then estimated the effect of time on the deseasonalised series to capture the underlying growth or trend component using a linear regression in order to make predictions. Since the last available data was in January 2018, we also made a forecast of three more quarters (2018Q2, Q3, Q4), an additional 9 months into the future. The forecast of the series revealed that MERS-CoV cases would approach zero by end of 2018 or beginning of 2019, making further extrapolation into the horizon infeasible.
Figure 2 shows decomposition of MERS-CoV series of the original series, centred moving average (smoothed series), forecast and linear forecast. Together, after accounting for irregular and trend components of the series, seasonality was found to range between 41% below the baseline i.e. the centred moving average (of order 4) in some Quarters and 11% above the baseline in other Quarters. The average seasonality component was found to be 14% below the baseline. However, regression estimation revealed that, unlike the trend component, seasonality was not statistically significant, a fact also backed by our statistical test, which showed the monthly global MERS-CoV cases series were stationary.
Regressions
Table 2 shows results of the effect of trend and seasonal on MERS-CoV cases. We compared the seasonal dummy variables, interpreted by comparing them with Quarter 4 (Q4) (base season) while holding time constant. Time, in this analysis, was represented by successive months and was interpreted as the effect of the linear trend on MERS-CoV cases over time, holding the effect of the seasons constant.
a Significant at <5% level.
The regression results revealed that after accounting for the trend, MERS-CoV cases Quarter 1 (Q1) each year averaged about 14 cases more than Q4 cases, although the effect was not found to be statistically significant. Similarly, after adjusting for the trend, cases in Quarter 2 (Q2) averaged around 16 cases more than Q4 cases. Quarter 3 (Q3) cases averaged 19 cases more than Q4 cases after accounting for the trend component time. It is important to note that the effects of seasonality were not statistically significant. However, what was revealed statistically significant is the negative trend. Consistent with the negative trend shown in Figure 1, the regression results revealed that each additional quarter registered approximately an average decrease of one case, after adjusting for the season. In other words, the MERS-CoV cases decreased, on average, by four (4 × −0.8667627) per quarter year to year. The model as a whole appears to suggest statistically significant predictive capacity.
We analysed fluctuations of reported MERS-CoV cases from period to another by graphing the residuals (generated from the regression of trend and seasonality on MERS-CoV cases) against time, as is the convention with time series analysis. The results indicated no clear patterns, suggesting that correlated errors are not a problem with this model. Other diagnostic tests also revealed neither violation of the classical linear assumptions nor correlation between the reported MERS-CoV cases in each month with cases reported in earlier months.
We further examined the effects of various risk factors of MERS-CoV cases such as camel contact, healthcare worker contact, exposure, gender and region. Table 3 shows regression that adjusts for these factors. The results reveal camel contact, Saudi Arabia and the Middle East regions, as well as being male significantly contribute to the global reported MERS-CoV cases (Table 3). The model in this estimation also appears to suggest statistically significant predictive capacity.
a Significant at <5% level.
Discussion
Using linear time series models and their application to the modelling and prediction of the globally reported MERS-CoV data, the present study identified trends, analysed seasonality, predicted and forecast evolution of MERS-CoV cases and assessed the contribution of various risk factors. The decomposition of the time series of MERS-CoV cases into trend and seasonality components and making predictions have not hitherto been studied in the context of MERS-CoV pandemic. In this study, we set out to understand the dynamics of its growth over time.
The results of our time series analysis of globally reported MERS-CoV cases suggest a significant negative trend that is forecast to be eradicated in the near future unless something unexpected happens.
Our study showed that although seasonality oscillated below or above the baseline i.e. the centred moving average (of order 4) over time, the average seasonality component was found to be 14% below the baseline. Even then, our analysis showed that, unlike the trend component, seasonality did not affect the series over time. Many risk factors are associated with MERS-CoV cases, mortalities, or complications. Our results indicate those aged under 30 years (reference category) are less likely to be a MERS-CoV case than those aged over 30, consistent with several studies that associated MERS-COV with elderly patients [Reference Ahmed15, Reference Al-Turaiki, Alshahrani and Almutairi16]. Surprisingly, comorbidity did not show a statistically significant contribution to MERS-CoV cases. However, there are studies that showed MERS-CoV cases were associated with patients with comorbidities [Reference Choi17–Reference Rivers, Majumder and Lofgren19]. A recent systematic study, for example, suggests the prevalence of comorbidities of MERS-CoV cases, diabetes, hypertension and cardiac diseases [Reference Badawi and Ryoo20]. While our analysis suggests males contribute to the global reported MERS-CoV cases, gender was reported to have a mixed effect on MERS-CoV mortalities in the literature. Some studies showed men as high risk [Reference Sherbini7, Reference Banik18] and MERS-CoV infects more males than females [Reference Assiri5, 21–Reference Assiri23]. Other literature indicated that the frequency of deaths was less in men [Reference Al Ghamdi24].
The literature showed that MERS-CoV can be spread via human–human [25–Reference Zhou29], or healthcare facilities [Reference Assiri23, Reference Harriman, Brosseau and Trivedi30–Reference Oboho32]. Other studies revealed animal to human [Reference Gossner33, Reference Ahmed34] as the primary culprit of MERS-CoV virus transmission. Specifically, the literature showed camels act as a direct source of human MERS-CoV infection [Reference Memish, Al-Tawfiq and Assiri31, Reference Memish35], while healthcare workers were reported to be at higher risk [Reference Sherbini7, Reference Al-Turaiki, Alshahrani and Almutairi16, Reference Rivers, Majumder and Lofgren19, Reference Al Ghamdi24]. The results of our study in this regard were mixed. While our study indicated that the effect of camel cases on overall MERS-CoV reported cases are positive and significant, the contribution of healthcare workers was not. Our analysis also showed evidence of geographical contributions to MERS-CoV cases such as Saudi Arabia and greater Middle East compared with South Korea. This can be seen as somewhat consistent with earlier studies that demonstrated a link between mortality associated with MERS-CoV and geography [Reference Ahmed15]. This finding is also intuitive in that Saudi Arabia and Middle East, in general, remain the global epicentre of MERS-CoV, hence the name.
The contribution of our study is that it adduces empirical evidence by making inferences and predictions based on the globally reported cases of MERS-CoV and extrapolated into the future by forecasting the trend. Unlike previous studies that descriptively analysed seasonality patterns of MERS-CoV and influenza in the Middle East [Reference He2], our study presents statistically significant results of trends of global MERS-CoV cases, consistent with regularities underlying the empirical dynamics and classical time series analysis. However, there are limitations of this study. First, the data used for this study comprised 37 months (January 2015 to January 2018). While this was just enough for several years’ worth of monthly observations to appropriately model seasonality, time series analysis can be sensitive to the number of observations. Hence, sufficiently large number of observations might have provided a better fit and results. Additionally, the analysis utilised WHO open source globally reported data, which may lack harmonisation from the various country sources.
Conclusion
This study contributes to the time series analysis of MERS-CoV literature. In particular, our analysis of trends and seasonality components the series, the prediction based on the globally reported cases of MERS-CoV and extrapolation into the future by forecasting the trend is envisaged to help in understanding the dynamics of the reported cases over time. The study findings suggest a significant negative trend of the monthly and quarterly data from 2015 to 2018. However, a further extrapolation into the future reveals that the MERS-CoV cases are forecast to be zero by end 2018 or beginning of 2019 unless something unexpected happens. Seasonality component of the series oscillated below or above the baseline, i.e. the centred moving average but did not affect the series over time. The results demonstrated that camel contact, exposure, gender, age and geography/region significantly contributed to the overall global reported MERS-CoV cases. The findings may well inform healthcare practitioners and policymakers about the underlying dynamics that produced the globally reported MERS-CoV cases.
Data
These dataset used and/or analysed are available at http://www.who.int/csr/don/archive/disease/coronavirus_infections/en/.
Acknowledgements
The authors are grateful for the collegiality and research support at the College of Public Health and Health Informatics, King Saud bin Abdulaziz University for Health Sciences. The information and opinions contained in this work do not necessarily reflect the views or policy of these institutions. This research is supported by King Abdullah International Medical Research Centre (KAIMRC), King Saud bin Abdulaziz University for Health Sciences, National Guard Health Affairs, Riyadh, Saudi Arabia.
Author contributions
OBD analysed the data and wrote manuscript. AEA retrieved the data from the WHO /registry website. AEA reviewed analysis and manuscript. All authors approved final manuscript for submission.
Conflict of interest
None declared.
Ethical standards
Not applicable.