Introduction
Monitoring SARS-CoV-2 infections in the endemic phase is important to estimate the incidence in the population, e.g. to identify risk groups [Reference Vos, van Hagen, Wong, Smits, Kuijer and Wijmenga-Monsuur1]or to estimate the real-world vaccine effectiveness of COVID-19 vaccines [Reference Huiberts, de Gier, Hoeve, de Melker, Hahne and den Hartog2-Reference Huiberts, Hoeve, de Gier, Cremer, van der Veer and de Melker4]. At the start of the pandemic, wide-scale community testing was available to monitor the incidence of infections. In the Netherlands, SARS-CoV-2 community testing facilities have been scaled down and eventually closed in March 2023. In addition, the commitment to self-testing declined. As wide-scale testing is no longer available, alternative methods are necessary to monitor SARS-CoV-2 infections.
One method to detect SARS-CoV-2 infections is through detecting antibodies induced by infection but not by vaccination. COVID-19 vaccines authorized in the Netherlands induce Spike S1-antibodies following vaccination, but these are also induced by infection. Spike S1 antibodies are therefore less suitable to detect SARS-CoV-2 infections in a highly vaccinated population, leaving antibodies against the Nucleoprotein (N), one of the structural proteins of SARS-CoV-2, a more specific marker to identify (re)infection [Reference van den Hoogen, Smits, van Hagen, Wong, vos and van Boven5]. While a relative increase in antibodies can be used as a marker for reinfection [Reference Cauchemez, Horby, Fox, Mai le, Thanh le and Thai6-Reference den Hartog, Andeweg, Hoeve, Smits, Voordouw and Eggink8], it is currently unknown how long N-antibodies persist after infection and which increase in N-antibodies can reliably detect reinfection.
The aim of this study is to describe the persistence of N-antibody seropositivity and boosting after infection, to estimate the most suitable fold increase to detect reinfection, and to determine whether there are factors that affect the sensitivity of detecting (re)infection.
Methods
Study design and population
VASCO (VAccine Study COvid-19) is an ongoing 5-year prospective cohort study and has included approximately 45,000 community-dwelling participants aged 18–85 years in the Netherlands [Reference Huiberts, Kooijman, de Melker, Hahne, Grobbee and Hoeve9]. The study started in May 2021, a few months after COVID-19 vaccines were introduced in the Netherlands, and the primary objective is to assess the real-world vaccine effectiveness against SARS-CoV-2 infection. Participants are asked to take a self-collected fingerprick blood sample every 6 months and one month after primary vaccination for detection of serum antibodies. In addition, participants are asked to complete monthly digital questionnaires in the first year and three-monthly in years 2–5, including questions on sociodemographic factors, health status, COVID-19 vaccination, SARS-CoV-2 related symptoms, positive SARS-CoV-2 tests (PCR or (self-administered) rapid antigen test), and willingness to test when having symptoms. Participants can also be notified of positive SARS-CoV-2 tests or COVID-19 vaccinations in the study app at any moment. After April 2022, regular testing has been scaled down in the Netherlands. Since then, participants have received self-tests free of charge and are encouraged to test when having COVID-19-like symptoms.
Antibody measurements
Self-collected fingerprick blood samples were collected in 0.5 μL Minicollect tubes (Greiner, #450533), returned in a pre-printed and addressed safety bag envelope, and centrifuged immediately upon arrival. The serum is subsequently separated, aliquoted, and stored at −80°C. Serum samples were analyzed for total immunoglobulin (Ig) levels against N-antibodies on the Cobas e801 (Roche Diagnostics, Mannheim, Germany) using batch-specific, linear calibration lines obtained with a dilution range of the NIBSC 20/136 WHO standard (NIBSC) or an internal pool of 125 N-antibody positive, anonymized patient sera calibrated against the WHO standard. The cut-off for N-positivity ranged from 2.05 to 3.79 BAU/mL between batches [Reference Huiberts, Hoeve, Kooijman, de Melker, Hahne and Grobbee10]. The clinical sensitivity of the Roche assay is 99.5% (95% confidence interval: 97.0%–100%), and the clinical specificity is 99.80% (99.69%–99.88%) [11].
To evaluate reproducibility, a total of 278 samples, ranging from 0.4 BAU/mL to 477 BAU/mL, were measured twice using different assay batches. The concordance of seropositivity was 99% (275/278). To evaluate reproducibility between assay batches, we selected all twice-measured seropositive samples (201/278). Proportions of samples with a 1.5-, 2-, 3- and 4-fold difference between measurements were calculated for these samples. Of the 201 samples, 6 (3.0%) samples, ranging from 12 BAU/mL to 107 BAU/mL, had a difference in N-antibody levels of at least 1.5-fold, and 1 (0.5%) of 2-fold (12 BAU/mL). There were no samples with a difference in N-antibody levels of 3-fold or more between measurements. The coefficient of variation ranged between 0.01% and 20.4%.
Determinants
The following potential determinants were considered: age group (18–59 years vs. 60–85 years), vaccination status (unvaccinated, partly vaccinated [one primary series dose +7 days], primary series [two primary series doses +14 days, or one dose JCovden +28 days], first booster, second booster, third booster [booster doses +7 days]), calendar time (quarters), and log N-antibody concentration of first sample (continuous in BAU/mL). In case of a reported infection between the first and second sample, the following potential determinants were also included: COVID-19-related symptoms (yes, no, unknown) and severity of infection (local, systemic, other, not reported).
For each sample pair, vaccination status was determined at the sampling date of the second blood sample, as described before [Reference Huiberts, Hoeve, Kooijman, de Melker, Hahne and Grobbee10]. Calendar time was determined by the sampling date of the second blood sample. Occurrence and type of COVID-19 symptoms were collected if participants reported a new positive test in the study app and monthly follow-up questionnaires after infection. Infections were defined as systemic when at least one of the following symptoms was reported: fever, general malaize, extreme fatigue, joint pain, muscle pain, irritability or confusion, nausea or vomiting, diarrhoea, stomach pain, pain while breathing, and shortness of breath. Infections were defined as local if at least one of the following symptoms was reported without any of the systemic symptoms present: cough, sore throat, runny nose, loss of smell and/or taste, and headache.
Data analysis
First, N-antibody levels 200 days pre- and post-infection were explored and visualized using a generalized additive model for first, second, and third reported infections separately. Models were fitted with time since infection as a base-spline with six knots. The most favourable number of knots and knot positions were determined for each subsequent infection by the AIC value of the models.
Then, we included participants with at least two blood samples with an interval of 4–8 months between May 2021 and May 2023, for which antibody assessment was available. Subsequently, the data was organized and analyzed in sample pairs consisting of two consecutive samples of the same participant. A participant could contribute more than one sample pair if the participant had submitted more than two samples, i.e. the second sample of the first sample pair could serve as a first sample in the second sample pair (Figure 1, panel A).

Figure 1. Organization of sample pairs. (a) Subsequent samples (S) form a sample pair (SP). A second sample may serve as a first sample in the next sample pair of the same individual (e.g. sample 2 is considered a second sample in sample pair 1 but a first sample in sample pair 2). (b) In case of infection (star symbol) between a first and second sample of a sample pair, a third sample may be added to evaluate a longer time interval since infection (sample triple (ST)). A third sample can only be added to the sample pair if there is no infection between the second and third samples. Fourth, samples were not included due to limited numbers. S3 may, therefore, form an ST sample pair with S1 in the upper figure and a sample pair with S2 in panel A.
All sample pairs were stratified by the N-seropositivity of the first sample. Subsequently, the sample pairs were stratified by whether an infection was reported within the sampling interval. In case an infection was reported, the time since infection was calculated as the time between the infection and the second sample. In case of a reported infection, the third sample, if available, was added to the sample pair, provided that between the second and third sample, no infection was reported and no increase in N-levels of > = twofold was observed (Figure 1, panel B).
For sample pairs where the first sample was seronegative, we calculated the percentage and 95% confidence interval (CI) of N-seroconversion overall and by categories of the determinants, stratified by reported infection within the sample pair interval. To estimate the association between potential determinants and N-seroconversion, we performed univariable and multivariable logistic regression using generalized estimating equation (GEE) models with exchangeable correlation structures to account for dependencies within participants. Odds ratios with 95% CI and p-values were provided. Variables included in the univariable and multivariable models were age group (18–59 years, 60–85 years), vaccination status, time period (year-quarter), and severity of infection (only for sample pairs with a reported infection in the sampling interval). A sensitivity analysis was performed on participants who reported to (almost) always test in case of symptoms. For sample pairs with a reported infection, the duration of seropositivity after infection was assessed by calculating the proportion (and 95% CI) of seropositive samples over time.
For sample pairs where the first sample was seropositive, the percentage of sample pairs with a 1.5-, 2-, 3-, and 4-fold was calculated overall and by categories of the determinants, stratified by reported infection within the sample pair interval. The fold increase levels were chosen arbitrarily to explore different relative increase measures as a marker for unreported reinfections. A univariable and multivariable GEE model with an exchangeable correlation structure was used to estimate the association between potential determinants and n-fold increase. Odds ratios with 95% CI and p-values were provided. Variables included in the univariable models were age group, vaccination status, time period, the concentration of the first sample in the sample pair, occurrence of symptoms, and severity of infection (only for sample pairs with a reported infection in the sampling interval). In the multivariable models, the same variables were included except for the occurrence of symptoms due to collinearity with the variable severity of infection. A sensitivity analysis was performed on participants who reported to (almost) always test in case of symptoms. For sample pairs with a reported infection, the fold increase over time was assessed by calculating the percentage (and 95% CI) of samples with a 1.5-, 2-, 3- and 4-fold increase relative to the first sample of the sample pair.
All analyses were performed using R version 4.4.1, including the tidyverse, geepack, and mgcv packages [12-Reference Wood15].
Ethics
The VASCO study is conducted in accordance with the principles of the Declaration of Helsinki, and the study protocol was approved by the not-for-profit independent Medical Ethics Committee of the Stichting Beoordeling Ethiek Biomedisch Onderzoek (BEBO), Assen, the Netherlands (NL76815.056.21). VASCO was registered in the online Dutch clinical trials register (trialregister.nl, registration number NL9279). Written informed consent was obtained from all participants prior to enrollment into the study.
Results
N-antibody concentrations following reported infection
Between May 2021 and May 2023, a total of 131,791 samples were collected from 44,407 participants. N-antibody levels 200 days before and after the first three reported infections are plotted in Figure 2. N-antibody levels post-infection showed a high degree of variation between participants (Figure 2). Overall, after the first reported infection, N-antibody levels showed an increase during the first weeks, after which a geometric mean concentration (GMC) of 39 BAU/mL was observed. This was followed by a gradual decline to 30 BAU/mL 200 days post-infection. After a second reported infection, we observed a peak GMC of 162 BAU/mL, after which N-antibody levels declined to 91 BAU/mL 200 days post-infection. After a third reported infection, N-antibody levels reached a peak GMC of 215 BAU/mL, which then declined to 150 BAU/mL 200 days post-infection.

Figure 2. Generalized additive model showing N-antibody geometric mean concentration (GMC) over time before and after a first ((a), n = 46,090 samples), second ((b), n = 9,607 samples) and third ((c), n = 719 samples) reported infection. Black lines represent N-antibody GMC and 95% confidence interval, blue scatter represents all individual samples used in the model, dotted vertical lines represent the moment of infection; red area corresponds with the cut-off range for seropositivity for different assay batches.
Selection of sample pairs
In total, 67,632 sample pairs of 33,283 participants were included. Eight thousand six hundred twenty-seven participants contributed one sample pair, 14,963 participants two sample pairs, and 9,693 three sample pairs. Figure 3 shows the number of sample pairs by seropositivity of the first sample. Infection was reported in the sampling interval for 31% of sample pairs with the first sample N-seronegative and for 14% of sample pairs with the first sample N-seropositive.

Figure 3. Flowchart of included sample pairs. Sample pairs are grouped based on the seropositivity of the first sample of a sample pair and a reported infection in the sampling interval.
Among sample pairs where the first sample was seronegative, the median N-antibody level of the second samples was 32.6 BAU/mL for sample pairs with a reported infection in the sampling interval and 1.3 BAU/mL for sample pairs without a reported infection. Additionally, the distribution of N-antibody levels was visually distinct between the two groups (Figure 4, panel A). Among sample pairs where the first sample was seropositive, the median N-antibody level of the second samples was 237.4 BAU/mL for sample pairs with a reported infection in the sampling interval and 30.6 BAU/mL for sample pairs without a reported infection (Figure 4, panel B).

Figure 4. Histogram of N-antibody levels of the second sample of sample pairs. (a) sample pairs with the first sample seronegative (n = 47,186) stratified by reported infection in the sampling interval, (b) sample pairs with the first sample seropositive (n = 20,446) stratified by reported infection in the sampling interval. Bars are plotted with overlap.
First infections
Among the sample pairs with a seronegative first sample, 14,685 infections were reported, and an additional 3,495 sample pairs for which no infection was reported showed seroconversion. Since the specificity of the Roche antibody test for N-seropositivity is 99.5%, this indicates that 19% of first infections would be missed in this study population when only considering reported infections.
Among participants who reported an infection with a seronegative first sample, the overall seroconversion rate was 89%. The seroconversion rate varied around 90% for the different age groups, levels of vaccination status, and severity of infection (Figure 5). For samples collected during Q4 of 2021 and Q1 of 2022 (mostly Delta infections), the N-seroconversion rate was significantly lower at 71% compared to ~90% in samples taken later (mostly Omicron infections) (p < 0.01). The N-seroconversion rate was significantly higher for infections with local and systemic symptoms (both 90%) than for asymptomatic infections (81%, p < 0.001). In the multivariable model, N-seroconversion was more likely for participants over 60 years compared to those under 60 years (OR: 1.25 [1.10–1.41]), unvaccinated participants compared to vaccinated participants (e.g. 0.47 [0.30–0.73] for the primary series), and for local infections (1.92 [1.40–2.65]) and systemic infections (1.81 [1.34–2.45]) compared to asymptomatic infections (Supplementary Table S1A). The odds of N-seroconversion increased with calendar time (8.04 [5.41–11.93] in 2022 Q2 to 24.39 [6.65–89.45] in 2023 Q2, compared to 2021 Q4).

Figure 5. N-seroconversion of sample pairs with (left) and without (right) reported infection. When there were less than 10 data points for a determinant, data is not shown in the figure. Calendar time was determined by the sampling date of the second blood sample.
The overall N-seroconversion rate was 11% among participants with a seronegative first sample who did not report an infection in the sample interval. The N-seroconversion rate was significantly higher among participants aged 18–59 years (12%) compared to 60–85 years (10%, p < 0.001). The N-seroconversion rate was 18% for unvaccinated participants and was significantly lower for vaccinated participants (4–16%, p < 0.05). N-seroconversion rates increased significantly with calendar-time from 1% in 2021 Q4 to 26% in 2023 Q2 (p < 0.001). In the multivariable analysis, only vaccination status and calendar time remained significant factors (Supplementary Table S1A).
N-seroconversion increased to 96% 8–9 weeks after a reported infection and started declining after 20–29 weeks (Figure 6). At 50–59 weeks after infection, the N-seroconversion rate was still 83%.

Figure 6. N-seroconversion in sample pairs with or without third sample, with a reported infection only between 1st and 2nd sample, by time since infection. Third samples were only included if there was no reported infection between the second and third samples and the absence of a 2-fold increase. The error bars represent the 95% confidence interval around the percentage. When there were less than 10 data points for a period, data was excluded from the figure.
*time since infection equals time between infection and third sample.
Reinfections
In sample pairs where the first sample was seropositive, the median fold increase in N-antibody levels was 7.6 (IQR: 2.3–22.9) for sample pairs with a reported infection in the sample interval (n = 2,934) and 0.6 (IQR: 0.4–1.1) for sample pairs without reported infection (n = 17,512).
Among the sample pairs with a seropositive first sample, a total of 2,934 infections were reported, and an additional 3,297 sample pairs for which no infection was reported showed a 1.5-fold increase, 2,676 a 2-fold increase, 2,040 a 3-fold increase, and 1,723 a 4-fold increase. Using a 1.5, 2-, 3-, or 4-fold increase suggests that, respectively, 53%, 48%, 41%, and 37% of infections are missed when only using reported infections for detection.
Figure 7 shows the percentage of sample pairs with a 1.5-, 2-, 3- or 4-fold increase in N-antibody level by determinants for sample pairs with and without a reported infection between samples. Among sample pairs with a reported infection in the sampling interval, 82% showed a 1.5-fold increase, 77% a 2-fold increase, 71% a 3-fold increase, and 65% a 4-fold increase. A fold increase was more likely among vaccinated participants, following local and systemic infections, following infections after the first quarter of 2022, and less likely with increasing antibody concentrations of the first sample of a sample pair (Supplementary Tables S1B-S1E). Age group did not affect fold increase. In the multivariable model, most of these effects were sustained, but there was no significant association between vaccination status and fold increase.

Figure 7. Percentage of sample pairs with 1.5-, 2-, 3- or 4-fold increase by determinant, stratified by reported infection in-between samples. When there were less than 10 data points for a determinant, data was excluded from the figure. Fold increase is presented by the saturation of the bars, with the lightest bars representing samples with a 1.5-fold increase and the darker bars samples with a 4-fold increase. Calendar time was determined by the sampling date of the second blood sample.
Among sample pairs without a reported infection in the sampling interval, the proportion of sample pairs with a 1.5-, 2-, 3-, and 4-fold increase were respectively 19%, 15%, 12%, and 10%. Among these sample pairs, a fold increase was more likely in the 18–59 year age group, in unvaccinated participants, for infections after 2022 Q1, and less likely with increasing antibody concentrations of the first sample of a sample pair. These results were maintained in the multivariable model (Supplementary Tables S1B-S1E).
The percentage of sample pairs with a 1.5-fold increase rose from 57% 0–1 weeks after infection to 89% 30–39 weeks after infection (Figure 8). A similar pattern is seen for the 2-, 3- and 4-fold increase in N-antibody levels, but with lower proportions with a higher fold increase. The 3-fold and 4-fold increase in N-level appeared to decline 20–29 weeks after infection.

Figure 8. Percentage of 1.5-, 2-, 3- or 4-fold increase in sample pairs with or without third sample, with a reported infection only between 1st and 2nd sample. Third samples were only included if there was no reported infection between the second and third samples and the absence of a 2-fold increase. The error bars represent the 95% confidence interval around the percentage. When there were less than 10 data points for a period, data was excluded from the figure.
*time since infection equals time between infection and the third sample.
Sensitivity analysis
A sensitivity analysis on participants who report to (almost) always test in case of symptoms revealed slightly lower proportions (approximately 2% points) of participants with seroconversion or fold increase among those that did not report an infection, particularly in the 18–59 year age group (Supplementary Tables S2A-S2E).
Discussion
In this study, we aimed to describe the persistence of N-antibody seropositivity after infection and to evaluate the sensitivity of different fold increases in order to detect reinfection(s) and whether this differed by several potential determinants. We found that the seroconversion rate to detect primary infections (using reported positive tests as a gold standard) is 89%, and using fold-increases to detect reinfections provided a sensitivity of 82% for 1.5-fold to 65% for 4-fold. We found that in our cohort, by not using serology data but only reporting infections, we would have missed 19% of primary infections and 37%–53% of reinfections, depending on the fold increase used. Among participants with a reported infection, seroconversion, 1.5-fold and 2-fold increases were maintained in more than 80% of participants, 3-fold increases in 73% of participants, and 4-fold increases in 67% after 6 months. This allows the detection of infections well for at least half a year after the occurrence of infection, making the method suitable for sampling intervals such as in the VASCO study (a 6-month interval).
We observed several determinants that affected seroconversion/fold increase. First, we observed that seroconversion/fold increase was more likely among participants with symptomatic infections. Higher antibody responses can be expected with more severe infections, resulting in a higher chance of seroconversion or a fold increase. Since widescale testing has discontinued, it is likely that participants only test when being symptomatic. This is supported by our observation that the proportion with a fold increase among those without a test increases over time. Furthermore, our sensitivity analysis among participants who (almost) always test when having symptoms showed marginal differences in proportions for sample pairs without reported infections, suggesting that unreported infections are indeed mostly asymptomatic. Second, higher seroconversion rates were seen among unvaccinated compared to vaccinated participants. After (multiple) doses, vaccinated individuals usually have higher levels of antibodies against the Spike protein [Reference Hoeve, Huiberts, de Gier, Andeweg, den Hartog and de Melker16]. As a consequence, infections may be cleared earlier before substantial levels of N-antibodies are induced, resulting in less fold increase and a lower rate of seroconversion in vaccinated participants and lower sensitivity of the assay, similar to what was shown by Bazin et al. [Reference Bazin, Rochette, Perreault, Fournier, Gregoire and Boivin17]. Furthermore, due to the absence of protective antibodies, unvaccinated participants may experience more severe infections and, therefore, induce higher antibody concentrations, resulting in higher seroconversion rates [Reference van den Hoogen, Smits, van Hagen, Wong, vos and van Boven5, Reference Van Elslande, Oyaert, Lorent, Vande Weygaerde, Van Pottelbergh and Godderis18]. Fold increase rates were not significantly different between vaccinated and unvaccinated participants, suggesting that vaccination status does not affect the detection of reinfections. Third, among sample pairs without reported infections, a fold increase was more likely among the younger age group (18–59 years). Participants in the younger age group are expected to experience fewer symptoms when infected and are less likely to test when having symptoms. This is supported by our findings in the sensitivity analysis among participants who (almost) always test when having symptoms, where the age effect was no longer present. In contrast, seroconversion among sample pairs with a reported test was more likely among the older age group. The older age group possibly experienced more severe first infections compared with the younger age group, which may have led to enhanced antibody induction in the older age group. While we adjust for symptoms after infections, we may not be able to fully adjust for severity as categorization is only based on the type of symptom (local vs. systemic) and does not include a measure for the seriousness of the symptoms. Fourth, we observed lower rates of seroconversion or fold increase among those with and without a reported infection during the Delta prevalent period. The Omicron variant has deviated more from the vaccine strain than the Delta variant. This may, in turn, result in a less adequate immune response among vaccinated participants during an Omicron infection than during a Delta infection due to immune escape. This, in turn, might result in more severe symptoms and, thus, higher rates of seroconversion following Omicron infections. Finally, we observed that a fold increase is less likely when the N-antibody levels in the first sample were already high. Recent research has shown that higher serum N-antibody levels are associated with higher mucosal immunity, thus reducing the risk of reinfection [Reference Verheul, Kaczorowska, Hofstee, Schepp, Smits and Wessels Beljaars19]. However, in addition to the observed boosting of N-antibody levels following each additional infection (at least up to three infections), we also observed a ceiling effect. The higher the N-antibody levels in the first sample, the lower the chance that a fold increase will be found, as high levels cannot always accurately be measured due to the saturation of the assay. This limits the possibility of using the fold increase as a proxy for reinfection when multiple infections occur in a relatively short period of time. Overall, using serology in addition to reported positive tests in population-based studies will help identify more infections, specifically in unvaccinated participants and younger age groups.
A 4-fold increase in antibody concentrations is considered a gold standard for various pathogens but has been posed as too strict for population studies and resulting in under-detection of reinfections [Reference Cauchemez, Horby, Fox, Mai le, Thanh le and Thai6, Reference Zhao, Siegel, Chen and Cook7]. Traditionally, a 2-fold increase was considered to carry a risk of measurement error causing false positives [Reference Cauchemez, Horby, Fox, Mai le, Thanh le and Thai6]. False positives may be caused by differences in sample concentrations due to factors other than infection, such as variations in sample distribution and storage conditions or variations in lab measurement. This may cause small variations in concentrations that may be picked up as a fold increase (e.g. a difference of 1.5 BAU/mL between a sample with 3 and 4.5 BAU/mL is a 1.5-fold increase, but not necessarily due to an infection). We found 3% false-positives when using a 1.5-fold increase and 0.5% false-positives when using a 2-fold increase in samples measured in duplicate. The extent of other sources of false positives, such as conditions during the transport of samples, could not be measured in the current study. Where false positives might not be wanted in a clinical setting, in the context of cohort studies such as the VASCO study, a small proportion of false positives may be more acceptable. For example, if one would like to exclude participants with a recent infection, this would result in the inappropriate exclusion of only a small group but simultaneously identify a large number of true positives.
Not all infections can be identified using seroconversion or fold increases. One factor that affects the possibility of detection is the time of sampling in relation to the moment of infection. N-antibody levels are still increasing during the first 4–6 weeks after infection and then decrease over time. For example, if the infection occurred just before the second sample, an infection could be missed. For reinfections, if the first sample is shortly after a prior infection or the second sample is too long after the reinfection, a fold increase may not be observed. We showed that fold increases remained high for at least half a year, making the fold increase method suitable for serial sampling with an interval of 6 months and possibly longer. Other approaches have been suggested, for example, to only look at fold increases after a downward trend in antibodies has been observed [Reference Richard, Nisenbaum, Colwill, Mishra, Dayam and Liu20]. However, this approach requires sampling with smaller intervals, as one would need to identify the downward trend before the reinfection occurs.
Recommendation
When choosing the most suitable fold increase to detect reinfections, a trade-off needs to be made between having more false positives with a lower fold increase or having less true positives with a higher fold increase. In our study, we gained 82% true positives and 3% false positives when using a 1.5-fold increase, whereas we gained 77% true positives and 0.5% false positives when using a 2-fold increase, thus decreasing the proportion of false positives to near zero. Using a 3-fold increase, the proportion of true positives decreased to 71%, whereas only a marginal reduction in false positives was found compared to using a 2-fold increase (0%). Furthermore, due to increasing antibody levels and the assay limit of detection, a higher fold increase is hard to maintain over time as larger fold increases become less practical to measure for commercial kits. At this time, a 2-fold increase, therefore, appears most favourable. In the end, the most favourable fold increase may depend on the research question.
Strengths and limitations
VASCO has a large study population with blood sampling at a regular interval, allowing us to identify (re)infections in addition to reported infections based on (self-)testing. However, the interval of 6 months makes it more difficult to study trends (e.g. downward trends) within individuals. It should be noted that the duration of N-seropositivity, but also the sensitivity and specificity is dependent on the assay used [Reference Alfego, Sullivan, Poirier, Williams, Adcock and Letovsky21, Reference Stone, Grebe, Sulaeman, Di Germanio, Dave and Kelly22]. When using an assay with lower sensitivity over time, this may limit the applicability of seroconversion or a fold increase as a proxy for (re)infection. Our findings should, therefore, be read in the context of the Roche assay measuring total Ig. In addition, the Roche assay has a high specificity (99.5%), making it suitable for this application. Finally, our method assumes that sample collection is complete, samples are collected between 4–8 months, and all have a measurable result. In reality, this is not always the case, which leads to less detection of infections than presented here.
Conclusion
Seroconversion and fold-increase are suitable methods to detect (re)infections in population-based prospective research at least up to a year after infection. Which fold increase to use requires a trade-off to include either more false positives or less true positives, and this may depend on the research question. Overall, using a 2-fold increase resulted in the detection of a large proportion of additional infections in our data, with only a small share of false positives.
Supplementary material
The supplementary material for this article can be found at http://doi.org/10.1017/S095026882500010X.
Data availability statement
Anonymized data reported from this study can be obtained from the corresponding author upon request. The dataset may include individual data, and a data dictionary will be provided. Data requests should include a proposal for the planned analyses. Data transfer will require a signed data-sharing agreement.
Author contribution
Writing – review & editing: A.J.H., H.d.M., R.v.B., E.R.V., G.d.H., M.K., S.P.A., S.v.d.H.; Conceptualization: E.R.V., M.K., C.E.H.; Methodology: E.R.V., M.K., N.N., C.E.H.; Data curation: N.N., C.E.H.; Formal analysis: C.E.H.; Visualization: C.E.H.; Writing – original draft: C.E.H.
Funding statement
This study is funded by the Dutch Ministry of Health, Welfare and Sport.
Competing interest
The authors declare none.
Ethical standard
Medical Ethics Committee of the Stichting Beoordeling Ethiek Biomedisch Onderzoek, Assen, the Netherlands, gave ethical approval for this work.