Introduction
There is limited evidence informing the detection of anxiety in older adults, with even less evidence available for those living in long-term care (LTC) (Therrien & Hunsley, Reference Therrien and Hunsley2012). LTC, also known as “nursing homes”, “continuing care”, or “residential care homes” provide 24-hour nursing support to residents with complex medical or physical needs (Canadian Institute for Health Information, 2021). In Alberta, the average age in LTC was reported to be 82.5, the majority of residents were female, and 59 per cent of residents had a diagnosis of dementia (Alberta Health Continuing Care, 2018). Prevalence estimates of anxiety symptoms and disorders in LTC are wide ranging, but are as high as 58.4 per cent for anxiety symptoms and 20 per cent for anxiety disorders (Creighton, Davison, & Kissane, Reference Creighton, Davison and Kissane2016). Other common conditions such as dementia and depression impact 69 per cent and 44 per cent of residents of LTC, respectively (Canadian Institute for Health Information, 2010; Canadian Institute for Health Information, 2018). Anxiety in residents of LTC has been associated with increased suffering and negative health outcomes, including reduced levels of functioning, reduced well-being, and increased health service use (Lenze, Reference Lenze2003; Smalbrugge et al., Reference Smalbrugge, Pot, Jongenelis, Gundy, Beekman and Eefsting2006).
Anxiety in residents of LTC can be difficult to detect because of the medical complexity, frailty, and symptom overlap with prevalent co-morbidities such as dementia and depression (Ballard et al., Reference Ballard, Neill, O’Brien, McKeith, Ince and Perry2000; Fuentes & Cox, Reference Fuentes and Cox1997; Seignourel, Kunik, Snow, Wilson, & Stanley, Reference Seignourel, Kunik, Snow, Wilson and Stanley2008). Because of the difficulties in detecting anxiety, residents of LTC may be at increased risk of experiencing the effects of unrecognized and untreated anxiety. Anxiety detection tools are often used to detect anxiety symptoms, but it is not clear which tool to use in the LTC population.
Commonly used anxiety detection tools for older adults have been identified, but the majority lack psychometric evidence for use in the older adult population and clinically relevant cut-off scores (Therrien & Hunsley, Reference Therrien and Hunsley2012). A systematic review identified 22 tools that have been used to detect anxiety in the LTC population (Creighton, Davison, & Kissane, Reference Creighton, Davison and Kissane2018a). Although psychometrics specific to the identified tools were collected, the diagnostic performance of the identified tools was not compared to a reference standard. Without comparing a tool’s diagnostic performance to a reference standard, the diagnostic accuracy, or the ability of the tool to detect true cases of anxiety within a specific population, cannot be evaluated. The lack of synthesized evidence of rigorous diagnostic accuracy studies for anxiety within LTC highlights a gap in knowledge as to which anxiety detection tools are supported for use with the LTC population, demonstrated through validation studies that utilize a reference standard.
To assess the diagnostic accuracy of tools to detect anxiety, the performance of the anxiety detection tool must be compared to a reference standard that is used to accurately define the target condition (Leeflang, Reference Leeflang2014). Our research objective was to identify anxiety detection tools that have been evaluated using a reference standard, and to determine which tools are most effective at detecting anxiety disorders or symptoms in residents of LTC. The findings from this systematic review may improve the ability of practitioners to detect anxiety in residents of LTC. Time and treatment resources may also be better utilized through an understanding of which anxiety detection tools are most accurate for use in the LTC population. Through increased detection of anxiety, residents of LTC may have increased access to appropriate treatment options which may then improve the health of residents downstream.
Methods
Search Strategy
The initial search strategy was developed by K.A., S.S., and Z.G. A health sciences librarian reviewed and edited the search strategy to ensure appropriate database, keyword, and controlled vocabulary selection. Four electronic databases (MEDLINE®, Embase, PsycINFO, and the Cochrane Database of Systematic Reviews) were searched to identify literature relevant to the research objective. In addition to the electronic databases, grey literature sources including general (e.g., Google Scholar) and theses (e.g., Open Access Theses and Dissertations) databases, as well as Web sites of relevant organizations, were searched using the keyword search terms anxiety and LTC (Appendix I). The systematic review was registered with PROSPERO (CRD42020155206).
The two search concepts used were anxiety and LTC. The keywords and the controlled vocabulary (MeSH, Emtree, and PsycINFO terms) were combined within each concept using “OR”. The two search concepts were then combined using “AND”. No search filters or limits to language, year of publication, or publication status were applied to the search. The MEDLINE search strategy is reported in Appendix I.
Inclusion Criteria
At the level of title/abstract, included literature had to detail individuals residing in LTC who were screened or assessed for anxiety symptoms or disorders. At the level of full text, included literature had to be primary research, detail the use of a tool to detect anxiety (i.e., any tool, measure, questionnaire, or scale used to detect anxiety), and compare the anxiety detection tool to a reference or gold standard method of anxiety detection (i.e., clinical interview, Diagnostic and Statistical Manual of Mental Disorders [DSM] criteria, or International Classification of Diseases [ICD] criteria). The reference lists of included studies were hand searched. Google translate was used to screen any non-English articles.
Study Selection
A Population, Index Tool(s), Reference Tool(s), Diagnosis (PIRD) statement was used to articulate the inclusion criteria for the review (Munn, Stern, Aromataris, Lockwood, & Jordan, Reference Munn, Stern, Aromataris, Lockwood and Jordan2018). The population of interest was adults 65 years of age and older, of any cognitive status, residing in LTC settings. Given the range in cognitive status in LTC, it is useful to look at the population as a whole and not only at those with dementia to obtain a more complete understanding of which anxiety detection tools might be useful to the broader population in LTC. Anxiety detection tools were described as tools or questionnaires used to detect anxiety disorders or symptoms of anxiety. Anxiety detection tools could include co-morbidity-specific tools (e.g., Rating Anxiety in Dementia [RAID] scale [Shankar, Walker, Frost, & Orrell, Reference Shankar, Walker, Frost and Orrell1999]), population-specific tools (e.g., Geriatric Anxiety Inventory [GAI] [Pachana et al., Reference Pachana, Byrne, Siddle, Koloski, Harley and Arnold2007], Geriatric Anxiety Scale [Segal, June, Payne, Coolidge, & Yochim, Reference Segal, June, Payne, Coolidge and Yochim2010]), or other neuropsychiatric tools that measured anxiety or included anxiety items as part of global measures (e.g., Neuropsychiatric Inventory [Cummings et al., Reference Cummings, Mega, Gray, Rosenberg-Thompson, Carusi and Gornbein1994]). The reference standard was used to evaluate the diagnostic accuracy of the anxiety detection tool. Reference standards, also called “gold standards”, had to be a set of criteria used by clinicians to diagnose anxiety symptoms or disorders. There is no single reference standard method for anxiety detection in older adults, therefore, the DSM criteria, the ICD criteria, or a clinical diagnosis by a health care practitioner were considered acceptable methods for anxiety diagnosis (American Psychiatric Association, 2013; World Health Organization, 2004). Clinical diagnoses included the use of clinical interviews based on the DSM or ICD criteria (e.g., Structural Clinical Interviews for DSM [SCID] or Schedules for Clinical Assessment in Neuropsychiatry [SCAN]). The outcome of interest was the diagnostic accuracy of the anxiety detection tool as compared to the reference standard.
Following the removal of duplicates, all citations were independently reviewed by two reviewers at the level of title/abstract. All literature that met the inclusion criteria at the level of title/abstract was then reviewed at the level of full text in duplicate. Disagreements between reviewers were resolved through discussion at the level of title/abstract screening, and by a third reviewer at the level of full text. Literature was screened and data from included studies were extracted between September and November 2019. After adequate agreement was established with the first 50 articles, reviewers evaluated consensus every 500 articles at the level of title/abstract and every 50 articles at the level of full text.
Data Extraction
Included studies had all relevant data extracted independently by two reviewers using a standardized data extraction form. Data extracted from studies included: the study authors, year of publication, country of origin, anxiety detection tool or questionnaire, reference standard, population characteristics, cognitive assessment tool used, cognitive status of participants, LTC setting characteristics (e.g., level of care provided, funding model), co-morbidities/diagnoses, and tool administrator. The reported outcomes of diagnostic accuracy extracted for each cited anxiety detection tool included: true positives, true negatives, false positives, false negatives, sensitivity, specificity, predictive values, likelihood ratios, area under the curve, and the prevalence of anxiety.
Risk of Bias Assessment
The risk of bias assessment addressed concerns of the applicability of each study to the review question and concerns of bias within domains specific to the study population, anxiety detection tool, reference standard, and flow and timing of the study. The risk of bias assessment for all included studies was completed independently in duplicate using the Quality Assessment for Diagnostic Accuracy Studies-2 (QUADAS-2) tool (Whiting et al., Reference Whiting, Rutjes, Westwood, Mallett, Deeks and Reitsma2011). The QUADAS-2 tool was selected to account for blinding, verification bias, and spectrum bias. Each domain was assessed to have an overall low, unclear, or high risk of bias based on questions within the domain. For example, the three questions (1) was a consecutive or random sample of patients enrolled; (2) was a case-control design avoided; and (3) did the study avoid inappropriate exclusions, were used to assess an overall level of bias within the domain of patient selection for each study (Whiting et al., Reference Whiting, Rutjes, Westwood, Mallett, Deeks and Reitsma2011). Any conflict among reviewers in the risk of bias assessment was discussed and resolved by reviewers.
Data Synthesis and Analysis
A descriptive synthesis of the systematic review findings was completed. The diagnostic anxiety detection tool and reference standard findings were reported in tabular format. The sensitivity, specificity, and likelihood ratios were collected for each tool at each reported cut-off point. There were limited studies per anxiety detection tool; therefore, combined analyses were not pursued. Diagnostic accuracy outcomes were not plotted on the receiver operating characteristic (ROC) space nor was a summary estimate generated. With only two studies per anxiety detection tool, the heterogeneity among studies made combined analyses inappropriate, as the statistical models for such analyses are not accurate. All findings from this systematic review were reported in adherence to the Preferred Reporting Items for a Systematic Review and Meta-analysis of Diagnostic Test Accuracy (PRISMA-DTA) studies checklist (McInnes et al., Reference McInnes, Moher, Thombs, McGrath, Bossuyt and Clifford2018).
Results
Study Selection
A total of 4,620 articles were returned from the four databases searched. After the removal of duplicates, 3,464 articles were screened at the level of title/abstract. A total of 167 articles were reviewed in full text, with four articles meeting the inclusion criteria and being included in the final descriptive synthesis. An additional 1,018 records from grey literature sources and reference lists of included articles were reviewed at the level of title/abstract but none met inclusion for full text review. Web sites of relevant organizations were searched with no relevant records identified.
The greatest number of articles were excluded at the level of full text because of the absence of a reference tool when assessing for anxiety using an anxiety detection tool (n = 82). Other reasons for exclusion at the level of full text included the study population not being residents of LTC (n = 21) or the absence of an anxiety detection tool or anxiety assessment (e.g., global measures of mood) (n = 20). All other reasons for full text exclusion are reported in Figure 1.
Characteristics of Included Studies
The study and participant characteristics of the four included studies are summarized in Table 1. All studies were published between 2008 and 2019. Studies originated from either Australia, Norway, or the Netherlands (Boddice, Pachana, & Byrne, Reference Boddice, Pachana and Byrne2008; Creighton, Davison, & Kissane, Reference Creighton, Davison and Kissane2019; Dozeman et al., Reference Dozeman, van Schaik, van Marwijk, Stek, van der Horst and Beekman2011; Goyal, Bergh, Engedal, Kirkevold, & Kirkevold Reference Goyal, Bergh, Engedal, Kirkevold and Kirkevold2017). LTC settings included nursing homes (n = 2) and residential care homes (n = 2), with multiple locations being included within the analysis of each study. Residential care homes were reported to provide daily care to residents over the age of 65 and a lower level of care than that provided by nursing homes, according to one study (Dozeman et al., Reference Dozeman, van Schaik, van Marwijk, Stek, van der Horst and Beekman2011), whereas another study used the term “residential aged care facilities” to refer to nursing homes, assisted living, LTC/residential homes, and hostels (Creighton et al., Reference Creighton, Davison and Kissane2019). No further information on the level of care within the settings of included studies was provided. The sample size of included studies ranged between 27 and 277 participants with a combined total of 585 participants from all studies. Females made up between 63 and 78.2 per cent of study participants. The mean age of participants ranged from 82.8 to 86 years of age. Information about the co-morbidities of the study participants was absent from all studies. The mean Mini-Mental State Examination (MMSE) (Folstein, Folstein, & McHugh, Reference Folstein, Folstein and McHugh1975) score of participants, a measure of cognitive function, was reported by three studies (Boddice et al., Reference Boddice, Pachana and Byrne2008; Creighton et al., Reference Creighton, Davison and Kissane2019; Goyal et al., Reference Goyal, Bergh, Engedal, Kirkevold and Kirkevold2017). The mean MMSE scores and standard deviations were found to be 24(4.1), 24.7(3.6), and 14(5.6) indicating samples with both mild and moderate levels of cognitive impairment (Boddice et al., Reference Boddice, Pachana and Byrne2008; Creighton et al., Reference Creighton, Davison and Kissane2019; Goyal et al., Reference Goyal, Bergh, Engedal, Kirkevold and Kirkevold2017). The prevalence of diagnosed anxiety ranged from 6.5% to 27.7%. Two studies investigated GAD as the outcome of interest (Creighton et al., Reference Creighton, Davison and Kissane2019; Goyal et al., Reference Goyal, Bergh, Engedal, Kirkevold and Kirkevold2017), one study assessed for DSM-IV defined anxiety disorders (Boddice et al., Reference Boddice, Pachana and Byrne2008), and the remaining study separately assessed for GAD and anxiety disorders inclusive of GAD, panic disorder, social phobia, and agoraphobia (Dozeman et al., Reference Dozeman, van Schaik, van Marwijk, Stek, van der Horst and Beekman2011).
Note. CES-D = Center for Epidemiologic Studies Depression Scale; CIDI = Composite International Diagnostic Interview; DSM-5 = Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition; GAD = generalized anxiety disorder; GAI = Geriatric Anxiety Inventory; HADS-A = Hospital Anxiety and Depression Scale-Anxiety Subscale; LTC = long-term care; MINI = Mini-International Neuropsychiatric Interview; MMSE = Mini-Mental State Examination; n = sample size; RAID = Rating Anxiety in Dementia Scale; RAID-N = Rating Anxiety in Dementia Scale-Norwegian version; SD = standard deviation
a Anxiety disorders defined using DSM-IV include: GAD, obsessive-compulsive disorder, panic disorder (with or without agoraphobia), post-traumatic stress disorder, and social anxiety disorder.
b Anxiety disorders included one or more of the following: GAD, panic disorder, social phobia, agoraphobia.
c Mini-Mental State Examination-Norwegian Revised 2 (MMSE-NR2) used to assess 94 of 101 patients.
Risk of Bias Assessment
The complete risk of bias assessment is reported in Table 2. There were no concerns related to applicability to the research question in the four included studies. One study had a low risk of bias across all four domains (Goyal et al., Reference Goyal, Bergh, Engedal, Kirkevold and Kirkevold2017). The remaining three studies had unclear ratings for anxiety detection tool and reference standard categories (Boddice et al., Reference Boddice, Pachana and Byrne2008; Creighton et al., Reference Creighton, Davison and Kissane2019; Dozeman et al., Reference Dozeman, van Schaik, van Marwijk, Stek, van der Horst and Beekman2011). The unclear ratings were a result of inadequate reporting on who administered the anxiety detection and reference tools and whether or not the tool administrator was blinded to the results of the other tool. One study also scored a high risk of bias in patient selection because of participant exclusions for reference standard testing and analysis. Specifically, only participants with a Center of Epidemiologic Studies Depression Scale (CES-D) tool score above the cut-off of 8, indicating a suspected depressive or anxiety disorder, were tested using the reference standard (Dozeman et al., Reference Dozeman, van Schaik, van Marwijk, Stek, van der Horst and Beekman2011).
Note. Unclear indicates that sufficient information to support a definitive answer of yes/no was not reported in the study.
a Applies to all three index tools (Geriatric Anxiety Inventory [GAI], Hospital Anxiety and Depression Scale-Anxiety Subscale [HADS-A], Rating Anxiety in Dementia Scale [RAID]) reported by Creighton et al. (Reference Creighton, Davison and Kissane2019).
Anxiety Detection Tools and Reference Standards
Four anxiety detection tools were identified from the four included studies (Table 3). Three studies reported on the use of only one tool (Boddice et al., Reference Boddice, Pachana and Byrne2008; Dozeman et al., Reference Dozeman, van Schaik, van Marwijk, Stek, van der Horst and Beekman2011; Goyal et al., Reference Goyal, Bergh, Engedal, Kirkevold and Kirkevold2017), whereas one study reported on the use of three different tools, all in comparison to a single reference standard (Creighton et al., Reference Creighton, Davison and Kissane2019). Two studies investigated the diagnostic accuracy of the Geriatric Anxiety Inventory (GAI) (Boddice et al., Reference Boddice, Pachana and Byrne2008; Creighton et al., Reference Creighton, Davison and Kissane2019). Two studies looked at the Rating Anxiety in Dementia (RAID) scale. One study each looked at the English and Norwegian versions of the RAID scale (Creighton et al., Reference Creighton, Davison and Kissane2019; Goyal et al., Reference Goyal, Bergh, Engedal, Kirkevold and Kirkevold2017). The diagnostic accuracy of the CES-D and Hospital Anxiety and Depression Scale-Anxiety (HADS-A) were each investigated by one study (Creighton et al., Reference Creighton, Davison and Kissane2019; Dozeman et al., Reference Dozeman, van Schaik, van Marwijk, Stek, van der Horst and Beekman2011).
The number of cut-offs reported per tool ranged from one to seven. Three studies reported the diagnostic accuracy of the anxiety detection tool(s) using multiple cut-offs. The cut-offs tested in the three studies were not pre-specified but did include the tool’s original recommended cut-off. Study authors identified an optimal threshold when multiple cut-offs were reported (Table 4). Optimal cut-offs were defined by a sensitivity and specificity above 80 per cent and 60 per cent, respectively (Creighton et al., Reference Creighton, Davison and Kissane2019); a sensitivity above 80 per cent (Dozeman et al., Reference Dozeman, van Schaik, van Marwijk, Stek, van der Horst and Beekman2011); and the highest sensitivity (Goyal et al., Reference Goyal, Bergh, Engedal, Kirkevold and Kirkevold2017).
Note. aRows indicate optimal cut-off for each tool or outcome as defined by the study authors.
AxD = anxiety disorder; CI = confidence interval; CES-D = Center for Epidemiologic Studies Depression Scale; CIDI = Composite International Diagnostic Interview; DSM-5 = Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition; GAD = generalized anxiety disorder; GAI = Geriatric Anxiety Inventory; HADS-A = Hospital Anxiety and Depression Scale-Anxiety Subscale; LR(-) = negative likelihood ratio; LR(+) = positive likelihood ratio; MINI = Mini-International Neuropsychiatric Interview; RAID = Rating Anxiety in Dementia scale; RAID-N = Rating Anxiety in Dementia Scale-Norwegian version.
Two studies reported on RAID and identified ≥ 11 as the optimal cut-off with sensitivities of 85.0 per cent and 85.7 per cent (Creighton et al., Reference Creighton, Davison and Kissane2019; Goyal et al., Reference Goyal, Bergh, Engedal, Kirkevold and Kirkevold2017). Although the sensitivities were similar, one study reported a higher specificity at the ≥ 11 cut-off (Creighton et al., Reference Creighton, Davison and Kissane2019). Additionally, two studies reported on the GAI and either examined the GAI at only the ≥ 9 cut-off or identified the ≥ 9 cut-off as optimal, which produced sensitivities of 80.0 per cent and 90.0 per cent (Boddice et al., Reference Boddice, Pachana and Byrne2008; Creighton et al., Reference Creighton, Davison and Kissane2019).
The study that used the CES-D tool identified an optimal cut-off of ≥ 18 for the detection of both GAD and anxiety disorders in the LTC population (Dozeman et al., Reference Dozeman, van Schaik, van Marwijk, Stek, van der Horst and Beekman2011). In the study that looked at three different tools, the GAI appeared to have the best balance of sensitivity and specificity at the optimal cut-off; however, both the GAI and HADS-A had sensitivities of 90.0 per cent at the author identified optimal cut-offs (Creighton et al., Reference Creighton, Davison and Kissane2019). In the studies that provided diagnostic accuracy data at more than one cut-off, lower cut-offs that produced high sensitivities and adequate specificities were generally selected as optimal for use in the LTC population.
All reference tools used in the four included studies were based on or derived from the DSM, which was considered a reference standard method for detecting anxiety. One study used the Composite International Diagnostic Interview (CIDI) 2.1, which is a structured clinical interview based on the DSM-IV (Boddice et al., Reference Boddice, Pachana and Byrne2008). Two studies used the Mini-International Neuropsychiatric Interview (MINI), a diagnostic interview that uses DSM-5 and ICD-10 criteria (Creighton et al., Reference Creighton, Davison and Kissane2019; Dozeman et al., Reference Dozeman, van Schaik, van Marwijk, Stek, van der Horst and Beekman2011). The final study used a clinical interview based on DSM-5 criteria (Goyal et al., Reference Goyal, Bergh, Engedal, Kirkevold and Kirkevold2017).
Discussion
We identified four tools (RAID, GAI, HADS-A, and CES-D) used to detect anxiety in residents of LTC, which were validated using a reference standard. There was a high degree of clinical heterogeneity among the four included studies that limited direct comparisons among studies. Results from the included studies found that most tools performed similarly with respect to sensitivity. The limited findings indicate that there remains an evidence gap as to which anxiety detection tool and corresponding cut-off is best for use in the LTC population.
The GAI (cut-off ≥ 9) and HADS-A (cut-off ≥ 6) appeared to have the best sensitivity (both 90%) while maintaining acceptable specificity for detecting GAD in the LTC population (Creighton et al., Reference Creighton, Davison and Kissane2019). The performance characteristics of the GAI may have been dependent upon the condition of interest, as a lower sensitivity but higher specificity was reported when identifying anxiety disorders at the same cut-off (Boddice et al., Reference Boddice, Pachana and Byrne2008).
To maximize sensitivity for screening purposes, the studies that tested the RAID scale recommended a cut-off of ≥ 11 (sensitivity [SN]: 85.0% and 85.7%), aligned with what is suggested by the original scale, however, these sensitivities were lower when compared with other tools (Creighton et al., Reference Creighton, Davison and Kissane2019; Goyal et al., Reference Goyal, Bergh, Engedal, Kirkevold and Kirkevold2017; Shankar et al., Reference Shankar, Walker, Frost and Orrell1999). A cut-off of ≥ 12 (SN: 82.1%) was also recommended for the RAID scale to optimize clinical utility with high sensitivity and moderate specificity (Goyal et al., Reference Goyal, Bergh, Engedal, Kirkevold and Kirkevold2017). The studies that tested the GAI either tested the previously recommended cut-off of ≥ 9 (SN: 80.0%) (Boddice et al., Reference Boddice, Pachana and Byrne2008) or found the cut-off of ≥ 9 (SN: 90.0%) to be the optimal threshold with maximized sensitivity and specificity (Creighton et al., Reference Creighton, Davison and Kissane2019). When the HADS-A was tested, a cut-off of ≥ 6 (SN: 90.0%) was recommended for screening purposes in residents of LTC, which is lower than the recommended threshold for the tool (Creighton et al., Reference Creighton, Davison and Kissane2019; Zigmond & Snaith, Reference Zigmond and Snaith1983). To detect GAD, the study that used the CES-D recommended a cut-off of ≥ 18 (SN: 88.9%), compared with the recommended cut-off of ≥ 16 used for depressive symptoms (McQuaid, Stein, McCahill, Laffaye, & Ramel, Reference McQuaid, Stein, McCahill, Laffaye and Ramel2000; Sawyer Radloff & Teri, Reference Sawyer Radloff and Teri1986).
All studies reported optimal cut-offs, which were those with high sensitivity and specificity. The optimal cut-offs were often selected over those producing the highest sensitivity. Maintaining a high sensitivity, or detecting most true cases of anxiety, is important for initiating treatment and decreasing anxiety morbidity. The high prevalence of anxiety in LTC may justify the use of a detection tool, as identifying cases of anxiety in residents may prove effective in benefiting residents’ quality of life and caregiver resources.
The included studies provided limited descriptions of the feasibility of the tools or their ease of use in the LTC setting. The RAID and GAI tools were constructed specifically to detect anxiety in the older adult population (Pachana et al., Reference Pachana, Byrne, Siddle, Koloski, Harley and Arnold2007; Shankar et al., Reference Shankar, Walker, Frost and Orrell1999). Three tools (RAID, HADS-A, and CES-D) were identified as appropriate for use in detecting anxiety in the presence of co-morbidities (Creighton et al., Reference Creighton, Davison and Kissane2019; Dozeman et al., Reference Dozeman, van Schaik, van Marwijk, Stek, van der Horst and Beekman2011; Goyal et al., Reference Goyal, Bergh, Engedal, Kirkevold and Kirkevold2017). The RAID scale is a tool used to screen for anxiety and has evidence of validity in detecting GAD in those with dementia (Shankar et al., Reference Shankar, Walker, Frost and Orrell1999). It was suggested that the RAID scale, although comprehensive in its account of the resident, caregiver, and provider perspectives may prove to be too consuming for use as a screening tool in LTC (Creighton et al., Reference Creighton, Davison and Kissane2019; Goyal et al., Reference Goyal, Bergh, Engedal, Kirkevold and Kirkevold2017). The GAI is a brief measure of anxiety severity that may be applicable in the LTC setting given its ease of use. The HADS is a screening tool for the detection of anxiety and depression symptoms with a dedicated subscale for anxiety. It was noted that the low internal reliability and the limited ability of the HADS-A to capture GAD symptoms may restrict the use of this tool in the LTC setting (Creighton et al., Reference Creighton, Davison and Kissane2019). Previous research has found older adults to be more likely than those in younger age groups to present with symptoms of both anxiety and depression (Parmelee, Katz, & Lawton, Reference Parmelee, Katz and Lawton1993). The CES-D was suggested to be a method to detect both depression and anxiety in those living in LTC, despite the tool’s original purpose to screen for symptoms of depression (Dozeman et al., Reference Dozeman, van Schaik, van Marwijk, Stek, van der Horst and Beekman2011; McQuaid et al., Reference McQuaid, Stein, McCahill, Laffaye and Ramel2000; Sawyer Radloff & Teri, Reference Sawyer Radloff and Teri1986).
The diagnostic accuracy outcomes and quality assessment of the included studies facilitate an understanding of the applicability and feasibility of tool use in the LTC setting. The majority of the tools used were aimed at detecting GAD, which is aligned with the high prevalence of GAD in the LTC population (Creighton, Davison, & Kissane, Reference Creighton, Davison and Kissane2018b). Although not identical, all reference tools used across included studies were based on DSM criteria (Boddice et al., Reference Boddice, Pachana and Byrne2008; Creighton et al., Reference Creighton, Davison and Kissane2019; Dozeman et al., Reference Dozeman, van Schaik, van Marwijk, Stek, van der Horst and Beekman2011; Goyal et al., Reference Goyal, Bergh, Engedal, Kirkevold and Kirkevold2017). The common use of the DSM allowed for a degree of consistency in the reported prevalence of anxiety disorders amongst included studies. Established reference standard criteria such as the DSM were not created specifically for persons living in LTC; therefore, population-specific characteristics including common co-morbidities and the prevalence of anxiety must be considered when applying such criteria (Bryant et al., Reference Bryant, Mohlman, Gum, Stanley, Beekman and Wetherell2013; Shead, Rodriguez, Dreeben, & McBride, Reference Shead, Rodriguez, Dreeben, McBride, Byrne and Pachana2021). The findings from this review identified that most anxiety detection tools were used for the detection of GAD or, more generally, anxiety disorders. Following GAD, agoraphobia and other specific phobias are frequently reported in LTC (Creighton et al., Reference Creighton, Davison and Kissane2018b). Future studies looking at detecting anxiety in LTC may aim to investigate the diagnostic accuracy of anxiety detection tools that are used to detect agoraphobia and other specific anxiety disorders. Additionally, future studies may aim to determine the utility of tools for detecting clinically relevant anxiety symptoms in persons with anxiety symptoms not meeting the established criteria for an anxiety disorder using a given tool.
The included studies were found to be highly variable in clinical heterogeneity and study quality, therefore limiting the ability to generate a summary estimate. Two studies each reported on the GAI and RAID scale, which was insufficient for meta-analysis comparisons. Baseline characteristics of the study populations, such as age, sex, and MMSE scores, varied among studies. Previous research has found cognitive impairment, depression, pain, and visual and other functional impairments, as well as higher education to be related to increased risk of anxiety symptoms (Smalbrugge, Pot, Jongenelis, Beekman, & Eefsting, Reference Smalbrugge, Pot, Jongenelis, Beekman and Eefsting2005). Factors such as co-morbidities, length of stay in LTC, medication use, previous diagnoses, and indexes of frailty and activities of daily living were all unknown in the included studies. The high degree of unknown variables had the potential to introduce significant heterogeneity, making the comparison of the study samples inappropriate.
Based on the risk of bias assessment, a general lack of clarity on how the studies were conducted resulted in poor study quality. Three studies were identified as having unclear or high concern for risk of bias given inadequate reporting on the study procedures or exclusion of select participants. The concern of bias from the included studies highlights the need to follow reporting standards when conducting diagnostic accuracy studies. The included studies were limited by the number of study participants and low prevalence of anxiety disorders (Boddice et al., Reference Boddice, Pachana and Byrne2008), the exclusion of select participants from the study or analysis (Creighton et al., Reference Creighton, Davison and Kissane2019; Dozeman et al., Reference Dozeman, van Schaik, van Marwijk, Stek, van der Horst and Beekman2011), the lack of understanding of scale content validity (Goyal et al., Reference Goyal, Bergh, Engedal, Kirkevold and Kirkevold2017), and reference standard shortcomings (Creighton et al., Reference Creighton, Davison and Kissane2019).
Future studies that clearly report study procedures, setting characteristics, and population characteristics may provide better quality evidence for the diagnostic accuracy of anxiety detection tools in LTC. A reduction in clinical heterogeneity among studies will allow for more meaningful comparisons of tool performance to be completed. Future research must also consider how feasible the use of tools in LTC may be when used to detect anxiety by staff who must be trained in their use. Further, there is a need for interdisciplinary collaboration to occur internationally to establish best practice standards for the complex clinical issue of anxiety detection in LTC.
At the level of full text review, the absence of a reference standard resulted in the exclusion of many anxiety detection studies. Our review found that anxiety detection tools such as the Neuropsychiatric Inventory, Anxiety in Cognitive Impairment and Dementia scales, Brief Anxiety and Depression scale, Brief Symptom Inventory, and Clinical Anxiety Scale, have been investigated in LTC settings (Gerolimatos et al., Reference Gerolimatos, Ciliberti, Gregg, Nazem, Bamonti and Cavanagh2015; Mansbach, Mace, & Clark, Reference Mansbach, Mace and Clark2015; McNeil, Reference McNeil1999; Neville & Teri, Reference Neville and Teri2011; Selbaek, Kirkevold, Sommer, & Engedal, Reference Selbaek, Kirkevold, Sommer and Engedal2008). The diagnostic accuracy of these tools was assessed using other anxiety detection tools rather than a reference standard based on DSM or ICD criteria. More rigorous diagnostic accuracy studies should be completed with these tools to further evaluate their use in LTC.
A major strength of this study is that to the best of the authors’ knowledge, it is the first study of the diagnostic accuracy of anxiety detection tools in LTC with a broad search strategy that likely captured all studies of interest. Limitations of the present systematic review include the small number of articles that met the inclusion criteria and the heterogeneity of included studies that prohibited pooling and the ability to discern optimal tools and cut-offs for use in LTC. There was a lack of information on participant co-morbidities that could have confounded the detection of anxiety. The lack of reporting on co-morbid conditions and previous diagnoses restricted the ability to complete pooled estimates of diagnostic accuracy measures (e.g., sensitivity, specificity) and further explore the results and effectiveness of the identified anxiety detection tools.
Through this review, we were able to identify anxiety detection tools for use in the LTC setting that have been evaluated using a reference standard. There is a gap in the diagnostic accuracy literature that uses a reference standard comparator to assess the detection of anxiety in LTC. The choice of which tool to use should reflect resident characteristics such as co-morbidities, as well as resource constraints. The findings from this review may be best suited to inform clinicians on which tools exist for use in the detection of anxiety in older adults living in LTC.
Supplementary Materials
To view supplementary material for this article, please visit http://doi.org/10.1017/S0714980822000101.