Cognitive tests used in chronic adult human randomised controlled trial micronutrient and phytochemical intervention studies

Anna L. Macready; Laurie T. Butler; Orla B. Kennedy; Judi A. Ellis; Claire M. Williams; Jeremy P. E. Spencer

doi:10.1017/S0954422410000119

Cognitive tests used in chronic adult human randomised controlled trial micronutrient and phytochemical intervention studies

Published online by Cambridge University Press: 12 August 2010

Claire M. Williams and

Jeremy P. E. Spencer

Show author details

Anna L. Macready: Affiliation:
School of Psychology and Clinical Language Sciences, University of Reading, Reading, UK
Laurie T. Butler*: Affiliation:
School of Psychology and Clinical Language Sciences, University of Reading, Reading, UK
Orla B. Kennedy: Affiliation:
Hugh Sinclair Human Nutrition Unit, Department of Food Biosciences, University of Reading, Reading, UK
Judi A. Ellis: Affiliation:
School of Psychology and Clinical Language Sciences, University of Reading, Reading, UK
Claire M. Williams: Affiliation:
School of Psychology and Clinical Language Sciences, University of Reading, Reading, UK
Jeremy P. E. Spencer: Affiliation:
Hugh Sinclair Human Nutrition Unit, Department of Food Biosciences, University of Reading, Reading, UK
*: *Corresponding author: Dr Laurie T. Butler, fax +44 118 9316715, email [email protected]

Article contents

Abstract
Introduction
Cognitive domains and associated brain regions
Research aims and questions
Methods
Results
Discussion
Conclusion and recommendations
References

Rights & Permissions

Abstract

In recent years there has been a rapid growth of interest in exploring the relationship between nutritional therapies and the maintenance of cognitive function in adulthood. Emerging evidence reveals an increasingly complex picture with respect to the benefits of various food constituents on learning, memory and psychomotor function in adults. However, to date, there has been little consensus in human studies on the range of cognitive domains to be tested or the particular tests to be employed. To illustrate the potential difficulties that this poses, we conducted a systematic review of existing human adult randomised controlled trial (RCT) studies that have investigated the effects of 24 d to 36 months of supplementation with flavonoids and micronutrients on cognitive performance. There were thirty-nine studies employing a total of 121 different cognitive tasks that met the criteria for inclusion. Results showed that less than half of these studies reported positive effects of treatment, with some important cognitive domains either under-represented or not explored at all. Although there was some evidence of sensitivity to nutritional supplementation in a number of domains (for example, executive function, spatial working memory), interpretation is currently difficult given the prevailing ‘scattergun approach’ for selecting cognitive tests. Specifically, the practice means that it is often difficult to distinguish between a boundary condition for a particular nutrient and a lack of task sensitivity. We argue that for significant future progress to be made, researchers need to pay much closer attention to existing human RCT and animal data, as well as to more basic issues surrounding task sensitivity, statistical power and type I error.

Keywords

Cognitive tests Micronutrients Phytochemicals Adult randomised controlled trials

Type: Review Article
Information: Nutrition Research Reviews , Volume 23 , Issue 2 , December 2010 , pp. 200 - 229

DOI: https://doi.org/10.1017/S0954422410000119 [Opens in a new window]
Copyright: Copyright © The Authors 2010

Introduction

As the number of individuals over the age of 60 years is expected to double between 2000 and 2050⁽¹⁾, the projected incidence of age-related neurodegenerative diseases and associated health care costs is also set to rise significantly. In a recent report commissioned by the Alzheimer's Society, the current annual cost of dementia in the UK alone was estimated to be about £17·03 billion⁽Reference Knapp, Prince and Albanese²⁾, with the total worldwide cost estimated to be US$315·4 billion annually⁽Reference Wimo, Winblad and Jönsson³⁾. Moreover, given that individuals aged 65 years can now expect to live for at least another 20 years, there is an urgent need to identify means of mitigating age-related changes in healthy older adults. Diet is crucial in this respect as it is thought to reduce the impact of age-related cognitive decline, for instance, by combating oxidative stress, reducing LDL-cholesterol, and modulating neurological mechanisms such as cell-signalling pathways.

Over the last decade or so a significant, albeit mixed, body of evidence regarding the effects of diet on cognition has accumulated from human and animal work. For instance, in longitudinal and human observational studies, vitamin E intake has been associated with reduced age-related cognitive decline through its antioxidant properties⁽Reference Morris, Evans and Bienias⁴⁾, and poorer memory performance has been linked to lower levels of serum vitamin E per unit of cholesterol⁽Reference Perkins, Hendrie and Callahan⁵⁾. Evidence from The Rotterdam Study has shown an association between higher plasma folate and better cognitive function, in particular for tests measuring psychomotor processing speed⁽Reference de Lau, Refsum and Smith⁶⁾, episodic memory and verbal ability⁽Reference Feng, Ng and Chuah⁷⁾. In the PAQUID (Personnes Agées QUID, loosely translated to ‘What about the elderly?’) study, older adults with the highest dietary flavonoid intake showed significantly lower cognitive decline over 10 years than those with the lowest intake⁽Reference Letenneur, Proust-Lima and Le Gouge⁸⁾. Studies using animal models have also demonstrated that certain groups of flavonoids may slow and even reverse the effects of ageing and dementia⁽Reference Galli, Shukitt-Hale and Youdim⁹^–Reference Joseph, Shukitt-Hale and Denisova¹¹⁾. For example, memory deficits may be prevented by the consumption of foods rich in anthocyanins, a flavonoid subgroup⁽Reference Barros, Amaral and Izquierdo¹²^–Reference Shukitt-Hale, Carey and Simon¹⁶⁾.

While still an emerging area, an examination of the available human randomised controlled trial (RCT) literature reveals rather more variable evidence for the beneficial effects of diet on cognition. For example, in a systematic review of B vitamin and antioxidant supplement studies, Jia et al. ⁽Reference Jia, McNeill and Avenell¹⁷⁾ found very little evidence for cognitive benefits from taking antioxidant supplements or B vitamins. A similar story is shown for flavonoid studies⁽Reference Macready, Kennedy and Ellis¹⁸⁾, with reports of both significant⁽Reference File, Hartley and Elsabagh¹⁹^–Reference Le Bars, Velasco and Ferguson²³⁾ and non-significant⁽Reference Ho, Chan and Ho²⁴^–Reference van Dongen, van Rossum and Kessels²⁹⁾ effects of supplementation.

Developing a better understanding of the conditions under which particular nutrients do or do not derive cognitive benefits represents a key challenge for research. However, one major problem facing researchers aiming to do this is that there is currently little consensus across studies in terms of either the cognitive domains to be explored or the specific tests to be used. Thus it is hard to determine whether a failure to reproduce a previously reported effect has established an important boundary condition for that nutrient (for example, supplementation with X not effective for population Y) or, alternatively, is a reflection of the idiosyncrasies of the respective tasks employed across the two studies. For example, although it is often assumed that all tests of working memory performance reflect common mechanisms or processes, it is quite possible that different tests measure partially separate cognitive capacities⁽Reference Salthouse³⁰⁾ and that performance dissociates across different tests. Indeed, Waters & Caplan (2003)⁽Reference Waters and Caplan³¹⁾ reported only moderate correlations between a series of seven different working memory measures. Thus simply assuming a one-to-one correspondence between two different cognitive measures purporting to measure the same domain (for example, working memory) is ill-advised. In the present paper we aim to establish the extent of this practice, as well as make recommendations for future studies.

Cognitive domains and associated brain regions

For the purposes of structuring the present review we now briefly outline the major taxonomies within human cognition. Importantly, attempts to characterise the effects of dietary nutrients on human cognition need to utilise a wide range of tasks to fully assess cognitive ability. In so doing, two points should be borne in mind. Firstly, although a particular task might be identified as having a primary neuropsychological focus such as ‘executive function’ or ‘episodic memory’, such measures are not ‘task pure’⁽Reference Burgess and Rabbitt³²⁾. For example, a range of processes may support a nominally ‘executive’ task such as memory, processing speed and motor function. Secondly, in terms of the underlying brain regions supporting cognitive performance, it is important to recognise that any task is likely to recruit multiple neural regions. For example, functional neuroimaging studies have revealed activations in the prefrontal cortex, medial and lateral parietal cortex, as well as hippocampal/medial temporal lobe activations during episodic memory retrieval⁽Reference Rugg, Henson, Parker, Wilding and Bussey³³⁾. A thorough understanding of the brain regions underpinning performance on particular cognitive tests is important, especially when attempting to relate findings from human studies to animal work. We return to this point in the Discussion.

Executive function

Executive function is a complex term used to describe a number of distinct, specifiable ‘control’ functions that are distinguishable from processing speed, memory, and motor functions. Examples of executive functions include ‘switching’ or ‘shifting’ (for example, alternating between behaviours or information sources), ‘inhibition’ (the ability to suppress automatic and habitual responses or behaviours), ‘updating’ (the ability to discard and replace information⁽Reference Miyake, Friedman and Emerson³⁴^, Reference Rabbitt³⁵⁾), ‘sustained attention’ (requiring sustained concentration and monitoring skills⁽Reference Rabbitt³⁵^, Reference Manly, Robertson and Rabbitt³⁶⁾), ‘strategic memory search’ (conscious, controlled retrieval of structured information⁽Reference Burgess and Rabbitt³²^, Reference Rabbitt³⁵^–Reference Phillips and Rabbitt³⁷⁾), and ‘planning’ (the ability to deal with novel information, generate goals and make decisions on a suitable course of action⁽Reference Burgess and Rabbitt³²^, Reference Rabbitt³⁵⁾). Neuroimaging studies suggest that the prefrontal cortex and striatum interact to perform specific executive functions⁽Reference Robbins³⁸⁾, and that distinct brain regions are recruited for different executive functions. For instance, the left inferior frontal gyrus in the prefrontal cortex is recruited in verbal fluency tasks⁽Reference Costafreda, Fu and Lee³⁹⁾, whereas the right inferior frontal gyrus shows greater activation in tasks measuring both shifting and inhibition⁽Reference Robbins³⁸⁾.

Working memory

All of the above executive functions are dependent on ‘working memory’, a psychological construct used to describe a hypothetical system for the temporary maintenance and manipulation of speech-based and/or visuospatial information, requiring the control of attentional resources⁽Reference Burgess and Rabbitt³²⁾. Functional neuroimaging work shows that working memory is not a unitary or dedicated system, and is not localised to a single brain region⁽Reference D'Esposito⁴⁰⁾. D'Esposito described working memory as ‘an emergent property of functional interactions between the PFC [prefrontal cortex] and the rest of the brain’ (p. 769)⁽Reference D'Esposito⁴⁰⁾, and evidence suggests that the network of brain regions recruited for the active maintenance of task-relevant information will depend on the type of information being maintained⁽Reference D'Esposito⁴⁰^, Reference Curtis, Rao and D'Esposito⁴¹⁾.

Memory

A number of key distinctions can be drawn between different types of memory. Specifically, researchers frequently distinguish between short-term memory (retrieval occurs within 30 s of stimulus presentation) v. long-term memory (retrieval occurs after 30 s); explicit memory (consciously and intentionally retrieved) v. implicit memory (unconsciously retrieved); episodic (memory for events) v. semantic (memory for meaning); retrospective (memory for past events) v. prospective memory (remembering to perform actions in the future); memory for skills (procedural memory) v. memory for facts (declarative memory); and verbal memory v. visual or visuospatial memory.

As might be expected, a wide range of brain regions are thought to be involved in supporting these various forms of memory. For instance, activation of left-lateralised posterior temporal regions, the supramarginal gyrus, dorsolateral premotor cortex and Broca's area have been associated with short-term memory⁽Reference Henson, Burgess and Frith⁴²⁾, in contrast to activation of bilateral ventrolateral prefrontal regions and dorsolateral prefrontal regions during encoding and recognition in long-term episodic memory⁽Reference Ranganath, Johnson and D'Esposito⁴³⁾. In terms of the neural substrate of explicit and implicit memory tasks⁽Reference Voss and Paller⁴⁴⁾, explicit memory has been linked to left frontal, and bilateral hippocampal, parahippocampal and parietal activation⁽Reference Wagner, Shannon and Kahn⁴⁵⁾, whereas implicit memory is primarily associated with reduced left fusiform gyrus and bilateral frontal and occipital activity⁽Reference Voss, Reber and Mesulam⁴⁶^, Reference Schott, Henson and Richardson-Klavehn⁴⁷⁾. The hippocampus, parahippocampus and parietal areas are typically implicated in spatial memory tasks⁽Reference Spiers and Maguire⁴⁸⁾, whereas the anterior prefrontal cortex has been shown to be actively involved in prospective memory tasks⁽Reference Simons, Schölvinck and Gilbert⁴⁹⁾.

Motor function, perception and intelligence quotient

Motor function may be measured with or without a cognitive component, and encompasses a range of measures from psychomotor processing speed to planning of movement. Voluntary movement is controlled by the basal ganglia system, which includes the striatum and substantia nigra, by enabling required motor mechanisms and inhibiting competing mechanisms⁽Reference Mink⁵⁰⁾. Various brain regions are thought to be involved during motor skill acquisition: prefrontal regions are recruited initially, with a subsequent shift to posterior regions, for example, premotor, posterior parietal and cerebellar cortex structures, as the task becomes more automatic⁽Reference Shadmehr and Holcomb⁵¹⁾.

Visual perception relies on visual acuity, field of view and contrast sensitivity, abilities that are reduced with age⁽Reference Attebo, Mitchell and Smith⁵²^, Reference Ivers, Cumming and Mitchell⁵³⁾ and which underpin any cognitive function with a visual component. Visual perception is associated with a wide range of brain regions in neuroimaging studies, namely the striate cortex and other occipital areas, parietal, temporal and prefrontal regions⁽Reference Ganis, Thompson and Kosslyn⁵⁴⁾.

Intelligence tasks may be sub-divided into crystallised intelligence (measuring acquired knowledge) and fluid intelligence (measuring non-verbal ability, problem-solving and pattern recognition independently of acquired knowledge)⁽Reference Cattell⁵⁵⁾. General intelligence, or Spearman's g, is associated most closely with fluid intelligence and activation of the lateral frontal⁽Reference Duncan, Seitz and Kolodny⁵⁶⁾ or prefrontal cortex and parietal areas⁽Reference Gray and Thompson⁵⁷⁾.

Research aims and questions

The primary aim of the present paper is to review the cognitive methods used in existing RCT studies that have explored the effects of nutrition on human cognition, with a view to identifying domains (for example, executive function, episodic memory) and individual tasks within those domains (for example, category fluency task, common objects recall task) that have shown greatest sensitivity to chronic supplementation (for example, supplementation of a nutrient over a number of days, weeks or months, as opposed to an acute intake of a nutrient on a single experimental day). A related aim is to catalogue the cognitive tasks used in existing chronic RCT studies within a single framework enabling researchers to better choose suitable tasks, as well as identify potential gaps in terms of the domains measured.

It should be noted here that significant outcomes for cognitive testing in dietary intervention studies rely on two things: (1) the potential for cognitive change as a result of direct dietary intervention with respect to dose and duration in the cognitive domain or cognitive aspect being measured and (2) cognitive methodologies sensitive enough to measure such cognitive change. The most important consideration in setting up a suitable framework for measuring human cognitive function in nutritional research is to determine methods that are sensitive to dietary changes and repeatable over time, are simple to interpret, and specific to cognitive domains. In this respect, brief measures such as the Mini-Mental State Examination (MMSE)⁽Reference Folstein, Folstein and McHugh⁵⁸⁾ and the Alzheimer's Disease Assessment Scale Cognitive Subscale (ADAS-Cog)⁽Reference Rosen, Mohs and Davis⁵⁹⁾ are suitable for cognitive screening of dementia and mild cognitive impairment, a term generally used to describe the level of cognitive impairment found in the intermediate stage between normal ageing and fully developed dementia⁽Reference Petersen, Smith and Waring⁶⁰⁾. Both the MMSE and the ADAS-Cog consist of items covering a broad range of cognitive functions: orientation, attention and calculation, memory, language, and motor skills, but they cannot truly be said to measure ‘global cognitive function’, as the individual test items do not measure the full range of cognitive functions. The term ‘general cognitive function’ is therefore preferred here.

In terms of examining changes in cognitive performance over time, the MMSE and ADAS-Cog may be useful for the measurement of widespread, gross cognitive changes in longitudinal studies⁽Reference Letenneur, Proust-Lima and Le Gouge⁸⁾; in Alzheimer's disease research, where the fastest rate of deterioration over time is likely to be seen, the MMSE has shown an overall progression rate of 0·24 points per month, although this was moderated by education duration, sex, disease incidence and drug therapy⁽Reference Roselli, Tartaglione and Federico⁶¹⁾. However, such measures are unlikely to be sensitive to smaller changes over shorter time periods in healthier individuals at pre-dementia stages, and will indeed show ceiling effects in young and/or cognitively healthy adult populations.

Overall we address four main questions:

(1) What proportion of chronic dietary interventions has reported significant benefits to cognition and in which domains?
(2) How much consistency is there across studies in terms of cognitive domains measured and tasks employed?
(3) Are there any cognitive domains that are under-represented in existing intervention studies?
(4) What are the implications for future chronic dietary intervention studies?

Methods

A search of five databases (PUBMED, Web of Knowledge, PsychINFO, CINAHL and the Cochrane Central Register of Controlled Trials) was carried out for micronutrient or phytochemical adult human randomised controlled intervention trials exploring cognitive function as the primary or secondary outcome. The following search terms were used:

Cognitive tasks: cogniti*, executive function, switching, shifting, updating, inhibition, vigilance, attention, memory, episodic, semantic, implicit, explicit, spatial, visuospatial, prospective, declarative, procedural, processing speed, psychomotor, reaction time, accuracy.

(1) Nutrient: vitamin*, thiamin*, riboflavin, niacin, nicotinamide, pantothenic, pyridox*, biotin, cobalamin*, folic acid, folate, ascorbic, tocopherol, iron, copper, zinc, magnesium, manganese, selenium, flavan*, flavon*, isoflavone, caroten*.
(2) Population sample: human adult [not] adolescent, child, infant, maternal, rat, mouse, mice, rodent, dog, monkey.
(3) Experimental design: randomi*, controlled trial, RCT.
(4) Type of article: journal article, peer-reviewed.

All studies identified through the literature search were evaluated according to the eligibility criteria by one reviewer and independently verified by a second reviewer. The search was limited to the previous 10 years (for example, 1 January 1999 to 31 August 2009).

Chronic studies that specifically focused on the cognitive effects of the target nutrients, both single and combined, were included. Studies exploring phytochemicals administered in extract rather than whole-food form (for example, Ginkgo biloba) were included. Studies on populations suffering from age-related cognitive impairment or dementia were included, except in the case of traumatic head injury or specific neurological disorders, whose findings may not be generalisable to a normal human adult population.

Acute interventions, non-randomised studies, RCT with sample sizes of less than twenty participants (or less than ten for cross-over designs), and studies that did not include a proper baseline, or control and/or placebo group were excluded. Studies using whole foods or treatments combined only with macronutrients or drug therapy were also excluded, as were drug therapy studies using micronutrients as a placebo control. As the present review is focusing on micronutrient or phytochemical strategies for the attenuation and prevention of cognitive decline throughout the adult lifespan, cognitive development studies (for example, maternal, infant and child) were excluded. As we were primarily interested in comparing uniquely human, language-based cognitive paradigms across studies, animal studies were excluded. Also excluded were studies of specific hospital-based patient groups (for example, CHD, stroke or diabetes), as were studies comparing pre- and post-operative cognitive performance. The literature search and screening process is shown in Fig. 1.

Fig. 1 Randomised controlled trial (RCT) literature search and screening process.

Outcomes were changes in cognitive performance. The first author categorised the tasks and the second author resolved any inconsistencies encountered during classification. Tasks were initially categorised by their primary neuropsychological focus, which was determined by the descriptions provided in the selected papers. Where disagreement occurred, the authors used the task descriptions cited in the majority of the included studies and checked these, where possible, against Lezak et al. ⁽Reference Lezak, Howieson and Loring⁶²⁾, a comprehensive sourcebook of neuropsychological assessment.

All included studies were rated using a three-category quality assessment grading system (A, B, C), based on a method successfully trialled elsewhere⁽Reference Balk, Chung and Raman⁶³^, Reference Levey, Coresh and Balk⁶⁴⁾, to identify any methodological shortcomings which might affect interpretation of the results. Briefly, category A studies employ the best designs, such that they are sufficiently powered with less than 20 % drop-out, methods (double-blind, intervention, comparator, outcome measures and statistical tests) are appropriate, results are clearly reported and assessed as valid. Category B studies may contain some weaknesses, but show no major problems and are still considered valid. Category C studies show significant methodological difficulties which may invalidate results, with flawed designs and analysis, missing information, greater than 20 % drop-out, randomisation issues (for example, unequal between-group baseline scores), reporting discrepancies and low power. Grading was carried out and agreed by the first and second authors.

Studies were examined for cognitive outcome status in order to see if the RCT was designed primarily to test cognitive function, or if cognition was only a secondary outcome measure, as this again may affect the validity of the results. Cognitive outcome status was then designated as ‘primary’ or ‘secondary’ for all RCT.

Results

Assessment of cognitive performance in existing studies

Thirty-nine studies met the inclusion criteria (see Table 1). Five used multivitamins⁽Reference Cockle, Haller and Kimber⁶⁵^–Reference Wolters, Hickstein and Flintermann⁶⁹⁾, and two of these also included minerals⁽Reference McNeill, Avenell and Campbell⁶⁷^, Reference Wouters-Wesseling, Wagenaar and Rozendaal⁶⁸⁾. Ten studies examined vitamin B treatments⁽Reference Hvas, Juul and Lauritzen⁷⁰^–Reference Bryan, Calvaresi and Hughes⁷⁹⁾ and three looked at specific minerals: Zn⁽Reference Maylor, Simpson and Secker⁸⁰⁾, Fe⁽Reference Murray-Kolb and Beard⁸¹⁾ and Cu⁽Reference Kessler, Pajonk and Bach⁸²⁾. One trialled β-carotene with vitamins C and E⁽Reference Smith, Clark and Nutt⁸³⁾. Of the studies targeting individual micronutrients or phytochemicals, twenty looked at flavonoids⁽Reference File, Hartley and Elsabagh¹⁹^–Reference van Dongen, van Rossum and Kessels²⁹^, Reference Casini, Marelli and Papaleo⁸⁴^–Reference Stough, Clarke and Lloyd⁹²⁾. Twelve of these were isoflavone RCT⁽Reference File, Hartley and Elsabagh¹⁹^–Reference Duffy, Wiseman and File²²^, Reference Ho, Chan and Ho²⁴^, Reference Basaria, Wisniewski and Dupree²⁵^, Reference Fournier, Ryan and Robison²⁷^, Reference Kreijkamp-Kaspers, Kok and Grobbee²⁸^, Reference Casini, Marelli and Papaleo⁸⁴^–Reference Kritz-Silverstein, Von Muhlen and Barrett-Connor⁸⁶^, Reference Woo, Lau and Ho⁸⁹⁾, six used G. biloba extracts containing 24 or 25 % flavonoids and 6 % terpenes⁽Reference Le Bars, Velasco and Ferguson²³^, Reference Elsabagh, Hartley and Ali²⁶^, Reference van Dongen, van Rossum and Kessels²⁹^, Reference Mix and Crews⁸⁸^, Reference Santos, Galduróz and Barbieri⁹¹^, Reference Stough, Clarke and Lloyd⁹²⁾, one used pine bark⁽Reference Ryan, Croft and Mori⁹⁰⁾ and one used cocoa flavanols⁽Reference Francis, Head and Morris⁸⁷⁾. No other intervention met the study criteria within the time frame of the review.

Table 1 Characteristics of included micronutrient and phytochemical human chronic randomised controlled trials

MCI, mild cognitive impairment; DSM, Diagnostic and Statistical Manual of Mental Disorders; MMSE, Mini-Mental State Examination; T, treatment; C, control; ADAS-Cog, Cognitive element of the Alzheimer's Disease Assessment Scale; n/a, not applicable; IQ, intelligence quotient; ITT, intention-to-treat; D/O, dropped out; RDA, recommended daily allowance; WAIS, Wechsler Adult Intelligence Scale (all versions); HRNTB, Halstead–Reitan Neuropsychological Test Battery; CAMCOG, Cambridge Cognitive Examination; WMS, Wechsler Memory Scale (all versions); mITT, modified intention-to-treat; TICS, Telephone Interview for Cognitive Status; NINDS-ADRDA, National Institute of Neurological Disorders and Stroke and Alzheimer's Disease and Related Disorders Association; CAT, Cognitive Abilities Test; CANTAB, Cambridge Neuropsychological Test Automated Battery; IDED, Intra Dimensional/Extra Dimensional Set Shifting Task; fMRI, functional magnetic resonance imaging; BOLD, blood oxygenation level-dependent; CDR, cognitive drug research; TP, Toulouse–Pieron Test; CBT, Cognometer Battery of Tests; SCOLP, Speed and Capacity of Language-Processing Test; HRT, hormone replacement therapy. * Mean treatment group scores were significantly better than those of the control group: * P < 0·05, ** P < 0·01, *** P < 0·005. † Mean treatment group scores were significantly worse than those of the control group: † P < 0·05, †† P < 0·01, ††† P < 0·005.

‡ Study designed with cognitive function as ‘primary’ or ‘secondary’ target outcome measure.

§ Study graded for quality: category A, highest quality, no bias; category B, medium quality, some bias but results are deemed valid; category C, poor quality, significant bias that may invalidate the results.

∥ 1 IU vitamin A = 0·3 μg retinol or 0·6 μg β-carotene.

Fifteen studies (38 %) were graded as category C and judged to contain significant bias that may invalidate the results, mostly as a result of lack of any quantitative cognitive screening, missing information and reporting errors. Eighteen studies (46 %) were classed as category B and judged to be susceptible to bias, but not sufficiently so to invalidate the results. Only six studies (15 %) met the more rigorous criteria for category A, as described earlier (see Table 1). In the assessment of cognitive outcome status, 69 % of the RCT were found to be specifically designed to measure cognitive function as the primary outcome (see Table 1).

Only seventeen studies (44 %) reported benefits of treatment on cognitive function in the expected direction⁽Reference File, Hartley and Elsabagh¹⁹^–Reference Le Bars, Velasco and Ferguson²³^, Reference Cockle, Haller and Kimber⁶⁵^, Reference Wouters-Wesseling, Wagenaar and Rozendaal⁶⁸^, Reference van Uffelen, Chinapaw and van Mechelen⁷⁵^, Reference Durga, van Boxtel and Schouten⁷⁷^, Reference Bryan, Calvaresi and Hughes⁷⁹^, Reference Maylor, Simpson and Secker⁸⁰^, Reference Casini, Marelli and Papaleo⁸⁴^, Reference Kritz-Silverstein, Von Muhlen and Barrett-Connor⁸⁶^, Reference Woo, Lau and Ho⁸⁹^–Reference Stough, Clarke and Lloyd⁹²⁾, of which two were graded as category A⁽Reference Durga, van Boxtel and Schouten⁷⁷^, Reference Maylor, Simpson and Secker⁸⁰⁾, seven were category B⁽Reference File, Jarrett and Fluck²¹^–Reference Le Bars, Velasco and Ferguson²³^, Reference Cockle, Haller and Kimber⁶⁵^, Reference Bryan, Calvaresi and Hughes⁷⁹^, Reference Kritz-Silverstein, Von Muhlen and Barrett-Connor⁸⁶^, Reference Ryan, Croft and Mori⁹⁰⁾ and the rest were category C. Twelve of the seventeen RCT were flavonoid⁽Reference File, Hartley and Elsabagh¹⁹^–Reference Le Bars, Velasco and Ferguson²³^, Reference Casini, Marelli and Papaleo⁸⁴^, Reference Kritz-Silverstein, Von Muhlen and Barrett-Connor⁸⁶^, Reference Mix and Crews⁸⁸^–Reference Stough, Clarke and Lloyd⁹²⁾ including seven isoflavone studies⁽Reference File, Hartley and Elsabagh¹⁹^–Reference Duffy, Wiseman and File²²^, Reference Casini, Marelli and Papaleo⁸⁴^, Reference Kritz-Silverstein, Von Muhlen and Barrett-Connor⁸⁶^, Reference Woo, Lau and Ho⁸⁹⁾ and four G. biloba interventions⁽Reference Le Bars, Velasco and Ferguson²³^, Reference Mix and Crews⁸⁸^, Reference Santos, Galduróz and Barbieri⁹¹^, Reference Stough, Clarke and Lloyd⁹²⁾.

In evaluating these effects of treatment, there was found to be considerable variability in the statistical rigour employed in individual studies. Gleason et al. ⁽Reference Gleason, Carlsson and Barnet²⁰⁾ reported both positive and negative effects with a small sample size of thirty, but did not appear to have accounted for the possibility of type I error. Mix & Crews⁽Reference Mix and Crews⁸⁸⁾ reported a small significant effect of treatment on a single outcome measure using a one-tailed t test, a result unlikely to survive cut-off if an arguably more appropriate two-tailed convention had been employed. Additionally, both Casini et al. ⁽Reference Casini, Marelli and Papaleo⁸⁴⁾ and Santos et al. ⁽Reference Santos, Galduróz and Barbieri⁹¹⁾ reported multiple t tests without any apparent correction for type I error. Howes et al. ⁽Reference Howes, Bray and Lorenz⁸⁵⁾ also carried out a large number of tests on a small sample (n 30) initially reporting a series of significant effects. However, as an illustration of good practice, these disappeared after the authors statistically accounted for type I error. Finally, Stough et al. ⁽Reference Stough, Clarke and Lloyd⁹²⁾ provided no descriptive statistics at all, making it impossible to evaluate the quality or rigour of their experimental design and analysis.

Of the micronutrient studies, three vitamin B studies reported some positive effects of treatment⁽Reference van Uffelen, Chinapaw and van Mechelen⁷⁵^, Reference Durga, van Boxtel and Schouten⁷⁷^, Reference Bryan, Calvaresi and Hughes⁷⁹⁾, although Bryan et al. ⁽Reference Bryan, Calvaresi and Hughes⁷⁹⁾ also reported negative effects. Two multivitamin interventions reported benefits⁽Reference Cockle, Haller and Kimber⁶⁵^, Reference Wouters-Wesseling, Wagenaar and Rozendaal⁶⁸⁾, and Maylor et al. ⁽Reference Maylor, Simpson and Secker⁸⁰⁾ found both positive and negative effects of Zn treatment on cognitive function.

Interestingly, four studies (10 %) showed only null and negative effects of treatment on cognitive function: three vitamin B studies⁽Reference McMahon, Green and Skeaff⁷²^, Reference Pathansali, Mangoni and Creagh-Brown⁷³^, Reference Lewerin, Matousek and Steen⁷⁸⁾ and one flavonoid intervention⁽Reference Fournier, Ryan and Robison²⁷⁾. The vitamin B studies were carried out on older populations and appear to have used t tests on multiple tasks with no correction for type I error.

Of the thirty-nine studies included in the present review, the size of study populations ranged from sixteen in a functional magnetic resonance imaging cross-over study⁽Reference Francis, Head and Morris⁸⁷⁾ to 818⁽Reference Durga, van Boxtel and Schouten⁷⁷⁾. Seventeen studies had fewer than 100 participants, and five studies had forty participants or less⁽Reference Gleason, Carlsson and Barnet²⁰^–Reference Duffy, Wiseman and File²²^, Reference Elsabagh, Hartley and Ali²⁶^, Reference Howes, Bray and Lorenz⁸⁵⁾. Power calculations were carried out in only four RCT, all researching vitamin B, with populations of 179 or more⁽Reference Eussen, de Groot and Joosten⁷¹^, Reference McMahon, Green and Skeaff⁷²^, Reference van Uffelen, Chinapaw and van Mechelen⁷⁵^, Reference Aisen, Schneider and Sano⁷⁶⁾, but these were based largely on expected changes in physiological rather than cognitive markers. In other RCT, group size does not appear to have been driven by effect sizes found in previous studies and varies considerably across nutrient studies, for example Pathansali et al. ⁽Reference Pathansali, Mangoni and Creagh-Brown⁷³⁾ with groups of twelve participants, and Durga et al. ⁽Reference Durga, van Boxtel and Schouten⁷⁷⁾ with groups of over 400.

Participant ages were highly variable, ranging from 18 years to over 80 years of age, with twenty-nine studies (74 %) carried out on participants over the age of 50 years, including nine studies specifically carried out on adults of 65 years or more. Three further RCT included young and older adult populations⁽Reference Le Bars, Velasco and Ferguson²³^, Reference Basaria, Wisniewski and Dupree²⁵^, Reference Bryan, Calvaresi and Hughes⁷⁹⁾, two more studies focused on the 40–65 years age range⁽Reference Fournier, Ryan and Robison²⁷^, Reference Casini, Marelli and Papaleo⁸⁴⁾, and five others were conducted on 18- to 40-year-olds⁽Reference File, Jarrett and Fluck²¹^, Reference Elsabagh, Hartley and Ali²⁶^, Reference Murray-Kolb and Beard⁸¹^, Reference Francis, Head and Morris⁸⁷^, Reference Stough, Clarke and Lloyd⁹²⁾.

Single v. multiple cognitive domains

Among the thirty-nine RCT included in the present review, a variety of approaches was used to measure cognitive performance, mostly targeting multiple cognitive domains, with the exception of a flavonoid brain imaging study which used a single executive function ‘switching’ (or ‘shifting’) task⁽Reference Francis, Head and Morris⁸⁷⁾. Of the RCT testing multiple cognitive domains, the majority examined a range of specific memory processes and executive functions. The remaining RCT targeted general cognition rather than any specific domain (see Table 1).

Rationale for choice of cognitive tests

Twenty-one RCT based their choice of cognitive test(s) on findings from previous studies. Eleven cited sensitivity to the class of nutrient under investigation, such as B vitamins, flavonoids, other type of dietary manipulation, hormone replacement therapy or oestrogen⁽Reference File, Hartley and Elsabagh¹⁹^, Reference File, Jarrett and Fluck²¹^, Reference Duffy, Wiseman and File²²^, Reference Fournier, Ryan and Robison²⁷^, Reference Kreijkamp-Kaspers, Kok and Grobbee²⁸^, Reference Cockle, Haller and Kimber⁶⁵^, Reference Wouters-Wesseling, Wagenaar and Rozendaal⁶⁸^, Reference Eussen, de Groot and Joosten⁷¹^, Reference Pathansali, Mangoni and Creagh-Brown⁷³^, Reference Bryan, Calvaresi and Hughes⁷⁹^, Reference Smith, Clark and Nutt⁸³⁾. Seven studies instead used tasks sensitive to ageing⁽Reference van Dongen, van Rossum and Kessels²⁹^, Reference Durga, van Boxtel and Schouten⁷⁷^, Reference Howes, Bray and Lorenz⁸⁵⁾, brain disorders and pharmacological interventions⁽Reference Le Bars, Velasco and Ferguson²³^, Reference Aisen, Schneider and Sano⁷⁶^, Reference Maylor, Simpson and Secker⁸⁰⁾, or changes in functional magnetic resonance imaging blood oxygenation level-dependent (BOLD) signal⁽Reference Francis, Head and Morris⁸⁷⁾. Three RCT selected tasks from established computerised psychometric test series: Ryan et al. ⁽Reference Ryan, Croft and Mori⁹⁰⁾ used the Cognitive Drug Research^® battery⁽Reference Wesnes, Simpson and Christmas⁹³⁾; Murray-Kolb & Beard⁽Reference Murray-Kolb and Beard⁸¹⁾ the Cognitive Abilities Test battery⁽Reference Detterman⁹⁴⁾; and Mix & Crews⁽Reference Mix and Crews⁸⁸⁾ selected the Trail-Making Test on the basis that it appeared to be ‘one of the best measures of general cognitive functioning’ (p. 223) according to Reitan⁽Reference Reitan⁹⁵⁾. These tests were developed for use with multiple populations and settings; none was specifically designed with reference to micronutrient or phytochemical interventions. Revealingly, in the remaining eighteen studies⁽Reference Gleason, Carlsson and Barnet²⁰^, Reference Ho, Chan and Ho²⁴^–Reference Elsabagh, Hartley and Ali²⁶^, Reference Clarke, Harrison and Richards⁶⁶^, Reference McNeill, Avenell and Campbell⁶⁷^, Reference Wolters, Hickstein and Flintermann⁶⁹^, Reference Hvas, Juul and Lauritzen⁷⁰^, Reference McMahon, Green and Skeaff⁷²^, Reference Seal, Metz and Flicker⁷⁴^, Reference van Uffelen, Chinapaw and van Mechelen⁷⁵^, Reference Lewerin, Matousek and Steen⁷⁸^, Reference Kessler, Pajonk and Bach⁸²^, Reference Casini, Marelli and Papaleo⁸⁴^, Reference Kritz-Silverstein, Von Muhlen and Barrett-Connor⁸⁶^, Reference Woo, Lau and Ho⁸⁹^, Reference Santos, Galduróz and Barbieri⁹¹^, Reference Stough, Clarke and Lloyd⁹²⁾, no rationale for task choice was given, although four of these included dementia patients, so task selection was naturally restricted to measures appropriate to these populations⁽Reference Clarke, Harrison and Richards⁶⁶^, Reference Hvas, Juul and Lauritzen⁷⁰^, Reference Seal, Metz and Flicker⁷⁴^, Reference Kessler, Pajonk and Bach⁸²⁾.

Range of cognitive measures used

Across the thirty-nine RCT under investigation, 121 cognitive tasks were identified (see Table 2). After an analysis of the primary neuropsychological focus for each measure, it was calculated that thirty-seven memory tasks (for example, episodic, semantic and short-term), twenty-six executive function tasks, fourteen working memory tasks, nineteen psychomotor processing speed tasks, nine general or ‘global’ tasks, thirteen intelligence quotient (IQ) tasks (mostly to measure baseline between-group differences), two motor function tasks and one perception measure had been employed (see Table 2).

Table 2 Neuropsychological focus for 121 measures used in thirty-nine human chronic dietary intervention randomised controlled trial (RCT) studies

IQ, intelligence quotient; Y, yes; WMS, Wechsler Memory Scale; WAIS, Wechsler Adult Intelligence Scale; CAT, Cognitive Abilities Test; Exec Fn, Executive function; IDED, Intra Dimensional/Extra Dimensional Set Shifting Task; CANTAB, Cambridge Neuropsychological Test Automated Battery; CDR, cognitive drug research; HRNTB, Halstead–Reitan Neuropsychological Test Battery; CBT, Cognometer Battery of Tests; SCOLP, Speed and Capacity of Language-Processing Test; TP, Toulouse–Pieron Test.

Generally, there was little correspondence in measures between studies, with occasional notable exceptions. For instance, researchers from King's College London⁽Reference File, Hartley and Elsabagh¹⁹^, Reference File, Jarrett and Fluck²¹^, Reference Duffy, Wiseman and File²²⁾ used the same seven executive function and memory tasks in their flavonoid intervention studies on older populations as had previously been used in a group of 22-to 30-year-old subjects⁽Reference File, Hartley and Elsabagh¹⁹⁾. Two tasks which were non-significant in the earlier study were excluded. While they found the Common Objects Recall Test and the Cambridge Neuropsychological Test Automated Battery (CANTAB) Intra Dimensional/Extra Dimensional Set Shifting Task Rule Learning and Reversal tests to be sensitive to flavonoid treatment in all three studies, these tasks appear to have been rarely employed elsewhere (see Table 2).