Hostname: page-component-586b7cd67f-vdxz6 Total loading time: 0 Render date: 2024-11-25T21:13:50.814Z Has data issue: false hasContentIssue false

Harmonising dietary datasets for global surveillance: methods and findings from the Global Dietary Database

Published online by Cambridge University Press:  19 January 2024

Dimitra Karageorgou*
Affiliation:
Friedman School of Nutrition Science and Policy, Tufts University, 150 Harrison Ave, Boston, MA 02111, USA
Laura Lara Castor
Affiliation:
Friedman School of Nutrition Science and Policy, Tufts University, 150 Harrison Ave, Boston, MA 02111, USA
Victoria Padula de Quadros
Affiliation:
Food and Agriculture Organization of the United Nations, Rome, Italy
Rita Ferreira de Sousa
Affiliation:
Food and Agriculture Organization of the United Nations, Rome, Italy
Bridget Anna Holmes
Affiliation:
Food and Agriculture Organization of the United Nations, Rome, Italy
Sofia Ioannidou
Affiliation:
European Food Safety Authority, Parma, Italy
Dariush Mozaffarian
Affiliation:
Friedman School of Nutrition Science and Policy, Tufts University, 150 Harrison Ave, Boston, MA 02111, USA
Renata Micha
Affiliation:
Friedman School of Nutrition Science and Policy, Tufts University, 150 Harrison Ave, Boston, MA 02111, USA
*
*Corresponding author: Emails [email protected]; [email protected]
Rights & Permissions [Opens in a new window]

Abstract

Objective:

The Global Dietary Database (GDD) expanded its previous methods to harmonise and publicly disseminate individual-level dietary data from nutrition surveys worldwide.

Design:

Analysis of cross-sectional data.

Setting:

Global.

Participants:

General population.

Methods:

Comprehensive methods to streamline the harmonisation of primary, individual-level 24-h recall and food record data worldwide were developed. To standardise the varying food descriptions, FoodEx2 was used, a highly detailed food classification and description system developed and adapted for international use by European Food Safety Authority (EFSA). Standardised processes were developed to: identify eligible surveys; contact data owners; screen surveys for inclusion; harmonise data structure, variable definition and unit and food characterisation; perform data checks and publicly disseminate the harmonised datasets. The GDD joined forces with FAO and EFSA, given the shared goal of harmonising individual-level dietary data worldwide.

Results:

Of 1500 dietary surveys identified, 600 met the eligibility criteria, and 156 were prioritised and contacted; fifty-five surveys were included for harmonisation and, ultimately, fifty two were harmonised. The included surveys were primarily nationally representative (59 %); included high- (39 %), upper-middle (21 %), lower-middle (27 %) and low- (13 %) income countries; usually collected multiple recalls/ records (64 %) and largely captured both sexes, all ages and both rural and urban areas. Surveys from low- and lower-middle v. high- and upper-middle income countries reported fewer nutrients (median 17 v. 30) and rarely included nutrients relevant to diet-related chronic diseases, such as n-3 fatty acids and Na.

Conclusions:

Diverse 24-h recalls/records can be harmonised to provide highly granular, standardised data, supporting nutrition programming, research and capacity development worldwide.

Type
Research Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press on behalf of The Nutrition Society

Malnutrition in all its forms is a leading modifiable risk factor for mortality and morbidity globally(1Reference Mozaffarian, Hao and Rimm7). It is, therefore, essential that scientists, policy makers and other stakeholders are able to characterise the whole diet (food, beverage and nutrient intakes), estimate diet-related health, economic and environmental burdens and inform and implement evidence-based priorities(2,3,Reference Juma, Mohamed and Matanje Mwagomba8Reference Haddad, Hawkes and Webb12) . Reliable, comprehensive and regularly collected dietary data in all nations are critical for such work. Given that national patterns hide significant inequalities within countries and populations, it is essential that dietary data are further disaggregated by key subgroups, such as age, sex, ethnicity, socio-economic status and location (e.g. urban or rural)(13). Ideally, dietary data should be nationally representative and collected at the individual level, using validated diet assessment tools, such as 24-h dietary recalls, that collect detailed information on every food item consumed(Reference Micha, Coates and Leclercq14). A plethora of national, subnational and community-level nutrition surveys at the individual level have been conducted worldwide(Reference Miller, Singh and Onopa15). However, these are rarely harmonised, hence not comparable between and within countries, populations and over time, due to differences mainly in representativeness, diet assessment tools used and data analysis and reporting.

Over the past 10 years, dietary surveys from around the world have been collated and harmonised at the food group level, e.g. total fruit intake, as part of the Global Dietary Database (GDD). However, such efforts have not previously accounted for the often large variation in the classification and description of individual food items, for example related to differing food definitions, preparation methods, local food names and more. For example, a food may be described by a simple food name (e.g. chicken), but not capture multiple additional levels of detail (e.g. baked or fried; light or dark meat; with or without skin and with or without added oil or salt). Local food names can also denote different foods in varying countries or even within the same country (e.g. biscuit in the USA v. the UK). Such heterogeneity can lead to inconsistent food names, definitions, groupings, food matching (referring to the process by which a food is assigned to its corresponding nutrient content using a food composition table or database) and ultimately discrepancies in analysis and reporting. Furthermore, inaccurate or inconsistent food aggregation (or classification) of individual food items with similar characteristics into larger food groups/ categories in a hierarchical manner can compound the errors. This could include, for example, heterogeneity in classifying a food (e.g. avocado and tomato) as a fruit v. vegetable or in different levels of food groups (e.g. total vegetables and starchy vegetables). Accurate food classification is especially crucial for mixed dishes and packaged foods, which are an increasing part of the global food supply and must be disaggregated into their ingredients in a standardised fashion to capture intake from all sources. Moreover, even if standardised, such dietary data are rarely publicly available, limiting use by and impact for national and global nutrition communities(Reference Leclercq, Allemand and Balcerzak16).

To address these critical gaps and advance the quality and quantification of dietary intakes worldwide, the GDD, in collaboration with the European Food Safety Authority (EFSA) and the Food and Agriculture Organization of the United Nations (FAO), developed and implemented comprehensive and standardised methods to streamline the collation, harmonisation and public dissemination of primary individual-level dietary datasets worldwide.

Methods

Study overview

The methods for this work build and expand upon the approach of the GDD 2018 to identify, collate and standardise existing dietary data and create a comprehensive and comparable database of diet intakes globally(Reference Miller, Singh and Onopa15). In brief, GDD systematically searched for and identified surveys with quantitative intake data for any of fifty-four foods, beverages and nutrients of interest; extracted intakes using standardised definitions and units per dietary factor, jointly disaggregated by age (23 age groups from 0-6 months to 99+ years), sex, education (low, middle and high), urban/rural residence and pregnancy/lactation status, as available and incorporated these inputs along with relevant covariates into a Bayesian hierarchical prediction model to estimate stratum-specific mean intakes worldwide.

This current project expanded these methods and protocols(Reference Micha, Kalantarian and Wirojratana17Reference Micha, Khatibzadeh and Shi19) to identify, retrieve, harmonise and disseminate individual-level 24-h dietary recall or record data at their finest level (hereinafter referred to as microdata). Standardised criteria, processes and materials – as described in detail below – were developed (Fig. 1, see online supplementary material, Supplemental Table 1).

Fig. 1 The GDD harmonisation process: steps, actions, and developed material. The Global Dietary Database aimed to harmonise dietary surveys with quantitative 24-h recall or food record data from around the world. For details on material description and use, refer to Table S3

Collaborations

Given the shared goal of harmonising individual-level dietary data worldwide, the GDD joined forces with FAO and EFSA(20). In 2017, FAO and the WHO developed the FAO/WHO Global Individual Food consumption data Tool (FAO/WHO GIFT)(Reference Leclercq, Allemand and Balcerzak16,21) , aimed at harmonising and disseminating individual-level dietary microdata, as well as presenting summary statistics in the areas of food consumption, nutrition and food safety(Reference Leclercq, Allemand and Balcerzak16). EFSA, a European agency providing independent scientific advice on food-related risks, developed and maintains the FoodEx2 food classification and description system (FCDS) and has been working, for over 10 years, on the harmonisation of dietary microdata. Each initiative developed and used their own processes and materials with regards to data harmonisation and dissemination. Yet, through our collaboration, we aimed to maximise efficiency by exchanging resources, methodological approaches and complementary experience and expertise.

Eligibility and prioritisation of dietary surveys

Surveys were eligible if they had valid quantitative 24-h recall or food record data and aiming to assess the general population of any country (see online supplementary material, Supplemental Table 2). Surveys with less than 100 participants, evidence of strong selection bias (e.g. special population subsets such as with medical conditions), published before 1980, or not agreeing to make the harmonised microdata publicly available through the GDD platform were excluded. Surveys needed to include a minimum level of detail required for FoodEx2 mapping (i.e. use of the FoodEx2 FCDS to code all reported foods): specifically, foods should not be reported as ‘generic’ terms (e.g. fruit instead of apple, orange, etc.; legumes instead of beans, lentils, etc.).

Due to the time- and resource-intensive nature of FoodEx2 mapping (see below), the GDD aimed to ultimately include and harmonise 40–50 dietary datasets. Surveys were prioritised based on key characteristics, such as being nationally representative and from populous countries (see online supplementary material, Supplemental Table 3) and aiming to include low (LIC), lower-middle (LMIC), upper-middle (UMIC) and high (HIC) income countries(22). Sub-national and community-level surveys were considered if national data were unavailable for a priority country or if the FoodEx2 mapping had been already performed. Surveys that were more recent, included both men and women and diverse ages, and captured both rural and urban areas were further prioritised. For publicly available surveys, those with datasets in English or Spanish that could be readily managed by the GDD investigator team were prioritised.

Survey identification

The GDD searched for public and non-public dietary surveys in multiple electronic databases and through extensive direct communications with data owners, nutrition experts and collaborators worldwide. The survey lists of the GDD 2018(Reference Miller, Singh and Onopa15,23) , EFSA(20,24) and FAO/WHO GIFT database were screened(21). The GDD 2018 incorporated 1220 public and non-public surveys, including 286 with 24-h recall data covering all world regions. The EFSA database comprised sixty-nine surveys with 24-h recall/ food record data from European countries (fourteen in common with GDD 2017) for which harmonisation and FoodEx2 mapping had been completed, but the datasets were not publicly available(20). The FAO had identified 336 surveys with 24-h recall data (110 in common with GDD 2018) for harmonisation(21). Complementary online searches were performed in PubMed and Google to identify more recent surveys through December 2019. The main search terms included: ‘nutrition survey’ OR ‘diet survey’ OR ‘dietary survey’ OR ‘nutrient survey’ AND ’24-h recall’ OR ‘diet recall’ OR ‘food record’ OR ‘diet record’ OR ‘food diary’ AND ‘national’ OR ‘nationally representative’ AND [‘country of interest’].

Survey contacting, screening and inclusion

Standardised algorithms and email templates (see online supplementary material, Supplemental Table 1, Supplemental Figure 1) were developed for contacting the data owners of eligible surveys. To augment internal and external consistency and data validity, the GDD, in collaboration with FAO, aimed to establish contact with all prioritised surveys, including publicly available ones. Data owners were invited to participate, harmonise and share their data under one of three options: (1) structure their data per the GDD requirements (see next section for details) and perform the FoodEx2 mapping; (2) structure their data for the FoodEx2 mapping to be performed by the GDD investigators or (3) share their original raw microdata to be structured and mapped to FoodEx2 by the GDD investigators.

Interested data owners completed a nine-question Short Survey Questionnaire (see online supplementary material, Supplemental File 1) to confirm 24-h recall or food record data availability and structure. Questions assessed mixed dish disaggregation, English translation, food matching, FoodEx2 mapping, number of unique food items and mixed dishes and data sharing option (of the three described above). One investigator screened responses, with questions resolved by consensus. The Short Survey Questionnaire did not apply to already harmonised EFSA and FAO/WHO GIFT surveys. For eligible non-public datasets surveys, data owning institutions signed a data sharing agreement; for public datasets, eligibility for re-posting was determined by their terms of use.

Data owners were offered remuneration for harmonisation efforts of up to $7700 per harmonised survey; 57 % of data owners accepted remuneration (mean $2584 per survey). The survey-specific amount was determined based on an algorithm, which was informed by the Short Survey Questionnaire and accounted for the complexity of the required efforts.

Data retrieval and harmonisation

Standardised protocols and forms were developed to retrieve survey information and data in a systematic manner and achieve high-quality harmonisation. Survey information was retrieved using a metadata form – adapted from the FAO relevant one – and it captured, among other characteristics, survey name, country, year(s), sampling methods, representativeness, sample (size, age and sex), diet assessment method (seasons and days covered, single v. multiple, portion size estimation aids) and food composition table/database (FCT/FCDB) (see online supplementary material, Supplemental File 2). Individual-level participant and dietary data (termed microdata) were retrieved using a standardised codebook and extraction template, building on those developed by EFSA. The codebook included instructions on how data owners should structure their data; the variables (name, definition and values) to use for reporting participant data, such as socio-demographics (age, sex and education), anthropometry (weight, height and BMI), pregnancy/lactation status, breast-feeding status and physical activity (see online supplementary material, Supplemental Table 4) and the variables (name, definition, unit of measurement and values) to use for the dietary data, which included all food items consumed per participant, meal type (e.g. breakfast, lunch), time and place of consumption, amount consumed and contents of energy and thirty-nine nutrients (see online supplementary material, Supplemental Table 5).

Microdata harmonisation

Microdata harmonisation included standardising variable definitions and coding, data structure and food description. For each step, a standardised process (see online supplementary material, Supplemental Figure 2) and instructions (see online supplementary material, Supplemental Figure 3) were developed to maximise validity and consistency. Each variable in the participant and dietary datasets was characterised according to the standardised codebook previously mentioned (see online supplementary material, Supplemental Tables 4 and 5). The dietary data structure included individual food items consumed per participant, each presented in a separate row along with the corresponding amount and nutrient content. Mixed dishes were disaggregated, as possible, into all ingredients and corresponding amounts (e.g. lentils (200 g) = lentils (130 g), tomatoes (50g), onion (10g), olive oil (10g)); if disaggregation was not part of the data collection, data owners were asked to perform it post hoc, based on commonly used recipes among the reference population. Nutrients were assigned to each food item and disaggregated ingredients; food matching was performed by the data owners using the FCT/FCDB they considered as more appropriate for their needs. All reported data were retrieved in English. For datasets harmonised by EFSA and FAO, variables were renamed, recoded and retrieved (e.g. nutrient intakes for EFSA datasets and additional socio-demographics, such as education, for all datasets), as appropriate, so that all harmonised datasets included the GDD standardised set of variables.

FoodEx2 mapping

Each food item and disaggregated ingredient was coded according to FoodEx2(25), an FCDS including >4540-terms organised across twenty-one top-level food groups, each further split into up to seven levels of food groups or foods (see online supplementary material, Supplemental Table 6) and a twenty-eight-facet description system (see online supplementary material, Supplemental Table 7). FoodEx2 facets include, for example, cooking and other processes (frying, mincing, sugar-coating, etc.), sweetening agent (including low- or non-energetic sweeteners), production method (organic farming, aquaculture, etc.); yet, not all facets are relevant to food consumption (e.g. packaging material is relevant to food safety). All FoodEx2 foods and facet descriptors have a unique five-character alphanumeric code. For all datasets, each individual food item, after mixed dish disaggregation, was mapped to the appropriate FoodEx2 food term. For food items that were further described, as many facets possible were used to describe all additional characteristics reported (see example below).

Mapping was performed manually for each food item by a trained coder using the FoodEx2 Catalogue browser(26,Reference Ioannidou27) , a desktop application which includes all FoodEx2 foods and facets, allows iterative building of coding and accounts for multiple mapping rules. The GDD investigators received a rigorous 3-day in-person formal training at EFSA’s headquarters(28) and observed a data owner training session delivered by FAO. To enable data owners to also perform high-quality FoodEx2 mapping themselves, a structured training process and relevant materials (see online supplementary material, Supplemental Table 1) were developed; all were reviewed and approved by EFSA. The training materials included a presentation on FoodEx2 theory, concepts, rules and mapping examples (see online supplementary material, Supplemental Figure 3), and a thirty-two-food item exercise to practice the theory learned; EFSA’s FoodEx2 technical report(25) and webinars(29) were further shared as reference materials. Moreover, a 2-hour web call was held to provide clarifications on FoodEx2 mapping and map survey-specific foods. Success of training was validated by asking data owners to map 100 food items from their dataset and hand-checking the results (see online supplementary material, Supplemental Figure 2), with follow-up personalised training as needed.

Quality control

Data integrity and quality were assessed throughout the retrieval and harmonisation process. All metadata forms were reviewed by one investigator to confirm all fields were completed correctly; with random double checks by a second investigator. Microdata retrieval and harmonisation included specific actions and milestones (see online supplementary material, Supplemental Figure 2), which served as intermediate check points. This included whether datasets had all variables as dictated by the codebook, whether mixed dishes were fully disaggregated, assessment for outliers and errors in amounts consumed based on FoodEx2-code specific EFSA plausibility thresholds and visual inspection of nutrient intakes at the food item level. In addition, all FoodEx2 mapping performed by data owners was reviewed by one investigator for errors with random second reviews of 20–25 % of the foods by a second investigator. For FoodEx2 mapping performed by GDD, up to 50 % of the foods were reviewed randomly by a second investigator. Surveys already harmonised by EFSA and FAO were checked using each initiative’s internal protocols. Data checks were performed using Stata v14·0 (StataCorp LLC). Identified potential errors, inconsistencies or implausible values were discussed within the GDD team, followed by direct communication with data owners to obtain clarifications and additional consultation with EFSA for remaining uncertainties.

Dissemination

For each survey, the metadata, microdata and corresponding standardised codebook were prepared. All harmonised surveys are publicly available and free to download through the GDD website,(23) with parallel availability through the FAO/WHO GIFT website for consenting surveys; any future harmonised surveys will be added on a rolling basis. Three files are available for each harmonised survey: the participant dataset, the diet dataset and the codebook, which includes the citation of the harmonised survey, survey metadata, all data variables and instructions on how to use FoodEx2 to classify foods into food groups.

Results

Survey identification, screening and inclusion

Of 1500 surveys identified, 600 were eligible for inclusion (Fig. 2). Of those, GDD prioritised and contacted 156 surveys. Of the 156 surveys, twelve (8 %) were excluded based on the Short Survey Questionnaire, e.g. related to qualitative data collection or microdata no longer available; twenty three (15 %) refused to participate; thirty eight (25 %) never responded and twenty-eight surveys (18 %) responded but never made a final decision about participating or not. Overall, fifty-five surveys (35 %) were ultimately included for harmonisation, of which forty two (76 %) were not publicly available (Table 1). Of the fifty-five surveys, twenty four were retrieved directly from data owners, fifteen from EFSA and sixteen from FAO/WHO GIFT.

Fig. 2 Screening and selection process of dietary surveys with valid 24-h recall or food record data for GDD harmonisation. All public and non-public priority surveys were contacted, with the exception of the three NHANES surveys, for which the harmonisation could be performed independently. Of the 55 surveys included for harmonisation, the harmonisation has not been completed for three of these due to time and resource restrictions

Table 1 Characteristics and progress status of the surveys included for harmonisation

12hR, 12-h recall; 24hR, 24-h recall; ARG, Argentina; BGD, Bangladesh; BRA, Brazil; BGR, Bulgaria; BFA, Burkina Faso; CAN, Canada; COD, Democratic Republic of the Congo; DEU, Germany; EFSA, European Food Safety Authority; ECU, Ecuador; ESP, Spain; EST, Estonia; ETH, Ethiopia; F, female; FIN, Finland; FR, food record; GDD, Global Dietary Database; GTM, Guatemala; IND, India; IRN, Iran, Islamic Republic of; ISR, Israel; ITA, Italy; KEN, Kenya; KOR, Republic of Korea; LAO, Lao People’s Democratic Republic; M, male; MEX, Mexico; MOZ, Mozambique; MYS, Malaysia; PAK, Pakistan; PHL, Philippines; PRT, Portugal; ROU, Romania; SVK, Slovakia; SWE, Sweden; TWN, Taiwan; TZA, Tanzania; UGA, Uganda; USA, United States of America; ZMB, Zambia.

* As sub-nationals were considered the surveys which were representative of a major region of the country and as community-level the surveys which were representative of a single city, town or village.

Rolling programs refer to surveys whose administration is repeated every year or every few years; these surveys may or may not follow the same population.

The reported sample size, age range and sex refer to the participants with available dietary data and not to the overall survey sample.

§ Country-income level was based on the World Bank classification of countries into low-income (LIC), lower-middle income (LMIC), upper-middle income (UMIC) and high-income (HIC) countries.(22).

|| The number in the parenthesis is the number of recalls or records collected per participant. When multiple recall/ record is reported, it applies to either part of the sample or all participants.

Financial support was offered to the data owners of all surveys, with the exception of publicly available datasets and surveys whose harmonisation had been already completed by FAO.

** The leading initiative was responsible for harmonising the dietary data or for guiding the data owners to perform the harmonisation themselves.

*† Refers to the data sharing option selected by data owners, and it is only applicable to surveys whose harmonisation was led by GDD. Option A refers to data owners agreeing to structure their data per the GDD requirements and perform the FoodEx2 mapping; option B includes agreement to data structure only and GDD to perform the FoodEx2 mapping and option C refers to sharing data as is, without structure or FoodEx2 mapping and GDD performs the overall harmonisation. NA refers to surveys that have been harmonised by FAO or EFSA.

*‡ Refers to the harmonisation status of each survey. Three of the fifty-five surveys that were included for harmonisation were not harmonised due to time and resources restrictions.

EFSA oversaw the data structure and FoodEx2 mapping and GDD the addition of food matching.

*|| Data were collected in the country language (Bengali or Pilipino), but the English version of the datasets was also available in the original public source.

Food matching is missing. The nutrient content for the reported food items was not allowed to be made publicly available. For Spain, food matching was not performed due to the COVID-19 pandemic.

Data harmonisation

For the twenty-four surveys retrieved from data owners, eleven agreed to provide both data structure (including mixed dish disaggregation and food matching) and FoodEx2 mapping; five agreed to data structure only and the GDD performed the FoodEx2 mapping and eight provided their data as is, including seven publicly available ones, which the GDD fully harmonised (Table 2). For fifteen surveys retrieved through EFSA that were lacking the food matching, thirteen agreed to perform this following contact by the GDD. All sixteen surveys retrieved through the FAO/WHO GIFT platform were fully harmonised. Among data owners performing the harmonisation, the most resource-intensive action was mixed dish disaggregation (generally 2–6 months to complete), followed by FoodEx2 mapping (1–4 months) and food matching (1–3 months). With experience, such as in the GDD investigator team working with surveys over time, the harmonisation of each public dataset took from 1 to 3 months. Common challenges included understanding the reported information and data, especially related to variable dataset documentation, and FoodEx2 mapping for highly localised food items.

Table 2 Survey and dietary data characteristics of the included 24-h recall/record dietary surveys by country income category

HIC, high-income countries; LMIC, lower-middle income countries; LIC, low-income countries; UMIC, upper-middle income countries; FCDS, food classification and description system; IQR, interquartile range.

* The sum of surveys across the different assessment methods is more than the overall sum because three surveys used both 24-h recall and food record.

The institutional home refers to the data owning institution/organisation of the survey data. The data owning institution could be either governmental (e.g. ministry, statistical agency), an academic institution (e.g. university), other institution (e.g. non-governmental organisations, independent research institutes), or individuals who self-funded the survey data collection.

This applies only to the twenty-four GDD surveys. The surveys the GDD received from FAO and EFSA were already harmonised.

§ Values refer to the harmonised – not to the original – dataset. Only surveys whose harmonisation was complete (n 52 of fifty-five available) were used.

Survey characteristics

The fifty-five surveys included for harmonisation were from thirty-five countries (Fig. 3) and included twenty-four surveys from HIC, ten from UMIC, fifteen from LMIC and six from LIC (Table 1). Most surveys (n 32, 58 %) were nationally representative, fifteen (27 %) were sub-national, and eight (15 %) were at the community level, especially in LIC and LMIC where nationally representative 24-h recall data were limited (Table 2). Half of the surveys (n 27) were rolling programs. About two-thirds (n 34, 62 %) were performed after 2010 and few (n 2, 4 %) before 2000. The predominant diet assessment method was 24-h recall (n 43, 78 %), followed by food records (n 9, 16 %), whereas three (5 %) surveys used both methods. Most surveys (n 38, 69 %) collected multiple recall/ record data for a subset or all participants. Generally, LIC and LMIC had smaller-scale surveys (median 826 participants; IQR 434–1665), compared with HIC and UMIC (1832; 893–6860). Most surveys (n 44, 80 %) collected data for both sexes; most (n 49, 89 %), on children and adolescents (ages 0–19 years), including twenty-seven surveys with data on children under 5 years of age; and nearly two-thirds (n 32, 58 %), from both rural and urban areas. Twelve of the twenty-one surveys from LIC and LMIC were administered in children under 5 years of age and/or women of reproductive age. Data reporting language varied across and within datasets (twenty-eight languages and multiple dialects). Surveys were originally collected using pen and paper in forty-four surveys (80 %), with computer-assisted interviews only in HIC and UMIC. Survey data owners varied between governmental institutions (n 22), academic institutions (n 22), independent research or non-governmental organisations (n 9) and individual scientists (n 2).

Fig. 3 Availability of harmonised dietary surveys per country. The map presents the 55 surveys, from 35 countries, that have agreed to the GDD harmonisation. Of these, the harmonisation has not been completed for three surveys (one in Brazil, one in Taiwan and one in the USA) due to time and resource restrictions

Survey data

Surveys originally had either no (n 39) or a national or survey-specific (n 16) food classification system in place. The number of reported unique food items varied across datasets (median 719, IQR 232–1804, max 6063) (Fig. 4). All LIC surveys had less than 250 unique food items (149, 92–184); LMIC (269, 87–684) and UMIC (717, 458–1096) surveys included, in general, less than 1000 food items; and HIC surveys generally included over 1000 unique food items (1602, 1068–2784). Mixed dish disaggregation was originally available for twelve of the twenty-four surveys retrieved from data owners; for ten, disaggregation was performed as part of the harmonisation; for two publicly available surveys, this information was not available. For eight datasets, the food description remained in the original language, because FoodEx2 mapping, which enables translation into English, was performed by data owners.

Fig. 4. Number of unique food items per dietary survey by country income level. The top panel shows the distribution of surveys from low-income (LIC), lower-middle income (LMIC), upper-middle income (UMIC), and high-income countries (HIC) across 4 range categories of unique food items per survey. All 55 surveys included for harmonisation were used. The bottom panel shows the absolute number of unique food items per survey grouped by country income level. Boxplots are shown, with horizontal lines representing the median value; shaded bars representing the interquartile range (IQR); whiskers representing maximum (top) and minimum (bottom) value excluding outliers, x’s representing the mean value and circles representing outliers. Only surveys whose harmonisation was complete were included (n 52 of 55)

All datasets captured the whole diet; more than half (n 31) further captured water consumption. Food matching in HIC and UMIC relied mainly on national FCT/FCDB (twenty-five of twenty-nine surveys with nutrient intakes reported) that contained a large number of available foods (1626, 1040–4300) and nutrients. LIC and LMIC relied also primarily on local national FCT (fourteen of eighteen surveys with nutrient intakes reported), but with a lower number of available food items (343, 152–696) and nutrients; to incorporate additional nutrients into their dataset, data owners generally attempted to use large FCDB of HIC, such as the USDA Food Database (the metadata information that is published as part of the harmonised data includes the specific FCT/FCDB used for each survey).

Four surveys only reported on foods, of which three publicly available ones did not permit the nutrient composition to be made public and 1 did not manage to add the food matching due to constraints related to the COVID-19 pandemic. In the remaining forty-eight surveys, twenty-four (sd: ±8) nutrients were available on average per dataset (Table 2), with generally fewer available in LIC and LMIC (19, ±5) v. in HIC and UMIC (28, ±7). Energy was available in all forty-eight datasets reporting nutrient intakes, followed by total protein, carbohydrates and total fat (each N 47; Fig.5). The most frequently reported vitamins and minerals were Ca (n 47), Fe (n 46), vitamin B1 (n 46), vitamin B2 (n 46) and vitamin C (n 46) and the least, vitamin K (n 9) and iodine (n 9). Other less frequently reported dietary factors included protein subtypes (n 1–6 surveys) and plant n-3 fatty acids (n 6).

Fig. 5 Availability of energy and nutrients across harmonised dietary datasets, by survey representativeness. Only surveys whose harmonisation has been completed and they report nutrient intakes are presented (n 48 of 55). The figure presents the 40 nutrients requested; datasets may contain additional nutrients (e.g. total sucrose, retinol) that are not listed here

Public dissemination

As of October 2023, the final output of the harmonisation process are fifty-two surveys, which have been harmonised and made publicly available and free to download through the GDD website (www.globaldietarydatabase.org)(23); twenty seven of these are also available through the FAO/WHO GIFT website (www.fao.org/gift-individual-food-consumption)(21).

Discussion

This investigation reports on the process and results for identification, retrieval, harmonisation and public dissemination of individual-level dietary data at their finest level from dietary surveys around the world. The GDD identified and included for harmonisation fifty-five surveys – fifty two were ultimately harmonised – using 24-h recalls (78 %), food records (16 %) or both (5 %) from thirty-five countries. Most surveys (58 %) were nationally representative, and countries of all income levels were represented. The surveys largely captured both sexes and diverse ages from birth to late in life and generally collected data for both rural and urban areas. Notably, the majority of these surveys (76 %) were not previously publicly available, requiring extensive work to contact data owning institutions, access data and complete data sharing agreements for public dissemination. Thus far, these harmonised surveys represent the most comprehensive collection of comparable, standardised dietary data at their most granular level globally.

Survey methods for collecting and reporting data were variable, including dataset structure, reporting language, available level of detail in recorded foods and use of an FCDS. This meant that data harmonisation and analysis from diverse countries were particularly complex undertakings. For example, reporting language included over twenty-eight languages and dialects across the fifty-five datasets, stressing the importance of harmonisation to a common language and terminology across datasets. In addition, the combination of open-ended questions in a 24-h recall or record plus local food names resulted in similar items being reported differently across datasets and with varying detail (e.g. ‘grilled beef’ v. ‘steak, cow, barbequed’). Extensive efforts and clarifications by data owners were often required to fully understand the foods reported. Many surveys had not previously disaggregated their mixed dishes, preventing valid estimates of intakes of different food groups (e.g. fruits, vegetables and meats). Finally, most surveys had not used an FCDS prior to this effort, and those that did used a local or survey-specific one, not comparable across datasets. This effort demonstrated that such foundational challenges can be addressed with a coordinated effort to harmonise and share the original survey information and it greatly advanced data completeness, comparability and availability.

The availability of nutrient estimates following harmonisation and comprehensiveness of the FCT used also varied substantially. In general, HIC and UMIC reported more nutrients than LIC and LMIC (median 30 v. 17). LIC and LMIC typically focused on energy, total protein, carbohydrates, total fat and key vitamins and minerals relevant to specific micronutrient deficiencies in maternal and child health, such as Ca, Fe, Zn and vitamins A, B1, B2 and C. These surveys rarely reported nutrients relevant to diet-related chronic diseases, such as total polyunsaturated fat, plant or seafood n-3 fat, trans fat, Na or potassium. FCT were also typically more comprehensive in HIC and UMIC than in LIC and LMIC (median ∼1626 v. ∼343 food items). This partly reflects the greater diversity of available food items commonly consumed in high-income nations, such as specific packaged and processed foods, compared with low-income nations where staples and variety are much smaller. However, this also reflects less available local food composition data in middle- and low-income nations. Given the societal burdens of malnutrition in all its forms in every country worldwide, including diet-related chronic diseases(13), our findings highlight the need for more comprehensive nutrient assessments in LIC and LMIC worldwide, coupled with investment in expanded local FCT.

Our extensive searches only identified twenty-one 24-h recall or record surveys from LIC and LMIC available for inclusion in this effort. These surveys were usually administered within a specific region or community; were small-scale and aimed to evaluate specific nutrients and, as such, nutrient deficiencies, especially focused on maternal and young child nutrition. Such surveys also most often collected data via pen and paper, raising challenges for standardised data collection and future data manipulation(Reference Illner, Freisling and Boeing30). Half of the included surveys (49 %) were rolling programs, i.e. with planned additional cycles of dietary assessment. This raises the possibility of far more efficient, or even prospective, data harmonisation and FoodEx2 mapping in future survey cycles. However, most rolling programs were in HIC or UMIC (n 22), fewer were in LMIC (n 5), and none were identified in LIC. These findings stress such disparities in dietary data availability and quality across countries of different income levels, emphasizing the need for substantial new investments in dietary surveillance and harmonisation efforts in LIC. For example, ongoing major investments in supplementation programs, crop biofortification and food fortification will be more inefficient and less effective without reliable, valid surveillance data on the specific nutrient gaps in specific population subgroups and the specific food intakes that should be targets of fortification delivery intakes.

The current investigation has several strengths. Systematic searches were performed across multiple online databases, complemented by extensive contacts with data owners and nutrition experts worldwide to identify surveys with valid 24-h recall or food record data. The collaboration with FAO and EFSA advanced global coordination including sharing of expertise, experiences, methodological approaches and networks. Standardised methods, protocols and materials were developed for all steps of this effort in order to reduce the likelihood of error during data acquisition and achieve high-quality harmonisation. Individual-level dietary data at their finest level, as well as detailed socio-demographic information per participant, were collected and harmonised, which altogether capture the overall diet of the population, minimise errors in reporting and analysis and provide critical information on dietary heterogeneity within countries and populations. A common FCDS, FoodEx2, was used to address the challenge of food description and classification variations, improve data interpretability, allow for detailed analysis and enable comparability of estimated diet intakes within and between nations, further overcoming any language barriers. To ensure data integrity and maximise internal and external consistency, data owners were extensively trained on restructuring and harmonizing their datasets using FoodEx2 and rigorous data checks throughout the harmonisation process were performed. The outcome of this process, the harmonisation methods and final datasets were made publicly available and freely accessible. These can serve as a critical public resource for researchers, health agencies and governments to inform future dietary data collection efforts and promote global collaboration and capacity development. Such timely, reliable and comparable nutrition methods and data can be leveraged to understand, quantify and address the corresponding nutrition burden locally, nationally and across the world.

Potential limitations should be considered. Nationally representative 24-h recall/record data were less common in LIC and LMIC, requiring greater reliance on subnational surveys. Variability in primary data collection methods and details, as expected, precluded collecting all variables of interest, such as socio-demographics, dietary information and nutrients and leveraging all FoodEx2 facets for every food item. However, by using the rigorous protocols and materials developed, all available information in the primary datasets were converted or derived and standardised. While ultimately fifty-five surveys were included by the GDD for harmonisation, not all potentially eligible data owning institutions were able to participate. Interestingly, the majority of exclusions were due to non-response, no final decision, unwillingness to share their data, with only a few noting workload or resource challenges. This raises the question of whether strengthened global coordination, including prioritisation from national governments, could significantly increase such dietary harmonisation efforts with additional resource investment.

In summary, this effort demonstrates that new investments in institutional support and training can result in diverse global 24-h dietary recalls/records being harmonised to provide highly granular, standardised and publicly available dietary data from around the world. Such high-quality comparable data can promote and support nutrition programming, research and capacity development worldwide. This work also identifies key gaps in 24-h recall data collection, facilitates and promotes country capacity development that can alleviate many of the barriers that prevent routine harmonised collection of 24-h recall data and highlights the importance of collecting comprehensive harmonised dietary data systematically in all nations.

Acknowledgements

We would like to thank Agnieszka Balcerzak, Pauline Allemand and Teresa Bevere from FAO and Catherine Leclercq from CREA for their contribution to this work.

The Global Dietary Database harmonisation collaborators

Ute Alexy, University of Bonn; Jennifer Ali, Statistics Canada; Lilian Aluso, Consultative Group on International Agricultural Research; Rosangela Alves Pereira, Federal University of Rio de Janeiro; Imelda Angeles Agdeppa, Food and Nutrition Research Institute; Joanne E. Arsenault, University of California Davis; Laxmaiah Avula, ICMR-National Institute of Nutrition; Donka Baykova, Medical Center Markovs; Julia Boedecker, Consultative Group on International Agricultural Research; Ken Brown, University of California Davis; Zipporah Bukania, Kenya Medical Research Institute; Mario Capanzana, Food and Nutrition Research Institute; Hsing-Yi Chang, National Health Research Institutes; Justin Chileshe, Tropical Diseases Research Center; Alan de Brauw, International Food Policy Research Institute; Stefaan De Henaw, Ghent University; Amanda de Moura Souza, Universidad Federal de Rio de Janeiro; Josie Desnacido, Food and Nutrition Research Institute; Tshilenge S. Diba, University of Colorado; Daovieng Douangvichit, Lao Tropical and Public Health Institute; Vesselka Duleva, National Centre of Public Health and Analyses; Laila Eleraky, University of Hohenheim; Maijaliisa Erkkola, University of Helsinki; Getahun Ersino, University of Saskatchewan; Riitta Freese, University of Helsinki; Rebecca Goldsmith, Israel Center for Disease Control; Ron Gravel, Statistics Canada; K. Michael Hambidge, University of Colorado; Carol Henry, University of Saskatchewan; Christine Hotz; Meshram Indrapal, ICMR-National Institute of Nutrition; Veronika Janská, Public Health Authority of the Slovak Republic; Irmgard Jordan, Justus Liebig University Giessen; Tilakavati Karupaiah, Taylor’s University; Gudrun B. Keding, University of Goettingen; Lital Keinan-Boker, Israel Center for Disease Control; Umber S Khan, University of Colorado; Yeri Kombe, Kenya Medical Research Institute; Liisa M Korkalo, University of Helsinki; Nancy Krebs, University of Colorado; Maarja Kukk, National Institute for Health Development; Sanghui Kweon, Korea Centers for Disease Control and Prevention; Carl Lachat, Ghent University; Anna Karin Lindroos, National Food Agency; Julie Long, University of Colorado; Carla Lopes, Institute of Public Health - University of Porto; Chanthaly Luangphaxay, Lao Tropical and Public Health Institute; Maryam Maghroun, Isfahan Cardiovascular Research Institute; Yves Martin-Prevel, Institut de Recherche pour le Développement; Shivanand C. Mastiholi, University of Colorado; J.V. Meenakshi, University of Delhi; Noushin Mohammadifard, Isfahan Cardiovascular Research Institute; Carlos Monteiro, University of Sao Paulo; Lotta Moraeus, National Food Agency; Mourad Moursi, International Food Policy Research Institute; Moses Mwangi, Kenya Medical Research Institute; Sydney Mwanza, Tropical Diseases Research Center; Keiu Nelis, National Institute for Health Development; Anca Nicolau, Dunărea de Jos University of Galaţi; Laetitia Nikiema, WHO; Arlappa Nimmathota, ICMR-National Institute of Nutrition; Ute Nothlings, University of Bonn; Eha Nurk, National Institute for Health Development; Francis Oduor, Consultative Group on International Agricultural Research; Kyungwon Oh, Korea Centers for Disease Control and Prevention; Wen-Harn Pan, Academia Sinica; Myungsook Park, Korea Centers for Disease Control and Prevention; Jessica Petrelius Sipinen, National Food Agency; Stefka Petrova, National Centre of Public Health and Analyses; Raffaela Piccinelli, CREA-Alimenti e Nutrizione; Harikumar Rachakulla, ICMR-National Institute of Nutrition; Hemalatha Rajkumar, ICMR-National Institute of Nutrition; Ivonne Ramirez Silva, Instituto Nacional de Salud Publica; Lalka Rangelova, National Centre of Public Health and Analyses; Lourdes Ribas-Barba, Fundación para la Investigación Nutricional; Juan Rivera, Instituto Nacional de Salud Publica; Sonia Rodriguez Ramirez, Instituto Nacional de Salud Publica; Eva Roos, University of Helsinki; Alicia Rovirosa, Centro de Estudios sobre Nutrición Infantil; Constance Rybak, Leibniz Centre for Agricultural Landscape Research; Nizal Sarrafzadegan, Isfahan Cardiovascular Research Institute; Lluis Serra-Majem, University of Las Palmas de Gran Canaria; Stefania Sette, CREA-Alimenti e Nutrizione; Milton Severo, Institute of Public Health - University of Porto; Tal Shimony, Israel Center for Disease Control; Ward Siamusantu, National Food and Nutrition Commission; Rosely Sichieri, Universidade do Estado do Rio de Janeiro; Latsamy Siengsounthone, Lao Tropical and Public Health Institute; Wolfgang Stuetz, University of Hohenheim; Kalyana Sundram, Malaysian Palm Oil Council; Tina Tao, Statistics Canada; Tamene Taye Asayehu, Addis Ababa Science and Technology University; Gabriela Tejeda, University of Colorado; Celine Termote, Alliance of Bioversity International and CIAT; Duarte Torres, University of Porto; Cornelia Tudorie, Dunărea de Jos University of Galaţi; Aida Turrini, CREA-Alimenti e Nutrizione; Sirje Vaask, Tallinn University; Eva Warensjö Lemming, National Food Agency; Lydiah M. Waswa, University of Goettingen; Elizabeth Yakes-Jimenez, University of New Mexico; Maria Elisa Zapata, Universidad del Centro Educativo Latinoamericano; Gordon Zello, University of Saskatchewan; Corina Aurelia Zugravu, University of Medicine and Pharmacy Carol Davila.

Disclaimer for FAO

The views expressed in this publication are those of the author(s) and do not necessarily reflect the views or policies of the FAO of the UN.

Disclaimer for EFSA

The position and opinions presented in this article are those of the authors/s alone and do not necessarily represent the views/any official position or scientific work of EFSA.

Financial support

This study was supported by the Bill & Melinda Gates Foundation (grant OPP1176682 to Dr. Mozaffarian). The funder had no role in study design, data collection, data analysis, data interpretation or writing of the report.

Conflict of interest

D.K. reports contract with Development Initiatives, outside the submitted work. L.L.C. reports research funding from the Bill & Melinda Gates Foundation and Consejo Nacional de Ciencia y Tecnologia for this manuscript and from the American Heart Association outside the submitted work. D.M. reports research funding from the National Institutes of Health, the Gates Foundation, The Rockefeller Foundation, Vail Innovative Global Research and Kaiser Permanente; chapter royalties from UpToDate; consulting fees from Acasti Pharma, Barilla, Danone and Motif FoodWorks; advisory board member for Beren Therapeutics, Brightseed, Calibrate, Discern Dx, Elysium Health, Filtricine, HumanCo, January Inc., Perfect Day, Tiny Organics and (ended) Day Two and Season Health and stocks from Calibrate and HumanCo, all outside the submitted work. RM reports grants from National Institute of Health, Nestle and Danone and consulting fees from Development Initiatives, outside the submitted work. V.P.Q., R.F.S., B.A.H. and S.I. have no interests to declare.

Authorship

R.M. and D.M. conceptualised and designed the study. D.K. and L.L.C. were involved in the systematic searches, data collection, harmonisation material and process development, data harmonisation. V.P.Q., R.F.S. and B.A.H. (FAO/WHO GIFT) facilitated the introduction of GDD to the data owners they were collaborating with and reviewed and provided high-level input for the harmonisation processes and materials the GDD developed and used. S.I. (EFSA) reviewed and provided feedback on the FoodEx2 training materials developed by the GDD and supported, as needed, the FoodEx2 mapping process. D.K. conducted the analyses for the study. D.K. wrote the first draft, and D.K., R.M. and D.M. the subsequent drafts of the manuscript. All the authors read the final manuscript, provided feedback and approved its submission.

Supplementary material

For supplementary material accompanying this paper visit https://doi.org/10.1017/S1368980024000211

Ethics of human subject participation

This investigation was exempt from institutional review board review as it was based on publicly available or privately held de-identified datasets with no personally identifiable information (e.g. name, address). There was not public involvement in the study.

References

Collaborators GBDRF (2020) Global burden of 87 risk factors in 204 countries and territories, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet 396, 1223–49.CrossRefGoogle Scholar
World Health Organization (2010) Global Status Report on Noncommunicable Diseases 2010. Geneva: World Health Organization.Google Scholar
World Health Organization (2018) Noncommunicable Diseases Country Profiles 2018. Geneva: World Health Organization.Google Scholar
Pimpin, L, Kranz, S, Liu, E et al. (2019) Effects of animal protein supplementation of mothers, preterm infants, and term infants on growth outcomes in childhood: a systematic review and meta-analysis of randomized trials. Am J Clin Nutr 110, 410–29.CrossRefGoogle ScholarPubMed
Micha, R, Shulkin, ML, Penalvo, JL et al. (2017) Etiologic effects and optimal intakes of foods and nutrients for risk of cardiovascular diseases and diabetes: systematic reviews and meta-analyses from the Nutrition and Chronic Diseases Expert Group (NutriCoDE). PLOS ONE 12, e0175149.CrossRefGoogle Scholar
Mozaffarian, D, Wilson, PW, Kannel, WB (2008) Beyond established and novel risk factors: lifestyle risk factors for cardiovascular disease. Circulation 117, 3031–8.CrossRefGoogle ScholarPubMed
Mozaffarian, D, Hao, T, Rimm, EB et al. (2011) Changes in diet and lifestyle and long-term weight gain in women and men. N Engl J Med 364, 2392–404.CrossRefGoogle ScholarPubMed
Juma, PA, Mohamed, SF, Matanje Mwagomba, BL et al. (2018) Non-communicable disease prevention policy process in five African countries. BMC Public Health 18, 961.CrossRefGoogle ScholarPubMed
Mozaffarian, D & Capewell, S (2011) United Nations’ dietary policies to prevent cardiovascular disease. BMJ 343, d5747.CrossRefGoogle ScholarPubMed
Arnett, DK, Blumenthal, RS, Albert, MA et al. (2019) 2019 ACC/AHA guideline on the primary prevention of cardiovascular disease: executive summary: a report of the American college of cardiology/American heart association task force on clinical practice guidelines. Circulation 140, e563e95.Google Scholar
World Health Organization (2018) Global Nutrition Policy Review 2016–2017: Country Progress in Creating Enabling Policy Environments for Promoting Healthy Diets and Nutrition. Geneva: World Health Organization.Google Scholar
Haddad, L, Hawkes, C, Webb, P et al. (2016) A new global research agenda for food. Nature 540, 30–2.CrossRefGoogle ScholarPubMed
Development Initiatives (2020) 2020 Global Nutrition Report: Action on equity to end malnutrition. Bristol, UK: Development Initiatives. available at https://globalnutritionreport.org/reports/2020-global-nutrition-report/ (accessed February 2023).Google Scholar
Micha, R, Coates, J, Leclercq, C et al. (2018) Global dietary surveillance: data gaps and challenges. Food Nutr Bull 39, 175205.CrossRefGoogle ScholarPubMed
Miller, V, Singh, GM, Onopa, J et al. (2021) Global dietary database 2017: data availability and gaps on 54 major foods, beverages and nutrients among 5.6 million children and adults from 1220 surveys worldwide. BMJ Glob Health 6, e003585.Google Scholar
Leclercq, C, Allemand, P, Balcerzak, A et al. (2019) FAO/WHO GIFT (global individual food consumption data tool): a global repository for harmonised individual quantitative food consumption studies. Proc Nutr Soc 78, 484–95.CrossRefGoogle Scholar
Micha, R, Kalantarian, S, Wirojratana, P et al. (2012) Estimating the global and regional burden of suboptimal nutrition on chronic disease: methods and inputs to the analysis. Eur J Clin Nutr 66, 119–29.CrossRefGoogle ScholarPubMed
Khatibzadeh, S, Saheb Kashaf, M, Micha, R et al. (2016) A global database of food and nutrient consumption. Bull World Health Organ 94, 931–4.CrossRefGoogle ScholarPubMed
Micha, R, Khatibzadeh, S, Shi, P et al. (2015) Global, regional and national consumption of major food groups in 1990 and 2010: a systematic analysis including 266 country-specific nutrition surveys worldwide. BMJ Open 5, e008705.CrossRefGoogle ScholarPubMed
European Food Safety Authority (2018) The EFSA Comprehensive European Food Consumption Database. available at http://www.efsa.europa.eu/en/food-consumption/comprehensive-database (accessed June 2021).Google Scholar
Food and Agriculture Organization of the United Nations. Global Individual Food consumption data Tool (FAO/WHO GIFT) available at http://www.fao.org/gift-individual-food-consumption/en/ (accessed June 2021).Google Scholar
World Bank (2020) Country classification by income. available at        https://datahelpdesk.worldbank.org/knowledgebase/articles/906519-world-bank-country-and-lending-groups (accessed August 2020).Google Scholar
Global Dietary Database. available at https://globaldietarydatabase.org/ (accessed December 2023).Google Scholar
European Food Safety Authority (2011) Use of the EFSA Comprehensive European Food Consumption Database in Exposure Assessment available at https://www.efsa.europa.eu/en/efsajournal/pub/2097 (accessed June 2021).Google Scholar
European Food Safety Authority (2015) The food classification and description system FoodEx 2 (revision 2). EFSA Supporting Publications 12, 804E.Google Scholar
European Food Safety Authority. The FoodEx2 Catalogue browser available at https://github.com/openefsa/catalogue-browser/wiki.Google Scholar
European Food Safety Authority, Ioannidou, S (2019) EFSA Catalogue Browser User Guide. available at https://www.efsa.europa.eu/en/supporting/pub/en-1726 (accessed February 2023).Google Scholar
European Food Safety Authority (2018) Training on foodEx2. EFSA 15, 1437E.Google Scholar
European Food Safety Authority (2018) Webinar: The FoodEx2 classification system and guidance on its harmonised use available at https://www.efsa.europa.eu/en/events/event/180926 (accessed February 2023).Google Scholar
Illner, AK, Freisling, H, Boeing, H et al. (2012) Review and evaluation of innovative technologies for measuring diet in nutritional epidemiology. Int J Epidemiol 41, 1187–203.CrossRefGoogle ScholarPubMed
Primer estudio sobre el estado nutricional y los hábitos alimentarios de la población adulta de Rosario2012–2013; Centro de Estudios sobre Nutricion Infantil Dr. Alejandro O’Donnell, Argentina.Google Scholar
Institut de Recherche pour le Développement (IRD), Institut de Recherche en Sciences de la Santé (IRSS), HarvestPlus, International Food Policy Research Institute (IFPRI) (2016) Food consumption and iron status survey in two provinces of rural Burkina Faso2010; Harvard Dataverse, V1; https://doi.org/10.7910/DVN/5CXCLX (accessed September 2018).CrossRefGoogle Scholar
HarvestPlus, International Food Policy Research Institute (IFPRI), Program in International and Community Nutrition, University of California, Davis (PICN, UCDavis), International Centre for Diarrhoeal Disease Research, Bangladesh (ICDDR, B) (2016) HarvestPlus Bangladesh Bio-fortified Rice Project: Baseline Dietary Survey2007–2008; Harvard Dataverse, V1; https://doi.org/10.7910/DVN/IHVR6L (accessed September 2018).CrossRefGoogle Scholar
Bangladesh Integrated Household Survey (BIHS) (2013) 2011–2012; In: International Food Policy Research I, ed. V4 ed: Harvard Dataverse; https://dataverse.harvard.edu/dataset.xhtml?persistentId=hdl:1902.1/21266 (accessed September 2018).Google Scholar
National Survey of Food Intake and Nutritional Status (NSFIN) (2004) National Nutrition Monitoring in Bulgaria.Google Scholar
Nutrition and nutritional status of children under (2007) 5 years in Bulgaria (NUTRICHILD).Google Scholar
Consumer Expenditure Survey (Brazilian National Dietary Survey) 2008–2009.Google Scholar
Estudo de Riscos Cardiovasculares em Adolescentes (ERICA) 2013–2014; Instituto de Estudos de Saúde Coletiva da Universidade Federal do Rio de Janeiro.Google Scholar
Women First Dietary Recall Data - Democratic Republic of the Congo (2014–2016); University of Colorado School of Medicine, Denver, CO, USA and Kinshasa School of Public Health, Kinshasa, Democratic Republic of the Congo; Sud-Ubangi, Democratic Republic of the Congo.Google Scholar
Dortmund Nutritional and Anthropometric Longitudinally Designed Study (DONALD Study) (2006–2008) University of Bonn, Nutritional Epidemiology, Germany.Google Scholar
Dortmund Nutritional and Anthropometric Longitudinally Designed Study (DONALD Study) (2015–2019) University of Bonn, Nutritional Epidemiology, Germany.Google Scholar
Encuesta Nacional de Salud y Nutricion (ENSANUT-ECU) 2011–2013; Ministerio de Salud Publica. Instituto Nacional de Estadistica y Censos. Quito, Ecuador.Google Scholar
ENKID Study: Dietary habits and nutritional status of Spanish schoolchildren and adolescents1998–2000; Fundación para la Investigación Nutricional (FIN) and University of Las Palmas de Gran Canaria (ULPGC), Spain.Google Scholar
The Baltic Nutrition and Health Survey1997; World Health Organisation.Google Scholar
The Estonian National Dietary Survey (RTU)2014; National Institute for Health Development.Google Scholar
Dietary behavior, food and nutrient intake of pregnant and non-pregnant women of Southern Ethiopia2013.Google Scholar
Ersino. G. Dietary Practices, Maternal Nutritional Status and Child Stunting: Comparative and Intervention Studies in Pulse and Non-pulse Growing Rural Communities in Ethiopia, 2013.Google Scholar
Increased Health and Wellbeing in Preschools (DAGIS) survey2015–2016; University of Helsinki, Finland and Folkhälsan Research Center, Finland.Google Scholar
Women First Dietary Recall Data - Guatemala2014–2016; University of Colorado School of Medicine, Denver, CO, USA and Institute of Nutrition in Central America and Panama (INCAP), Guatemala City, Guatemala; Chimaltenango, Guatemala.Google Scholar
Diet and Nutritional Status of Rural Population, Prevalence of Hypertension and Diabetes among Adults, and Infant and Young child feeding practices2009–2012; Indian Council of Medical Research, National Institute of Nutrition, Hyderabad.Google Scholar
Women First Dietary Recall Data - India2014–2015; University of Colorado School of Medicine, Denver, CO, USA and KLE Academy of Higher Education and Research’s Jawaharlal Nehru Medical College, Belagavi, India.Google Scholar
The Isfahan Healthy heart Program (IHHP (2007) Isfahan Cardiovascular Research Center. Isfahan, Iran: Cardiovascular Research Institute, Isfahan University of Medical Sciences.Google Scholar
MABAT First Israeli National Health and Nutrition Survey ages 25–641999–2001; Israel Center for Disease Control, Ministry of Health, State of Israel.Google Scholar
MABAT Youth First Israeli National Health and Nutrition Survey in 7th-12th grade students2003–2004; Israel Center for Disease Control, Ministry of Health, State of Israel.Google Scholar
MABAT Zahav National Health and Nutrition Survey ages 65 and over2005–2006; Israel Center for Disease Control, Ministry of Health, State of Israel.Google Scholar
The third national Food Consumption Survey (INRAN-SCAI)2005–2006; Italian Ministry of Agriculture, Italy “Council for Agricultural Research and Economics, Research Centre for Food and Nutrition” (former National Research Institute on Food and Nutrition).Google Scholar
Kenya National Micronutrient Survey (KNMS)2011; Ministry of Health, Kenya.Google Scholar
Improving Nutrition through Local Agrobiodiversity Survey (INULA study)2012–2014; Bioversity International, Nairobi, Kenya.Google Scholar
Innovative, participatory tools for dietary assessment and nutrition education in Turkana County – Diagnostic survey2016; Bioversity International, Kenya.Google Scholar
Improving access to and benefits from a wealth of diverse seeds to support on-farm biodiversity for healthy people in resilient landscapes – Baseline survey2018; Bioversity International, Kenya.Google Scholar
The Seventh Korea National Health and Nutrition Examination Survey (KNHANES VII-2) 2017; Korea Centers for Disease Control and Prevention.Google Scholar
Lao Food Consumption Survey (LFCS)2016–2017; Lao Tropical and Public Health Institute.Google Scholar
Mexican National Health and Nutrition Survey (2012) National Institute of Public Health, Mexico.Google Scholar
Estudo do Estado Nutricional e da Dieta em Raparigas Adolescentes na Zambézia (ZANE): Mozambique2010; Department of Food and Nutrition, University of Helsinki, Finland.Google Scholar
The Malaysia Lipid Study2015; Universiti Kebangsaan Malaysia and Malaysian Palm Oil Council.Google Scholar
Women First Dietary Recall Data - Pakistan2014–2016; University of Colorado School of Medicine, Denver, CO, USA and Aga Khan University, Karachi, Pakistan; Thatta, Pakistan.Google Scholar
Cebu Longitudinal Health Nutrition Survey (CLHNS (2002) Carolina Population Center University of North Carolina at Chapel Hill.Google Scholar
6th National Nutrition Survey2003; Department of Science and Technology - Food and Nutrition Research Institute.Google Scholar
Cebu Longitudinal Health Nutrition Survey (CLHNS (2005) Carolina Population Center University of North Carolina at Chapel Hill.Google Scholar
The Portuguese National Food, Nutrition, and Physical Activity Survey (IAN-AF)2015–2016; University of Porto.Google Scholar
DIETA-PILOT Survey2012; Dunarea de Jos University of Galaţi, Romania.Google Scholar
National nutrition survey in Slovakia (SK MON) (2008); Food Research Institute and Public Health Authority.Google Scholar
National dietary survey in adults in Sweden (1997–1998) Riksmaten adults; Swedish Food Agency.Google Scholar
National dietary survey in children in Sweden (2003) Riksmaten children; Swedish Food Agency.Google Scholar
National dietary survey in adults in Sweden (2010–2011) Riksmaten adults; Swedish Food Agency.Google Scholar
Nutrition and Health Survey in Taiwan: the nutrition knowledge (2005–2008) attitude and behavior of 19–64 years old adults.Google Scholar
Scale-N Nutrition Survey 2016; University of Hohenheim, Germany, Sokoine University of Agriculture, Tanzania, and ZALF, Müncheberg, Germany.Google Scholar
The HarvestPlus REU/A2Z-AED Uganda 24-Hour Recall Survey2007–2008; HarvestPlus, International Food Policy Research Institute (IFPRI); Academy for Educational Development (AED), 2017, https://doi.org/10.7910/DVN/FOYZBL, Harvard Dataverse, V1.CrossRefGoogle Scholar
National Health and Nutrition Examination Survey Data (NHANES) (2013–2014) Centers for Disease Control and Prevention (CDC), National Center for Health Statistics (NCHS). Hyattsville, MD: U.S. Department of Health and Human Services, Centers for Disease Control and Prevention. available at https://wwwn.cdc.gov/nchs/nhanes/continuousnhanes/default.aspx?BeginYear=2013 (accessed September 2018).Google Scholar
National Health and Nutrition Examination Survey Data (NHANES) (2015–2016) Centers for Disease Control and Prevention (CDC), National Center for Health Statistics (NCHS). Hyattsville, MD: U.S. Department of Health and Human Services, Centers for Disease Control and Prevention. available at https://wwwn.cdc.gov/nchs/nhanes/continuousnhanes/default.aspx?BeginYear=2015 (accessed September 2018).Google Scholar
National Health and Nutrition Examination Survey Data (NHANES) (2017–2018) Centers for Disease Control and Prevention (CDC), National Center for Health Statistics (NCHS). Hyattsville, MD: U.S. Department of Health and Human Services, Centers for Disease Control and Prevention. available at https://wwwn.cdc.gov/nchs/nhanes/continuousnhanes/default.aspx?BeginYear=2017 (accessed August 2020).Google Scholar
HarvestPlus, International Food Policy Research Institute (IFPRI), Tropical Diseases Research Centre (TDRC) (2009) The 2009 Food consumption and Vitamin A status survey in Zambia.Google Scholar
Figure 0

Fig. 1 The GDD harmonisation process: steps, actions, and developed material. The Global Dietary Database aimed to harmonise dietary surveys with quantitative 24-h recall or food record data from around the world. For details on material description and use, refer to Table S3

Figure 1

Fig. 2 Screening and selection process of dietary surveys with valid 24-h recall or food record data for GDD harmonisation. All public and non-public priority surveys were contacted, with the exception of the three NHANES surveys, for which the harmonisation could be performed independently. Of the 55 surveys included for harmonisation, the harmonisation has not been completed for three of these due to time and resource restrictions

Figure 2

Table 1 Characteristics and progress status of the surveys included for harmonisation

Figure 3

Table 2 Survey and dietary data characteristics of the included 24-h recall/record dietary surveys by country income category

Figure 4

Fig. 3 Availability of harmonised dietary surveys per country. The map presents the 55 surveys, from 35 countries, that have agreed to the GDD harmonisation. Of these, the harmonisation has not been completed for three surveys (one in Brazil, one in Taiwan and one in the USA) due to time and resource restrictions

Figure 5

Fig. 4. Number of unique food items per dietary survey by country income level. The top panel shows the distribution of surveys from low-income (LIC), lower-middle income (LMIC), upper-middle income (UMIC), and high-income countries (HIC) across 4 range categories of unique food items per survey. All 55 surveys included for harmonisation were used. The bottom panel shows the absolute number of unique food items per survey grouped by country income level. Boxplots are shown, with horizontal lines representing the median value; shaded bars representing the interquartile range (IQR); whiskers representing maximum (top) and minimum (bottom) value excluding outliers, x’s representing the mean value and circles representing outliers. Only surveys whose harmonisation was complete were included (n 52 of 55)

Figure 6

Fig. 5 Availability of energy and nutrients across harmonised dietary datasets, by survey representativeness. Only surveys whose harmonisation has been completed and they report nutrient intakes are presented (n 48 of 55). The figure presents the 40 nutrients requested; datasets may contain additional nutrients (e.g. total sucrose, retinol) that are not listed here

Supplementary material: File

Karageorgou et al. supplementary material

Karageorgou et al. supplementary material
Download Karageorgou et al. supplementary material(File)
File 1.6 MB