Nutrient profiling (NP) is the classifying of foods according to their nutritional composition, for reasons related to promoting health. NP is therefore heavily dependent on the completeness of food composition tables. Potential applications of NP include food reformulation and the Nestlé Nutritional Profiling System (NNPS) has been designed specifically for that purpose(Reference Vlassopoulos1).
Here, we assessed whether national food composition databases- the UK McCance & Widdowson, Chinese and Brazilian Food Composition databases and the US Food and Nutrient Database for Dietary Studies (FNDDS) - have the data completeness required to validate NP systems designed to guide food reformulation (for Energy, Total Fat, Saturated Fatty Acids, Sodium, Total Sugars and Added Sugars content). A sensitivity analysis was performed to assess the impact of using incomplete databases to validate NP systems using the NNPS as a case study (impact on the NNPS outcome Pass/Fail) (1). Three data handling scenarios were modelled – 1. exclusion of incomplete food items, 2. imputation of nutrient content “high” or 3. imputation of nutrient content “low” (for McCance & Widdowson only).
Granularity was assessed on the absolute number of food items amenable to reformulation available in each database. The FNDDS was the most detailed database, followed by the McCance & Widdowson (Table 1). In most databases, one category represented the majority of food items with the remaining categories unequally populated. Data completeness was high for Energy, Total Fat and Sodium. SFA completeness was low in the Chinese database and Total/Added Sugars completeness was critical in the Brazilian and Chinese databases (Table 1). Data incompleteness was linked to high likelihood of introducing bias as the same approach (assume nutrient content high in case of missing value) can have opposing impact on the NP outcome (imputation scenario led to 3-fold increase in NNPS pass: NNPS fail ratio for “center of plate” foods vs 3-fold decrease in NNPS Pass: NNPS fail ratio for foods in the “Cakes, Cookies & Desserts” category compared to the scenario rejecting foods with missing values)
Database quality based on granularity and data completeness for nutrient content is variable. The low level of information and detail available for foods belonging to less represented categories (i.e. savoury snacks) renders the databases a blunt tool to effectively segment the food supply and derive data-driven targets for reformulation and food policies.
* Added sugar value calculated from total sugar, fructose, lactose, galactose, maltose values
Data completeness is very poor in some cases and even in more complete databases, missing nutrient information can be linked to the introduction of heterogeneous biases which can impact the performance of NP models.