In recent years, growing scholarly attention(Reference Cummins and Macintyre1–Reference Holsten3) has been paid to the relationships between food retail environments (e.g. access, availability and price) and obesity. Fast food is high in energy and low in micronutrient density, and fast-food consumption may contribute to the obesity epidemic(Reference Bowman and Vinyard4–Reference Davis and Carpenter6). Although research on the possible interactions between obesity and characteristics of the food environment is at an early stage and the evidence is currently too weak to draw firm conclusions(Reference Schäfer-Elinder and Jansson7), US studies suggest that the food environment influences a healthy diet and obesity levels in North America(Reference Diez-Roux, Nieto and Muntaner8–Reference Moore, Roux and Nettleton10).
The clearly observable presence of processed foods, fast-food restaurants, supermarkets and other food outlets as factors determining the rising prevalence of overweight and obesity is a topic of research that is particularly relevant in the context of developed countries. Similar to the changing food retailing landscape in the post-World War II era in Western countries(Reference Tordjman11, Reference Wrigley12), the food retailing sector in China is experiencing dramatic changes half a century later. Over the past decade, supermarkets have been spreading faster in China than anywhere else in the world(Reference Hu, Reardon and Rozelle13). The impact of the food retail environment on consumers’ dietary behaviour is a complex issue(Reference Hawkes14). Some researchers argue that supermarkets in developing countries typically provide cheap, processed and junk foodstuffs, which suggests that the growth of supermarkets is a driver of the transition towards diets high in energy and nutrient-poor foods(Reference Asfauw15, Reference Reardon, Timmer and Berdegue16). Likewise, fast food is gaining popularity among younger generations and is widely spreading over both traditional and non-traditional food outlets(Reference Creel, Sharkey and McIntosh17, Reference Dave, An and Jeffrey18). Since entering China in the 1980s, the fast-food industry in China has attained an annual growth rate of 15 %. Despite a strand of research that documents the nutritional transition in China and its accompanying obesity epidemic(Reference Dagevos, He and Zhang19–Reference Shankar22), few studies have explicitly focused on the relationships between obesity and the food environment in China(Reference Li, Dibley and Sibbritt23). The present paper investigates the relationships between consumers’ fast-food preferences, the food retail environment and overweight and obesity in China in order to contribute to the relevant international debates on the issue of interactions between environmental factors and overweight/obesity.
The purpose of the study is threefold: (i) to identify segments of consumers on the basis of their fast-food preferences; (ii) to discover segments of the food retail environment and examine whether they are influencing consumers’ preference patterns; and (iii) to examine whether fast-food preference patterns affect consumers’ overweight and obesity as measured by BMI. At the end, consumers’ demographic characteristics, including their nutritional knowledge, are used to profile the consumer patterns.
Methods and procedures
Sample
We utilized data from the China Health and Nutrition Survey (CHNS)(Reference Popkin, Du and Zhai24), which covers a sample of nine provinces in China. This Sino–US longitudinal survey has been widely used in China's dietary studies(Reference Dagevos, He and Zhang19, Reference Bell, Ge and Popkin25, Reference Shankar26). Since the first round of the survey in 1989, the questionnaire has been administered seven times, and approximately 4400 households (with a total of 19 000 individuals) have participated. Further detailed information about this survey can be found at http://www.cpc.unc.edu/projects/china.
In the present study we used data from two questionnaires: the 2006 Adult Questionnaire and the 2006 Community Questionnaire. The 2006 Adult Questionnaire has been administered to a total sample of 9788 adults (age 18 years and older). The response rate in 2006 was 88 %, among the highest when compared with all other years. A total of 218 communities participated in the 2006 Community Questionnaire. Most community questions were directed to community heads and community accountants since they are the most informed resource people, while some were directed to local food safety inspectors.
Measurements
BMI measurement
BMI was calculated by dividing weight in kilograms by the square of height in metres. The measures of weight and height can be found in Dearth-Wesley et al.(Reference Dearth-Wesley, Wang and Popkin27). We used the WHO-defined cut-off points to classify participants into four categories: underweight (< 18·5 kg/m2), normal weight (18·5 to 24·9 kg/m2), overweight (25·0 to 29·9 kg/m2) and obesity (≥ 30·0 kg/m2).
Fast-food preferences
Food preferences were measured in the 2006 Adult Questionnaire by asking ‘How much do you like this food?’ in relation to five different food categories: (i) fast food (Kentucky Fried Chicken, pizza, hamburgers, etc.); (ii) salty snack foods (potato chips, pretzels, French fries, etc.); (iii) fruits; (iv) vegetables; and (v) soft drinks and sugared fruit drinks. Respondents were asked to choose one of the six answers: 1 = ‘dislike very much’; 2 = ‘dislike’; 3 = ‘neutral’; 4 = ‘like’; 5 = ‘like very much’; 9 = ‘do not eat this’. To simplify the data, we collapsed the six preference categories into four categories by merging 1 with 2 and 4 with 5. As such, four categories remained: 1 = ‘dislike’; 2 = ‘neutral’; 3 = ‘like’; 9 = ‘do not eat this’.
Food retail environment
We utilized the number of food retailing businesses and the penetration of supermarkets in each community as indicators of the food retail environment. The question ‘How many of these businesses are currently operating in this village/neighbourhood?’ was directed to the community heads and community accountants in the 2006 Community Questionnaire. A total of ten types of food businesses were listed and presented. We took the sum of the ten types of food businesses in the community as an indicator of the density of the food business operation in the communitiesFootnote *. In addition, three questions were used to measure the penetration of supermarkets or hypermarkets in each community, namely the number of supermarkets within a 30 min bus ride from the community, the number of the floors as well as the total areas (square metres) of each supermarket.
Dietary knowledge
Consumers’ dietary knowledge has been found to be an important predictor of healthy eating(Reference Wardle, Parmenter and Waller28, Reference White, Bunting and Raybould29). In the present study we investigated whether consumers’ dietary knowledge is associated with their food preference patterns. The respondents were asked to provide their judgements on the health effects of twelve dietary knowledge statements. These statements were measured on a 5-point Likert scale from ‘strongly disagree’ to ‘strongly agree’.
Other factors
Other factors used in the analysis include gender, age, region, education level and social status. Unfortunately, 88 % of the respondents refused to reveal their income. Therefore, income was left out from the analysis.
We used primary occupation and the means of transportation to and from work or school as proxies for social status. The primary occupations were originally recorded in twelve categories. We merged them into three categories to illustrate status: manual worker (farmers, fisherman, etc.), service occupation (housekeeper, waiter, etc.) and professional work (doctor, midwife, etc.).
Analysis
Although an emerging field of research focuses on relationships between the food retail environment and individual health status and dietary intake, only a small number of studies have explicitly examined the multilevel structure of the data. These studies(Reference Diez-Roux, Nieto and Muntaner8, Reference Diez-Roux, Nieto and Caulfield30, Reference Yen and Kaplan31) applied hierarchical linear modelling or logistic regression. In the current study, we take a latent class (LC) approach and simultaneously identify consumer segments at both the individual level and the food retail environmental level. Such models yield powerful methodological improvements over traditional approaches by recognizing existing market segments and thus providing insights into policy interventions on the basis of segmentation results.
The conceptual relationships of the modelling are presented in Fig. 1. First, we applied an LC cluster model(Reference Magidson and Vermunt32) on consumers’ fast-food preferences. The goal of LC clustering is the same as other clustering techniques, i.e. to identify homogeneous groups. However, the LC cluster model has advantages over other, more traditional types of clustering techniques, such as the model selection criteria and probability-based classification(Reference Vermunt and Magidson33). In their comparative study, Magidson and Vermunt(Reference Magidson and Vermunt34) clearly demonstrate that LC clustering, which uses maximum likelihood as a criterion, is superior to traditional K-means clustering. Second, we included BMI as a concomitant variable in the same model to determine whether it is related to the fast-food preference segments.
Because our data have a multilevel character, with consumers within food retail environment in communities, we finally add a multilevel LC (MLLC) cluster model for the retail environmental data in addition to the standard LC cluster model. The MLLC cluster model takes a non-parametric random-coefficient approach, which allows the heterogeneity to be taken into account not only at the respondent level, but also at the community level(Reference Vermunt35).
We used the computer program Latent GOLD 4·5 (Statistical Innovation Inc., Belmont, MA, USA) to fit both the simple LC cluster model and the MLLC cluster model. Furthermore, we applied principal component analysis for the twelve statements of dietary knowledge to identify the underlying factors. The resulting factor scores were used to profile the identified consumer clusters. The principal component analysis was performed using the SPSS statistical software package version 17·0 (SPSS Inc., Chicago, IL, USA).
Results
Summary statistics for the sample characteristics are presented in Table 1. The sample consisted of 9788 respondents, with an average age of 48·84 years. The mean BMI was 23·42 kg/m2. Based on the WHO definition, approximately 23·2 % of the sample was overweight and the prevalence of obesity was 4·9 %.
*Because of sometimes large numbers of missing data, the valid cases for each characteristic are presented.
†The four transport methods do not sum to 100 % since participants were asked to answer ‘Yes’ or ‘No’ separately. The numbers shown in the table are ‘Yes’ answers.
Examining the food preference data, we found a high percentage of answers in the category ‘like’ for both fruits and vegetables (above 80 %), which implies that the preferences for these two food categories hardly differ between individual consumers. Therefore, we deleted the two food categories fruit and vegetables and included only the three fast-food items in the analysis.
A four-cluster solution was selected from our LC cluster analysis to identify segments with different fast-food preference patterns, since it had the lowest Bayesian information criterion (BIC; 56 086·94) compared with solutions with one, two, three and five clusters (see Appendix). Table 2 contains segment-specific proportions for the different fast-food preference categories in each of the four identified segments.
* Bold font indicates the highest proportion per food category in each cluster.
The first row of numbers in Table 2 shows the size of each segment, i.e. the prior probabilities. The cluster size shows that the first two clusters are dominant and account for 80 % of the total sample. The last two clusters are rather small and make up the remainder (20 %) of the sample. When examining the segment-specific proportions, consumers in Cluster 1 had an extremely high chance (e.g. 98·89 % for fast food) of not eating products from any of the three food product categories. Thus, we labelled Cluster 1 as ‘Don't eat’ fast food. In Cluster 2, the category with the highest probability was ‘Dislike’ for all three products. Therefore, Cluster 2 was labelled as ‘Dislike’ fast food. Cluster 3 was called ‘Neutral’ since consumers in this cluster held a neutral position towards fast food, snack food and soft drinks. The last cluster was coined as ‘Like all’ since the highest probability for all three product categories was the indicator ‘Like’. We also found that the segmentation was more dominated by the first two food categories, ‘Fast food’ and ‘Snack food’, and less dominated by the food product ‘Soft drinks’, since the latter's segment-specific proportions were more modest.
The results after including BMI as a concomitant variable are presented in Table 3, which provides segment-specific proportions for each BMI category. The model with BMI as a concomitant variable performs much better in terms of BIC (52 684·50) than the model without BMI. The model improvement is highly significant (P < 0·001). The segment-specific proportions indicate the chances of each BMI category given one's cluster membership. By combining the probabilities of overweight and obesity for each cluster, we obtained the overweight/obesity rates of 29 %, 30 %, 20 % and 18 % for the four clusters, respectively. The results seem contrary to the common wisdom of the interaction of fast food and the rise of overweight/obesity. Why do those who ‘Dislike’ (Cluster 2) and ‘Don't eat’ (Cluster 1) fast food, snack food and soft drinks have the highest overweight/obesity rate, while those who ‘Like’ (Cluster 4) and are ‘Neutral’ (Cluster 3) have the lowest? Bearing this question in mind, we searched for an answer from other perspectives such as the food retail environment and sociodemographics.
We began the MLLC cluster model with a one-group class model (assuming one segment at the community level), and then continued with two group classes and three group classes. The model with two group classes produced the lowest BIC (see Appendix), which was also lower than the BIC for the two previous models (Tables 2 and 3). Therefore, we identified an MLLC model with four consumer clusters embedded in two food retail environments.
The food retail environment characteristics of the two identified group classes can be further profiled (Table 4). Due to the count characteristics of the data, we tested the differences assuming a Poisson distribution. The last column provides the significance of the likelihood ratio χ 2. The number of food businesses in Group Class 1 communities is significantly higher (with an average of sixty-five businesses) than in Group Class 2 communities, which only have about half as many food businesses. In addition, the average number of supermarkets or hypermarkets within a 30 min bus ride from Group Class 1 communities was nearly four, while that in Group Class 2 communities was approximately 1·5, indicating a significantly different degree of supermarket penetration in the two groups of communities. There was also a significant difference for the average floor area of the supermarket/hypermarket in which most residents in the community shop. The only non-significant measure is the average number of floors of the supermarket/hypermarket in which most residents shop. On the basis of these results, we named Group Class 1 the ‘Saturated food retail environment’ and Group Class 2 the ‘Deprived food retail environment’.
Table 5 shows the results from the MLLC model, which contains the size of each group class and the probability of being in each cluster given one's communities group class. The estimated individual-level cluster probabilities of the food category preferences showed only small changes compared with the two previously presented models and are therefore not shown. Thus, these individual-level clusters are basically the same as the previously identified individual-level clusters.
* Bold font indicates the highest proportion per food retail environment in each cluster.
The multilevel model identified two group classes at the community's food environment level with a size of 58 % and 42 % each. The conditional probabilities revealed that the ‘Saturated food retail environment’ consisted of 25 % of members of Cluster 1 (‘Don't eat’), 48 % of Cluster 2 (‘Dislike’), 15 % of Cluster 3 (‘Neutral’) and 12 % of Cluster 4 (‘Like all’). Cluster 1 (‘Don't Eat’) makes up 74 % of the ‘Deprived food retail environment’, while Cluster 2 (‘Dislike’) makes up approximately half of the ‘Saturated food retail environment’. Cluster 3 (‘Neutral’) and Cluster 4 (‘Like all’) have the highest chances of belonging to the ‘Saturated food retail environment’, although together they make up only about a quarter of it.
Having done the modelling work, we wished to start profiling consumers’ preference patterns by their nutritional knowledge. We aimed to determine whether consumers’ dietary knowledge can explain differences in overweight and obesity. Principal component analysis for the twelve dietary knowledge statements results in a three-factor solution (Kaiser–Meyer–Olkin index of sampling adequacy = 0·934, total variance explained = 61·86 %).
The Varimax-rotated factor loadings are presented in Table 6. Items 4, 2, 3 and 6 loaded onto Factor 1. This factor therefore appears to reflect a type of health and dietary belief that favours high consumption of animal fats along with higher sugar and fats content in general. Thus, this factor was named as ‘Pro fat and sugar diet’. Factor 2 was labelled as ‘Pro bean and dairy diet’ since items 9 and 8 loaded onto this factor. The final factor, Factor 3, was not in favour of sweaty sports but desired a heavy body. It reflected an outlook with no interest in athletics but favoured a full figure Therefore we chose the name ‘Full-figure diet’. The mean factor scores for each cluster as well as the means and percentages of the sociodemographic characteristics for each cluster are presented in Table 7.
* Bold font indicates the higher factor loadings for each factor.
*Of the χ 2 value for testing independence in cross-tabulations for categorical characteristics and of the F value for testing equality of means in ANOVA for metric characteristics.
Table 7 shows that members in the ‘Don't eat’ cluster were likely to be the oldest (52 years old) among all four groups, with most of them living in the countryside. Compared with the other three clusters, they attained the lowest educational degree and their occupations were primarily manual work. They walked to and from work. Their dietary knowledge was unique. They considered that a diet containing high fat, more sugar and many animal products was good for their health, while intense physical activities were not good for their health. They perceived a heavy body as a healthy image. Members in this cluster lived in the second group of communities, ‘Deprived food retail environment’, where the number of food business operations and supermarkets was lower. We may conclude that the non-availability of fast food and snack food was the main reason for their ‘Don't eat’ behaviour.
Members in Cluster 2, ‘Don't like’, tended to be middle-aged men living in the urban areas. They attained their middle-level school degree and performed manual work in the urban areas. Approximately half of them walked to work, but some utilized the bus and motorcycles. Their perception of dietary knowledge was opposite to that of Cluster 1. They disagreed that sweaty sports are not good for one's health. They did not perceive that a heavy body is healthy, although they were quite heavy themselves. They did not consider fat and sugar, beans and dairy products as good for one's health. They seemed to dislike everything despite living in the saturated food business community.
Members in Cluster 3, ‘Neutral’, were typically middle-aged women, primarily living in the urban areas. They received relatively higher education or technical degrees and engaged in service occupations. More than one quarter of them took a bus or a car to work. Although they lived in the ‘Saturated food retail environment’, they managed to obtain a relatively lower overweight and obesity rate.
Consumers in Cluster 4, ‘Like all’, had the lowest overweight and obesity rates. They represent the youngest urban women and had a relatively higher level of education. They lived in the ‘Saturated food retail environment’ and were employed in occupations such as service worker, office staff, as well as journalist professional workers. They had the highest percentages of taking a vehicle to work.
Discussion
Using an MLLC model, we have simultaneously identified four segments of consumers, namely ‘Don't eat’, ‘Dislike’, ‘Neutral’ and ‘Like all’, on the basis of fast food, snack food and soft drinks preferences. These four segments were embedded in two different food retail environments: saturated and deprived. Furthermore, we found that the two segments of consumers with high overweight and obesity rates were not unique in terms of fast food or snack preferences, but segmentation criteria were found in dietary knowledge, food retail environment and sociodemographics.
Our research reveals that the higher overweight and obesity rates from the ‘Don't eat’ segment were not related to fast food or snack food preferences since members in this segment did not have access to these products given the retail environment in which they resided. The main reasons for their heavy bodies may be explained by their dietary knowledge, which encourages them to eat more and become heavier. When considering the Chinese history in food security, this group suffered severely during the Great Chinese Famine in 1959–1961. Wang et al.(Reference Wang, Wang and Kong36) also found an association between early nutritional status during the famine and the risk of overweight and obesity in Chongqing females in China. This disaster occurred mainly in the Chinese countryside and cost approximately 30 million Chinese lives. After such a catastrophe and following decades of chronic food shortage in the Chinese countryside, this group has a strong desire to devour more with the abundant supply of food today. Another point of worry with respect to this generation is that they have now entered into the role of grandparents. Grandparents provide the main childcare for the third generation in most of the countryside. Their ideas and thoughts may influence their grandchildren, which might create a new generation with limited dietary knowledge, accompanied by risks with regard to gaining weight and reaching BMI above 25 kg/m2. For this group, their relatively low levels of nutritional knowledge about healthy food and food patterns is a more serious issue at present than the ‘Westernization’ of the food consumption environment in today's China. Public health for this segment is better served by paying attention to improve consumers’ nutritional knowledge than to direct the attention to the food environment.
The other three segments all scored lower on the three factors of dietary knowledge, an indication of the opposite beliefs in comparison to Cluster 1. In particular, members in Cluster 2, ‘Dislike’, were strongly against the idea that intense physical exercise and a skinny body are unhealthy. This can be explained by a relatively large percentage of urban members in this cluster, who may have the desire of visiting a fitness centre and body building. The main reason for the high overweight and obesity rates in Cluster 2 may be explained by the relatively lower social status in the cities and the food retail environment. Consumers in Cluster 2 may perceive Western fast food as luxurious and expensive and thus ‘Dislike’ it because of its unaffordability. Alternatively, they purchase other local food products to satisfy their appetite since they live in a saturated food retail environment, where consumers have abundant access to both supermarkets and other food stores. The lower BMI in Cluster 3 (‘Neutral’) and Cluster 4 (‘Like all’) may be explained by their relatively higher social status as well as their proper dietary knowledge toward less sugar/fat and more physical exercise.
The research results generate important implications for policy interventions on preventing and reducing obesity in China. However, the target for the two higher BMI groups is quite different. For the ‘Don't eat’ segment, nutritional education and correct knowledge on a proper diet are urgently needed. Communication methods must be carefully designed since the individuals in this group are often scattered throughout the countryside and have the least education. Considering their nurturing roles for the third generation, schools should play an active role in providing students with information on healthy diets and preventing childhood obesity.
For the ‘Don't like’ segment, it is not useful to simply blame the saturated food environment. The rapid development of the retail industry, particularly supermarket development, is part of the agricultural structural transformation(Reference Timmer37). The central issue that challenges policy makers is how to direct the food retail industry in such ways that it becomes part of the solution rather than part of the problem generated by an affluent diet and urban lifestyles.
With respect to the ‘Neutral’ and ‘Like all’ clusters, the interaction with the food environment is largely counter-intuitive. In general, the present study shows that unlike the fast-changing ‘Westernization’ of China, it is too simplistic to study the Chinese obesity epidemic from a ‘Western’ perspective. As a consequence, it cannot be concluded that major changes in Westernized community food environments in many parts of today's China are causing higher levels of overweight and obesity at the consumer level. This underscores rather than diminishes the importance of and urgency for future research that focuses on understanding overweight and obesity among contemporary (Chinese) consumers in the context of (physical, social, informational and political) environmental factors, the so-called ‘ecological approach’(Reference Nayga38).
The methodological contribution of the present paper is that the MLLC cluster model succeeded in disentangling community-level effects from individual consumer-level effects. The identification of two classes at the community level indicated that individual consumers were not independent but nested in two different food retail environments.
We are aware that CHNS does not contain a nationally representative sample. Major cities such as Beijing and Shanghai were not included in the survey regions. Although these surveyed regions are experiencing the transitions of the retailing environment, the sample may still limit us from capturing the progressive nature of the most rapidly changing regions in China, where fast-food consumption is more prevalent. In addition, we used the number of food operating businesses as an indicator of the food retail environment in each community and ignored their type of operation and the types of food that they provide. Future obesity research should pay more attention to these front-running regions and to specific aspects of the changing food environments.
Acknowledgements
This study was supported by the Wageningen University and Research Centre strategic research programme ‘Plant and Animal for Human Health’, which is funded by the Dutch Ministry of Agriculture, Nature and Food Quality. No conflict of interest was declared. X.Z. designed the study, analysed the data and drafted the manuscript. I.v.d.L. advised on the methodology and edited the manuscript. H.D. commented on and edited the manuscript.
Appendix 1
Latent class (LC) modelresults for fast-food preference*
LL, log likelihood; BIC, Bayesian information criterion.
*The value in bold denotes the lowest BIC value among alternative models.
Appendix 2
Multilevel latentclass (MLLC) model results*
LL, log likelihood; BIC, Bayesian information criterion.
*The value in bold denotes the lowest BIC value among alternative models.