A epidemiological transition has occurred in Korea, characterised by rapid increases in the incidence and mortality of chronic disease(Reference Kim, Moon and Popkin1). In order to understand the association between diet and chronic disease during this transitional period, there has been an increased demand for both prospective and retrospective epidemiological studies in Korea(Reference Yoo, Shin, Chang, Choi, Hong, Kim, Kang, Cho, Shin and Jin2). An important issue in conducting such studies is establishing valid dietary assessment tools for the estimation of average long-term intake of food and nutrients in individuals rather than for precise measurements of absolute dietary intake(Reference Andersen, Veierod, Johansson, Sakhi, Solvoll and Drevon3–Reference Willett5). A number of questionnaires, mainly food-based FFQ (FFFQ), have been widely employed in epidemiological studies in both Western countries(Reference Block, Woods, Potosky and Clifford6–Reference Willett, Sampson, Stampfer, Rosner, Bain, Witschi, Hennekens and Speizer8) and Asian countries(Reference Tsubono, Takamori, Kobayashi, Takahashi, Iwase, Iitoi, Akabane, Yamaguchi and Tsugane9, Reference Date, Fukui and Yamamoto10), including Korea(Reference Kim, Lee and Ahn11, Reference Ahn, Lee, Paik, Lee, Jo and Kim12). However, the dietary practices of Koreans are quite different from those of Europeans and Americans, which are typically based on the consumption of a single food. Koreans eat many kinds of mixed dishes with various seasonings and cooking oils; thus, a FFFQ is not sufficient to evaluate the effects of the seasonings and cooking oils in the Korean diet. The use of a FFFQ could underestimate the proportion of certain micronutrients, including antioxidant vitamins, fatty acids and phyto-oestrogens. Therefore, the development of a dish-based FFQ (DFFQ) could account for the different dietary practices of Koreans.
The purpose of the present study was to test the usefulness of ninety-five dish items selected in developing a DFFQ for the epidemiological study of Koreans. The 2001 Korean National Health and Nutrition Examination Survey (KNHANES) data were used as representative data from the Korean population.
Materials and methods
Data source
The dietary data of 6817 subjects aged 20 years and older, from the 2001 KNHANES, were used in the analysis. Detailed information about the 2001 KNHANES is described elsewhere(13, Reference Kim, Lee and Kim14). In brief, the nationwide KNHANES is conducted every 3 years, and information is collected from a stratified multistage probability sample of South Korean households representing the civilian, non-institutionalised population. The KNHANES is comprised of four parts: the Health Interview Survey, the Health Behavior Survey, the Health Examination Survey, and the Nutrition Survey. There were 246 097 primary sampling units, with sixty households in each unit. Sampling units were randomly selected from the primary sampling units using a stratified multistage probability method. A total of 600 sampling units encompassing 13 200 households were selected. A total of 37 769 subjects (aged 1 year and older) from the 13 200 households were selected and participated in the Health Interview Survey. The remaining three surveys were conducted on randomly selected 200 sampling units encompassing 4015 households from the original 600 sampling units. All household members participating in the remaining three surveys were examined to determine health status, and were interviewed for the dietary survey using a 24 h dietary recall (9968 subjects).
The 24 h dietary recall questionnaire requested detailed descriptions of the type and amount of all foods consumed on the day before the interview, including snacks, beverages, and ingredient information for all sauces and condiments. Additionally, information regarding the time and the place of each meal was recorded. Subjects were also asked to assess whether or not the time period in question represented typical dietary intake. If it was not a typical day (for example, birthday party, family gathering, eating at a restaurant), the subjects were asked to report 2 d before the interview date, or the most recent typical day. The 24 h dietary recall began with the first meal or beverage consumed at waking until midnight of the reporting day. Trained dietitians administered the 24 h dietary recall using measuring cups, portion-size booklets and photographs. All completed records were checked by a research dietitian for accuracy. Nutrient intake of each food item was calculated using the Diet Analysis program (version 4.0; Nutritional Epidemiology Laboratory, National Cancer Center) for a total of seventeen nutrients (total energy, protein, fat, carbohydrate, fibre, Ca, P, Fe, Na, K, vitamin A, retinol, carotene, vitamin B1, vitamin B2, niacin and vitamin C). Nutrient intake was calculated based on the 5th Edition Food Composition Table(15) and a database of processed and fast foods(16). Individuals aged less than 20 years (2990 subjects) and individuals with extreme total energy intake (defined as 2·93–25·1 MJ/d for males and 2·51–16·7 MJ/d for females) (forty males and 121 females) were excluded, resulting in 6817 study participants for the analysis.
Initial selection of the ninety-five dish items for developing the dish-based food-frequency questionnaire
The selection of the dish items was implemented using Willett's method(Reference Willett5), with the 24 h dietary recall data of the 2001 KNHANES(13), and is described by flow chart in Fig. 1. For the initial selection of foods items contributing to total nutrient intake, the contribution analysis was performed. The percentage contribution of each nutrient by the 629 dishes was computed as the arithmetic mean of the individual percentage contribution of each nutrient by dish. The 411 dish items that were selected accounted for 90 % of the cumulative percentage contribution (cPC). Multiple regression analysis (MRA) was performed with the total intake of each nutrient as the dependent variable and the overall amounts of each nutrient from each of the 411 items as the independent variables for each participant. This estimated the degree of explanation of between-person variation. The items were selected until the cumulative multiple regression coefficients (cumulative partial R 2) reached 0·90. Finally, similar dishes were aggregated into groups based on the nutrient content per portion eaten, the cooking method and the food ingredients in the dish; they were coordinated when a name, a main food ingredient, or more than 90 % of recipe ingredients for a dish were similar to another dish. Several seasonal food items, for example, seasonal fruits (strawberry, watermelon, etc.) and celebration foods, were not covered by the survey period of the 2001 KNHANES, but were included in the DFFQ. The composition values of the dishes were derived from standard recipes based on a publication by the Korean Ministry of Health and Welfare(16).
Statistical approach
The comparison of general characteristics between males and females was performed using the t test for continuous variables and the χ2 test for categorical variables. Stepwise regression models were used to determine the list of dishes to be included in the DFFQ, taking into consideration the between-person variability added by each item(Reference Willett5). The data processing was carried out using SAS (version 8.0; SAS Institute Inc., Cary, NC, USA) statistical software.
Results
Characteristics of study subjects
The characteristics of the study subjects (3158 men and 3659 women) are shown in Table 1. The mean age of the male subjects was 43·7 years and that of the female subjects was 44·9 years. The residential areas of the subjects were categorised as metropolitan city (46·6 %), middle or small city (30·7 %) and rural (22·7 %). Among the study subjects, 37·7 % of the men and 23·5 % of the women had education levels of more than 12 years. The BMI for the men and women were 23·7 and 23·4 kg/m2, respectively.
R.E., retinol equivalents.
* P values for comparison between sex by t test.
Selected ninety-five dish items for developing the dish-based food-frequency questionnaire
The initial analysis yielded a total of 993 dish items consumed by the 6817 participants. Dishes with very low frequencies of consumption were excluded (n 364) and dishes with minimal contribution to the dietary acquisition of the aforementioned seventeen nutrients (n 218) were excluded. This left a final count of 411 dishes, which accounted for 90 % of the cPC. The next stage of selection among the 411 items was carried out based on the degree of explanation for between-person variation. Therefore, 359 dish items were retained. Finally, after grouping and aggregation of 359 dishes, a total of ninety-five dish items were selected for the DFFQ (Fig. 1). The publication categories(16) of the ninety-five dish items included (Table 2) in the DFFQ were rice and cereals (seven items), noodles (three items), breads (seven items), soups or stews (eleven items), fish and shellfish (ten items), meats (ten items), eggs (two items), pulses (three items), vegetables (thirteen items), kimchies (six items), milk and milk products (two items), fruits (six items), beverages (nine items) and snacks (six items).
MRC, multiple regression coefficient; PC, percentage contribution.
* Ranking order for the top twenty dishes.
Intake frequency and portion size
The food intake frequency in the semi-quantitative FFQ was classified into nine categories: almost never, once per month, two or three times per month, once or twice times per week, three or four times per week, five or six times per week, once per d, twice per d, and three times per d. In short, the standard portion size of each dish item per meal was determined using the mean amount, the typical or standard value, or the natural unit and also referred to the Korean Ministry of Health and Welfare portion-size booklet(16). Portion size in the semi-quantitative FFQ was divided into three categories: small (half the medium portion), medium and large (1·5 times or greater than the medium portion). The medium intake was determined as the mean amount from the study subjects. For an example of the semi-quantitative FFQ, see the Appendix.
Usefulness of a diet-based food-frequency questionnaire
We wished to quantify the usefulness of the ninety-five selected dishes and the top twenty items in explaining between-person variations in nutrient consumption for energy and nutrients as estimated by cumulative multiple regression coefficient (R 2) and the coverage of total nutrient consumption as estimated by cPC (%) (Table 2).
The top twenty dish items accounted for 77, 70, 91 and 82 % of the between-person variation for total energy, protein, carbohydrate and fat intake, respectively. Among the twenty items, rice, the main staple for Korean people, was shown to account for 32 and 52 % of the between-person variation for total energy and carbohydrate intake, respectively. Rice also covered 45·3 and 59·7 % of the total intake of energy and carbohydrate, respectively. Roasted pork, spicy pan-fried pork and roasted beef accounted for 45 % of the between-person variation of fat intake.
A relatively limited number of dish items can explain the > 90 % of the between-person variation of vitamins, compared with those of macronutrients and minerals. Only one item accounted for 96 and 79 % of the between-person variation for retinol (grilled fatty fish) and vitamin C (orange-coloured fruit) intake. Furthermore, the top twelve items reached almost 100 % of the between-person variation for retinol, and between seven and sixteen items reached 90 % of the between-person variation for vitamin C and carotene, respectively. Grilled fatty fish accounted for 96 % of the between-person variation for retinol. The analysis suggested that the top twelve dish items were sufficient enough to cover 100 % of the between-person variation.
The top twenty dish items accounted for 85, 67, 81, 83 and 71 % of the between-person variations for total intakes of Ca, P, Fe, Na and K, respectively. Soyabean paste soup was the top contributor in terms of Ca and Fe intake, and the major contributors to Na intake were Korean cabbage kimchi, radish kimchi and soyabean paste stew (Table 2).
The number of dish items required to meet the 90 % of consumed foods as well as the between-person variation of the seventeen nutrients are listed in Table 3. The number of items that covered 90 % of the total intake of each nutrient, as estimated by cPC, varied from thirty-three for vitamin C to seventy-two for retinol. The number of items that accounted for 90 % of the between-person variability also varied from one for retinol to forty-one for K.
Table 4 summarises the values of the ninety-five selected dish items in terms of coverage of total nutrient intake and between-person variation. The ninety-five selected items covered an average of 92·3 % of the intake of the seventeen nutrients by the study subjects, ranging from 88·6 % for retinol to 95·4 % for vitamin C. The ninety-five items could also explain 99·9 % of the between-person variation for the consumption of the seventeen nutrients in the study subjects.
Discussion
The major functions of a rational FFQ are whether it is a useful tool for measuring average long-term dietary intake and for ranking the typical nutrient intakes of individuals. It does not precisely estimate the absolute consumption of nutrients for an entire study population. The use of a food item must vary from person to person for food discrimination according to dietary intake. A more important issue in the design of FFQ is the selection of food items with high between-person variations identified through multiple regression analyses for each nutrient compared with the total nutrient intake in order to identify food items with the most discrimination. Recently, many dietary assessment questionnaires(Reference Ahn, Lee, Paik, Lee, Jo and Kim12, Reference Kim, Kim, Ahn, Paik, Ahn, Tokudome, Hamajima, Inoue and Tajima17, Reference Lee, Lee, Ha, Kye, Kim, Lee and Yoon18) have been developed and several among them were validated for use in Korea(Reference Kim, Lee and Ahn11, Reference Ahn, Kwon, Shim, Park, Joo, Kimm, Park and Kim19). Nevertheless, most existing FFFQ(Reference Ahn, Lee, Paik, Lee, Jo and Kim12) with a few exceptions(Reference Kim, Lee and Ahn11) lacked the appropriate analyses for food discrimination. In compiling a food list of FFQ, the food consumed frequently by Koreans and the foods with substantial nutrient content were selected based on the data of the Report on the National Nutrition Survey(20). However, selection of foods was not based on between-person variations.
A DFFQ could have advantages over the several FFFQ developed and validated in Korea(Reference Kim, Lee and Ahn11, Reference Ahn, Lee, Paik, Lee, Jo and Kim12, Reference Kim, Kim, Ahn, Paik, Ahn, Tokudome, Hamajima, Inoue and Tajima17, Reference Lee, Lee, Ha, Kye, Kim, Lee and Yoon18), as it could include ingredients such as seasonings, spices and cooking oils. This would allow for a more accurate calculation of nutrient intake, especially for micronutrients such as antioxidant vitamins, phytochemicals and fatty acids. The relatively low coverage rates of FFFQ were most likely secondary to a food list that did not include seasonings and oils, which are among the main sources of fat, Na and β-carotene in the Korean diet(Reference Ahn, Lee, Paik, Lee, Jo and Kim12, Reference Shim, Paik, Moon and Kim21). The estimated coverage of the intake of seventeen nutrients selected by this DFFQ was much higher (92·3 % overall, ranging from 88·6 to 95·4 %) than those of FFFQ(Reference Tsubono, Takamori, Kobayashi, Takahashi, Iwase, Iitoi, Akabane, Yamaguchi and Tsugane9, Reference Ahn, Lee, Paik, Lee, Jo and Kim12). Additionally, our DFFQ was developed on the basis of national representative data (KNHANES) with a substantially larger sample size (6817 subjects) as compared with previous developed FFQ in Korea(Reference Kim, Lee and Ahn11, Reference Kim, Kim, Ahn, Paik, Ahn, Tokudome, Hamajima, Inoue and Tajima17, Reference Lee, Lee, Ha, Kye, Kim, Lee and Yoon18) and other countries(Reference Pellegrini, Salvatore and Valtuena4, Reference Tsubono, Takamori, Kobayashi, Takahashi, Iwase, Iitoi, Akabane, Yamaguchi and Tsugane9, Reference Date, Fukui and Yamamoto10, Reference Keating, Bogen and Chan22).
The main reduction from the initial 993 dishes compiled in the 24 h recall to the ninety-five dishes in the FFQ resulted from aggregating 359 items to ninety-five usable dishes in the last step. Therefore, there was a minimal loss of information, but all the nutrients provided by 359 foods and dishes remained in the calculation. The regression analysis resulted in the greatest reduction in the number of items (87 % reduction to 359 items) as compared with the contribution analysis (65 % reduction to 411 items). The number of dish items that covered 90 % of the total nutrients consumed by the study subjects varied from thirty-three items for vitamin C to seventy-two items for retinol. To test the function of ranking individuals using the DFFQ, cumulative regression coefficients were estimated. As a result, the top twenty dish items among the ninety-five items accounted for 67 to 100 % of the between-person variation of the main seventeen nutrients consumed by the entire study subject population. Each nutrient composition defined the number of dish items required to meet an underestimation factor of less than 10 %. Some micronutrients, including retinol and vitamin C, had a limited number of dish items accounting for more than 90 % of the between-person variation in intake, and were assessed relatively easily via a small number of items in the DFFQ. In contrast, thirty-one to thirty-nine items were needed to reach the 90 % level for the between-person variation in energy, protein and fat intake, which is similar to other studies(Reference Tsubono, Takamori, Kobayashi, Takahashi, Iwase, Iitoi, Akabane, Yamaguchi and Tsugane9, Reference Ahn, Lee, Paik, Lee, Jo and Kim12, Reference Shahar, Shai, Vardi, Brener-Azrad and Fraser23). Even some items with a limited amount of specific nutrients may still be effective in discriminating between individuals. In this population, grilled fatty fish accounted for 96 % of the between-person variability in retinol intake, but only contributed 17·9 % of the total retinol intake, indicating that this dish was not a major contributor of retinol. However, this explained a relatively large amount of the variability within populations. As for vitamin C intake, only eight items were needed to account for over 90 % of the between-person variability.
The cPC estimation results imply that there are limited dish and food sources for some nutrients such as vitamin C and carbohydrate in the Korean diet, whereas there is a variety of dish and food sources for other nutrients such as fat and retinol. Therefore, the length of the dish list used during the development of a DFFQ could depend on a nutrient of interest in relation to a disease.
The present study has several limitations originating from the KNHANES data we used. First, the use of the single day 24 h dietary recall meant that it was not possible to estimate within-person variations. Second, seasonal variation of food was not included in the survey of the 2001 KNHANES, as it was conducted from November through December. For example, the contributions of fruits more readily available during the survey period might be overestimated, and the contributions of other fruits that were less available might be underestimated. Therefore, several seasonal food items, for example, seasonal fruits (strawberry, watermelon, etc) and celebration foods not covered by the survey period of the 2001 KNHANES were included in the DFFQ, taking into consideration the seasonal variation of the Korean diet.
In conclusion, the selected ninety-five dish items that were included in the DFFQ covered 90 % of the total seventeen nutrients consumed by subjects, as well as 99 % of the between-person variation for the nutrients of interest. Therefore, the ninety-five selected items could be useful in developing a DFFQ for use in large-scale epidemiological studies of Koreans, within less than 10 % underestimation.
Acknowledgements
The present study was supported in part by a Korea Science and Engineering Foundation (KOSEF) grant funded by the Korean government (MOST) (no. R01–2006–000–10 621–0). The authors declare that there are no potential conflicts of interest. The authors' responsibilities were as follows: K. M. K. designed the study and obtained funding; Y. O. K., S.-A. L. and K. M. K. contributed to the study design, interpretation of the results and manuscript preparation; S. S. contributed to the study design and interpretation of the results; Y. M. Y. carried out the statistical analysis of data. All authors approved the final manuscript.
Appendix
Sample questionnaire for staple food in a developed dish-based food-frequency questionnaire.