Higher consumption of fruit and vegetables has been associated with lower risk of several diseases, including heart disease and several cancers(Reference Bazzano, Serdula and Liu1–Reference van't Veer, Jansen and Klerk4). Fruits and vegetables contain numerous phytochemicals that may reduce disease risk. High fruit and vegetable intakes may also be a marker for an overall diet or for other lifestyle factors associated with reduced disease risk. Observations regarding disease risk in relation to broad and heterogeneous categories, such as ‘fruits’ and ‘vegetables’, have direct implications for public health recommendations, such as ‘two cups of fruit and cups of vegetables per day are recommended for a reference 2000-energy intake’ according to the US Department of Agriculture Dietary Guidelines for Americans 2005(5). Evaluation of fruits and vegetables classified on the basis of their nutrient content, such as on the basis of particular vitamins or other phytochemicals, is useful for generating hypotheses regarding the biological mechanisms of associations of fruit and vegetable intakes with disease risk.
Many studies on the association between fruit and vegetable intakes and disease risk have been conducted in Western populations; however, fewer data are available on other population groups. The amount and patterns of fruit and vegetable consumption may differ across populations. For example, fruit and vegetable intakes in South-East Asia are lower than those of Western populations(Reference Pomerleau, Lock, McKee and Altmann6). A difference in the amount or pattern of consumption may affect the validity and interpretation of dietary intake data.
Our primary objective was to evaluate the validity of fruit and vegetable intakes obtained from an FFQ by evaluating various classification schemes in relation to plasma carotenoid and vitamin C concentrations. Additional analyses were conducted to evaluate dietary intake and plasma concentrations in relation to season of interview and year of interview to evaluate potential bias that might be introduced by these variables.
Experimental methods
Participants and study procedures
Participants (n 2031) comprised women who had participated in a case–control study of diet and breast-related diseases nested within a randomized trial of breast self-examination among textile workers in Shanghai, China (n 266 064; Fig. 1)(Reference Li, Ray and Lampe7). Figure 1 illustrates the selection of participants for the analysis conducted for the present study from the larger randomized trial. Recruitment into the trial took place between 1989 and 1991, and follow-up for breast-related diseases continued through July 2000. Women born between 1925 and 1958 were eligible for participation in the trial (aged 30–64 years at enrolment). Further details regarding eligibility for the randomized trial and baseline data collection are published elsewhere(Reference Thomas, Gao and Ray8). Dietary data were collected from selected women in the cohort with benign or malignant breast disease diagnosed between 1 September 1995 and 31 July 2000 and from women without breast diseases who were randomly selected from the trial cohort as controls(Reference Li, Ray and Lampe9). Controls were frequency-matched to cases on the basis of age. Women with breast-related diseases were eligible as cases for the case–control study if they were diagnosed at one of the three hospitals serving the Shanghai Textile Industry Bureau (STIB). Non-fasting blood samples were collected for the analysis of several nutrients, including carotenoids and vitamin C. Attempts were made to schedule the diet interview and blood sample collection on or before the date of the scheduled breast biopsy for women with benign or malignant breast disease. Women were excluded from the present analysis if their blood sample collection was completed and their FFQ was administered >1 week apart or if plasma vitamin C or any of the plasma carotenoids were missing. The Institutional Review Boards at the Fred Hutchinson Cancer Research Center and STIB approved the study, and informed consent from all participants was obtained before their participation.
FFQ and classification of food groups
The FFQ consisted of 107 food items and was administered during an in-person interview. Women were asked ‘During most of your adult life how often did you usually eat?’ for each of the FFQ food items. Women responded by citing the number of times per day, week, month or year, or they indicated that they had never consumed the item. Foods were classified using a three-grouping system: the traditional group, the botanical group and the high-nutrient group (containing foods high in the nutrient). The botanical groups were included because certain phytochemicals may aggregate within the botanical groups distinct from the high-nutrient groups. The traditional groups were mutually exclusive and consisted of fruit, vegetables, red meat, poultry, seafood, eggs, milk and milk products, soya foods, legumes excluding soya and grains excluding corn. Fruits included apples, pears, oranges/tangerines, lychee, bananas, peaches, persimmon, pineapple, grapes, apricots and watermelon. Vegetables included salted mustard greens, other salted vegetables, bok choy, spinach, cabbage, Chinese cabbage, watercress, broccoli, Chinese broccoli, green asparagus, cauliflower, celery, eggplant, wild rice stem, winter squash, lettuce, yellow sweet potatoes or yam, other potatoes, wax gourd, gherkins, carrots, pumpkin, mushrooms, red or green pepper, tomatoes, bamboo shoots, radish/turnip, lotus rhizomes, taro root, corn, onions and chives, garlic stalk and seaweed. Table 1 provides examples of foods included in each botanical group. Foods included in the botanical and high-nutrient groups have been published in detail elsewhere(Reference Wu, Ray and Lin10). The high-nutrient groups were not mutually exclusive with respect to included foods, and included groups for high α-carotene (carrots, pumpkin, string beans, winter squash), high β-carotene (apricots, bok choy, broccoli, carrots, Chinese broccoli, Chinese cabbage, persimmon, pumpkin, salted mustard greens, spinach, string beans, watercress, winter squash, yellow sweet potato or yam), high β-cryptoxanthin (corn, oranges/tangerines, persimmon, watermelon), high lutein + zeaxanthin (broccoli, Chinese broccoli, corn, hyacinth beans, salted mustard greens, seaweed, spinach, watercress), high lycopene (persimmon, tomatoes, watermelon) and high total carotenoids (apricots, bok choy, broccoli, carrots, Chinese broccoli, Chinese cabbage, persimmon, pumpkin, salted mustard greens, red or green peppers, seaweed, spinach, tomato, watercress, watermelon, yellow sweet potato or yam). The botanical groups were mutually exclusive with respect to included foods and included araliaceae (fresh ginseng, white ginseng powder or extract, red ginseng powder or extract), compositae (sunflower seeds, lettuce), convolvulaceae/dioscoreaceae (yellow sweet potatoes, yam), cruciferae (bok choy broccoli, cabbage, cabbage, cauliflower, Chinese broccoli, Chinese cabbage, radish or turnip, watercress), cucurbitaceae (gherkin, pumpkin, watermelon, wax gourd, winter squash), ebenaceae (persimmon), laminariaceae (seaweed), leguminosae (fresh fava beans, fried bean curd, fried bean curd puff, green or kidney beans, hyacinth beans, mung bean sprouts, mung beans, other dried beans, other soyabean foods, pea or cow peas, peanuts, peanut butter, red pea or green bean groups, soyabean milk, soyabeans, string beans, szuki beans), liliaceae (asparagus, Chinese chives, chives, garlic, garlic stalk, onions, scallions), rosaceae (apples, apricots, peaches, pears), rutaceae (oranges, tangerines), sapindaceae (lychee), solanceae (eggplant, hot pepper, other potato, red or green pepper, tomato), umbelliferae (carrots, celery), vitaceae (grapes) and zingiberaceae (ginger root). Servings per week were calculated for each food group. The average serving size obtained from the Chinese population was used to estimate a serving from the reported frequency for each food item. Each reported frequency was assigned the average serving size for that food item. Average serving sizes were obtained from the 1992 China Health and Nutrition Survey(11).
*Fruits included: apples, apricots, bananas, grapes, lychee, oranges or tangerines, peaches, pears, persimmon, pineapple and watermelon.
†Vegetables included: unsalted vegetables (asparagus, bok choy, broccoli, cabbage, Chinese broccoli, Chinese cabbage, spinach and watercress) and salted and cured vegetables (fermented bean curd, salted mustard greens and other salted vegetables).
Plasma collection and nutrient analysis
Blood samples were collected into foil-covered EDTA 10 ml vacutainers and centrifuged within 45 min of collection or refrigerated and centrifuged within 2 h of collection. The samples were centrifuged for 15 min at 1300g. Plasma was transferred into 2 ml cryovials and frozen at −20°C for 1 month or at −70°C for longer periods until they were shipped to Seattle. Specimens were shipped to Seattle on dry ice and immediately stored at −70°C.
Ascorbic acid concentration measurements have been described in detail elsewhere(Reference Dijkstra, Lampe and Ray12). The intra-assay per cent CV (%CV) was 6·3 and 1·3 at 0·83 and 1·65 mg/dl, respectively. The inter-assay %CV was 8·1 and 5·0 at 0·78 and 1·65 mg/dl, respectively. The extraction of analytes from plasma, the quality control parameters and the HPLC methods for measurements of plasma carotenoids and tocopherols have been published previously(Reference Kalman, Goodman and Omenn13, Reference Zhang, Williams and Sanchez14). The HPLC method in the present study included a profile of carotenoids, carotenes, retinols and tocopherols in a single HPLC run and has been described elsewhere(Reference Dijkstra, Lampe and Ray12). The %CV for the pooled quality control samples for all analytes was ≤10 %.
Statistical methods
Mean and sd were calculated as summary statistics. Because distributions were not normally distributed, data were log transformed before analyses for plasma concentrations of β-cryptoxanthin, lycopene, α-carotene, β-carotene, retinyl palmitate and vitamin C. Exponentiations of the mean value of the log-transformed nutrients to obtain the geometric mean were performed for presenting the data in the results. Dietary intake was divided into quartiles for analysis. Linear regression was also used to evaluate differences in mean plasma nutrient concentrations across quartiles of intake, adjusted for each woman's age, case–control classification, season of interview and year of interview. The quartile of intake was calculated using the median and 25th and 75th percentiles of each food group (Table 1) and was included as a single categorical term in the linear regression models. Season was defined as four 3-month categories (December–February, March–May, June–August and September–November). Year of recruitment was divided into three categories: 1995–1996, 1997–1998 and 1999–2001. Case–control classification was divided into three groups: controls, women with benign breast disease and women with breast cancer. In order to quantify the differences across seasons and year of interview, dietary intake and plasma nutrients were evaluated in relation to season of interview and year of interview using ANOVA and regression analyses, respectively, adjusted for each woman's age, case–control classification and, for dietary nutrients, total energy (kJ) intake. Analyses using controls only were evaluated, and differences in overall interpretation of the results were unremarkable; therefore, the results of analyses for all participants are presented. A sampling of plots of predicted values compared with standardized residuals was evaluated for heteroscedasticity; plots were consistent with meeting the assumptions of homoscedasicity. Statistical significance was considered at P < 0·05. Analyses were conducted using the STATA statistical software package version 11·0 (StataCorp., College Station, TX, USA).
Results
Women reported consuming an average of approximately six servings of fruit and sixteen servings of vegetables per week (Table 1). Fresh vegetables contributed to the majority of vegetable intake. The correlation between fruit and vegetable intakes was r = 0·39. Cruciferae, cucurbitaceae, liliaceae, rosaceae and solanaceae were the most frequently consumed fruit and vegetable botanical groups. High β-carotene, total carotenoid, lycopene and β-cryptoxanthin foods were the most frequently consumed high-nutrient groups. Weekly servings of high β-carotene and high total carotenoid foods were highly correlated (r = 0·91); therefore, further analyses were conducted with high total carotenoid foods only. Although similar foods were in the high lycopene and high β-cryptoxanthin groups, weekly servings of these foods were not strongly correlated enough to be considered collinear (r = 0·58).
The distribution of plasma nutrients is presented in Table 2. Wide variation in the plasma concentrations of nutrients was observed among the women.
LOQ, limit of quantitation.
Fruit intake was significantly and positively associated with plasma concentrations of α-tocopherol, β-cryptoxanthin, retinyl palmitate, lycopene, α-carotene, β-carotene and vitamin C (Table 3). Inverse associations of γ-tocopherol and lutein + zeaxanthin with fruit intake were observed. Vegetable consumption was significantly and positively associated with γ-tocopherol and β-cryptoxanthin. The botanical and high-nutrient groups were associated with plasma nutrients in a manner distinct from association with the fruit and vegetable groups. Almost all groups were positively associated with vitamin C concentrations. Some groups (e.g. rosaceae) were associated with most nutrient concentrations, whereas other groups (e.g. solanceae) were associated with only a few nutrient concentrations.
* Geometric means for β-cryptoxanthin, lycopene, α-carotene, β-carotene, retinyl palmitate and vitamin C.
Season and year of interview were significantly associated with diet and most plasma nutrients (Table 4). Women reported lower lifetime fruit consumption when interviewed during the summer and autumn months and lower vegetable consumption when interviewed during the autumn months. Fruit and vegetable intakes and mean plasma concentrations of lutein + zeaxanthin, γ-tocopherol and β-cryptoxanthin decreased over the course of the study. To evaluate whether decreased fruit and vegetable consumption over time was balanced by increased consumption of other foods, we examined the mean intake of other main food groups in relation to year of interview, adjusted for total enegy (kJ), case–control classification, season of interview and age at interview. Consumption of red meat (mean intake for 1995–1996, 1997–1998 and 1999–2001 was 5·6, 5·3 and 4·8 servings/week, respectively) and legumes excluding soya (3·5, 3·2 and 2·9 servings/week for the respective corresponding time periods) also decreased significantly in relation to year of interview (P ≤ 0·001). Consumption of poultry (0·8, 1·0 and 1·0 servings/week), eggs (3·2, 3·2 and 3·7 servings/week), milk and milk products (3·2, 3·1 and 4·1 servings/week) and grains excluding corn (5·5, 5·2 and 6·9 servings/week) increased significantly (P ≤ 0·001). No change in consumption of seafood (3·4, 3·3 and 3·3 servings/week) or soya foods (5·6, 5·2 and 5·4 servings/week) was observed (P > 0·3).
* Geometric means for β-cryptoxanthin, lycopene, α-carotene, β-carotene, retinyl palmitate and vitamin C.
† Winter: December, January and February; Spring: March, April and May; Summer: June, July and August; Autumn: September, October and November.
‡ P value for difference between categories calculated from ANOVA.
§ P value for linear trend across categories calculated from linear regression.
Discussion
Serum and plasma carotenoids and vitamin C are proposed as biomarkers of fruit and vegetable intakes(Reference Brevik, Andersen and Karlsen15–Reference van Kappel, Steghens and Zeleniuch-Jacquotte17). Carotenoids are stored in the liver and blood concentrations provide a marker of long-term exposure. Blood concentration of vitamin C, which is not stored, serves as a short-term marker of exposure(Reference Shils, Olson and Shike18). In the present study, we observed that fruit intake was associated with most plasma carotenoids and vitamin C. Inverse associations of γ-tocopherol and lutein + zeaxanthin with fruit intake were observed; it is remarkable that there was no positive association because γ-tocopherol is found in high amounts in vegetable oils and lutein + zeaxanthin is found in eggs and corn, as well as in green and yellow fruits and vegetables. However, the inverse association may have been observed by chance or may reflect a higher consumption of other foods, such as vegetables, that replaces fruit consumption. Vegetable intake was associated with two of the ten plasma nutrients evaluated, β-cryptoxanthin and γ-tocopherol. These observations suggest that the fruit group provides a broad classification of exposure to carotenoids and that the vegetable group provides an index of exposure for select nutrients.
We observed that the high total carotenoid group was positively associated with plasma concentrations of α-carotene, β-cryptoxanthin, lutein + zeathanin, γ-tocopherol and vitamin C. This pattern was distinct from the high lycopene and high β-cryptoxanthin food groups. Among other plasma nutrients, the high lycopene group was positively associated with plasma lycopene concentrations and the high β-cryptoxanthin group was positively associated with plasma β-cryptoxanthin concentrations. These observations suggest that the high total carotenoid group provides a valid assessment of several plasma carotenoids and that the specific carotenoid food groups adequately reflect their specific plasma nutrients. A strength of using these high-nutrient classifications is that, although they are a finer stratification compared with fruit and vegetables, there is adequate variation in intake. A limitation of using these classifications is that it may be more difficult to provide public health recommendations regarding consumption of the high-nutrient groups compared with public health recommendations for increased consumption of fruit and vegetables in general.
Fruits and vegetables can be classified on the basis of their botanical family groups, such as cruciferae, or on the basis of their highly specific phytochemical content, such as isoflavones in soya or glucosinolates in cruciferous vegetables. Evaluating the associations between these groups and disease risk is useful for generating and evaluating hypotheses regarding risk associated with particular phytochemicals found in high amounts in these botanical families. We observed patterns of association distinct from the other classifications with regard to plasma carotenoids for the five categories of botanical groups evaluated. In general, the patterns of association were consistent with what might be expected. For example, cruciferae was positively associated with β-carotene, lutein + zeaxanthin, γ-tocopherol and vitamin C, and solanceae, which includes eggplant and peppers, was positively associated with lycopene and vitamin C. This observation suggests that these groups are useful for evaluating particular botanical families of interest. However, rosaceae (apples, pears, peaches and apricots) was associated with most plasma nutrients, suggesting that this botanical family may be serving as a marker of consumption of other foods.
Season of interview was associated with fruit and vegetable intakes and with plasma nutrient concentrations. Consumption of overall and particular fruits and vegetables is usually seasonal, based on availability and pricing, and therefore changes in plasma nutrients likely reflect changes in consumption. For example, plasma lycopene was higher in summer months and lutein + zeaxanthin was lower in autumn months, which may reflect seasonal changes in consumption of high lycopene foods (e.g. persimmon, tomato and watermelon) or high lutein + zeaxanthin foods (e.g. perishable greens). The FFQ queried participants about their usual intake of particular foods during their adult life. Association of season of interview with fruit and vegetable intakes likely reflects a source of error in dietary exposure measurement, in which participants extrapolate current exposure to lifetime exposure. As a result, season of interview is associated with exposure (diet) and could introduce non-differential or differential bias when evaluating an exposure–outcome association, depending on the relationship of season of interview with outcome(Reference Armstrong and White19).
Year of interview was also associated with fruit and vegetable intakes and with plasma nutrient concentrations. Plasma β-cryptoxanthin, γ-tocopherol and vitamin C were associated with vegetable intake. β-Cryptoxanthin and γ-tocopherol, which are stored in the liver and reflect a longer period of exposure, were also inversely associated with year of interview, whereas vitamin C, which reflects a shorter exposure period, was not associated. Red meat and legume consumption also decreased over the course of the study, whereas consumption of poultry, milk and milk products and grain increased. Overall, these observations suggest that the reported decreased fruit and vegetable intakes may represent a real trend of changing consumption patterns. There are many possible reasons for a decrease in fruit and vegetable consumption with time, including increased urbanization, although this has not been predicted to have a remarkable influence on China's vegetable and fruit consumption(Reference Hsu, Chern and Gale20), and changing availability and cost of fruits and vegetables in response to shifts in import and export practices during 1999–2001(Reference Shields and Huang21). However, it is also possible that the lower fruit and vegetable consumption in later years was observed by chance.
The present study has several potential limitations. One limitation is that the average serving size for the population was used to calculate dietary intake for all participants. Willett summarized results from several studies that indicated that individuals were relatively inaccurate in recalling portion sizes and that the majority of intake variability was from frequency of consumption rather than from serving size(Reference Armstrong and White19). Thus, not collecting serving size information likely reduced the intake values but did not markedly alter the overall ranking of individuals by intake. This could be expected to be a minor source of bias across quartiles of intake. However, if a reduction did occur in the variability in dietary intake, this would represent a source of non-differential bias and would have reduced the ability to detect an association between dietary intake and plasma nutrient concentrations. In this circumstance, our results would be conservative. Because a single plasma sample, rather than multiple samples, was collected over time and the single plasma sample was compared with usual adult intake, we expect a decreased ability to detect an association. Overall, these sources of potential non-differential bias in our study suggest that the observed associations between plasma nutrients and dietary fruit and vegetable intakes may be conservative.
Our primary objective was to evaluate the validity of fruit and vegetable intakes obtained from an FFQ by evaluating various classification schemes in relation to plasma carotenoid and vitamin C concentrations. From this evaluation, we observed several outcomes relevant to the study design and analysis. First, various classifications of fruit and vegetable intakes were associated with plasma carotenoids and vitamin C, suggesting that intakes estimated from the FFQ provide a valid assessment of exposure. Second, the fruit and vegetable, high-nutrient and botanical groups were distinctly associated with plasma nutrients. Third, season of interview should be considered as a potential confounder, or cases and controls should be matched to season of interview. We also observed a significant decrease in fruit and vegetable consumption over time in this population of Chinese women and, given the positive associations of fruit and vegetable intakes with health, this should be evaluated in additional studies.
Acknowledgements
The present study was supported by the US National Institutes of Health (R01CA75332 and T32CA009168). The authors have no conflict of interest to declare. C.L.F. contributed to statistical analyses, manuscript preparation and manuscript review; J.W.L. contributed to study design, laboratory analyses, manuscript preparation and manuscript review; J.S. contributed to study design, data collection, dietary analyses and manuscript review; D.L.G. was the primary physician contact in Shanghai and also contributed to study design, implementation of study procedures, supervision of data acquisition and manuscript review; W.L. contributed to statistical analyses and manuscript review; R.M.R. oversaw data management and statistical analyses and actively participated in manuscript review; C.C. and I.B.K. were involved in laboratory analyses and manuscript review; D.B.T. was the principal investigator for the breast self-examination trial and for other related protocols that funded the current study and was involved in study design and manuscript review.