Obesity is becoming increasingly prevalent worldwide. Historically, BMI and waist:hip ratio (WHR) have been used as indices of obesity and related disease risk in epidemiological studies(Reference Hartz, Rupley and Rimm1). However, waist circumference (WC) is a better indicator of body fat distribution than BMI(Reference Lean and Han2) and it has been demonstrated that central deposition of adipose tissue can better predict risk of disease(Reference Janssen, Kartzmarzyk and Ross3). Additionally, WC is a more precise marker of central adiposity than WHR as WHR will remain constant if WC and hip circumference (HC) increase proportionally. Furthermore, HC has been found to be independently associated with disease, with larger HC providing a protective effect for a given WC(Reference Hartz, Rupley and Rimm1, Reference Lissner, Björkelund and Heitmann4). Therefore measurement of both WC and HC can act as simple indicators for disease risk.
Anthropometric measurements such as WC and HC are traditionally collected by a trained technician; however, this method can be time-consuming and costly to conduct. For this reason epidemiological studies often rely on self-administered questionnaires to obtain data. Previous studies have demonstrated reliability of self-measures of anthropometry(Reference Roberts, Wilder and Jackson5–Reference Prince, Janssen and Tranmer8) but vary in terms of over- and underestimation of body girths. Correlations between self- and technician measurements have ranged from 0·79 to 0·98(Reference Tehard, van Liere and Com Nougué7, Reference Prince, Janssen and Tranmer8) for WC and from 0·74 to 0·96(Reference Spencer, Roddam and Key9, Reference Kushi, Kaye and Folsom10) for HC; mean errors have ranged from 0·14 cm to 5·2 cm(Reference Koh-Banerjee, Chu and Spiegelman11, Reference Cullum, McCarthy and Gunnell12) for WC and from 0·56 cm to 1·8 cm(Reference Spencer, Roddam and Key9, Reference Kushi, Kaye and Folsom10) for HC; and limits of agreement have ranged from −5·0, 4·2 cm to −14·9, 8·9 cm(Reference Han and Lean6, Reference Bigaard, Spanggaard and Thomsen13). These studies therefore support the utilisation of self-measurements in providing a simple and inexpensive method to enable reliable data collection in epidemiological studies.
Previous studies examining the accuracy of self-measured body circumferences compared with technician measurements have predominantly provided written instructions to participants. However, instruction provided via video technology, with directive voice-over, may improve accuracy. Providing instruction in a dual format, i.e. audio and visual, can improve the effectiveness of working memory(Reference Chandler, Cooper and Pollock14). A recent study(Reference Elliott15) described a computer-based video tutorial for teaching self-measurement of WC, compared with technician measurement, and found this method to be reliable and effective for teaching WC to untrained individuals. Compared with written instruction, video instruction has been shown to improve skill, performance and understanding in a range of applications including electric toothbrush usage(Reference Renton-Harper, Addy and Warren16), laboratory experiments(Reference Burewicz and Miranowicz17) and patient education(Reference Tait, Voepel-Lewis and Moscucci18).
Within developed countries computer ownership and Internet access is rising annually. Of UK households, 72 % owned a home computer in 2008 and 70 % had Internet access in 2009(19, 20). Computers can therefore provide the majority of the population with low-cost and easy access to a range of educational materials(Reference Norris21), providing a simple means with which to follow video instruction provided as part of epidemiological studies.
To date, only one published study has examined the accuracy of self-measurements of body circumferences using video instruction(Reference Elliott15). No previous studies have compared instruction formats. The aim of the present study therefore was to investigate if video instruction can improve the accuracy of self-measurement of WC and HC compared with written instruction. It was hypothesised that using computer-based video technology instructions for self-measures of anthropometry would improve accuracy compared with written instruction. Secondary objectives were to assess whether reliability was affected by sex, age, BMI and WC. The present study is the first one known to compare different methods of teaching girth measurement technique to assess whether instruction methods differ in their ability to enable accurate and reliable measurements by participants.
Method
Participants
Participants were recruited via posters, an email bulletin distributed in a large educational establishment and word of mouth. Inclusion criteria were: (i) a high level of English language, i.e. native English speaker or minimum IELTS (International English Language Testing System) or TOEFL® (Test of English as a Foreign Language™) score of 6·0 or 550, respectively; and (ii) no prior training in anthropometry. All volunteers received an information sheet detailing the study background, process, and clothing requirements (loose, light clothing) prior to providing written consent to participate. The participants were randomised to receive either written instructions (‘written method’) or video instructions (‘video method’).
Anthropometry
Participants were advised to follow instructions carefully; to read/watch them as many times as required until sufficiently confident to take measurements; to measure each parameter in duplicate to the nearest 0·1 cm; and to document measurements on the questionnaire provided, including how many times they read/watched each set of instructions. In addition, participants also completed a questionnaire detailing age, sex and education level.
A flexible steel tape measure (Lufkin W606PM; Cooper Industries, Lexington, SC, USA) and a mirror were provided to participants; the technician ensured participants could clearly see their reflection in the mirror. Written or video instructions, based on an established protocol(22), were provided.
Written instructions were adapted from a previous study(Reference Mavroeidi, Macdonald and Stewart23) and were drafted to a Flesch–Kincaid Grade Level of 7·4. A diagram was included with the written instructions highlighting the approximate location of measurements. Professional video instructions were developed and incorporated into a PowerPoint® 2007 (Microsoft Corporation, Redmond, WA, USA) presentation. Two separate videos were included, WC (32 s duration; Fig. 1) and HC (35 s duration; Fig. 2), and included audio and visual instructions of measurement technique. Participants in the video method were verbally instructed to click on the computer screen to start each video but were left alone to complete their measurements; the technician was available should the participants encounter any problems. Once complete, the technician returned to the room and measured participants’ WC and HC in duplicate according to the same protocol. The technician was blinded to participants’ self-measurements during the experimental process. WC was measured at the narrowest point of the abdomen and HC measured at the level of the greatest posterior protuberance of the buttocks(22). Body weight was measured with shoes removed, to the nearest 0·5 kg, using reliable weighing scales (Seca Mechanical Scales; Seca, Hamburg, Germany or Soehnle Mechanical Scales, Soehnle, Backnang, Germany). Height was measured with shoes removed, to the nearest 0·1 cm, using a portable stadiometer (Leicester Height Measure; Seca Ltd, Birmingham, UK). BMI was subsequently calculated. All measurements were taken by a single, trained researcher (technician) with experience of undertaking anthropometric measurements in both clinical and academic settings prior to the study. In addition, the technician received additional training and assessment by a Level 4 Criterion Anthropometrist, accredited by the International Society for the Advancement of Kinanthropometry (ISAK), to determine inter- and intra-tester accuracy and reliability.
Data analysis
The mean of duplicate measurements for both self- and technician measurements were used in the analysis. Participants failing to complete the measurement procedure were excluded from analysis. Where only single measurements were provided for both WC and HC these measurements were used for analysis. Where participants used the wrong side of the tape measure to record their measurements (imperial rather than metric), this was converted to metric units for inclusion in the analysis.
Participants were stratified into sub-categories relating to sex, age, BMI and WC in order to identify potential differences in accuracy which may exist according to these characteristics. For the purpose of analysis participants were split into two groups for age (≤26 years, >26 years), BMI (<25 kg/m2, ≥25 kg/m2) and WC (<80 cm, ≥80 cm for women and <94 cm, ≥94 cm for men). Cut-off points for BMI and WC were those specified in current guidelines(24).
Data were screened for normality and Mann–Whitney U tests and χ 2 analysis used to assess distribution of the data across instruction methods. The Mann–Whitney U test was used to assess whether self- and technician-measured circumferences differed within sex, age, BMI and WC sub-categories within each instruction method. Spearman's rank correlation coefficients examined the relationship between accuracy of measurement and number of times instructions were reviewed. The Wilcoxon signed-rank test was used to assess differences between self- and technician-measured circumferences. Due to the skewed nature of the data, the average differences between self- and technician measurements (biases) are reported in terms of median error. Agreement between measurements was examined in relation to instruction method, sex, age, BMI and WC using Bland and Altman's(Reference Bland and Altman25, Reference Bland and Altman26) non-parametric approach to the limits of agreement method. Statistical significance was accepted at P < 0·05. All analyses were performed using the SPSS for Windows statistical software package version 15·0 (SPSS Inc., Chicago, IL, USA).
The study was approved by the Institutional Ethical Review Panel in February 2010. All information obtained was confidential and participants were free to withdraw from the study at any time.
Results
Intra-tester precision of the trained researcher was assessed using the technical error of measurement, which was 0·2 % for WC and 0·14 % for HC.
Participant characteristics
Table 1 shows both the mean and median values for participant characteristics. Following stratification of participants based on instruction method, χ 2 analysis confirmed that the two groups did not differ significantly in terms of sex, age group, BMI group and WC group. Owing to the homogeneity of the sample in terms of educational status, this variable was not considered in subsequent analysis. Only one participant failed to complete the measurements and was therefore excluded from subsequent analysis.
WC, waist circumference; HC, hip circumference.
†Mann–Whitney U test for the difference between written and video methods.
Waist circumference
Figure 3 illustrates the biases and limits of agreement for all participants within the written and video methods. Table 2 shows the biases and limits of agreement between self- and technician measurements for different participant characteristics within each instruction method. The mean WC was overestimated in all sub-categories regardless of instruction method. There were no significant differences between the biases within the sub-categories for sex, age, BMI or WC for either instruction method. Accuracy was greater for WC in the video method and no significant differences were observed between self- and technician measurements for participants within any sub-category in the video method only.
WC, waist circumference; LOA, limits of agreement.
Significance of the difference between self- and technician WC measurements: **P < 0·01, ***P < 0·005.
†Median error, the difference between self- and technician measurements (self-measured minus technician-measured). Positive indicates overestimation; negative indicates underestimation.
‡Wilcoxon signed-rank test for agreement between self- and technician measurements.
§5th and 95th percentiles, non-parametric LOA(Reference Bland and Altman26).
The number of times instructions were read/watched for WC correlated significantly with WC bias in the video method (r S = 0·387; P = 0·042) and with the number of times instructions were read/watched for HC in the written method (r S = 0·494; P = 0·009) and the video method (r S = 0·600; P = 0·001).
Hip circumference
Figure 4 illustrates the biases and limits of agreement for all participants within the written and video methods. Table 3 shows the biases and limits of agreement between self- and technician measurements for different participant characteristics within each instruction method. HC was underestimated in all sub-categories regardless of instruction method. Accuracy of HC did not differ significantly within sub-categories for sex, age, BMI or WC for either instruction method.
WC, waist circumference; LOA, limits of agreement.
Significance of the difference between self- and technician HC measurements: *P < 0·05, **P < 0·01.
†Median error, the difference between self- and technician measurements (self-measured minus technician-measured). Positive indicates overestimation; negative indicates underestimation.
‡Wilcoxon signed-rank test for agreement between self- and technician measurements.
§5th and 95th percentiles, non-parametric LOA(Reference Bland and Altman26).
Participants in the written method read/watched instructions for HC significantly more times than those in the video method (P = 0·009).
Discussion
In the present study WC was predominantly overestimated and HC predominantly underestimated by participants, findings consistent with previous studies(Reference Tehard, van Liere and Com Nougué7, Reference Spencer, Roddam and Key9, Reference Koh-Banerjee, Chu and Spiegelman11, Reference Elliott15, Reference Weaver, Kushi and McGovern27, Reference Dekkers, van Wier and Hendriksen28). Although significant biases existed within the video method, fewer significant biases were observed for both WC and HC, suggesting improved accuracy using video instruction.
Overall biases observed for WC and HC, in both the written and the video methods in the current study, are smaller than those observed in several previous studies(Reference Spencer, Roddam and Key9, Reference Cullum, McCarthy and Gunnell12, Reference Elliott15, Reference Weaver, Kushi and McGovern27), suggesting improved accuracy using the ISAK measurement protocol(22). ISAK is an organisation which has developed international standards for anthropometric assessment and an international anthropometry accreditation scheme which aims to assist with global standardisation of anthropometry(22, 29).
Many studies have used correlational analysis in reporting reliability of self-measurements. However the use of correlations in examining accuracy of measurement may be misleading as correlation ignores any systematic bias which may be present between the measurements and therefore does not identify the level of agreement between two methods(Reference Bland and Altman25). For this reason several studies have also used limits of agreement to examine the accuracy of self-measurement(Reference Han and Lean6, Reference Prince, Janssen and Tranmer8, Reference Koh-Banerjee, Chu and Spiegelman11, Reference Bigaard, Spanggaard and Thomsen13, Reference Elliott15, Reference Dekkers, van Wier and Hendriksen28). The limits of agreement in the current study fall within the range of those observed in previous studies. However, few previous studies(Reference Freudenheim and Darrow30, Reference Mavroeidi, Stewart and Macdonald31) have used the narrowest waist as a measurement location and no previous studies where this specific location has been used have examined limits of agreement; therefore any comparison of results needs to be interpreted with caution. Similarly, no previous studies have investigated limits of agreement for HC. The wide limits of agreement observed in the current study are influenced by the inclusion of outliers in the analysis. Participants who displayed extreme measurement errors were included as they represent a valid segment of the population; in epidemiological research it is likely that any population examined will comprise individuals who may demonstrate greater difficulty in understanding and implementation of instructions. Indeed, the homogeneity of the study sample towards a higher educational level may be a confounding factor. For this reason, the exclusion of these outliers from the analysis would result in a misrepresentation of the population and removal of participants from analysis due to higher or lower than expected values is not recommended(Reference Bland and Altman26, Reference Petrie and Sabin32).
Previous studies, predominantly looking at self-reported height and weight, have demonstrated that reporting of anthropometric measures is affected by body size, age, sex and social status(Reference Bostrom and Diderichsen33–Reference Spencer, Appleby and Davey35). Reduced accuracy has been reported in younger(Reference Dekkers, van Wier and Hendriksen28), older(Reference Han and Lean6, Reference Spencer, Roddam and Key9) and male(Reference Dekkers, van Wier and Hendriksen28) individuals. Several studies have also found greater misreporting in those with a higher body weight(Reference Cullum, McCarthy and Gunnell12, Reference Weaver, Kushi and McGovern27), higher WC(Reference Spencer, Roddam and Key9, Reference Bigaard, Spanggaard and Thomsen13, Reference Weaver, Kushi and McGovern27) and HC(Reference Spencer, Roddam and Key9) and higher BMI(Reference Spencer, Roddam and Key9, Reference Cullum, McCarthy and Gunnell12, Reference Bigaard, Spanggaard and Thomsen13). However, in the present study, insufficient participant numbers and homogeneity in terms of BMI and WC do not permit accurate conclusions to be made with regard to accuracy according to body girth size and BMI, and future studies using larger and more heterogeneous samples are required to assess any potential differences in accuracy according to these characteristics.
In a study of a Dutch, overweight, working population(Reference Dekkers, van Wier and Hendriksen28) the authors reported a greater degree of accuracy among females and older participants when measuring WC. The current study provides some support for these findings; the variability observed among certain sub-categories suggests that distinct populations perform differently depending on instruction format and further research is warranted to investigate if these differences persist in a more heterogeneous population.
Many previous studies have observed a tendency to underestimate WC, particularly in females, which could lead to misclassification of fat distribution and a subsequent underestimation of disease risk(Reference Hall and Young36). However, the current study demonstrates a tendency for participants to overestimate WC. This finding may be due to the measurement protocol, as the location of ‘narrowest waist’ significantly minimises the likelihood of underestimation.
In general, video instruction improved accuracy. However, improved accuracy of HC measurements with written instruction in certain sub-categories suggests some individuals may perform better using written instruction. The varied results may be suggestive of the potential benefit of a combined instruction format such as that used in previous research(Reference Elliott15), in which written and video instruction were combined into a computer-based tutorial for WC measurement, which may maximise accuracy by providing participants with a mixed instruction format. The video format used in the current study resulted in smaller bias than the prior video study(Reference Elliott15) in all sub-categories analysed, suggesting easier measurement technique using the narrowest waist as a point of reference. The location of the midpoint halfway between the lowest rib and the iliac crest was used as the reference point in devising the healthy cut-off points(Reference Lean, Han and Morrison37). For this reason, cut-off points for measurements taken at the narrowest waist may need to be reviewed to enable accurate prediction of risk; studies have shown significant differences between measurements taken at the narrowest waist and other commonly used reference points(Reference Wang, Thornton and Bari38).
Within the video method the number of times participants watched the instructions was positively, but weakly, associated with the bias for WC, suggesting that error increased with increased viewing of the instructions – an unexpected finding. This may be indicative of a poorer understanding of the appropriate anthropometric technique and should be further explored in future research. The finding that instructions for HC measurement were viewed significantly more in the written method is suggestive of improved comprehension of the measurement technique with video instruction. However, the number of times instructions were read/watched may be due to participants’ willingness to devote only a given amount of time to measurements rather than representing their comprehension of the technique and must therefore be interpreted cautiously. Indeed, a significant advantage of the video instruction used in the current study, in terms of maintaining participant attention, is the short duration.
There are many potential explanations for the biases and variability of errors observed in the current study. Although all participants were provided with instructions and the appropriate materials to carry out measurements, it cannot be ascertained whether they followed the correct procedure. Errors may have resulted from misreading, incorrect positioning or ‘cinching’(Reference Hall and Young36) of the tape. Preconceptions of the location of the ‘waist’ and ‘hips’ may have biased results as several participants reported that they thought WC was lower and HC higher. Previous studies have used the alternative terms ‘torso’ or ‘abdomen’ and ‘buttocks’(Reference Roberts, Wilder and Jackson5, Reference Kushi, Kaye and Folsom10) in an attempt to minimise the effect of such preconceptions. Additionally, the type of clothing worn by participants may have affected measurements as some did not wear the requested loose, light clothing. However, this did not have an effect on WC as measurements were taken on bare skin. It may have had an impact on HC as both participants and the technician may have been unable to accurately identify the exact location of the widest protrusion of the buttocks, resulting in underestimation of girth. The observed variability of error for HC between instruction formats in different sub-categories is supportive of this.
The use of a standardised, easy-to-follow video instruction format such as that used in the present study provides a simple, inexpensive method of enabling individuals to provide accurate, reliable measurements in epidemiological studies and previous studies have demonstrated good reliability of repeated measurements by individuals(Reference Kushi, Kaye and Folsom10, Reference Rimm, Stampfer and Colditz39). Additionally, the use of video instruction based on a standardised technique may be beneficial in educating health-care professionals about appropriate measurement technique and healthy cut-off points. Indeed, a previous study(Reference Dunkley, Stone and Patel40) found there was a high level of variation among health-care professionals on the suggested specific values for a healthy waist (74–88 cm for females and 90–100 cm for males), demonstrating the need to develop more robust educational tools to support awareness-raising of central adiposity and its associated risks. The resulting standardisation would be beneficial to epidemiologists who rely on health-care professionals’ records to monitor populations and to patients in ascertaining an accurate indication of their health risks.
Conclusions
To the authors’ knowledge, the present study is the first to compare the influence of instruction format on reliability of self-measures of anthropometry. Although previous studies have shown that the use of written instruction enables individuals to reliably self-measure body girths, the findings of the current study confirm that video instruction can improve reliability and accuracy of self-measurement of waist girth compared with written instruction, thus providing a cheap and easy-to-use method for inclusion in epidemiological research. Future studies using larger and more diverse samples are required to provide further insight into whether video instruction can improve accuracy across a more varied population.
Acknowledgements
Source of funding: The present research received no specific grant from any funding agency in the public, commercial or not-for-profit sectors. Statement of acceptance of conditions: This submission represents original work that has not been published previously and is not being considered by another journal. Being accepted for Public Health Nutrition, it will not be published elsewhere in the same form, in English or in any other language, without the written consent of the Nutrition Society. Each author has seen and approved the contents of the submitted manuscript. Conflicts of interest: None. Author contributions: D.F.M. was responsible for the study design, data collection and analysis, and writing of the manuscript. S.C.L. was responsible for the research proposal, supervision of the study design, data collection and analysis, and writing of the manuscript. Acknowledgements: We would like to thank Dr Arthur Stewart (Deputy Director, Centre for Obesity Research and Epidemiology, Robert Gordon University, Aberdeen) for providing training in appropriate anthropometric techniques; and Mr Sid Marrekech (Manager, Educational Media Services, DELTA, Robert Gordon University, Aberdeen) and his team for their invaluable help in creating the video instructions.