American Journal of Clinical Nutrition, Vol. 87, No. 2, 279-291,
February 2008
© 2008 American Society for Nutrition
Estimating activity energy expenditure: how valid are physical activity questionnaires?1,2,3
Heather K Neilson,
Paula J Robson,
Christine M Friedenreich and
Ilona Csizmadi
1 From the Division of Population Health and Information, Alberta Cancer Board, Calgary, Canada
2 CMF was supported by a Canadian Institutes of Health Research New Investigator Award and an Alberta Heritage Foundation for Medical Research Health Scholar Award.
3 Address reprint requests and correspondence to HK Neilson, Division of Population Health and Information, Alberta Cancer Board, 1331-29 St NW, Calgary T2N 4N2, Canada. E-mail: heathnei{at}cancerboard.ab.ca.
 |
ABSTRACT
|
|---|
Activity energy expenditure (AEE) is the modifiable component of total energy expenditure (TEE) derived from all activities, both volitional and nonvolitional. Because AEE may affect health, there is interest in its estimation in free-living people. Physical activity questionnaires (PAQs) could be a feasible approach to AEE estimation in large populations, but it is unclear whether or not any PAQ is valid for this purpose. Our aim was to explore the validity of existing PAQs for estimating usual AEE in adults, using doubly labeled water (DLW) as a criterion measure. We reviewed 20 publications that described PAQ-to-DLW comparisons, summarized study design factors, and appraised criterion validity using mean differences (AEEPAQ – AEEDLW, or TEEPAQ – TEEDLW), 95% limits of agreement, and correlation coefficients (AEEPAQ versus AEEDLW or TEEPAQ versus TEEDLW). Only 2 of 23 PAQs assessed most types of activity over the past year and indicated acceptable criterion validity, with mean differences (TEEPAQ – TEEDLW) of 10% and 2% and correlation coefficients of 0.62 and 0.63, respectively. At the group level, neither overreporting nor underreporting was more prevalent across studies. We speculate that, aside from reporting error, discrepancies between PAQ and DLW estimates may be partly attributable to 1) PAQs not including key activities related to AEE, 2) PAQs and DLW ascertaining different time periods, or 3) inaccurate assignment of metabolic equivalents to self-reported activities. Small sample sizes, use of correlation coefficients, and limited information on individual validity were problematic. Future research should address these issues to clarify the true validity of PAQs for estimating AEE.
Key Words: Energy expenditure motor activity physical activity metabolic equivalents questionnaires retrospective studies doubly labeled water validation studies epidemiologic methods adults
 |
INTRODUCTION
|
|---|
The amount of energy expended during volitional and nonvolitional activities in humans is an emerging area of interest in the fields of disease prevention and health promotion. Activity energy expenditure (AEE), or activity thermogenesis (1), expands on the classic notions of physical activity and exercise in humans (2, 3) because it refers to thermogenesis from all activities associated with day-to-day living, not just planned and structured exercise.
Recently, higher levels of AEE have been reported to decrease the risk of all-cause mortality in elderly people (4) and blood pressure in younger adults (5). Higher levels of energy expenditure from nonexercise activities may also prevent weight gain (6, 7). However, owing to the inherent difficulties in assessing the duration, frequency, and intensity of all types of activities undertaken by free-living participants in large population studies (8, 9), the amount of AEE required for disease prevention and health promotion remains unclear.
In the continued absence of inexpensive, readily available, relatively noninvasive, valid and reliable technology for measuring AEE in large numbers of free-living humans, researchers may, by necessity, rely on estimates of AEE derived from physical activity questionnaires (PAQs; Figure 1
). Although a number of PAQs have been designed to capture various activity parameters, many have shown limited reliability and validity (12). Moreover, it is not entirely clear whether or not any are valid for estimating AEE at the individual level or even at the group level.

View larger version (7K):
[in this window]
[in a new window]
|
FIGURE 1. Total energy expenditure (TEE) and activity energy expenditure (AEE) can be derived by using doubly labeled water (DLW) or, possibly, physical activity questionnaires. Values for intensity (ie, metabolic cost of volitional activities) can be assigned by using the Compendium of Physical Activities (10, 11) or another similar source.
|
|
The purpose of this review, therefore, was to explore the validity of existing PAQs for estimating AEE in free-living adult populations. We were particularly interested in the potential for PAQs to capture usual AEE in large, population-based, etiologic studies of chronic disease. Various criterion measures, such as cardiorespiratory fitness (13-22), motion sensors (13, 17, 23-28), heart rate monitors (21, 22, 29), activity records (13, 16, 26, 27, 30, 31), 24-h physical activity recalls (32), and other PAQs (17, 30, 33) have been used to determine the relative validity of PAQs for capturing various aspects of activity. The most suitable approach for this task, however, is the doubly labeled water (DLW) method (9, 34). DLW measures total energy expenditure (TEE) at the individual level, and AEE is derived by subtraction: AEE = TEE –(resting metabolic rate under near basal conditions + dietary thermogenesis) (Figure 1
). Resting metabolic rate under near basal conditions (RMR), also referred to in the literature as basal metabolic rate (BMR), can be estimated by using prediction equations based on weight or weight and height (35), whereas dietary thermogenesis is usually assumed to be 10% of TEE (36, 37). Although there are potential limitations to DLW studies (34, 38-42), the method is widely regarded as the "gold standard" for estimating TEE in free-living individuals (9, 34, 38). This review, therefore, is limited to those studies that have compared DLW estimates of AEE or TEE with measures derived from PAQs.
 |
SUBJECTS AND METHODS
|
|---|
Publication search strategy
In October 2006 we searched the published literature using the electronic databases PubMed (National Institutes of Health, Bethesda, MD) and EMBASE: Excerpta Medica (1980 to week 42; 2006). To locate studies we combined search terms using the following 3 search strategies (* indicates all variations of a word stem): 1) (valid* or predict*) and (isotop* or doubly) and (activity or energy expenditure or energy or "food frequency" or food intake or dietary intake), 2) (doubly) and (physical activity) and (self report* or questionnaire* or survey* or record* or recall*), and 3) (energy requirement* and energy balance or energy expenditure) and (physical activity). We searched the dietary literature to account for studies that validated activity and dietary questionnaires simultaneously. Reference lists from relevant publications were also examined to identify relevant studies.
We included studies that aimed to 1) validate a PAQ against DLW, or 2) predict DLW-derived energy expenditure using a PAQ. We defined PAQs as instruments requiring retrospective activity recall beyond 24 h. The search was limited to human studies published in full text, in English, and with no restrictions on publication year. All subjects needed to be adults (aged
19 y) studied under free-living conditions. We excluded studies that were exclusive to athletes, pregnant or lactating women, or individuals with acute or chronic disease. We also excluded studies of PAQs designed for, and studied in, one ethnic minority subgroup because we were interested in PAQs that could, in theory, be applied more broadly to the general adult population.
Appraisal of study design
Using criteria described by Rennie and Wareham (43), we summarized study design characteristics that could affect the quality of PAQ validation studies. To appraise study designs, we extracted the following information from each article: mean age and body mass index (BMI; in kg/m2) of the respondents, sample size, length of DLW phase (defined as the number of days between DLW administration and the last day of urine collection), mode of PAQ administration, and the timing of each PAQ relative to its corresponding DLW phase.
Appraisal of questionnaire design
We summarized PAQ attributes to assess face validity for estimating usual AEE. We extracted information on the types and parameters of activities ascertained, time periods recalled, PAQ format, and the outcome summary measures. If this information was not provided for a particular PAQ, we consulted additional publications to complete our data collection. Physical activity duration, frequency, and intensity were classified as "self-reported parameters" only if respondents were asked explicitly to report them. Although a variety of activities were queried across PAQs, we limited our discussion to major activity types, defined as occupational, household, and leisure-time. Using criteria described previously in the literature (8, 38, 44, 45), we classified the format of each PAQ as global, recall, or quantitative.
Summary of analytic results
We quantified PAQ criterion validity in terms of the magnitude of agreement and correlation between DLW and PAQ estimates of TEE (TEEDLW, TEEPAQ) or AEE (AEEDLW, AEEPAQ). Mean differences (eg, AEEPAQ – AEEDLW) and Pearson's or Spearman's correlation coefficients (eg, AEEPAQ versus AEEDLW) were reported as group measures of agreement in kcal/d. We chose these statistics because no single measure is without limitation (46, 47) and because they are often used to estimate the validity of physical activity assessment tools (48, 49). We also reported SEM differences, P values for correlation coefficients, and 95% limits of agreement (95% LOA), where 95% LOA = mean difference [(ie, AEEPAQ – AEEDLW or TEEPAQ – TEEDLW) ± 2 x SD of the mean difference; 50)] as a measure of between-subject variability. If these statistics were not reported by authors, or were not reported as kcal/d, we derived them if enough data were provided in the article.
 |
RESULTS
|
|---|
Study design
A total of 20 publications ("DLW studies") were identified as eligible for inclusion in our review. Of these, 36 separate comparisons were described between PAQs and DLW-derived energy expenditure. Characteristics of the various study designs are summarized in Table 1
. It should be noted that Philippaerts et al (62) described several comparisons between TEEDLW and various indexes within the modified Stanford Usual Activity (Five City) Questionnaire; however, we focused on the 7-d index because its content was not limited to moderate or vigorous activities, and it attempted to estimate energy expenditure as opposed to a unitless score. Moreover, 2 publications by Lof et al (59, 60) described the same DLW study in the same respondents, with the exception of 3 women. Thus, for the purposes of this review, we referred to these publications in combination.
View this table:
[in this window]
[in a new window]
|
TABLE 1 Study design factors for comparisons made between physical activity questionnaires (PAQs) and doubly labeled water (DLW)1
|
|
Study population
In Table 1
, the largest study involved 80 women (51), and the smallest studies involved only 10 (63) or 13 (56, 57, 64) participants. Of 20 studies, 8 included both sexes (52, 56, 58, 65-67, 70, 71), 4 included only males (53-55, 62), and 8 included only females (51, 57, 59-61, 63, 64, 68, 69). Six studies were conducted in the elderly, ie, mean age >65 y (53, 56, 63, 65-67). Of all the studies, one (64) intentionally focused on obese individuals (ie, mean BMI
30), whereas 13 (51, 52, 54, 55, 58, 61, 65-71) included generally overweight subjects (ie, mean BMI of 25 to <30), but only 2 (69, 70) did so intentionally.
PAQ administration
Most studies (12 of 20) used interviewer-administered PAQs (53, 54, 56-60, 62, 63, 66, 67, 70, 71), whereas only 5 used self-administered PAQs (51, 55, 61, 68, 69). For 3 studies, the mode was not reported (52, 64, 65).
Only 4 of 36 comparisons (57, 59, 60, 62, 71) involved PAQs that covered exactly the same period of activity as the DLW phase, with the 4 periods ranging from 7 to 14 d (Table 1
). Ten of 36 comparisons (in 7 publications) (52, 53, 55, 56, 62, 67, 69) involved past year PAQs, with corresponding DLW phases ranging from 10 to 14 d. Twenty-one of the 36 comparisons presented in Table 1
(in 10 publications: 51, 53-55, 57, 59, 60, 62, 64, 68, 71) involved PAQs administered at the end of the DLW phase, when the period of recall would have included the period of DLW measurement. The timing of PAQ administration was not reported for 3 of 36 comparisons (52, 56, 58).
Questionnaire design
Within the 20 publications included in our review, we identified 23 distinct PAQs (Table 2
). Eleven of the 23 PAQs (51, 55, 58, 61, 62, 68, 69, 71, 76, 77) covered all major types of activity (ie, occupational, household, and leisure time) to varying degrees, and 3 others (18, 60, 70) included all types of activity in terms of intensity. Regarding the time period recalled, 6 of 23 PAQs inquired about activity over the past year (32, 52, 62, 69, 75, 77). One other PAQ asked about occupational activities over the past year and leisure activities over the past month (55), whereas another PAQ asked about sports and leisure activities over the past year and other activities as they are usually performed (73). Of 23 PAQs, we classified 7 as quantitative (15, 51, 55, 68, 69, 75, 77), 13 as recall (18, 32, 52, 60-63, 70-73, 76), and 2 as global (74, 78). We were unable to classify the format of one PAQ (58) because of lack of information. Fifteen of 23 PAQs were used to derive estimates of AEE or TEE (15, 18, 51, 52, 55, 60-62, 68-70, 73, 75, 77), and all were deemed to be recall or quantitative PAQs. Across all PAQs, intensity was assigned using information provided in the Compendium of Physical Activities (10, 11) or another approach (15, 18, 25, 61, 64, 73, 75, 77, 83, 84).
View this table:
[in this window]
[in a new window]
|
TABLE 2 Design of physical activity questionnaires (PAQs) as used in comparative studies with doubly labeled water (DLW)
|
|
Four PAQs [Questionnaire d'Activité Physique Saint-Etienne (QAPSE; 77), Tecumseh Community Health Study (TCHS; 62), Tecumseh Occupational Activity and past month Minnesota Leisure-Time Questionnaire (MLTQ; 55), and Tecumseh Occupational Activity and past year MLTQ (69)] asked about all major types of activities, covered activities over the past year, and provided estimates of energy expenditure.
Analytic results
To appraise criterion validity, summary statistics are presented and arranged according to PAQ outcome measure and period of recall in Table 3
. Across all 20 publications, a total of 10 comparisons were made between DLW and either a unitless PAQ score (53, 62, 63, 65) or the duration of activity derived from a PAQ (58, 71). Because the 10 PAQs involved in these comparisons estimated neither AEE nor TEE, we excluded these 10 comparisons from our appraisal, which left only 26 comparisons (16 publications) in Table 3
.
View this table:
[in this window]
[in a new window]
|
TABLE 3 Summary of results from comparison studies between physical activity questionnaires (PAQs) and doubly labeled water (DLW)1
|
|
Group level agreement
To assess the level of agreement between PAQs and DLW, we summarized the mean differences for 26 comparisons. As shown in Table 3
, some comparisons were made in terms of AEE (ie, AEEPAQ – AEEDLW), whereas others focused on TEE (ie, TEEPAQ –TEEDLW, where TEEPAQ was derived by summation; Figure 1
). Two studies compared AEEPAQ with AEEDLW and TEE PAQ with TEEDLW; however, we chose to report on only one comparison from each study: AEE in the first instance (70) and TEE in the other (68) because more data were reported on TEE in this publication. Overall, the highest percentage difference in mean values was 113% for the 7-d MLTQ (51), whereas 8 other comparisons resulted in percentage differences of
10% (55, 59, 60, 62, 64, 66, 67, 70). In terms of kcal/d, the magnitude of the mean difference (AEEPAQ – AEEDLW) ranged from –10 kcal/d (Yale PAQ; 67) to 952 kcal/d (7-d MLTQ; 51). The mean difference (TEEPAQ – TEEDLW) ranged in magnitude from 17 kcal/d (7-d Physical Activity Recall; 66) to 1589 kcal/d (Harvard/College Alumni; 61). Twelve of 26 comparisons (52, 53, 56, 57, 61, 62, 67, 68, 70) showed negative mean differences (ie, group underreporting), 13 (51, 53-55, 61, 62, 64, 66, 69) showed positive mean differences (ie, group overreporting), and one (59) showed positive and negative differences (mean difference = 130 kcal/d, n = 34; mean difference = –160 kcal/d, n = 37; 60).
In a comparison of AEEPAQ with AEEDLW, 3 separate evaluations of the past year Minnesota Leisure Time PAQ resulted in underestimation of AEE at the group level, with mean differences of –208 kcal/d (–37%) (56), –313 kcal/d (–39%) (53), and, in the third study, –487 kcal/d (–56%) and –752 kcal/d (–62%) for females and males, respectively (67). Conversely, 5 evaluations of TEEPAQ from the 7-d Physical Activity Recall questionnaire (53, 54, 61, 64, 66) showed group overreporting, with mean differences ranging from 17 kcal/d (0.7%) (66) to 989 kcal/d (31%) (54). However, the same questionnaire resulted in group underreporting in 2 (57, 70) of 3 studies (51, 57, 70) that compared this PAQ with AEEDLW.
In 8 comparisons, the direction of bias [ie, AEEPAQ – AEEDLW (51) or TEEPAQ – TEEDLW (54, 55, 59, 61)] became more positive with increasing mean AEE [ie, average of AEEPAQ and AEEDLW (51)] or TEE [ie, average of TEEPAQ and TEEDLW (54, 55, 59, 61)]. In other words, there was a positive trend between the individual differences and the means. Conversely, a slight negative trend, in which the direction of bias became more negative with increasing AEE, was reported for one comparison (70). Two other comparisons also showed negative trends (57, 67); however, in these instances the differences (AEEPAQ– AEEDLW)were plotted against AEEDLW rather than as the average of the 2 measures, which is recommended (86).
To assess validity further, we examined correlation coefficients reported for 19 comparisons in Table 3
(51, 53-56, 61, 62, 68-70). Coefficients ranged from 0.05 (7-d MLTQ in 80 middle-aged females; 51) to 0.83 (past year MLTQ in 13 elderly males and females; 56). Five coefficients were notably >0.60 (55, 56, 62, 68).
We also evaluated PAQ validity by considering mean differences and correlations, simultaneously, for 19 of the 26 comparisons in Table 3
. Of these, only 3 comparisons resulted in mean percentage differences of
10% and a correlation of
0.60 [modified Stanford Usual Activity 7-d index (62), TCHS (62), and Tecumseh Occupational Activity and past month MLTQ (55)]. All 3 comparisons were conducted in middle-aged men with BMIs ranging from values indicating normal weight to overweight, on average. Two of these PAQs—the Tecumseh Occupational Activity and past month MLTQ (55) and the TCHS PAQ (62)—also showed the greatest potential for capturing AEE in our earlier appraisal of questionnaire design (Table 2
). The former PAQ was self-administered (55), whereas the latter was administered in a face-to-face interview (62).
Individual level agreement
Inconsistent reporting precluded us from comparing individual level results across most studies and, thus, individual agreement was not summarized for this review. Only 7 of the 20 publications were explicit in reporting individual level agreement. Results were expressed as the proportions of positive and negative differences (ie, AEEPAQ – AEEDLW; 51), as the number of reporters within a certain percentage of their DLW estimate [within 10% (55, 70); TEEPAQ
5%, 5–10%, 10–20%, or >20% of TEEDLW (54)], or as tables of individual results (52, 56, 64).
We judged between-subject variability in terms of the 95% LOA for 22 of the 26 comparisons in Table 3
. The widths of 95% LOA (ie, upper limit minus lower limit) for the mean difference (AEEPAQ – AEEDLW) spanned from 817 kcal/d (past year MLTQ (56) to 4096 kcal/d (7-d MLTQ; 51). For the mean difference (TEEPAQ – TEEDLW) the widths of 95% LOA ranged from 1133 kcal/d (Tecumseh Occupational Activity and past year MLTQ; 69) to 17 948 kcal/d (Cross-Cultural Activity Participation Study 4-wk recall; 61).
 |
DISCUSSION
|
|---|
We found that the vast majority of PAQs previously compared with DLW did not have sufficient face validity for estimating usual AEE, judging from the types of activities, time period recalled, and summary measures derived from each PAQ. In a comparison of AEEPAQ with AEEDLW or of TEEPAQ with TEEDLW, the percentage difference in means was <10% in nearly one-third of the comparisons and about one-third of all correlations was >0.60. However, high correlations and small differences in means rarely coincided. Most validation study results were reported at the group level, with much less information on individual reporting. No study involved >80 subjects and many focused on overweight, female, and elderly subgroups. Most of the DLW experiments did not assess the same time period recalled in PAQs.
One common feature of the PAQs in this review was their inclusion of exercise, defined as planned, structured, and repetitive bodily movements intended to improve or maintain one or more components of physical fitness (87). In this review, however, our interest applied more broadly to AEE, or the energy expended from all exercise and nonexercise activities, both volitional and nonvolitional. We also wanted to explore PAQ validity in the context of population-based, etiologic studies of chronic disease. For this reason we were interested in "usual" AEE; in other words, relatively stable patterns of activity that, if prolonged, could contribute to disease risk.
Only 4 of the 23 PAQs in our review contained all of the basic design elements required for estimating usual AEE (Table 2
): the QAPSE PAQ, TCHS PAQ, Tecumseh Occupational Activity and past month MLTQ, and the Tecumseh Occupational Activity and past year MLTQ. In addition to queries regarding the "major" activity types, these PAQs also inquired about personal care, climbing stairs, walking, transportation, or sedentary activities. In theory, each activity could contribute to PAQ validity for estimating AEE. For instance, lower intensity activities (88) and posture during sedentary activities can influence AEE (7, 89, 90). Although all 4 PAQs inquired about sedentary activity, the Tecumseh Occupational Activity and past month MLTQ contrasted sitting versus standing activities, inquired about sleeping, and asked about "general activities" such as childcare, reading, and watching television (55). Incidentally, this PAQ also had higher criterion validity (Table 3
) and was quantitative in format. In contrast, 3 separate evaluations of the past year Minnesota Leisure Time PAQ showed underestimation of AEE by 37% to 62% at the group level (53, 56, 67). Although we are uncertain why underestimation occurred, it is possible that the omission of key activities (eg, occupational) from this PAQ contributed to its discrepancies with DLW.
Face validity is important not only when interpreting PAQ validation studies, but also in etiologic research. Recently, the issue of PAQ validity was raised (91) when significant inverse associations were found between all-cause mortality and AEE estimated from DLW (4). Self-reported stair climbing and work for pay were more likely to be reported by participants with a higher AEE. In contrast, the proportion of individuals who self-reported high-intensity exercise, walking for exercise, or walking for reasons other than for exercise did not change significantly across AEE tertiles. It was speculated that self-report errors may be to blame for the latter (91). We propose, however, that these findings may have arisen in part because these activities were not significant contributors to AEE in this population. Before discrediting self-reporting, therefore, we will need to understand the contributions of various types of activity to AEE. Only then can we determine whether or not PAQs have sufficient face validity for testing AEE-related hypotheses.
Before drawing any conclusions about PAQ validity, the quality of each validation study must be taken into account. First, the generalizability of study findings in this review was limited by tightly constrained study conditions and small sample sizes. Because the mode of PAQ administration (76, 92) and respondent characteristics (51, 70, 93) could affect PAQ validity, it may be that PAQs that appear to be valid (or not valid) under the conditions in which they were studied would perform differently in other contexts. Presumably, in some cases, small sample sizes were due to the prohibitive monetary cost of DLW experiments at the time the studies were undertaken. Regardless, the decision to "scale down" validation studies clearly comes at the expense of an inability to generalize results to other populations and provides less precise estimates of PAQ validity. Correlation coefficients, for example, become less precise when based on smaller samples because of an increased SE [ie, SE = [(1 – r2)/(n – 2)]1/2) (47, 94). Unfortunately, SEs were rarely provided in the articles, whereas P values (Ho:
= 0) were common. It should be noted that less than half of the correlation coefficients in Table 3
differed significantly from zero, but this result may have arisen simply as a consequence of smaller sample sizes, at least in some instances. Even statistically significant correlations may be imprecise if based on very few observations. Thus, all correlations in Table 3
should be interpreted with caution.
Even with sufficient sample sizes, the limitations of correlation coefficients are well documented (47, 49, 50, 94), with the literature emphasizing 2 potential pitfalls. First, correlations depend on the degree of between-subject variability in a given study population, so an acceptable correlation found in one PAQ validation study may not apply to groups with a different range of energy expenditure levels (49). Second, correlations are measures of association as opposed to agreement. A method with known systematic bias can correlate quite strongly with an unbiased reference measure (47, 49, 50), thereby masking a lack of agreement between measures. Despite these limitations, correlation coefficients have thus far been the most commonly used statistics in the PAQ validation study literature (49). We recommend, as have others (47, 49, 94), that PAQ validity not be judged solely on the basis of correlations, but rather on several statistical methods that would each compensate for the other's unique limitations.
Unfortunately, we found that important details of PAQ validation studies were sometimes omitted from the published literature, which made it difficult to generalize findings to other populations. In this review we noted missing information on the mode of PAQ administration (52, 64, 65), the BMI of validation study participants (53, 63), the timing of PAQs relative to DLW phases (52, 56, 58), and the length of DLW phase (61). Thus, authors are encouraged to be more specific when reporting on PAQ validation studies.
Bland-Altman plots, although not always presented by authors, were more informative. Some plots showed mean differences (ie AEEPAQ – AEEDLW or TEEPAQ – TEEDLW) or group level bias that increased with the level of energy expenditure, typically in the positive direction. These positive trends may imply that populations with higher levels of AEE or TEE tend to overreport their activities in PAQs. Although this tendency was apparent in some groups, it is noteworthy that not every individual with a high level of energy expenditure would overreport their activities. The group level, proportional bias we observed suggests systematic error in physical activity reporting that could be corrected, perhaps using regression calibration techniques (95), or with a modified PAQ design based on the determinants of misreporting, once they are better understood.
Alternatively, it is possible that the positive trends observed in several Bland-Altman plots were the result of random error. In a recent simulation study (49), a spurious, positive trend resulted in a Bland-Altman plot when one hypothetical unbiased measure of physical activity (M) was compared with a second, unbiased reference measure (R). The authors created the trend by simulating greater random error in measure M, thereby violating the assumption of equal variances between M and R which is inherent to Bland-Altman plots (86). This explanation seems plausible in the context of a PAQ validation study, in which DLW is a very objective, precise method with less random error than PAQs. Judging from Bland-Altman plots alone, it is unclear how much of the positive trend was attributable to random and systematic error, respectively. Regardless, this potential weakness of the Bland-Altman method could have important implications for PAQ validation research.
Across studies we observed an overall tendency to report agreement at the group level as opposed to the individual level. Individual validity cannot be inferred from group level validity (96) and in fact serves a different purpose. In this review, the mean differences across studies indicated that neither under- nor overreporting of activity was more common at the group level. This finding differs from dietary research in which underreporting is universally more common (96), which implies that different factors may be involved in misreporting of activities and diet, respectively. Because only a few articles reported results at the individual level, we were unable to evaluate fully the degree of individual level validity for many of the PAQs. However, the 95% LOA proposed by Bland and Altman (50) allowed for speculation by examining between-subject variability. The 95% LOA for the mean difference (AEEPAQ– AEEDLW) from one PAQ, for example, ranged from –464 to 645 kcal/d (Yale PAQ; 53). By definition (48, 97), this result implies that there is a 95% probability that an individual from the same population would underestimate AEE by no more than 464 kcal/d and overestimate AEE by no more than 645 kcal/d. If this level of error was not acceptable for practical purposes, the PAQ could not be used as a surrogate for AEEDLW. In fact, no 95% limit in our review was within 100 kcal/d (10% if AEE = 1000 kcal/d) of the mean difference of AEEPAQ – AEEDLW and or within 250 kcal/d (10% if TEE = 2500 kcal/d) of TEEPAQ – TEEDLW, which suggests that the PAQs in this review may be of limited use for estimating individual AEE.
A related matter for concern is the validity of DLW for individuals. At the group level, the DLW method is widely accepted as the gold standard for estimating free-living energy expenditure in adults (9, 98-100). In a review on DLW validity (101), the percentage error in TEE estimation averaged
2% or 8%, depending on the equation used to calculate DLW results. This level of error is acceptable for evaluating PAQ validity at the group level, as we have assumed. For individuals, however, the precision of DLW (ie, the SD of individual percentage errors) in the same review article (101) was 8–9%, which meant that some individual estimates will deviate substantially from the average. Furthermore, in comparisons based on AEE, DLW estimates must be derived by subtraction (ie, AEE = TEE –RMR –TEF), which could introduce more error if RMR is based on prediction equations. In a review of prediction equations (35), the proportion of healthy adults with valid predicted RMR (ie, within 10% RMR measured by calorimetry) ranged from 45% to 81% in nonobese individuals and from 38% to 70% in obese subjects. Under a worst-case scenario, therefore, a PAQ validation study using one of these equations to predict AEE would inevitably suggest disagreement, which may be wrongly attributed to the PAQ. Moreover, statistical measures of validity normally deemed "moderate" may be, under this scenario, as high as could be expected. Although an important consideration, of the 20 publications we reviewed, only 6 compared PAQs with DLW-derived AEE (10 PAQ comparisons in Table 3
; 51, 53, 56, 57, 67, 70), and only one of those used an RMR prediction equation to derive AEE from DLW (51). Thus, for 34 of the 36 PAQ-to-DLW comparisons in our review [2 PAQs in Adams et al (51)] RMR-related error of this magnitude was probably not an issue. Otherwise, it is possible that some individual differences reflected in the 95% LOA may have arisen in part from DLW-related error.
Concurrency between PAQ and DLW administration is another factor to consider when evaluating PAQ validity (43). Ideally, in validation studies, the criterion measure and PAQ should observe the same time frame of reference. In this review, however, the majority of method comparisons (32 of 36) did not cover exactly the same time periods. Free-living energy expenditure, when assessed using objective measures, is known to vary over days of the week (102), over weeks (103), and over seasons (104, 105) in adults. In one study of habitual activity, Levin et al (104) periodically measured physical activity patterns in 77 adults in the United States over 1 y. On the basis of intraindividual variability, they determined that six 48-h accelerometer sessions were needed to achieve 80% reliability in estimating mean annual physical activity in MET (metabolic equivalents)-minutes per day (104). Although analytic error does contribute to estimates of within-subject variability (102, 106), so does actual change in activity levels over time. The latter might partly explain some of the differences we observed between PAQ and DLW estimates of energy expenditure. In our review, 10 of 36 DLW comparisons involved past year PAQs and single 1- or 2-wk DLW phases. An alternative approach would have been to repeat DLW phases to coincide better with PAQs, but none of the 10 studies reported to have done this.
A final consideration for interpreting any PAQ-to-DLW comparison is the potential for error in converting self-reported physical activity into units of energy expenditure. In large epidemiologic studies, it is usually not feasible to measure energy costs of activities for each individual. Hence, the Compendium of Physical Activities (10, 11) has become a widely accepted extrapolation tool. The Compendium provides a convenient 5-digit coding scheme that can be used to classify activities according to rate of energy expenditure or METs. By definition, 1 MET is approximately equal to 1 kcalh–1 · kg body wt–1 (11). Clearly, an assumed body weight or RMR will rarely reflect that of a given individual. One important limitation of the Compendium, therefore, is the reliance on group averages that may not apply to individuals (1, 10, 11, 107, 108).
In conclusion, despite the numerous validation studies already published, the validity of PAQs for AEE estimation remains unclear. Weaknesses in the design and reporting of studies, combined with a paucity of information on the original intent of many PAQ designers, mean that it is difficult to draw any firm conclusions about the validity of existing PAQs for the assessment of usual AEE in large population-based studies. Nevertheless, our review highlights some important considerations for scrutinizing PAQ validation studies. First, there is a need to consider each PAQ's design and its expected level of agreement with DLW, which measures all activities (freely available PAQs would facilitate this; 109). Furthermore, if a PAQ is to be used to estimate individual AEE, then its validity must be supported by the appropriate statistical analyses. Results on individual level validity were generally lacking across the articles we reviewed. We speculate that some discrepancies found previously between PAQ and DLW estimates could have resulted, in part, because PAQs did not include key activities relating to AEE or, possibly, because the PAQ period of recall and DLW phase did not coincide. Issues related to small sample size, use of correlation coefficients, and conversion of self-reported activity into energy expenditure, all continue to be problematic. Future research and development efforts should address these issues to clarify the true validity of PAQs in this context.
 |
ACKNOWLEDGMENTS
|
|---|
We thank Pietro Ferrari for his helpful comments.
The authors' responsibilities were as follows—HKN: conducted the review and wrote the first draft of the manuscript; IC and PJR: helped write subsequent drafts of the manuscript; IC, PJR, and HKN: conceptualized the review and interpreted the results; and CMF: critically appraised the manuscript. None of the authors had a conflict of interest with respect to this manuscript.
 |
REFERENCES
|
|---|
- Levine JA. Non-exercise activity thermogenesis. Proc Nutr Soc 2003;62:667–79.[Medline]
- Howley ET. Type of activity: resistance, aerobic and leisure versus occupational physical activity. Med Sci Sports Exerc 2001;33(suppl):S364–9.[Medline]
- US Department of Health and Human Services. Physical activity and health: A Report of the Surgeon General. Atlanta, GA: US Department of Health and Human Services, Centers for Disease Control and Prevention, National Center for Chronic Disease Prevention and Health Promotion, 1996:20–2.
- Manini TM, Everhart JE, Patel KV, et al. Daily activity energy expenditure and mortality among older adults. JAMA 2006;296:171–9.[Abstract/Free Full Text]
- Luke A, Kramer H, Adeyemo A, et al. Relationship between blood pressure and physical activity assessed with stable isotopes. J Hum Hypertens 2005;19:127–32.[Medline]
- Dong L, Block G, Mandel S. Activities contributing to total energy expenditure in the United States: results from the NHAPS Study. Int J Behav Nutr Phys Act 2004;1:4.[Medline]
- Levine JA, Lanningham-Foster LM, McCrady SK, et al. Interindividual variation in posture allocation: possible role in human obesity. Science 2005;307:584–6.[Abstract/Free Full Text]
- Lamonte MJ, Ainsworth BE. Quantifying energy expenditure and physical activity in the context of dose response. Med Sci Sports Exerc 2001;33(suppl):S370–8.
- Melanson EL Jr, Freedson PS. Physical activity assessment: a review of methods. Crit Rev Food Sci Nutr 1996;36:385–96.[Medline]
- Ainsworth BE, Haskell WL, Leon AS, et al. Compendium of physical activities: classification of energy costs of human physical activities. Med Sci Sports Exerc 1993;25:71–80.
- Ainsworth BE, Haskell WL, Whitt MC, et al. Compendium of physical activities: an update of activity codes and MET intensities. Med Sci Sports Exerc 2000;32(suppl):S498–504.
- Shephard RJ. Limits to the measurement of habitual physical activity by questionnaires. Br J Sports Med 2003;37:197–206.[Abstract/Free Full Text]
- Ainsworth BE, Jacobs DR Jr, Leon AS, Richardson MT, Montoye HJ. Assessment of the accuracy of physical activity questionnaire occupational data. J Occup Med 1993;35:1017–27.[Medline]
- Bonnefoy M, Kostka T, Berthouze SE, Lacour JR. Validation of a physical activity questionnaire in the elderly. Eur J Appl Physiol Occup Physiol 1996;74:528–33.[Medline]
- Dipietro L, Caspersen CJ, Ostfeld AM, Nadel ER. A survey for assessing physical activity among older adults. Med Sci Sports Exerc 1993;25:628–42.
- Dishman RK, Steinhardt M. Reliability and concurrent validity for a 7-d re-call of physical activity in college students. Med Sci Sports Exerc 1988;20:14–25.
- Jacobs DR Jr, Ainsworth BE, Hartman TJ, Leon AS. A simultaneous evaluation of 10 commonly used physical activity questionnaires. Med Sci Sports Exerc 1993;25:81–91.
- Kohl HW, Blair SN, Paffenbarger RS Jr, Macera CA, Kronenfeld JJ. A mail survey of physical activity habits as related to measured physical fitness. Am J Epidemiol 1988;127:1228–39.[Abstract/Free Full Text]
- Lamb KL, Brodie DA. Leisure-time physical activity as an estimate of physical fitness: a validation study. J Clin Epidemiol 1991;44:41–52.[Medline]
- Roeykens J, Rogers R, Meeusen R, Magnus L, Borms J, de MK. Validity and reliability in a Flemish population of the WHO-MONICA Optional Study of Physical Activity Questionnaire. Med Sci Sports Exerc 1998;30:1071–5.
- Wareham NJ, Jakes RW, Rennie KL, Mitchell J, Hennings S, Day NE. Validity and repeatability of the EPIC-Norfolk Physical Activity Questionnaire. Int J Epidemiol 2002;31:168–74.[Abstract/Free Full Text]
- Wareham NJ, Jakes RW, Rennie KL, et al. Validity and repeatability of a simple index derived from the short physical activity questionnaire used in the European Prospective Investigation into Cancer and Nutrition (EPIC) study. Public Health Nutr 2003;6:407–13.[Medline]
- Dinger MK, Oman RF, Taylor EL, Vesely SK, Able J. Stability and convergent validity of the Physical Activity Scale for the Elderly (PASE). J Sports Med Phys Fitness 2004;44:186–92.[Medline]
- Iqbal R, Rafique G, Badruddin S, Qureshi R, Gray-Donald K. Validating MOSPA questionnaire for measuring physical activity in Pakistani women. Nutr J 2006;5:18.[Medline]
- Kriska AM, Knowler WC, LaPorte RE, et al. Development of questionnaire to examine relationship of physical activity and diabetes in Pima Indians. Diabetes Care 1990;13:401–11.[Abstract]
- Whitt MC, Levin S, Ainsworth BE, Dubose KD. Evaluation of a two-part survey item to assess moderate physical activity: the Cross-Cultural Activity Participation Study. J Women's Health 2003;12:203–12.
- Friedenreich CM, Courneya KS, Neilson HK, et al. Reliability and validity of the Past Year Total Physical Activity Questionnaire. Am J Epidemiol 2006;163:959–70.[Abstract/Free Full Text]
- Johnson-Kozlow M, Sallis JF, Gilpin EA, Rock CL, Pierce JP. Comparative validation of the IPAQ and the 7-Day PAR among women diagnosed with breast cancer. Int J Behav Nutr Phys Act 2006;3:7.[Medline]
- Bernstein MS, Costanza MC, Morabia A. Association of physical activity intensity levels with overweight and obesity in a population-based sample of adults. Prev Med 2004;38:94–104.[Medline]
- Matthews CE, Shu XO, Yang G, et al. Reproducibility and validity of the Shanghai Women's Health Study physical activity questionnaire. Am J Epidemiol 2003;158:1114–22.[Abstract/Free Full Text]
- Pols MA, Peeters PH, Bueno-De-Mesquita HB, et al. Validity and repeatability of a modified Baecke questionnaire on physical activity. Int J Epidemiol 1995;24:381–8.[Abstract/Free Full Text]
- Voorrips LE, Ravelli AC, Dongelmans PC, Deurenberg P, Van Staveren WA. A physical activity questionnaire for the elderly. Med Sci Sports Exerc 1991;23:974–9.
- Taylor-Piliae RE, Norton LC, Haskell WL, et al. Validation of a new brief physical activity survey among men and women aged 60–69 years. Am J Epidemiol 2006;164:598–606.[Abstract/Free Full Text]
- Schoeller DA. Measurement of energy expenditure in free-living humans by using doubly labeled water. J Nutr 1988;118:1278–89.[Abstract/Free Full Text]
- Frankenfield D, Roth-Yousey L, Compher C. Comparison of predictive equations for resting metabolic rate in healthy nonobese and obese adults: a systematic review. J Am Diet Assoc 2005;105:775–89.[Medline]
- Danforth E Jr. Diet and obesity. Am J Clin Nutr 1985;41:1132–45.[Free Full Text]
- Van Zant RS. Influence of diet and exercise on energy expenditure—a review. Int J Sport Nutr 1992;2:1–19.[Medline]
- Montoye HJ, Kemper HCG, Saris WHM, Washburn RA. Measuring physical activity and energy expenditure. Champaign, IL: Human Kinetics, 1996.
- Speakman JR. Principles, problems and a paradox with the measurement of energy expenditure of free-living subjects using doubly-labelled water. Stat Med 1990;9:1365–80.[Medline]
- Coward WA. Stable isotopic methods for measuring energy expenditure. The doubly-labelled-water (2H2(18)O) method: principles and practice. Proc Nutr Soc 1988;47:209–18.[Medline]
- IDECG Working Group. The doubly-labelled water method for measuring energy expenditure: a consensus report by the IDECG Working Group. In: Prentice AM, ed. Vienna, Austria: International Atomic Energy Agency, 1990.
- Jones PJ. Correction approaches for doubly labeled water in situations of changing background water abundance. Obes Res 1995;3(suppl):41–8.
- Rennie KL, Wareham NJ. The validation of physical activity instruments for measuring energy expenditure: problems and pitfalls. Public Health Nutr 1998;1:265–71.[Medline]
- LaPorte RE, Montoye HJ, Caspersen CJ. Assessment of physical activity in epidemiologic research: problems and prospects. Public Health Rep 1985;100:131–46.[Medline]
- Ainsworth BE, Coleman KJ. Physical activity measurement. In: McTiernan A, ed. Cancer prevention and management through exercise and weight control. Boca Raton, FL: CRC Press, 2006:13–23.
- Luiz RR, Szklo M. More than one statistical strategy to assess agreement of quantitative measurements may usefully be reported. J Clin Epidemiol 2005;58:215–6.[Medline]
- Bellach B. Remarks on the use of Pearson's correlation coefficient and other association measures in assessing validity and reliability of dietary assessment methods. Eur J Clin Nutr 1993;47(suppl):S42–5.
- Atkinson G, Nevill AM. Statistical methods for assessing measurement error (reliability) in variables relevant to sports medicine. Sports Med 1998;26:217–38.[Medline]
- Schmidt ME, Steindorf K. Statistical methods for the validation of questionnaires–discrepancy between theory and practice. Methods Inf Med 2006;45:409–13.[Medline]
- Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 1986;1:307–10.[Medline]
- Adams SA, Matthews CE, Ebbeling CB, et al. The effect of social desirability and social approval on self-reports of physical activity. Am J Epidemiol 2005;161:389–98.[Abstract/Free Full Text]
- Barnard JA, Tapsell LC, Davies PS, Brenninger VL, Storlien LH. Relationship of high energy expenditure and variation in dietary intake with reporting accuracy on 7 day food records and diet histories in a group of healthy adult volunteers. Eur J Clin Nutr 2002;56:358–67.[Medline]
- Bonnefoy M, Normand S, Pachiaudi C, Lacour JR, Laville M, Kostka T. Simultaneous validation of ten physical activity questionnaires in older men: a doubly labeled water study. J Am Geriatr Soc 2001;49:28–35.[Medline]
- Conway JM, Seale JL, Jacobs DR Jr, Irwin ML, Ainsworth BE. Comparison of energy expenditure estimates from doubly labeled water, a physical activity questionnaire, and physical activity records. Am J Clin Nutr 2002;75:519–25.[Abstract/Free Full Text]
- Conway JM, Irwin ML, Ainsworth BE. Estimating energy expenditure from the Minnesota Leisure Time Physical Activity and Tecumseh Occupational Activity questionnaires—a doubly labeled water validation. J Clin Epidemiol 2002;55:392–9.[Medline]
- Goran MI, Poehlman ET. Total energy expenditure and energy requirements in healthy elderly persons. Metabolism 1992;41:744–53.[Medline]
- Leenders NY, Sherman WM, Nagaraja HN, Kien CL. Evaluation of methods to assess physical activity in free-living conditions. Med Sci Sports Exerc 2001;33:1233–40.
- Livingstone MB, Strain JJ, Prentice AM, et al. Potential contribution of leisure activity to the energy expenditure patterns of sedentary populations. Br J Nutr 1991;65:145–55.[Medline]
- Lof M, Hannestad U, Forsum E. Comparison of commonly used procedures, including the doubly-labelled water technique, in the estimation of total energy expenditure of women with special reference to the significance of body fatness. Br J Nutr 2003;90:961–8.[Medline]
- Lof M, Forsum E. Validation of energy intake by dietary recall against different methods to assess energy expenditure. J Hum Nutr Diet 2004;17:471–80.[Medline]
- Mahabir S, Baer DJ, Giffen C, et al. Comparison of energy expenditure estimates from 4 physical activity questionnaires with doubly labeled water estimates in postmenopausal women. Am J Clin Nutr 2006;84:230–6.[Abstract/Free Full Text]
- Philippaerts RM, Westerterp KR, Lefevre J. Doubly labelled water validation of three physical activity questionnaires. Int J Sports Med 1999;20:284–9.[Medline]
- Reilly JJ, Lord A, Bunker VW, et al. Energy balance in healthy elderly women. Br J Nutr 1993;69:21–7.[Medline]
- Racette SB, Schoeller DA, Kushner RF. Comparison of heart rate and physical activity recall with doubly labeled water in obese women. Med Sci Sports Exerc 1995;27:126–33.
- Schuit AJ, Schouten EG, Westerterp KR, Saris WH. Validity of the Physical Activity Scale for the Elderly (PASE): according to energy expenditure assessed by the doubly labeled water method. J Clin Epidemiol 1997;50:541–6.[Medline]
- Seale JL, Klein G, Friedmann J, Jensen GL, Mitchell DC, Smiciklas-Wright H. Energy expenditure measured by doubly labeled water, activity recall, and diet records in the rural elderly. Nutrition 2002;18:568–73.[Medline]
- Starling RD, Matthews DE, Ades PA, Poehlman ET. Assessment of physical activity in older individuals: a doubly labeled water study. J Appl Physiol 1999;86:2090–6.[Abstract/Free Full Text]
- Staten LK, Taren DL, Howell WH, et al. Validation of the Arizona Activity Frequency Questionnaire using doubly labeled water. Med Sci Sports Exerc 2001;33:1959–67.
- Walsh MC, Hunter GR, Sirikul B, Gower BA. Comparison of self-reported with objectively assessed energy expenditure in black and white women before and after weight loss. Am J Clin Nutr 2004;79:1013–9.[Abstract/Free Full Text]
- Washburn RA, Jacobsen DJ, Sonko BJ, Hill JO, Donnelly JE. The validity of the Stanford Seven-Day Physical Activity Recall in young adults. Med Sci Sports Exerc 2003;35:1374–80.
- Yao M, McCrory MA, Ma G, Li Y, Dolnikowski GG, Roberts SB. Energy requirements of urban Chinese adults with manual or sedentary occupations, determined using the doubly labeled water method. Eur J Clin Nutr 2002;56:575–84.[Medline]
- Baecke JA, Burema J, Frijters JE. A short questionnaire for the measurement of habitual physical activity in epidemiological studies. Am J Clin Nutr 1982;36:936–42.[Abstract/Free Full Text]
- Paffenbarger RS Jr, Wing AL, Hyde RT. Physical activity as an index of heart attack risk in college alumni. Am J Epidemiol 1978;108:161–75.[Abstract/Free Full Text]
- Ainsworth BE, Jacobs DR Jr, Leon AS. Validity and reliability of self-reported physical activity status: the Lipid Research Clinics questionnaire. Med Sci Sports Exerc 1993;25:92–8.
- Taylor HL, Jacobs DR Jr, Schucker B, Knudsen J, Leon AS, Debacker G. A questionnaire for the assessment of leisure time physical activities. J Chronic Dis 1978;31:741–55.[Medline]
- Washburn RA, Smith KW, Jette AM, Janney CA. The Physical Activity Scale for the Elderly (PASE): development and evaluation. J Clin Epidemiol 1993;46:153–62.[Medline]
- Berthouze SE, Minaire PM, Chatard JC, Boutet C, Castells J, Lacour JR. A new tool for evaluating energy expenditure: the "QAPSE" development and validation. Med Sci Sports Exerc 1993;25:1405–14.
- Sallis JF, Haskell WL, Wood PD, et al. Physical activity assessment methodology in the Five-City Project. Am J Epidemiol 1985;121:91–106.[Abstract/Free Full Text]
- Ainsworth BE, Irwin ML, Addy CL, Whitt MC, Stolarczyk LM. Moderate physical activity patterns of minority women: the Cross-Cultural Activity Participation Study. J Women's Health Gend Based Med 1999;8:805–13.[Medline]
- Dallosso HM, Morgan K, Bassey EJ, Ebrahim SB, Fentem PH, Arie TH. Levels of customary physical activity among the old and the very old living at home. J Epidemiol Community Health 1988;42:121–7.[Abstract/Free Full Text]
- Kriska AM, Caspersen CJ. A collection of physical activity questionnaires for health-related research. Med Sci Sports Exerc 1997;29(suppl).
- Lof M, Hannestad U, Forsum E. Assessing physical activity of women of childbearing age. Ongoing work to develop and evaluate simple methods. Food Nutr Bull 2002;23:30–3.[Medline]
- Reiff GG, Montoye HJ, Remington RD, Napier JA, Metzner HL, Epstein FH. Assessment of physical activity by questionnaire and interview. J Sports Med Phys Fitness 1967;7:135–42.[Medline]
- Blair SN, Haskell WL, Ho P, et al. Assessment of habitual physical activity by a seven-day recall in a community survey and controlled experiments. Am J Epidemiol 1985;122:794–804.[Abstract/Free Full Text]
- Hebert JR, Ebbeling CB, Matthews CE, et al. Systematic errors in middle-aged women's estimates of energy intake: comparing three self-report measures to total energy expenditure from doubly labeled water. Ann Epidemiol 2002;12:577–86.[Medline]
- Bland JM, Altman DG. Comparing methods of measurement: why plotting difference against standard method is misleading. Lancet 1995;346:1085–7.[Medline]
- Caspersen CJ, Powell KE, Christenson GM. Physical activity, exercise, and physical fitness: definitions and distinctions for health-related research. Public Health Rep 1985;100:126–31.[Medline]
- Westerterp KR. Pattern and intensity of physical activity. Nature 2001;410:539.
- Institute of Medicine of the National Academies. Dietary reference intakes for energy, carbohydrate, fiber, fat, fatty acids, cholesterol, protein and amino acids. Washington, DC: The National Academies Press, 2005:107–264.
- Kotz CM, Levine JA. Role of nonexercise activity thermogenesis (NEAT) in obesity. Minn Med 2005;88:54–7.[Medline]
- Blair SN, Haskell WL. Objectively measured physical activity and mortality in older adults. JAMA 2006;296:216–8.[Free Full Text]
- Vuillemin A, Oppert JM, Guillemin F, et al. Self-administered questionnaire compared with interview to assess past-year physical activity. Med Sci Sports Exerc 2000;32:1119–24.
- Irwin ML, Ainsworth BE, Conway JM. Estimation of energy expenditure from physical activity measures: determinants of accuracy. Obes Res 2001;9:517–25.[Medline]
- Hebert JR, Miller DR. The inappropriateness of conventional use of the correlation coefficient in assessing validity and reliability of dietary assessment methods. Eur J Epidemiol 1991;7:339–43.[Medline]
- Rosner B, Willett WC, Spiegelman D. Correction of logistic regression relative risk estimates and confidence intervals for systematic within-person measurement error. Stat Med 1989;8:1051–69.[Medline]
- Livingstone MB, Black AE. Markers of the validity of reported energy intake. J Nutr 2003;133(suppl):895S–920S.[Abstract/Free Full Text]
- Bland JM, Altman DG. Applying the right statistics: analyses of measurement studies. Ultrasound Obstet Gynecol 2003;22:85–93.[Medline]
- Ainslie PN, Reilly T. Physiology of accidental hypothermia in the mountains: a forgotten story. Br J Sports Med 2003;37:548–50.[Abstract/Free Full Text]
- Schoeller DA. Recent advances from application of doubly labeled water to measurement of human energy expenditure. J Nutr 1999;129:1765–8.[Abstract/Free Full Text]
- Vanhees L, Lefevre J, Philippaerts R, et al. How to assess physical activity? How to assess physical fitness? Eur J Cardiovasc Prev Rehabil 2005;12:102–14.[Medline]
- Speakman JR. The history and theory of the doubly labeled water technique. Am J Clin Nutr 1998;68(suppl):932S–8S.[Abstract]
- Gretebeck RJ, Montoye HJ. Variability of some objective measures of physical activity. Med Sci Sports Exerc 1992;24:1167–72.
- Black AE, Cole TJ. Within- and between-subject variation in energy expenditure measured by the doubly-labelled water technique: implications for validating reported dietary energy intake. Eur J Clin Nutr 2000;54:386–94.[Medline]
- Levin S, Jacobs DR Jr, Ainsworth BE, Richardson MT, Leon AS. Intra-individual variation and estimates of usual physical activity. Ann Epidemiol 1999;9:481–8.[Medline]
- Plasqui G, Westerterp KR. Seasonal variation in total energy expenditure and physical activity in Dutch young adults. Obes Res 2004;12:688–94.[Medline]
- Schoeller DA, Taylor PB, Shay K. Analytic requirements for the doubly labeled water method. Obes Res 1995;3(suppl)1:15–20.
- Bassett DR Jr, Ainsworth BE, Swartz AM, Strath SJ, O'Brien WL, King GA. Validity of four motion sensors in measuring moderate intensity physical activity. Med Sci Sports Exerc 2000;32(suppl):S471–80.
- Hendelman D, Miller K, Baggett C, Debold E, Freedson P. Validity of accelerometry for the assessment of moderate intensity physical activity in the field. Med Sci Sports Exerc 2000;32(suppl):S442–9.
- Schilling LM, Kozak K, Lundahl K, Dellavalle RP. Inaccessible novel questionnaires in published medical research: hidden methods, hidden costs. Am J Epidemiol 2006;164:1141–4.[Abstract/Free Full Text]
Received for publication May 2, 2007.
Accepted for publication July 26, 2007.
This article has been cited by other articles:

|
 |

|
 |
 
C. M. Ulrich and R. S. Holmes
Shedding Light on Colorectal Cancer Prognosis: Vitamin D and Beyond
J. Clin. Oncol.,
June 20, 2008;
26(18):
2937 - 2939.
[Full Text]
[PDF]
|
 |
|