Orthogonal polynomial contrasts and trends
See the National Center for Health Statistics Guidelines for Analysis of Trends for more information.
Confidence Intervals
Korn and Graubard confidence intervals, along with confidence interval widths, sample size, and degrees of freedom are standards for determining the reliability of estimated proportions. For more information, see the Reliability of Estimates module. Other information on Korn and Graubard confidence intervals for proportions can be found in the National Center for Health Statistics Data Presentation Standards for Proportions.
Code included for:
- Age-standardization
- Population counts or estimates
Specific notes related to NHANES data analysis:
Age-Adjusted Prevalence Rates and Means
Age standardization, sometimes referred to as age adjustment, is a method that applies observed age-specific rates to a standard age distribution. The method adjusts for the confounding effect of age. Standard age proportions are calculated by dividing the age-specific Census population (P) by the total Census population number (T). The standardizing proportions (P/T) should sum to 1.
There are two steps:
- Choose a standard population. The example code provided uses the 2000 Census data.
- The age-specific prevalence from the study population is multiplied by the proportion of people in that age group in the standard population, and results summed up to get the age-adjusted estimates.
Example:
Standard Proportions for 20-year Age Groups Based on the 2000 U.S. Census Standard Population
Age Group |
Age-Specific Census Population (in thousands) |
Total Census Population (in thousands) |
Standard Age Proportions |
|
P |
T |
P/T |
20-39 |
77,670 |
195,850 |
.396579 |
40-59 |
72,816 |
195,850 |
.371795 |
60+ |
45,364 |
195,850 |
.231626 |
Total: |
195,850 |
Sum: |
1 |
IMPORTANT NOTE
See source census data (Table 2 in report). Age groups can be combined to reflect the age range of the population used in the specific analysis.
Population counts
The non-institutionalized population totals are used to calculate the final sample weights for the NHANES survey. However, it is NOT RECOMMENDED to use the sum of the final sample weights for sample persons with the health condition of interest in order to calculate population estimates of, or number of people with, the health condition. This is because, if there are exclusions or missing data for a health condition, summing the weights will underestimate the population estimate. Consequently, the steps below are recommended for calculating the population count or number of people with a given condition from NHANES:
- Calculate the crude percentage who have the outcome or characteristic overall and by subgroups of interest.
- Obtain the relevant population totals for the NHANES survey cycle(s) being used.
- Combine population totals (if desired). Ages, sexes, or race/Hispanic origin subgroups can be combined. It also is possible to combine NHANES survey cycles. For example, to combine two survey cycles (e.g., 2015-2016 and 2017-2018), the midpoint of each cycle is used:
½ (NHANES 2015-2016 population totals) + ½ (NHANES 2017-2018 population totals)
.
- In the last step, the prevalence estimates are multiplied with corresponding population totals to estimate the total number of civilian, non-institutionalized U.S. residents affected with the health condition. Percent prevalence estimates as well as lower and upper 95% confidence limits will be multiplied to the corresponding population total for that subgroup. Confidence intervals for the population estimates of those affected with the health condition are estimated using Wald, and Korn and Graubard methods. To calculate age-, sex-, or race/ethnicity- specific population estimates, multiply the prevalence of the health condition in each sub-domain by the population total for the respective sub-domain.
IMPORTANT NOTE
Age standardization of the prevalence estimates is NOT performed because the population counts should be based on the crude (unadjusted) prevalence in the population.