Table of Contents

Component Description

The National Health and Nutrition Examination Survey (NHANES) conducts interviews and health examinations on a nationally representative sample of approximately 5,000 people each year. NHANES operated continuously from 1999 until March 2020, when field operations were suspended due to the coronavirus disease 2019 (COVID-19) pandemic. Because data collection for the 2019–2020 cycle was interrupted, the resulting data from 2019-March 2020 was not nationally representative. Therefore, data from the partially completed 2019-2020 cycle were combined with the full 2017-2018 data to create a nationally representative 2017-March 2020 pre-pandemic data file. These data are included in the present dataset.

During each complete 2-year cycle through 2017-2018, NHANES visited 15 locations each year. No information about geographic location was released to protect the identification of NHANES participants. State and county information could be obtained through the NCHS Research Data Center (RDC). It is desirable, however, to use a finer level of geography to spatially analyze the NHANES data and to answer various important research questions about the effect of geography on health.

Therefore, National Center for Health Statistics (NCHS) entered into an agreement with U.S. Department of Housing and Urban Development (HUD) to use its geocoding services to assign geographic codes to the NHANES address data. HUD geocoded the 1999-March 2020 NHANES data to the 2020 census.

Eligible Sample

All survey participants have a geocoded record.

Interview Setting and Mode of Administration

Addresses were collected for each household with at least one survey participant by trained interviewers using the Computer-Assisted Personal Interview (CAPI) system in the participant’s home or by telephone.

Quality Assurance & Quality Control

After geocoding the NHANES data, HUD provides QC documentation (see Appendix 1).

Data Processing and Editing

The NHANES participant address data were submitted to HUD and, geo-enabled with latitude/longitude coordinates and other legal, statistical, and administrative geographies including census, postal, and other attributes, using the HUD Geocode Service Center (GSC) system.

Analytic Notes

Due to disclosure concerns, this dataset is not available publicly. Access may be obtained through the NCHS Research Data Center.

Only 30 locations are included in each 2-year NHANES cycle, analysts are urged to combine cycles to achieve greater geographic coverage and to assess if additional cycles are needed to achieve adequate statistical power. Generally, obtaining stable estimates for rare outcomes is not possible for sub-national geographic areas.

Please note the Census year for the geocoded file. Geographic linkage is done using the most recent decennial Census data available. Although most analyses are likely to use the most recent data available, some analysts may choose geocoded data linked to older Census data. The documentation accompanying the file should be carefully reviewed because Census variable names differ between geographic linkage file versions.

Codebook and Frequencies

SEQN - Respondent sequence number

Variable Name:
SEQN
SAS Label:
Respondent sequence number
English Text:
Respondent sequence number.
Target:
Both males and females 0 YEARS - 150 YEARS

RC2KY - Census 2020 Geocoder General Return Code

Variable Name:
RC2KY
SAS Label:
Census 2020 Geocoder General Return Code
English Text:
Reference Census 2020 Geocoder RCs (Return Codes) tab
Target:
Both males and females 0 YEARS - 150 YEARS

STM2KY - Census 2020 Geocoder Street Matcher RC

Variable Name:
STM2KY
SAS Label:
Census 2020 Geocoder Street Matcher RC
English Text:
Census 2020 Geocoder Street Matcher Return Code
Target:
Both males and females 0 YEARS - 150 YEARS

LVL2KY - Census 2020 Geocoder LAT/LONG Geocoding

Variable Name:
LVL2KY
SAS Label:
Census 2020 Geocoder LAT/LONG Geocoding
English Text:
Reference Census 2020 Geocoder RCs (Return Codes) tab
Target:
Both males and females 0 YEARS - 150 YEARS

LAT - Latitude (Decimal)

Variable Name:
LAT
SAS Label:
Latitude (Decimal)
English Text:
Latitude in decimal format with up to 6 decimal precision
Target:
Both males and females 0 YEARS - 150 YEARS

LON - Longitude (Decimal)

Variable Name:
LON
SAS Label:
Longitude (Decimal)
English Text:
Longitude in decimal format with up to 6 decimal precision
Target:
Both males and females 0 YEARS - 150 YEARS

STATE2KY - Census 2020 FIPS State Code

Variable Name:
STATE2KY
SAS Label:
Census 2020 FIPS State Code
English Text:
Census 2020 FIPS State Code (2-digit numeric with leading zeros significant)
Target:
Both males and females 0 YEARS - 150 YEARS

CNTY2KY - Census 2020 FIPS County Code

Variable Name:
CNTY2KY
SAS Label:
Census 2020 FIPS County Code
English Text:
Census 2020 FIPS County Code (3-digit numeric with leading zeros significant)
Target:
Both males and females 0 YEARS - 150 YEARS

CBSA - Core Based Statistical Area (CBSA)

Variable Name:
CBSA
SAS Label:
Core Based Statistical Area (CBSA)
English Text:
CBSA Lowest Level Code. Contains the first of: Metropolitan Division; Micropolitan Area; Metropolitan Area; in this order
Target:
Both males and females 0 YEARS - 150 YEARS

UR2KY - Urban/Rural Indicator

Variable Name:
UR2KY
SAS Label:
Urban/Rural Indicator
English Text:
Urban/Rural Indicator (U = Urban, R = Rural, blank = unknown) [TIGER/Line RcdType 'S'; UR value]
Target:
Both males and females 0 YEARS - 150 YEARS

TRACT2KY - Census 2020 Tract

Variable Name:
TRACT2KY
SAS Label:
Census 2020 Tract
English Text:
Census 2020 Tract (contains leader zeros with the decimal point implied)
Target:
Both males and females 0 YEARS - 150 YEARS

BG2KY - Census 2020 Block Group

Variable Name:
BG2KY
SAS Label:
Census 2020 Block Group
English Text:
Census 2020 Block Group
Target:
Both males and females 0 YEARS - 150 YEARS

BLOCK2KY - Census 2020 Block ID

Variable Name:
BLOCK2KY
SAS Label:
Census 2020 Block ID
English Text:
Census 2020 Block ID - first character is the Census Block Group
Target:
Both males and females 0 YEARS - 150 YEARS

Appendix 1. 1999-March 2020 GCP Match Census 2020

CENSUS 2020 GEOCODER GENERAL RETURN CODE
RC2KY Frequency Percent Cumulative Frequency Cumulative Percent
35 0.03 35 0.03
5 2301 2.14 2336 2.17
9 1956 1.82 4292 3.99
S 103330 96.01 107622 100

 

CENSUS 2020 GEOCODER LAT/LONG GEOCODING LEVEL RETURN CODE
LVL2KY Frequency Percent Cumulative Frequency Cumulative Percent
35 0.03 35 0.03
4 1102 1.02 1137 1.06
B 854 0.79 1991 1.85
R 103330 96.01 105321 97.86
T 2301 2.14 107622 100