Technical Info

Important Note

There are three breaks in the NEISS-Work data.
  1. 2011 to 2012 is a series break. Comparisons of injury events and sources should not be made between years that cross this break.
    • 1998-2011 data use OIICS v.1.01 for event and source codes.
    • 2012 to present data use OIICS v.2.01 for event and source codes.
  2. 2014 to 2015 is a series break related to inclusion/exclusion of most work-related illnesses. Caution should be taken when comparing estimates of total cases between years that cross this break.
    • 1998-2014 NEISS-Work data include illnesses that began at work (e.g., heart attacks and strokes), chronic conditions, and exacerbations of long-term injuries. These cases, classified as "illnesses," represent 5-10% of total annual case estimates.
    • 2015-onward data do not capture most illnesses. See case exclusions for details.
  3. 2017 to 2018 is a break related to the available detail for event, source, and secondary source.
    • Event, source, and secondary source for 2018 forward will only be available at one- and two-digit code levels.
    • This change is due to the use of a machine-learning model to assign two-digit event and source codes to each case based on OIICS v.2.01.

Data Source, Design, & Case Definition

Estimates & Errors

Data Elements & Reports

Data source

Work-RISQS is an online data access system for NIOSH surveillance data on nonfatal work-related injuries treated in U.S. hospital emergency departments (EDs). NIOSH collaborates with the U.S. Consumer Product Safety Commission (CPSC) to collect these data through an occupational supplement to the CPSC National Electronic Injury Surveillance System (NEISS) that does not require consumer product involvement. This occupational supplement is known as NEISS-Work.

 Top of Page

Sample design

NEISS-Work is a national stratified probability sample of hospitals in the U.S. and its territories that have a minimum of six beds and operate a 24-hour emergency department (ED). Hospitals in the sample were selected from the approximately 5,300 rural and urban U.S. hospitals after stratification by total annual ED visits. The occupational injury hospital sample is a 2/3 subset of the hospital sample used by CPSC to capture product-related injuries. Nominally, 67 geographically distributed sample hospitals capture work-related injuries every day of the year. Hospitals abstractors capture all treated cases that are identifiable as work-related, and code case information from the medical record.

Each case is assigned a statistical weight based on the inverse probability of selection. National estimates are obtained by summing weights for all cases or the selected set of cases. Statistical weights are adjusted within a sample year to account for hospital mergers, closings, or withdrawals from NEISS-Work (resulting in fewer than 67 hospitals reporting information), and for incomplete reporting. Statistical weights are adjusted annually based on the number of U.S. hospitals and their total number of ED visits as determined by a census of U.S. hospitals one year prior to the data year.

 Top of Page

Case definition

NEISS-Work cases used for Work-RISQS estimates involve nonfatal work-related injuries among civilian, non-institutionalized workers, treated in the emergency departments of participating hospitals. Workers are not restricted by age, employer size, or type of employer or industry. An injury is considered work-related if the patient was working for pay or other compensation, performing agricultural production activities, or volunteering with an organized group (e.g., a volunteer fire department).

 Top of Page

Case exclusions

The following types of cases are excluded from NEISS-Work data:

  • Injuries to active-duty military (Air Force, Army, Coast Guard, Marines, National Guard, Navy, or Reserve Officers' Training Corps [ROTC])
  • Injuries of institutionalized persons, such as those in prisons or psychiatric institutions
  • From 2015 onwards, most illness are excluded. Illnesses that remain in the data (e.g., dermatitis and asthma exacerbations) have documented links to work-related exposures. Prior to 2015, most illnesses that happened at work (excluding common illnesses such as colds and flus) were included.
  • Drug and alcohol screenings
  • Revisits to EDs for injuries previously treated in EDs
 Top of Page

Case identification

Hospital abstractors identify work-related cases from admissions information and emergency department chart review. NEISS-Work does not require a workers' compensation claim for inclusion.

 Top of Page

Injury estimates

NEISS-Work is designed to produce national estimates and is not suitable for regional, state, or local injury estimates. Extrapolating the number of cases seen in the approximately 67 hospitals by using the statistical weight of each case produces a national estimate. The statistical weight varies depending upon the size of the hospital and the number of patients typically treated in its ED. Each case captured in a sample hospital may represent from 20 to more than 100 cases seen in other U.S. hospitals. Summing the weights for similar cases gives a national estimate for all injuries, or for a specific demographic group, type of injury, or injury circumstance. In Work-RISQS, national estimates are expressed in thousands of injuries for a given year. For example, in 1998 there were an estimated 3,559.5 thousand (3,559,500) occupational injuries and illnesses treated in U.S. hospital emergency departments.

 Top of Page

Injury estimate confidence interval

Because the NEISS-Work system is a sample of hospitals and does not count every injury treated in every hospital, the national estimates may be in error. However, because this sample was statistically selected, Work-RISQS can calculate the potential error. Error estimates are based on the 95% confidence interval and are expressed as a value that should be subtracted from the estimate to get the lower confidence bound and added to it to get the upper bound. For example, in 1998, there were an estimated 3,559.5 thousand work-related injuries and illnesses with a 95% confidence interval of ±662.7 thousand cases. In other words, the true value from this system is likely to be in the range of 2,896,800 to 4,222,200 injuries and illnesses, or about three to four million nonfatal occupational injuries and illnesses treated in U.S. emergency departments in 1998.

The 95% confidence interval is an approximation based on the classical formula for variance of a total from a stratified sample. The confidence bounds are approximations of the general magnitude of error regarding estimates, and are not precise values.

The variance estimating formula currently used by Work-RISQS is:

contact us for accessible equations

Where:

  • contact us for accessible equations = Number of strata in the sample during the given time period
  • contact us for accessible equations = Number of hospitals in the sampling frame for stratum h
  • contact us for accessible equations = Number of hospitals selected for the sample for stratum h
  • contact us for accessible equations = Number of in-scope hospitals in the sample for stratum h
  • contact us for accessible equations = Number of hospitals participating in stratum h for the given time period
  • contact us for accessible equations = Number of occupational cases for a specific type of injury reported by hospital i in stratum h for the given time period
  • contact us for accessible equations = Weight of hospital i in stratum h
  • contact us for accessible equations
  • contact us for accessible equations

The 95% confidence interval, expressed as an amount to be subtracted or added to the national estimate, is then calculated as:

contact us for accessible equations

Where:

  • contact us for accessible equations = Variance of the injury estimate x
  • contact us for accessible equations = Injury estimate

To minimize issues with negative variances, contact us for accessible equations is calculated as a fractional contact us for accessible equations value where contact us for accessible equations equals the sum of the number of months that all hospitals within a strata reported during the calendar period divided by 12 months.

The confidence interval does not account for biases (nonsampling error) in the estimates that arise from the way in which data are collected or defined, the ability to identify all work-related cases, or mistakes in data collection or coding.

 Top of Page

Worker population

Work-RISQS numbers are derived from the NEISS-Work population: civilian, non-institutionalized workers of all ages and all employer or industry types and sizes.

To calculate a meaningful rate, the estimate for the selected worker population (denominator) should be the best possible match available for the Work-RISQS estimate (numerator). Users can obtain employed worker population estimates from the Employed Labor Force (ELF) query system. See How to Calculate Rates in the Work-RISQS Help section for more information about options for selecting a population in ELF.

Certain volunteer workers are included in the Work-RISQS injury counts. This creates an insignificantly small bias in rate estimates when the denominator population does not include volunteers.

 Top of Page

Treatment year

Definition: Calendar year in which the patient first received medical treatment in an emergency department for the injury.

Users may query injuries according to the date of first treatment in an ED, for one calendar year at a time, beginning with 1998. Users cannot query multiple years simultaneously.

Note that data years 1998-2011 use BLS Occupational Injury and Illness Classification System (OIICS) v1.01 for Event, Source, and Secondary Source parameter selection in the Incident Characteristics section, and data years 2012 and onward use OIICS v. 2.01. The query system defaults to the use of the appropriate version based on the year selected.

 Top of Page

Treatment month

Definition: Calendar month or quarter of year in which the patient first received medical treatment in an emergency department.

Users may select one or more months or quarters for queries. The months or quarters do not need to be consecutive. Injury estimates are based on the actual number of cases treated within the selected calendar month(s).

 Top of Page

Age group

Definition: Inclusive range of ages of patients (in years).

Injured workers are grouped by age ranges (primarily five-year ranges). Users may select one or more age groups. Age is unknown for a small proportion of cases. There are no age restrictions for cases captured in NEISS-Work.

 Top of Page

Sex

Data for 2021 onward:

Definition: The distinction between male, female, and non-binary/other.

A worker's sex is reported in hospital data as “Male,” “Female,” “Non-binary/other,” or “Not stated.” In any given year's data, the number of cases for which sex is “Not stated” is very small (typically < 0.01% of all cases). Sex parameters for queries are All, Male, and Female; users may select only one per query.

Data prior to 2021:

Definition: The distinction between male, and female.

A worker's sex is reported in hospital data as “Male,” “Female,” or “Not stated.” In any given year's data, the number of cases for which sex is “Not stated” is very small (typically < 0.01% of all cases). Sex parameters for queries are All, Male, and Female; users may select only one per query.

 Top of Page

Race and Ethnicity

Data for 2019 onward:

Beginning with the 2019 data, race and ethnicity have been separated into two variables. While this should be considered a break in series, it is possible to combine race and ethnicity to create groupings like those used prior to 2019.

Definitions:

  • Race classifies people with similar biological, social, and cultural heritage into racial groups (White, Black, American Indian/Alaska Native, Asian/Pacific Islander, and Other) according to information in the ED chart.
  • Ethnicity is coded as Hispanic when the ED record indicates the patient is Hispanic, Latino/Latina, or of Spanish origin. Hispanic Ethnicity includes people of Cuban, Mexican, Puerto Rican, South or Central American, or other Spanish culture or origin regardless of race.

Race is collected in two parts in the data. In the first part, hospital abstractors select "White," "Black/African American," "Asian," "American Indian/Alaskan Native," "Native Hawaiian/Pacific Islander," "Other," or "Not stated" for race. The second part is a free text field, where abstractors may specifically describe races coded as "Other." Race and ethnicity are not always uniformly reported because NEISS-Work hospitals do not collect race and ethnicity data the same way, if they collect or report it at all.

WARNING: Because of the outlined problems and the fact that race is missing for approximately 40% of cases and ethnicity is missing for approximately 37% of cases, Work-RISQS estimates for specific racial or ethnic worker populations should be used with caution.

Data prior to 2019:

For data prior to 2019, race and ethnicity were combined into one variable.

Definition:

  • The combined race/ethnicity variable classifies people with similar biological, social, and cultural heritage into racial groups (White [non-Hispanic], Black [non-Hispanic], American Indian/Alaska Native, and Asian/Pacific Islander) and one ethnic group (Hispanic), according to information in the ED chart.

Race and ethnicity are collected in two parts in the original data. In the first part, hospital abstractors select "White," "Black/African American," "Asian," "American Indian/Alaskan Native," "Native Hawaiian/Pacific Islander," "Other," or "Not stated" for race. The second part is a free text field, where abstractors may more specifically describe races coded as "Other" or note ethnicity (such as "Hispanic" or "Latina"). Race and ethnicity are not always uniformly reported because NEISS-Work hospitals do not collect race and ethnicity data the same way, if they collect or report it at all, and because of the manner in which race and ethnicity are captured in the NEISS-Work data. For example, a Hispanic patient might be indicated in NEISS-Work as: (1) "White" or "Black" with a blank free text field (only race reported); (2) "White" or "Black" with "Hispanic" entered in the free text field (both race and ethnicity reported); or (3) as "Other" with "Hispanic" in the free text field (only ethnicity reported).

Work-RISQS recodes race and ethnicity under a single combined data structure, designated Race/ethnicity. Within this structure, Hispanic ethnicity is indicated preferentially over either White or Black race, so that workers identified in hospital records as "White" or "Black" and "Hispanic" or "Latino/Latina" are recoded as Hispanic.

WARNING: Because of the outlined problems and the fact that race and ethnicity information is missing for approximately 37% of cases, Work-RISQS estimates for specific racial or ethnic worker populations should be used with caution.
 Top of Page

Diagnosis

Definition: The nature of the most severe injury as described by the attending physician or other medical staff. If a medical diagnosis is missing from the ED record, the Diagnosis is typically the patient’s chief complaint.

The diagnosis categories used by NEISS-Work hospital abstracters have been condensed into 25 categories for Work-RISQS queries. Categories with few cases (e.g., “Burn: electrical” and “Burn: not specified”) have been combined into a larger reportable group (e.g., Burn: other). Diagnoses of "Ingested foreign object," "Aspirated foreign object," "Dental injury," "Other," and "Submersion" have been combined in Other, n.e.c.

The coding of specific injury types may not always be uniform because of ambiguous terminology and multiple diagnoses listed in the ED record. For example, two workers in different cities may strain their arms while lifting crates. One ED chart may note this as “patient strained arm,” coded as Strain or sprain, while another chart may describe the case only as "patient presented with arm pain,” coded as Other, n.e.c. To the extent it is possible to do so, these types of issues are minimized in the data.

Definitions of individual diagnoses typically follow standard medical usage. Anoxia, “Aspirated foreign object,” “Ingested foreign object,” and Poisoning have specific definitions to help distinguish particular injuries:

  • Anoxia is used when the patient cannot obtain sufficient oxygen, due to hampered breathing or lack of oxygen in the breathable atmosphere. For example, this category is used when the diagnosis is strangulation, suffocation, or asphyxia; or when the patient has inhaled products of combustion such as carbon monoxide (CO), smoke, or soot.
  • “Aspirated foreign object” is used when an insoluble object causes choking; or is caught in the nose, lungs, or airway between nose and lungs; and the object does not lead to poisoning or anoxia. The esophagus is not considered part of the airway.
  • “Ingested foreign object” is used when a patient swallows an insoluble, solid object that is not likely to cause poisoning.
  • Poisoning is used when a patient swallows liquid or soluble chemicals, drugs, or medications; inhales vapors, fumes, or gases (other than carbon monoxide, smoke, or other products of combustion, in which case the Diagnosis is Anoxia instead of Poisoning); or reacts systemically to a product absorbed through the skin.
 Top of Page

Part of body affected

Definition: The region or part of the body most seriously harmed.

The part of the body primarily or most severely affected by the injury is the one coded. However, the injured body part may overlap two coding categories, such as a laceration to a finger and palm, which one hospital abstractor may list as “Finger” and another as “Hand.” Similarly, an injury to the eye may be coded as “Eyeball” or “Face” depending upon the information provided and the wording in the ED chart. Users should be cautious when comparing Work-RISQS estimates of body parts to adjoining body parts and when interpreting estimates of a specific part within a larger body region. Certain injury types have coding rules to minimize this issue:

  • Back injuries are coded as Upper trunk when the injury is above the waist or navel, and Lower trunk when below. A diagnosis of unspecified "back injury" is coded as Upper trunk.
  • Burn injuries involving less than 25% of the body are coded as the most severely burned body part. More extensive burn injuries are coded as 25 to 50% of the body or All parts of the body (i.e., more than 50% of the body).
  • Eye injuries are coded as “Eyeball” when the injury occurred directly to the eye itself, whereas injuries to the eyelids, eyebrows, or the area immediately around the eyes are coded as Face.
  • Facial injuries were unspecified or more generally involved a broader extent of the face are classified as Face even though there may have been damage to one or both eyes or to the mouth.
  • Head injuries with general diagnoses of "head trauma," "closed head injury," or "blunt head injury,” instead of a specific diagnosis such as “Laceration” or “Concussion,” are coded as Internal organ.
  • Injuries with diagnoses of Anoxia, Electric shock, Poisoning, or “Submersion” are coded as All parts of the body.
  • When the diagnosis is “Aspirated foreign object” or “Ingested foreign object,” the Part of body is coded as Internal organ.
 Top of Page

Disposition of patient

Definition: The status of a patient after treatment in the ED, categorized as Treated and released, Treated and transferred/hospitalized, or Unknown.

About half of Treated and transferred/hospitalized patients are hospitalized in the same facility as the ED. The other half are transferred to a different facility, generally a specialty-services hospital (e.g., burn center or pediatric hospital) or other higher-level care facility. Many of the transferred patients are hospitalized in the new facility, but their final disposition is not always known.

 Top of Page

Event or exposure

Definition: The manner in which the injury was produced or inflicted, as coded in the BLS Occupational Injury and Illness Classification System (OIICS).

The Event or Exposure is classified in a hierarchical structure with general divisions of aggregate single-digit codes that narrow into more specific categories with two-, three-, and four-digit codes. Aggregate categories, which are identified by asterisks after the code digits (such as 2* or 63*), include all subcategories and individual codes listed below the aggregate category.

Analysis of more detailed codes may not produce reportable results because the selected Event may occur only rarely, or because a lack of description in ED records may not allow for more detailed codes. For example, using OIICS v2.01, a case in which a worker fell seven feet off the edge of a loading dock to the ground below would be coded as 4332 Other fall to lower level 6 to 10 feet if the ED record and the abstractor have provided that level of detail in the final NEISS-Work data. However, the same injury might be described in the ED chart simply as "fell at work." In that instance, the Event would be coded as 420 Fall on same level, unspecified (the default when the height of a nonfatal fall is unknown), or 430 Fall to lower level, unspecified (if the severity of the resulting injuries suggested a fall from elevation). Injury estimates for specific Events may therefore significantly underestimate the true number, and should be considered minimum estimates.

Note that data years 1998-2011 use OIICS v1.01 for Event, Source, and Secondary Source, and data years 2012 and onward use OIICS v. 2.01. The query system defaults to the appropriate version based on the year selected. There is a break in series between the versions, and users should not compare Event, Source, and Secondary Source results between years that use different versions (e.g., 2011 and 2012).

For data years 1998 through 2017, each case was manually coded to the most specific level of detail (two-, three-, or four-digit codes) possible by a trained coder using information abstracted from the ED record.

Starting with data year 2018, event or exposure is classified in a hierarchical structure with general divisions of broad single-digit codes, but these codes only narrow into more specific two-digit codes. Furthermore, the data for 2018 forward are being coded using a machine-learning model to assign OIICS two-digits event or exposure codes to each case. The model predicts event or exposure codes based on the narrative description of the injury and the worker's age and sex. Event or exposure codes that do not meet specified threshold scores, reflecting the certainty at which a code was assigned, are flagged for review by a trained coder.

 Top of Page

Source of injury

Definition: The object, substance, person, bodily motion, or exposure which directly produced or inflicted the injury, as coded in the BLS Occupational Injury and Illness Classification System (OIICS).

Source, like Event, is classified in a hierarchical structure with general divisions of single-digit codes that narrow into more specific categories with two-, three-, and four-digit codes. Aggregate categories, which are identified by asterisks after the code digits (such as 2* or 63*), include all subcategories and individual codes listed below the aggregate category. As with Event, the more detailed levels may not produce tables with reportable results because the selected Source may only rarely appear in NEISS-Work cases, or because a lack of description in ED records may not allow for more detailed codes.

There are far more codes available for Source than Event because of the diversity of objects, substances, environments, and exertions that can produce injury. For example, using OIICS v 2.01, if a worker was struck in the eye by a flying nail, the Source code would be 4212 Nails, brads, tacks, nuts, bolts, washers. Other fasteners (e.g., rivets, screws, and staples) each have separate 4-digit source codes. As for Event, estimates of specific Sources are difficult to consistently identify and should be considered minimum estimates.

As with Event, the query system defaults to the appropriate version of OIICS based on the year selected. There is a break in series between the versions, and users should not compare Event, Source, and Secondary Source results between years that use different versions (e.g., 2011 and 2012).

For data years 1998-2017, each case was manually coded to the most specific level of detail (two-, three-, or four-digit codes) possible by a trained coder using information abstracted from the ED record.

Starting with data year 2018, source is classified in a hierarchical structure with general divisions of broad single-digit codes, but these codes only narrow into more specific two-digit codes. Furthermore, the data for 2018 forward are being coded using a machine-learning model to assign OIICS two-digit source codes to each case. The model predicts source codes based primarily on the narrative description of the injury and the worker's age and sex. Source codes that do not meet specified threshold scores, reflecting the certainty at which a code was assigned, are flagged for review by a trained coder.

 Top of Page

Secondary source of injury

Definition: The object, substance, or person that generated the source of the injury or that contributed to the event or exposure, as coded in the BLS Occupational Injury and Illness Classification System (OIICS).

The Secondary Source is classified using the same structure and with the same codes as the Source. An example of a secondary source would be a tool that set into motion a flying object which then struck a worker. If the worker noted above who was injured by a flying nail was hammering the nail at the time, the hammer would be the secondary source. Another example of a secondary source would be spilled water that caused a worker to slip and fall to the floor.

About 80% of cases have no Secondary Source coded, either because the lack of description in a record prevented identification of a secondary source or because there simply was no secondary source for that case.

For data years 1998-2017, each case was manually coded to the most specific level of detail (two-, three-, or four-digit codes) possible by a trained coder using information abstracted from the ED record.

Starting with data year 2018, event and primary source codes are assigned via machine learning. However, machine learning could not be used to assign secondary source codes. Thus, two-digit secondary source codes are applied by a trained NIOSH coder for select events: intentional injury by person (11), falls on same level (42), exposure to other harmful substances (55), and struck by object or equipment (62).

 Top of Page

Data reports

A Work-RISQS query results in a table of national injury estimates. Each table lists the query parameters selected for that particular estimate and the results of the query. The table columns include a national estimate (in thousands), the confidence bounds (in thousands) expressed as a value to be added to and subtracted from the national estimate to give the 95% confidence interval, and the percentage of the total estimated injuries for the year.

 Top of Page

Minimum reporting requirements

Work-RISQS uses three criteria to determine if a national estimate is reportable. First, estimates must be based on a minimum number of cases among the small sample of U.S. hospitals within NEISS-Work. Second, the national estimate must exceed a minimum number of injuries. Third, the error associated with an estimate must be small enough to indicate that the estimate would be reasonably reproducible (i.e., the coefficient of variation is less than or equal to 30%). To be reportable, a national estimate must meet all three criteria. Work-RISQS automatically verifies that results meet the reporting requirements.

 Top of Page

Limitations

NEISS-Work data have several limitations that should be considered when interpreting and reporting results. First, 1998-2011 NEISS-Work data use OIICS v.1.01 for event and source codes. NEISS-Work data from 2012 to present use OIICS v.2.01. Comparisons between event and source should not made between years that cross this break. Second, 1998-2014 NEISS-Work data include illnesses that began at work (e.g., heart attacks and strokes), chronic conditions, and exacerbations of long-term injuries. These cases, classified as “illnesses,” represent 5-10% of the total estimates for these years. NEISS-Work data from 2015 onwards do not capture most illnesses. Caution should be taken when comparing estimates between years that cross this break. Third, the small number of hospitals in the NEISS-Work sample contributes to large confidence intervals. Fourth, NEISS-Work data represent only injuries treated in EDs, omitting injuries treated in other medical venues and self-treated injuries. Finally, case capture relies on information found in the ED medical record identifying the case as work-related, and inaccuracies may occur due to the lack of information provided by the worker, incomplete medical records, or errors in abstracting the information from the record.

 Top of Page

Disclaimer

Although NIOSH extends considerable effort to insure reasonable data quality for Work-RISQS estimates, there are no warranties expressed or implied with these data. The underlying data for queries are subject to change without notice as errors, inconsistencies, or other data issues arise. The objective of Work-RISQS is to provide public access to occupational injury data for use in workplace-safety and injury-prevention activities where understanding the general magnitude of injuries, worker characteristics, and injury events is important. Use of these data for other purposes should be done with caution.

 Top of Page
Page last reviewed: December 9, 2023
Content source: National Institute for Occupational Safety and Health (NIOSH) Division of Safety Research