Skip to main content

Datasets · v2026.05

Public Company Federal Compliance

U.S. workplace enforcement records joined to SEC parent-company financials for 42,302 establishments operated by 714 publicly traded companies — OSHA, WHD, NLRB, and EPA activity rolled up to the registrant entity with revenue, net income, and total assets attached.

Published Apr 2026 · Refreshed May 2026 · Covers 2005–present · 42,302 establishments · CC-BY-4.0

Also available on: Hugging Face · Kaggle · Zenodo

Methodology

Overview

This dataset joins federal workplace enforcement records with SEC parent-company financial data at the establishment level. The unit of observation is one operating establishment with its parent's most-recent financial facts attached. 42,302 establishments across 714 publicly traded parents are covered. 970 unique parent legal-name variants resolve to those 714 unique SEC CIKs — some parents operate under multiple legal names that share a single SEC filer.

The joining is the point. OSHA inspection records, WHD wage cases, NLRB labor relations filings, and EPA environmental compliance are each available separately. SEC parent financials are available separately. Joining them at the establishment level with parent-company rollup applied surfaces patterns that single-source searches miss.

Data sources

  • OSHA — Federal OSHA Inspection and Violation Records (IMIS).
  • WHD — Wage and Hour Division Compliance Action Database (WHISARD).
  • EPA — Enforcement and Compliance History Online (ECHO) facility-level enforcement and Quarterly Non-Compliance Reports.
  • NLRB — National Labor Relations Board case filings.
  • SAM.gov — System for Award Management federal exclusions list.
  • SEC EDGAR — Public-company submissions index, used to enumerate registrants and their primary identifiers (CIK, ticker), and to source the most-recent reported revenue, net income, and total assets per registrant.

All sources are public records and are refreshed at FastDOL on a monthly cadence.

Methodology

Federal enforcement data is aggregated from the agency sources above. Entity resolution uses normalized employer name, state, and ZIP to assign each establishment a stable cluster identity within the FastDOL pipeline. Parent-company rollup uses a curated seed table augmented with SEC CIK matches; resolved parents are then joined to the SEC company-facts feed for their most-recent reported revenue, net income, and total assets.

The parent_facts_as_of column carries the period-end date of those financial facts so downstream users can scope analysis appropriately when combining with quarter- or year-specific external data.

Known limitations

  • Counts reflect what federal agencies investigated and recorded. Reporting practices vary by operator and industry, and small establishments are more likely to be undercounted than large ones.
  • Entity resolution is high-confidence but imperfect, especially for generic LLC names and DBAs. False positives (an establishment attributed to the wrong parent) and false negatives (an establishment unattributed) both occur.
  • OSHA columns reflect federal-OSHA inspections only. State-plan state enforcement (about half of all U.S. states) is reported separately and is not joined here.
  • "0" in any enforcement column means no record at the matched cluster identity for that agency, not an affirmation of perfect compliance.
  • The parent_*_latest financial fields reflect the registrant's most recently reported annual figures, which may pre-date the enforcement activity in the same row by several years for slow-filing entities.
  • Companies that have de-listed or merged are reflected at their most recent filing state; historical enforcement that pre-dates a merger remains attributed to the disclosed entity.

Use cases

  • ESG and stewardship research at the parent-company level — surfacing workplace and environmental enforcement that's distributed across many operating establishments.
  • Financial research correlating compliance footprint with parent revenue, net income, or total assets.
  • Workers' comp and casualty underwriting for public-company exposures.
  • ML feature engineering using risk scores, parent financials, and cross-agency activity counts.
  • Investigative journalism on the relationship between publicly disclosed corporate structures and on-the-ground compliance records.
  • Academic study of corporate-structure complexity and its relationship to enforcement visibility.

Schema

38 columns. Types as serialized in the Parquet file.

ColumnTypeDescription
employer_namestringLegal name of the establishment as recorded by federal agencies.
citystringCity of the establishment.
statestringUSPS two-letter state code (includes territories).
zipnumberZIP code of the establishment, stored as float64; nulls preserved.
naics_codenumberNAICS industry classification code, stored as float64; nulls preserved.
naics_descriptionstringHuman-readable NAICS industry description.
parent_namestringResolved publicly traded parent company (FastDOL parent rollup).
parent_ciknumberSEC Central Index Key of the resolved parent (numeric form).
parent_tickerstringPrimary trading ticker of the resolved parent, where one is listed.
osha_inspectionsintegerTotal federal OSHA inspections at this establishment.
osha_violationsintegerTotal federal OSHA violations cited at this establishment.
osha_total_penaltiesnumberTotal OSHA penalties assessed at this establishment, in U.S. dollars.
osha_fatalitiesintegerOSHA-investigated workplace fatalities recorded at this establishment.
osha_hospitalizationsintegerOSHA-recorded hospitalizations at this establishment.
osha_insp_accidentintegerOSHA inspections triggered by an accident.
osha_insp_complaintintegerOSHA inspections triggered by a worker complaint.
osha_insp_referralintegerOSHA inspections triggered by a referral from another agency or party.
osha_severe_violator_programstringOSHA Severe Violator Enforcement Program flag, stored as 't'/'f'.
whd_casesintegerWage and Hour Division enforcement cases on record at this establishment.
whd_backwages_totalnumberTotal WHD-assessed back-wages at this establishment, in U.S. dollars.
whd_employees_affectedintegerTotal employees recorded as affected across this establishment's WHD cases.
nlrb_casesintegerTotal NLRB case filings at this establishment.
nlrb_ulp_casesintegerNLRB unfair labor practice cases at this establishment.
nlrb_representation_casesintegerNLRB representation cases at this establishment.
epa_inspection_countintegerEPA inspections recorded at this establishment.
epa_formal_action_countintegerEPA formal enforcement actions recorded at this establishment.
epa_qtrs_noncomplianceintegerNumber of quarters in EPA non-compliance status (ECHO QNCR).
epa_total_penaltiesnumberEPA-assessed penalties at this establishment, in U.S. dollars.
epa_compliance_statusstringEPA compliance status label (six-class enumeration from ECHO).
parent_revenue_latestnumberParent company most recent reported annual revenue, in U.S. dollars.
parent_net_income_latestnumberParent company most recent reported annual net income, in U.S. dollars.
parent_total_assets_latestnumberParent company most recent reported total assets, in U.S. dollars.
parent_facts_as_ofdatePeriod-end date for the parent_*_latest financial facts (ISO date string).
risk_tierstringFastDOL composite risk tier — one of LOW, MEDIUM, ELEVATED, HIGH.
risk_scorenumberFastDOL composite risk score (0–100, continuous).
agencies_with_recordsintegerDistinct federal agencies with any recorded activity at this establishment.
peer_violation_percentileintegerEstablishment's violation percentile within its NAICS peer set (0–100).
debarredstringSAM.gov federal exclusion flag, stored as 't'/'f'.

Cite this dataset

Plain text

Turner, Ben (2026). Public Company Federal Compliance (Version 2026.05) [Data set]. FastDOL. https://doi.org/10.5281/zenodo.20031894

BibTeX

@dataset{turner_federalenforcement_2026,
  author    = {Turner, Ben},
  title     = {Public Company Federal Compliance},
  year      = {2026},
  version   = {2026.05},
  publisher = {FastDOL},
  doi       = {10.5281/zenodo.20031894},
  url       = {https://www.fastdol.com/datasets/public-company-federal-compliance}
}

Changelog

2026.05 — 2026-05-01

  • Initial public release on FastDOL.
  • Mirrored to Hugging Face at FastDOLz/public-company-federal-compliance.
  • Source data refreshed through 2026-04-30 across all contributing agencies.
  • SEC EDGAR Exhibit 21 disclosures parsed through the most recent fiscal-year 10-K filings.

Related datasets