Datasets · v2026.05
Public Company Federal Compliance
U.S. workplace enforcement records joined to SEC parent-company financials for 42,302 establishments operated by 714 publicly traded companies — OSHA, WHD, NLRB, and EPA activity rolled up to the registrant entity with revenue, net income, and total assets attached.
Published Apr 2026 · Refreshed May 2026 · Covers 2005–present · 42,302 establishments · CC-BY-4.0
Also available on: Hugging Face · Kaggle · Zenodo
Methodology
Overview
This dataset joins federal workplace enforcement records with SEC parent-company financial data at the establishment level. The unit of observation is one operating establishment with its parent's most-recent financial facts attached. 42,302 establishments across 714 publicly traded parents are covered. 970 unique parent legal-name variants resolve to those 714 unique SEC CIKs — some parents operate under multiple legal names that share a single SEC filer.
The joining is the point. OSHA inspection records, WHD wage cases, NLRB labor relations filings, and EPA environmental compliance are each available separately. SEC parent financials are available separately. Joining them at the establishment level with parent-company rollup applied surfaces patterns that single-source searches miss.
Data sources
- OSHA — Federal OSHA Inspection and Violation Records (IMIS).
- WHD — Wage and Hour Division Compliance Action Database (WHISARD).
- EPA — Enforcement and Compliance History Online (ECHO) facility-level enforcement and Quarterly Non-Compliance Reports.
- NLRB — National Labor Relations Board case filings.
- SAM.gov — System for Award Management federal exclusions list.
- SEC EDGAR — Public-company submissions index, used to enumerate registrants and their primary identifiers (CIK, ticker), and to source the most-recent reported revenue, net income, and total assets per registrant.
All sources are public records and are refreshed at FastDOL on a monthly cadence.
Methodology
Federal enforcement data is aggregated from the agency sources above. Entity resolution uses normalized employer name, state, and ZIP to assign each establishment a stable cluster identity within the FastDOL pipeline. Parent-company rollup uses a curated seed table augmented with SEC CIK matches; resolved parents are then joined to the SEC company-facts feed for their most-recent reported revenue, net income, and total assets.
The parent_facts_as_of column carries the period-end date of those
financial facts so downstream users can scope analysis appropriately when
combining with quarter- or year-specific external data.
Known limitations
- Counts reflect what federal agencies investigated and recorded. Reporting practices vary by operator and industry, and small establishments are more likely to be undercounted than large ones.
- Entity resolution is high-confidence but imperfect, especially for generic LLC names and DBAs. False positives (an establishment attributed to the wrong parent) and false negatives (an establishment unattributed) both occur.
- OSHA columns reflect federal-OSHA inspections only. State-plan state enforcement (about half of all U.S. states) is reported separately and is not joined here.
- "0" in any enforcement column means no record at the matched cluster identity for that agency, not an affirmation of perfect compliance.
- The
parent_*_latestfinancial fields reflect the registrant's most recently reported annual figures, which may pre-date the enforcement activity in the same row by several years for slow-filing entities. - Companies that have de-listed or merged are reflected at their most recent filing state; historical enforcement that pre-dates a merger remains attributed to the disclosed entity.
Use cases
- ESG and stewardship research at the parent-company level — surfacing workplace and environmental enforcement that's distributed across many operating establishments.
- Financial research correlating compliance footprint with parent revenue, net income, or total assets.
- Workers' comp and casualty underwriting for public-company exposures.
- ML feature engineering using risk scores, parent financials, and cross-agency activity counts.
- Investigative journalism on the relationship between publicly disclosed corporate structures and on-the-ground compliance records.
- Academic study of corporate-structure complexity and its relationship to enforcement visibility.
Schema
38 columns. Types as serialized in the Parquet file.
| Column | Type | Description |
|---|---|---|
| employer_name | string | Legal name of the establishment as recorded by federal agencies. |
| city | string | City of the establishment. |
| state | string | USPS two-letter state code (includes territories). |
| zip | number | ZIP code of the establishment, stored as float64; nulls preserved. |
| naics_code | number | NAICS industry classification code, stored as float64; nulls preserved. |
| naics_description | string | Human-readable NAICS industry description. |
| parent_name | string | Resolved publicly traded parent company (FastDOL parent rollup). |
| parent_cik | number | SEC Central Index Key of the resolved parent (numeric form). |
| parent_ticker | string | Primary trading ticker of the resolved parent, where one is listed. |
| osha_inspections | integer | Total federal OSHA inspections at this establishment. |
| osha_violations | integer | Total federal OSHA violations cited at this establishment. |
| osha_total_penalties | number | Total OSHA penalties assessed at this establishment, in U.S. dollars. |
| osha_fatalities | integer | OSHA-investigated workplace fatalities recorded at this establishment. |
| osha_hospitalizations | integer | OSHA-recorded hospitalizations at this establishment. |
| osha_insp_accident | integer | OSHA inspections triggered by an accident. |
| osha_insp_complaint | integer | OSHA inspections triggered by a worker complaint. |
| osha_insp_referral | integer | OSHA inspections triggered by a referral from another agency or party. |
| osha_severe_violator_program | string | OSHA Severe Violator Enforcement Program flag, stored as 't'/'f'. |
| whd_cases | integer | Wage and Hour Division enforcement cases on record at this establishment. |
| whd_backwages_total | number | Total WHD-assessed back-wages at this establishment, in U.S. dollars. |
| whd_employees_affected | integer | Total employees recorded as affected across this establishment's WHD cases. |
| nlrb_cases | integer | Total NLRB case filings at this establishment. |
| nlrb_ulp_cases | integer | NLRB unfair labor practice cases at this establishment. |
| nlrb_representation_cases | integer | NLRB representation cases at this establishment. |
| epa_inspection_count | integer | EPA inspections recorded at this establishment. |
| epa_formal_action_count | integer | EPA formal enforcement actions recorded at this establishment. |
| epa_qtrs_noncompliance | integer | Number of quarters in EPA non-compliance status (ECHO QNCR). |
| epa_total_penalties | number | EPA-assessed penalties at this establishment, in U.S. dollars. |
| epa_compliance_status | string | EPA compliance status label (six-class enumeration from ECHO). |
| parent_revenue_latest | number | Parent company most recent reported annual revenue, in U.S. dollars. |
| parent_net_income_latest | number | Parent company most recent reported annual net income, in U.S. dollars. |
| parent_total_assets_latest | number | Parent company most recent reported total assets, in U.S. dollars. |
| parent_facts_as_of | date | Period-end date for the parent_*_latest financial facts (ISO date string). |
| risk_tier | string | FastDOL composite risk tier — one of LOW, MEDIUM, ELEVATED, HIGH. |
| risk_score | number | FastDOL composite risk score (0–100, continuous). |
| agencies_with_records | integer | Distinct federal agencies with any recorded activity at this establishment. |
| peer_violation_percentile | integer | Establishment's violation percentile within its NAICS peer set (0–100). |
| debarred | string | SAM.gov federal exclusion flag, stored as 't'/'f'. |
Cite this dataset
Plain text
Turner, Ben (2026). Public Company Federal Compliance (Version 2026.05) [Data set]. FastDOL. https://doi.org/10.5281/zenodo.20031894
BibTeX
@dataset{turner_federalenforcement_2026,
author = {Turner, Ben},
title = {Public Company Federal Compliance},
year = {2026},
version = {2026.05},
publisher = {FastDOL},
doi = {10.5281/zenodo.20031894},
url = {https://www.fastdol.com/datasets/public-company-federal-compliance}
}Changelog
2026.05 — 2026-05-01
- Initial public release on FastDOL.
- Mirrored to Hugging Face at FastDOLz/public-company-federal-compliance.
- Source data refreshed through 2026-04-30 across all contributing agencies.
- SEC EDGAR Exhibit 21 disclosures parsed through the most recent fiscal-year 10-K filings.
Related datasets
- Cross-Agency Federal Violations
U.S. employers cited by two or more federal agencies — OSHA, WHD, MSHA, EPA, and NLRB joined into a single per-employer view of multi-agency workplace, wage, environmental, and labor-relations violations.
- WHD Wage Theft Enforcement Actions by Employer
Employer-level wage theft enforcement records from the U.S. Department of Labor's Wage and Hour Division — back-wages assessed, employees affected, and case counts joined to OSHA enforcement context.