The data contains the date, location and a description for 4000 fatalities over five years. I created columns for state, zipcode, number of people and cause.
The most common interesting words in these descriptions are
- 813 fell
- 708 struck
- 642 truck
- 452 falling
- 382 crushed
- 352 head
- 263 roof
- 261 tree
- 258 electrocuted
- 244 ladder
- 238 vehicle
- 226 trailer
- 197 machine
- 186 collapsed
- 180 forklift
Not common but interesting
- 10 lightning
- 48 shot
- 4 dog
- 2 bees
and here is a map I made of the states where they happen
I have created a repository to try augment the OSHA data and clean it up when errors are found.
The repository is on github here.
If you use it I'll give you edit rights and you can help improve it