Refinement: Redundant Execution
Slow workers significantly lengthen completion time
Solution: Near end of phase, spawn backup copies of tasks
- Other jobs consuming resources on machine
- Bad disks with soft errors transfer data very slowly
- Weird things: processor caches disabled (!!)
Effect: Dramatically shortens job completion time
- Whichever one finishes first "wins"