Task Granularity And Pipelining
Fine granularity tasks: many more map tasks than machines
- Minimizes time for fault recovery
- Can pipeline shuffling with map execution
- Better dynamic load balancing
Often use 200,000 map/5000 reduce tasks w/ 2000 machines