If you've spent any time working with distributed data processing, you've probably encountered a job where most of the tasks finish quickly and one task runs for what seems like forever, holding up ...