spark-instrumented-optimizer

History

Kay Ousterhout 2b807e4f2f Fix bug where scheduler could hang after task failure. When a task fails, we need to call reviveOffers() so that the task can be rescheduled on a different machine. In the current code, the state in ClusterTaskSetManager indicating which tasks are pending may be updated after revive offers is called (there's a race condition here), so when revive offers is called, the task set manager does not yet realize that there are failed tasks that need to be relaunched.	2013-11-14 13:33:11 -08:00
..
src	Fix bug where scheduler could hang after task failure.	2013-11-14 13:33:11 -08:00
pom.xml	Add a zookeeper compile dependency to fix build in maven	2013-10-11 16:31:47 +08:00

Kay Ousterhout 2b807e4f2f Fix bug where scheduler could hang after task failure.

When a task fails, we need to call reviveOffers() so that the
task can be rescheduled on a different machine. In the current code,
the state in ClusterTaskSetManager indicating which tasks are
pending may be updated after revive offers is called (there's a
race condition here), so when revive offers is called, the task set
manager does not yet realize that there are failed tasks that need
to be relaunched.

2013-11-14 13:33:11 -08:00

src

Fix bug where scheduler could hang after task failure.

2013-11-14 13:33:11 -08:00

pom.xml

Add a zookeeper compile dependency to fix build in maven

2013-10-11 16:31:47 +08:00