spark-instrumented-optimizer

History

Liwei Lin a4851ed050 [SPARK-15963][CORE] Catch `TaskKilledException` correctly in Executor.TaskRunner ## The problem Before this change, if either of the following cases happened to a task , the task would be marked as `FAILED` instead of `KILLED`: - the task was killed before it was deserialized - `executor.kill()` marked `taskRunner.killed`, but before calling `task.killed()` the worker thread threw the `TaskKilledException` The reason is, in the `catch` block of the current [Executor.TaskRunner](https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/executor/Executor.scala#L362)'s implementation, we are mistakenly catching: ```scala case _: TaskKilledException \| _: InterruptedException if task.killed => ... ``` the semantics of which is: - (`TaskKilledException` OR `InterruptedException`) AND `task.killed` Then when `TaskKilledException` is thrown but `task.killed` is not marked, we would mark the task as `FAILED` (which should really be `KILLED`). ## What changes were proposed in this pull request? This patch alters the catch condition's semantics from: - (`TaskKilledException` OR `InterruptedException`) AND `task.killed` to - `TaskKilledException` OR (`InterruptedException` AND `task.killed`) so that we can catch `TaskKilledException` correctly and mark the task as `KILLED` correctly. ## How was this patch tested? Added unit test which failed before the change, ran new test 1000 times manually Author: Liwei Lin <lwlin7@gmail.com> Closes #13685 from lw-lin/fix-task-killed.	2016-06-24 10:09:04 -05:00
..
src	[SPARK-15963][CORE] Catch `TaskKilledException` correctly in Executor.TaskRunner	2016-06-24 10:09:04 -05:00
pom.xml	[SPARK-15851][BUILD] Fix the call of the bash script to enable proper run in Windows	2016-06-15 20:11:23 -07:00

Liwei Lin a4851ed050 [SPARK-15963][CORE] Catch TaskKilledException correctly in Executor.TaskRunner

## The problem

Before this change, if either of the following cases happened to a task , the task would be marked as `FAILED` instead of `KILLED`:
- the task was killed before it was deserialized
- `executor.kill()` marked `taskRunner.killed`, but before calling `task.killed()` the worker thread threw the `TaskKilledException`

The reason is, in the `catch` block of the current [Executor.TaskRunner](https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/executor/Executor.scala#L362)'s implementation, we are mistakenly catching:
```scala
case _: TaskKilledException | _: InterruptedException if task.killed => ...
```
the semantics of which is:
- **(**`TaskKilledException` **OR** `InterruptedException`**)** **AND** `task.killed`

Then when `TaskKilledException` is thrown but `task.killed` is not marked, we would mark the task as `FAILED` (which should really be `KILLED`).

## What changes were proposed in this pull request?

This patch alters the catch condition's semantics from:
- **(**`TaskKilledException` **OR** `InterruptedException`**)** **AND** `task.killed`

to

- `TaskKilledException` **OR** **(**`InterruptedException` **AND** `task.killed`**)**

so that we can catch `TaskKilledException` correctly and mark the task as `KILLED` correctly.

## How was this patch tested?

Added unit test which failed before the change, ran new test 1000 times manually

Author: Liwei Lin <lwlin7@gmail.com>

Closes #13685 from lw-lin/fix-task-killed.

2016-06-24 10:09:04 -05:00

src [SPARK-15963][CORE] Catch TaskKilledException correctly in Executor.TaskRunner 2016-06-24 10:09:04 -05:00

pom.xml [SPARK-15851][BUILD] Fix the call of the bash script to enable proper run in Windows 2016-06-15 20:11:23 -07:00