73747ecb97
### What changes were proposed in this pull request? Currently there are no speculation metrics available for Spark either at application/job/stage level. This PR is to add some basic speculation metrics for a stage when speculation execution is enabled. This is similar to the existing stage level metrics tracking numTotal (total number of speculated tasks), numCompleted (total number of successful speculated tasks), numFailed (total number of failed speculated tasks), numKilled (total number of killed speculated tasks) etc. With this new set of metrics, it helps further understanding speculative execution feature in the context of the application and also helps in further tuning the speculative execution config knobs. Screenshot of Spark UI with speculation summary: ![Screen Shot 2021-09-22 at 12 12 20 PM](https://user-images.githubusercontent.com/8871522/135321311-db7699ad-f1ae-4729-afea-d1e2c4e86103.png) Screenshot of Spark UI with API output: ![Screen Shot 2021-09-22 at 12 10 37 PM](https://user-images.githubusercontent.com/8871522/135321486-4dbb7a67-5580-47f8-bccf-81c758c2e988.png) ### Why are the changes needed? Additional metrics for speculative execution. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Unit tests added and also deployed in our internal platform for quite some time now. Lead-authored by: Venkata krishnan Sowrirajan <vsowrirajanlinkedin.com> Co-authored by: Ron Hu <rhulinkedin.com> Co-authored by: Thejdeep Gudivada <tgudivadalinkedin.com> Closes #33253 from venkata91/speculation-metrics. Authored-by: Venkata krishnan Sowrirajan <vsowrirajan@linkedin.com> Signed-off-by: Kousuke Saruta <sarutak@oss.nttdata.com>
140 lines
2.2 KiB
Plaintext
140 lines
2.2 KiB
Plaintext
target
|
|
cache
|
|
.gitignore
|
|
.gitattributes
|
|
.project
|
|
.classpath
|
|
.mima-excludes
|
|
.generated-mima-excludes
|
|
.generated-mima-class-excludes
|
|
.generated-mima-member-excludes
|
|
.rat-excludes
|
|
.*md
|
|
derby.log
|
|
licenses/*
|
|
licenses-binary/*
|
|
LICENSE
|
|
NOTICE
|
|
TAGS
|
|
RELEASE
|
|
control
|
|
docs
|
|
slaves
|
|
spark-env.cmd
|
|
bootstrap.bundle.min.js
|
|
bootstrap.bundle.min.js.map
|
|
bootstrap.min.css
|
|
bootstrap.min.css.map
|
|
jquery-3.5.1.min.js
|
|
d3.min.js
|
|
dagre-d3.min.js
|
|
graphlib-dot.min.js
|
|
sorttable.js
|
|
vis-timeline-graph2d.min.js
|
|
vis-timeline-graph2d.min.css
|
|
dataTables.bootstrap4.1.10.20.min.css
|
|
dataTables.bootstrap4.1.10.20.min.js
|
|
dataTables.rowsGroup.js
|
|
jquery.blockUI.min.js
|
|
jquery.cookies.2.2.0.min.js
|
|
jquery.dataTables.1.10.20.min.css
|
|
jquery.dataTables.1.10.20.min.js
|
|
jquery.mustache.js
|
|
jsonFormatter.min.css
|
|
jsonFormatter.min.js
|
|
.*\.avsc
|
|
.*\.txt
|
|
.*\.json
|
|
.*\.data
|
|
.*\.log
|
|
pyspark-coverage-site/*
|
|
cloudpickle/*
|
|
join.py
|
|
SparkExprTyper.scala
|
|
SparkILoop.scala
|
|
SparkILoopInit.scala
|
|
SparkIMain.scala
|
|
SparkImports.scala
|
|
SparkJLineCompletion.scala
|
|
SparkJLineReader.scala
|
|
SparkMemberHandlers.scala
|
|
SparkReplReporter.scala
|
|
sbt
|
|
sbt-launch-lib.bash
|
|
plugins.sbt
|
|
work
|
|
.*\.q
|
|
.*\.qv
|
|
golden
|
|
test.out/*
|
|
.*iml
|
|
service.properties
|
|
db.lck
|
|
build/*
|
|
dist/*
|
|
.*out
|
|
.*ipr
|
|
.*iws
|
|
logs
|
|
.*scalastyle-output.xml
|
|
.*dependency-reduced-pom.xml
|
|
known_translations
|
|
json_expectation
|
|
app-20180109111548-0000
|
|
app-20161115172038-0000
|
|
app-20161116163331-0000
|
|
application_1516285256255_0012
|
|
application_1553914137147_0018
|
|
application_1555004656427_0144
|
|
stat
|
|
local-1422981759269
|
|
local-1422981780767
|
|
local-1425081759269
|
|
local-1426533911241
|
|
local-1426633911242
|
|
local-1430917381534
|
|
local-1430917381535_1
|
|
local-1430917381535_2
|
|
DESCRIPTION
|
|
NAMESPACE
|
|
test_support/*
|
|
.*\.Rd
|
|
help/*
|
|
html/*
|
|
INDEX
|
|
.lintr
|
|
gen-java.*
|
|
.*\.avpr
|
|
.*\.parquet
|
|
spark-deps-.*
|
|
.*\.csv
|
|
.*\.tsv
|
|
.*\.sql
|
|
.Rbuildignore
|
|
META-INF/*
|
|
spark-warehouse
|
|
structured-streaming/*
|
|
kafka-source-initial-offset-version-2.1.0.bin
|
|
kafka-source-initial-offset-future-version.bin
|
|
announce.tmpl
|
|
vote.tmpl
|
|
SessionManager.java
|
|
SessionHandler.java
|
|
GangliaReporter.java
|
|
application_1578436911597_0052
|
|
config.properties
|
|
local-1596020211915
|
|
app-20200706201101-0003
|
|
application_1628109047826_1317105
|
|
py.typed
|
|
_metadata
|
|
_SUCCESS
|
|
part-00000
|
|
.*\.res
|
|
flights_tiny.txt.1
|
|
over1k
|
|
over10k
|
|
exported_table/*
|
|
ansible-for-test-node/*
|
|
node_modules
|