ODIn/spark-instrumented-optimizer

Author	SHA1	Message	Date
yi.wu	7f937730ff	[SPARK-33741][FOLLOW-UP][CORE] Rename the min threshold time speculation config ### What changes were proposed in this pull request? This's a follow-up of https://github.com/apache/spark/pull/30710. Rename the conf from `spark.speculation.min.threshold` to `spark.speculation.minTaskRuntime`. ### Why are the changes needed? To follow the [config naming policy](https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/internal/config/ConfigEntry.scala#L21). ### Does this PR introduce _any_ user-facing change? No (since Spark 3.2 hasn't been released). ### How was this patch tested? Pass existing tests. Closes #33037 from Ngone51/spark-33741-followup. Authored-by: yi.wu <yi.wu@databricks.com> Signed-off-by: Wenchen Fan <wenchen@databricks.com>	2021-06-23 13:29:58 +00:00
Chris Thomas	ceb8122c40	[SPARK-35399][DOCUMENTATION] State is still needed in the event of executor failure ### What changes were proposed in this pull request? Fix incorrect statement that state is no longer needed in the event of executor failure and document that it is needed in the case of a flaky app causing occasional executor failure. SO [discussion](https://stackoverflow.com/questions/67466878/can-spark-with-external-shuffle-service-use-saved-shuffle-files-in-the-event-of/67507439#67507439). ### Why are the changes needed? To fix the documentation and guide users as to additional use case for the Shuffle Service. ### Does this PR introduce _any_ user-facing change? Documentation only. ### How was this patch tested? N/A. Closes #32538 from chrisheaththomas/shuffle-service-and-executor-failure. Authored-by: Chris Thomas <chrisheaththomas@hotmail.com> Signed-off-by: Sean Owen <srowen@gmail.com>	2021-05-17 08:58:46 -05:00
Kousuke Saruta	2b6640a169	[SPARK-35229][WEBUI] Limit the maximum number of items on the timeline view ### What changes were proposed in this pull request? This PR proposes to introduces three new configurations to limit the maximum number of jobs/stages/executors on the timeline view. ### Why are the changes needed? If the number of items on the timeline view grows +1000, rendering can be significantly slow. https://issues.apache.org/jira/browse/SPARK-35229 The maximum number of tasks on the timeline is already limited by `spark.ui.timeline.tasks.maximum` so l proposed to mitigate this issue with the same manner. ### Does this PR introduce _any_ user-facing change? Yes. the maximum number of items shown on the timeline view is limited. I proposed the default value 500 for jobs and stages, and 250 for executors. A executor has at most 2 items (added and removed) 250 is chosen. ### How was this patch tested? I manually confirm this change works with the following procedures. ``` # launch a cluster $ bin/spark-shell --conf spark.ui.retainedDeadExecutors=300 --master "local-cluster[4, 1, 1024]" // Confirm the maximum number of jobs (1 to 1000).foreach { _ => sc.parallelize(List(1)).collect } // Confirm the maximum number of stages var df = sc.parallelize(1 to 2) (1 to 1000).foreach { i => df = df.repartition(i % 5 + 1) } df.collect // Confirm the maximum number of executors (1 to 300).foreach { _ => try sc.parallelize(List(1)).foreach { _ => System.exit(0) } catch { case e => }} ``` Screenshots here. ![jobs_limited](https://user-images.githubusercontent.com/4736016/116386937-3e8c4a00-a855-11eb-8f4c-151cf7ddd3b8.png) ![stages_limited](https://user-images.githubusercontent.com/4736016/116386990-49df7580-a855-11eb-9f71-8e129e3336ab.png) ![executors_limited](https://user-images.githubusercontent.com/4736016/116387009-4f3cc000-a855-11eb-8697-a2eb4c9c99e6.png) Closes #32381 from sarutak/mitigate-timeline-issue. Authored-by: Kousuke Saruta <sarutak@oss.nttdata.com> Signed-off-by: Gengliang Wang <ltnwgl@gmail.com>	2021-05-11 20:53:11 +08:00
Shardul Mahadik	83f753e4e1	[SPARK-34472][YARN] Ship ivySettings file to driver in cluster mode ### What changes were proposed in this pull request? In YARN, ship the `spark.jars.ivySettings` file to the driver when using `cluster` deploy mode so that `addJar` is able to find it in order to resolve ivy paths. ### Why are the changes needed? SPARK-33084 introduced support for Ivy paths in `sc.addJar` or Spark SQL `ADD JAR`. If we use a custom ivySettings file using `spark.jars.ivySettings`, it is loaded at `b26e7b510b/core/src/main/scala/org/apache/spark/deploy/SparkSubmit.scala (L1280)`. However, this file is only accessible on the client machine. In YARN cluster mode, this file is not available on the driver and so `addJar` fails to find it. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Added unit tests to verify that the `ivySettings` file is localized by the YARN client and that a YARN cluster mode application is able to find to load the `ivySettings` file. Closes #31591 from shardulm94/SPARK-34472. Authored-by: Shardul Mahadik <smahadik@linkedin.com> Signed-off-by: Thomas Graves <tgraves@apache.org>	2021-04-20 13:35:57 -05:00
Kent Yao	5692aa0c2c	[SPARK-34894][CORE] Use 'io.connectionTimeout' as a hint instead of 'spark.network.timeout' for lost connections ### What changes were proposed in this pull request? Currently, when a connection for TransportClient is marked as idled and closed, we suggest users adjust `spark.network.timeout` for all transport modules. As a lot of timeout configs will fallback to the `spark.network.timeout`, this could be a piece of overkill advice, we should give a more targeted one with `spark.${moduleName}.io.connectionTimeout` ### Why are the changes needed? better advise for overloaded network traffic cases ### Does this PR introduce _any_ user-facing change? yes, when a connection is zombied and closed by spark internally, users can use a more targeted config to tune their jobs ### How was this patch tested? Just log and doc. Passing Jenkins and GA Closes #31990 from yaooqinn/SPARK-34894. Authored-by: Kent Yao <yao@apache.org> Signed-off-by: Kent Yao <yao@apache.org>	2021-03-30 09:58:24 +08:00
Dongjoon Hyun	499cc79344	[SPARK-34503][DOCS][FOLLOWUP] Document available codecs for event log compression ### What changes were proposed in this pull request? This PR is a follow-up of https://github.com/apache/spark/pull/31618 to document the available codecs for event log compression. ### Why are the changes needed? Documentation. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? Manual. Closes #31695 from dongjoon-hyun/SPARK-34503-DOC. Authored-by: Dongjoon Hyun <dhyun@apple.com> Signed-off-by: Dongjoon Hyun <dhyun@apple.com>	2021-03-01 15:42:10 -08:00
Dongjoon Hyun	2e31e2c5f3	[SPARK-34503][CORE] Use zstd for spark.eventLog.compression.codec by default ### What changes were proposed in this pull request? Apache Spark 3.0 introduced `spark.eventLog.compression.codec` configuration. For Apache Spark 3.2, this PR aims to set `zstd` as the default value for `spark.eventLog.compression.codec` configuration. This only affects creating a new log file. ### Why are the changes needed? The main purpose of event logs is archiving. Many logs are generated and occupy the storage, but most of them are never accessed by users. 1. Save storage resources (and money) In general, ZSTD is much smaller than LZ4. For example, in case of TPCDS (Scale 200) log, ZSTD generates about 3 times smaller log files than LZ4. \| CODEC \| SIZE (bytes) \| \|---------\|-------------\| \| LZ4 \| 184001434\| \| ZSTD \| 64522396\| And, the plain file is 17.6 times bigger. ``` -rw-r--r-- 1 dongjoon staff 1135464691 Feb 21 22:31 spark-a1843ead29834f46b1125a03eca32679 -rw-r--r-- 1 dongjoon staff 64522396 Feb 21 22:31 spark-a1843ead29834f46b1125a03eca32679.zstd ``` 2. Better Usability We cannot decompress Spark-generated LZ4 event log files via CLI while we can for ZSTD event log files. Spark's LZ4 event log files are inconvenient to some users who want to uncompress and access them. ``` $ lz4 -d spark-d3deba027bd34435ba849e14fc2c42ef.lz4 Decoding file spark-d3deba027bd34435ba849e14fc2c42ef Error 44 : Unrecognized header : file cannot be decoded ``` ``` $ zstd -d spark-a1843ead29834f46b1125a03eca32679.zstd spark-a1843ead29834f46b1125a03eca32679.zstd: 1135464691 bytes ``` 3. Speed The following results are collected by running [lzbench](https://github.com/inikep/lzbench) on the above Spark event log. Note that - This is not a direct comparison of Spark compression/decompression codec. - `lzbench` is an in-memory benchmark. So, it doesn't show the benefit of the reduced network traffic due to the small size of ZSTD. Here, - To get ZSTD 1.4.8-1 result, `lzbench` `master` branch is used because Spark is using ZSTD 1.4.8. - To get LZ4 1.7.5 result, `lzbench` `v1.7` branch is used because Spark is using LZ4 1.7.1. ``` Compressor name Compress. Decompress. Compr. size Ratio Filename memcpy 7393 MB/s 7166 MB/s 1135464691 100.00 spark-a1843ead29834f46b1125a03eca32679 zstd 1.4.8 -1 1344 MB/s 3351 MB/s 56665767 4.99 spark-a1843ead29834f46b1125a03eca32679 lz4 1.7.5 1385 MB/s 4782 MB/s 127662168 11.24 spark-a1843ead29834f46b1125a03eca32679 ``` ### Does this PR introduce _any_ user-facing change? - No for the apps which doesn't use `spark.eventLog.compress` because `spark.eventLog.compress` is disabled by default. - No for the apps using `spark.eventLog.compression.codec` explicitly because this is a change of the default value. - Yes for the apps using `spark.eventLog.compress` without setting `spark.eventLog.compression.codec`. In this case, previously `spark.io.compression.codec` value was used whose default is `lz4`. So this JIRA issue, SPARK-34503, is labeled with `releasenotes`. ### How was this patch tested? Pass the updated UT. Closes #31618 from dongjoon-hyun/SPARK-34503. Authored-by: Dongjoon Hyun <dhyun@apple.com> Signed-off-by: Dongjoon Hyun <dhyun@apple.com>	2021-02-23 16:37:29 -08:00
schintap	bd5039fc35	[SPARK-33741][CORE] Add min threshold time speculation config ### What changes were proposed in this pull request? Add min threshold time speculation config ### Why are the changes needed? When we turn on speculation with default configs we have the last 10% of the tasks subject to speculation. There are a lot of stages where the stage runs for few seconds to minutes. Also in general we don't want to speculate tasks that run within a minimum threshold. By setting a minimum threshold for speculation config gives us better control for speculative tasks ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Unit test Closes #30710 from redsanket/SPARK-33741. Lead-authored-by: schintap <schintap@verizonmedia.com> Co-authored-by: Sanket Chintapalli <chintapalli.sanketreddy@gmail.com> Signed-off-by: Thomas Graves <tgraves@apache.org>	2021-01-13 08:57:56 -06:00
Dongjoon Hyun	47d1aa4e93	[SPARK-33891][DOCS][CORE] Update dynamic allocation related documents ### What changes were proposed in this pull request? This PR aims to update the followings. - Remove the outdated requirement for `spark.shuffle.service.enabled` in `configuration.md` - Dynamic allocation section in `job-scheduling.md` ### Why are the changes needed? To make the document up-to-date. ### Does this PR introduce _any_ user-facing change? No, it's a documentation update. ### How was this patch tested? Manual. BEFORE ![Screen Shot 2020-12-23 at 2 22 04 AM](https://user-images.githubusercontent.com/9700541/102986441-ae647f80-44c5-11eb-97a3-87c2d368952a.png) ![Screen Shot 2020-12-23 at 2 22 34 AM](https://user-images.githubusercontent.com/9700541/102986473-bcb29b80-44c5-11eb-8eae-6802001c6dfa.png) AFTER ![Screen Shot 2020-12-23 at 2 25 36 AM](https://user-images.githubusercontent.com/9700541/102986767-2df24e80-44c6-11eb-8540-e74856a4c313.png) ![Screen Shot 2020-12-23 at 2 21 13 AM](https://user-images.githubusercontent.com/9700541/102986366-8e34c080-44c5-11eb-8054-1efd07c9458c.png) Closes #30906 from dongjoon-hyun/SPARK-33891. Authored-by: Dongjoon Hyun <dhyun@apple.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-12-23 23:43:21 +09:00
yangjie01	92bfbcb2e3	[SPARK-33631][DOCS][TEST] Clean up spark.core.connection.ack.wait.timeout from configuration.md ### What changes were proposed in this pull request? SPARK-9767 remove `ConnectionManager` and related files, the configuration `spark.core.connection.ack.wait.timeout` previously used by `ConnectionManager` is no longer used by other Spark code, but it still exists in the `configuration.md`. So this pr cleans up the useless configuration item spark.core.connection.ack.wait.timeout` from `configuration.md`. ### Why are the changes needed? Clean up useless configuration from `configuration.md`. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Pass the Jenkins or GitHub Action Closes #30569 from LuciferYang/SPARK-33631. Authored-by: yangjie01 <yangjie01@baidu.com> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>	2020-12-02 12:58:41 -08:00
HyukjinKwon	1a042cc414	[SPARK-33530][CORE] Support --archives and spark.archives option natively ### What changes were proposed in this pull request? TL;DR: - This PR completes the support of archives in Spark itself instead of Yarn-only - It makes `--archives` option work in other cluster modes too and adds `spark.archives` configuration. - After this PR, PySpark users can leverage Conda to ship Python packages together as below: ```python conda create -y -n pyspark_env -c conda-forge pyarrow==2.0.0 pandas==1.1.4 conda-pack==0.5.0 conda activate pyspark_env conda pack -f -o pyspark_env.tar.gz PYSPARK_DRIVER_PYTHON=python PYSPARK_PYTHON=./environment/bin/python pyspark --archives pyspark_env.tar.gz#environment ``` - Issue a warning that undocumented and hidden behavior of partial archive handling in `spark.files` / `SparkContext.addFile` will be deprecated, and users can use `spark.archives` and `SparkContext.addArchive`. This PR proposes to add Spark's native `--archives` in Spark submit, and `spark.archives` configuration. Currently, both are supported only in Yarn mode: ```bash ./bin/spark-submit --help ``` ``` Options: ... Spark on YARN only: --queue QUEUE_NAME The YARN queue to submit to (Default: "default"). --archives ARCHIVES Comma separated list of archives to be extracted into the working directory of each executor. ``` This `archives` feature is useful often when you have to ship a directory and unpack into executors. One example is native libraries to use e.g. JNI. Another example is to ship Python packages together by Conda environment. Especially for Conda, PySpark currently does not have a nice way to ship a package that works in general, please see also https://hyukjin-spark.readthedocs.io/en/stable/user_guide/python_packaging.html#using-zipped-virtual-environment (PySpark new documentation demo for 3.1.0). The neatest way is arguably to use Conda environment by shipping zipped Conda environment but this is currently dependent on this archive feature. NOTE that we are able to use `spark.files` by relying on its undocumented behaviour that untars `tar.gz` but I don't think we should document such ways and promote people to more rely on it. Also, note that this PR does not target to add the feature parity of `spark.files.overwrite`, `spark.files.useFetchCache`, etc. yet. I documented that this is an experimental feature as well. ### Why are the changes needed? To complete the feature parity, and to provide a better support of shipping Python libraries together with Conda env. ### Does this PR introduce _any_ user-facing change? Yes, this makes `--archives` works in Spark instead of Yarn-only, and adds a new configuration `spark.archives`. ### How was this patch tested? I added unittests. Also, manually tested in standalone cluster, local-cluster, and local modes. Closes #30486 from HyukjinKwon/native-archive. Authored-by: HyukjinKwon <gurwls223@apache.org> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-12-01 13:43:02 +09:00
Josh Soref	485145326a	[MINOR] Spelling bin core docs external mllib repl ### What changes were proposed in this pull request? This PR intends to fix typos in the sub-modules: * `bin` * `core` * `docs` * `external` * `mllib` * `repl` * `pom.xml` Split per srowen https://github.com/apache/spark/pull/30323#issuecomment-728981618 NOTE: The misspellings have been reported at `706a726f87 (commitcomment-44064356)` ### Why are the changes needed? Misspelled words make it harder to read / understand content. ### Does this PR introduce _any_ user-facing change? There are various fixes to documentation, etc... ### How was this patch tested? No testing was performed Closes #30530 from jsoref/spelling-bin-core-docs-external-mllib-repl. Authored-by: Josh Soref <jsoref@users.noreply.github.com> Signed-off-by: Takeshi Yamamuro <yamamuro@apache.org>	2020-11-30 13:59:51 +09:00
Thomas Graves	acfd846753	[SPARK-33288][SPARK-32661][K8S] Stage level scheduling support for Kubernetes ### What changes were proposed in this pull request? This adds support for Stage level scheduling to kubernetes. Kubernetes can support dynamic allocation via the shuffle tracking option which means we can support stage level scheduling by getting new executors. The main changes here are having the k8s cluster manager pass the resource profile id into the executors and then the ExecutorsPodsAllocator has to request executors based on the individual resource profiles. I tried to keep code changes here to a minimum. I specifically choose to leave the ExecutorPodsSnapshot the way it was and construct the resource profile to pod states on the fly, with a fast path when not using other resource profiles, to keep the impact to a minimum. This results in the main changes required are just wrapping the allocation logic in a for loop over each profile. The other main change is in the basic feature step we have to look at the resources in the ResourceProfile to request pods with the correct resources. Much of the other logic like in the executor life cycle manager doesn't need to be resource profile. This also adds support for [SPARK-32661]Spark executors on K8S should request extra memory for off-heap allocations because the stage level scheduling api has support for this and it made sense to make consistent with YARN. This was started with PR https://github.com/apache/spark/pull/29477 but never updated so I just did it here. To do this I moved a few functions around that were now used by both YARN and kubernetes so you will see some changes in Utils. ### Why are the changes needed? Add the feature to Kubernetes based on customer feedback. ### Does this PR introduce _any_ user-facing change? Yes the feature now works with K8s, but not underlying API changes. ### How was this patch tested? Tested manually on kubernetes cluster and with unit tests. Closes #30204 from tgravescs/stagek8sOrigSnapshotsRebase. Lead-authored-by: Thomas Graves <tgraves@apache.org> Co-authored-by: Thomas Graves <tgraves@nvidia.com> Signed-off-by: Thomas Graves <tgraves@apache.org>	2020-11-13 16:04:13 -06:00
Kent Yao	4335af075a	[MINOR][DOC] spark.executor.memoryOverhead is not cluster-mode only ### What changes were proposed in this pull request? Remove "in cluster mode" from the description of `spark.executor.memoryOverhead` ### Why are the changes needed? fix correctness issue in documentaion ### Does this PR introduce _any_ user-facing change? yes, users may not get confused about the description `spark.executor.memoryOverhead` ### How was this patch tested? pass GA doc generation Closes #30311 from yaooqinn/minordoc. Authored-by: Kent Yao <yaooqinn@hotmail.com> Signed-off-by: Takeshi Yamamuro <yamamuro@apache.org>	2020-11-12 18:53:06 +09:00
Gengliang Wang	2b6dfa5f7b	[SPARK-20044][UI] Support Spark UI behind front-end reverse proxy using a path prefix Revert proxy url ### What changes were proposed in this pull request? Allow to run the Spark web UI behind a reverse proxy with URLs prefixed by a context root, like www.mydomain.com/spark. In particular, this allows to access multiple Spark clusters through the same virtual host, only distinguishing them by context root, like www.mydomain.com/cluster1, www.mydomain.com/cluster2, and it allows to run the Spark UI in a common cookie domain (for SSO) with other services. ### Why are the changes needed? This PR is to take over https://github.com/apache/spark/pull/17455. After changes, Spark allows showing customized prefix URL in all the `href` links of the HTML pages. ### Does this PR introduce _any_ user-facing change? Yes, all the links of UI pages will be contains the value of `spark.ui.reverseProxyUrl` if it is configurated. ### How was this patch tested? New HTML Unit tests in MasterSuite Manual UI testing for master, worker and app UI with an nginx proxy Spark config: ``` spark.ui.port 8080 spark.ui.reverseProxy=true spark.ui.reverseProxyUrl=/path/to/spark/ ``` nginx config: ``` server { listen 9000; set $SPARK_MASTER http://127.0.0.1:8080; # split spark UI path into prefix and local path within master UI location ~ ^(/path/to/spark/) { # strip prefix when forwarding request rewrite /path/to/spark(/.*) $1 break; #rewrite /path/to/spark/ "/" ; # forward to spark master UI proxy_pass $SPARK_MASTER; proxy_intercept_errors on; error_page 301 302 307 = handle_redirects; } location handle_redirects { set $saved_redirect_location '$upstream_http_location'; proxy_pass $saved_redirect_location; } } ``` Closes #29820 from gengliangwang/revertProxyURL. Lead-authored-by: Gengliang Wang <gengliang.wang@databricks.com> Co-authored-by: Oliver Köth <okoeth@de.ibm.com> Signed-off-by: Gengliang Wang <gengliang.wang@databricks.com>	2020-11-01 23:57:57 +08:00
Thomas Graves	72ad9dcd5d	[SPARK-32037][CORE] Rename blacklisting feature ### What changes were proposed in this pull request? this PR renames the blacklisting feature. I ended up using "excludeOnFailure" or "excluded" in most cases but there is a mix. I renamed the BlacklistTracker to HealthTracker, but for the TaskSetBlacklist HealthTracker didn't make sense to me since its not the health of the taskset itself but rather tracking the things its excluded on so I renamed it to be TaskSetExcludeList. Everything else I tried to use the context and in most cases excluded made sense. It made more sense to me then blocked since you are basically excluding those executors and nodes from scheduling tasks on them. Then can be unexcluded later after timeouts and such. The configs I changed the name to use excludeOnFailure which I thought explained it. I unfortunately couldn't get rid of some of them because its part of the event listener and history files. To keep backwards compatibility I kept the events and some of the parsing so that the history server would still properly read older history files. It is not forward compatible though - meaning a new application write the "Excluded" events so the older history server won't properly read display them as being blacklisted. A few of the files below are showing up as deleted and recreated even though I did a git mv on them. I'm not sure why. ### Why are the changes needed? get rid of problematic language ### Does this PR introduce _any_ user-facing change? Config name changes but the old configs still work but are deprecated. ### How was this patch tested? updated tests and also manually tested the UI changes and manually tested the history server reading older versions of history files and vice versa. Closes #29906 from tgravescs/SPARK-32037. Lead-authored-by: Thomas Graves <tgraves@nvidia.com> Co-authored-by: Thomas Graves <tgraves@apache.org> Signed-off-by: Thomas Graves <tgraves@apache.org>	2020-10-30 17:16:53 -05:00
Dongjoon Hyun	cc06266ade	[SPARK-33019][CORE] Use spark.hadoop.mapreduce.fileoutputcommitter.algorithm.version=1 by default ### What changes were proposed in this pull request? Apache Spark 3.1's default Hadoop profile is `hadoop-3.2`. Instead of having a warning documentation, this PR aims to use a consistent and safer version of Apache Hadoop file output committer algorithm which is `v1`. This will prevent a silent correctness regression during migration from Apache Spark 2.4/3.0 to Apache Spark 3.1.0. Of course, if there is a user-provided configuration, `spark.hadoop.mapreduce.fileoutputcommitter.algorithm.version=2`, that will be used still. ### Why are the changes needed? Apache Spark provides multiple distributions with Hadoop 2.7 and Hadoop 3.2. `spark.hadoop.mapreduce.fileoutputcommitter.algorithm.version` depends on the Hadoop version. Apache Hadoop 3.0 switches the default algorithm from `v1` to `v2` and now there exists a discussion to remove `v2`. We had better provide a consistent default behavior of `v1` across various Spark distributions. - [MAPREDUCE-7282](https://issues.apache.org/jira/browse/MAPREDUCE-7282) MR v2 commit algorithm should be deprecated and not the default ### Does this PR introduce _any_ user-facing change? Yes. This changes the default behavior. Users can override this conf. ### How was this patch tested? Manual. BEFORE (spark-3.0.1-bin-hadoop3.2) ```scala scala> sc.version res0: String = 3.0.1 scala> sc.hadoopConfiguration.get("mapreduce.fileoutputcommitter.algorithm.version") res1: String = 2 ``` AFTER ```scala scala> sc.hadoopConfiguration.get("mapreduce.fileoutputcommitter.algorithm.version") res0: String = 1 ``` Closes #29895 from dongjoon-hyun/SPARK-DEFAUT-COMMITTER. Authored-by: Dongjoon Hyun <dhyun@apple.com> Signed-off-by: Dongjoon Hyun <dhyun@apple.com>	2020-09-29 12:02:45 -07:00
waleedfateem	8749b2b6fa	[SPARK-32701][CORE][DOCS] mapreduce.fileoutputcommitter.algorithm.version default value The current documentation states that the default value of spark.hadoop.mapreduce.fileoutputcommitter.algorithm.version is 1 which is not entirely true since this configuration isn't set anywhere in Spark but rather inherited from the Hadoop FileOutputCommitter class. ### What changes were proposed in this pull request? I'm submitting this change, to clarify that the default value will entirely depend on the Hadoop version of the runtime environment. ### Why are the changes needed? An application would end up using algorithm version 1 on certain environments but without any changes the same exact application will use version 2 on environments running Hadoop 3.0 and later. This can have pretty bad consequences in certain scenarios, for example, two tasks can partially overwrite their output if speculation is enabled. Also, please refer to the following JIRA: https://issues.apache.org/jira/browse/MAPREDUCE-7282 ### Does this PR introduce _any_ user-facing change? Yes. Configuration page content was modified where previously we explicitly highlighted that the default version for the FileOutputCommitter algorithm was v1, this now has changed to "Dependent on environment" with additional information in the description column to elaborate. ### How was this patch tested? Checked changes locally in browser Closes #29541 from waleedfateem/SPARK-32701. Authored-by: waleedfateem <waleed.fateem@gmail.com> Signed-off-by: Sean Owen <srowen@gmail.com>	2020-08-27 09:05:50 -05:00
Thomas Graves	e926d419d3	[SPARK-30322][DOCS] Add stage level scheduling docs ### What changes were proposed in this pull request? Document the stage level scheduling feature. ### Why are the changes needed? Document the stage level scheduling feature. ### Does this PR introduce _any_ user-facing change? Documentation. ### How was this patch tested? n/a docs only Closes #29292 from tgravescs/SPARK-30322. Authored-by: Thomas Graves <tgraves@nvidia.com> Signed-off-by: Thomas Graves <tgraves@apache.org>	2020-07-29 13:46:28 -05:00
HyukjinKwon	4ad9bfd53b	[SPARK-32138] Drop Python 2.7, 3.4 and 3.5 ### What changes were proposed in this pull request? This PR aims to drop Python 2.7, 3.4 and 3.5. Roughly speaking, it removes all the widely known Python 2 compatibility workarounds such as `sys.version` comparison, `__future__`. Also, it removes the Python 2 dedicated codes such as `ArrayConstructor` in Spark. ### Why are the changes needed? 1. Unsupport EOL Python versions 2. Reduce maintenance overhead and remove a bit of legacy codes and hacks for Python 2. 3. PyPy2 has a critical bug that causes a flaky test, SPARK-28358 given my testing and investigation. 4. Users can use Python type hints with Pandas UDFs without thinking about Python version 5. Users can leverage one latest cloudpickle, https://github.com/apache/spark/pull/28950. With Python 3.8+ it can also leverage C pickle. ### Does this PR introduce _any_ user-facing change? Yes, users cannot use Python 2.7, 3.4 and 3.5 in the upcoming Spark version. ### How was this patch tested? Manually tested and also tested in Jenkins. Closes #28957 from HyukjinKwon/SPARK-32138. Authored-by: HyukjinKwon <gurwls223@apache.org> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-07-14 11:22:44 +09:00
Holden Karau	90ac9f975b	[SPARK-32004][ALL] Drop references to slave ### What changes were proposed in this pull request? This change replaces the world slave with alternatives matching the context. ### Why are the changes needed? There is no need to call things slave, we might as well use better clearer names. ### Does this PR introduce _any_ user-facing change? Yes, the ouput JSON does change. To allow backwards compatibility this is an additive change. The shell scripts for starting & stopping workers are renamed, and for backwards compatibility old scripts are added to call through to the new ones while printing a deprecation message to stderr. ### How was this patch tested? Existing tests. Closes #28864 from holdenk/SPARK-32004-drop-references-to-slave. Lead-authored-by: Holden Karau <hkarau@apple.com> Co-authored-by: Holden Karau <holden@pigscanfly.ca> Signed-off-by: Holden Karau <hkarau@apple.com>	2020-07-13 14:05:33 -07:00
yi.wu	54e702c0dd	[SPARK-31970][CORE] Make MDC configuration step be consistent between setLocalProperty and log4j.properties ### What changes were proposed in this pull request? This PR proposes to use "mdc.XXX" as the consistent key for both `sc.setLocalProperty` and `log4j.properties` when setting up configurations for MDC. ### Why are the changes needed? It's weird that we use "mdc.XXX" as key to set MDC value via `sc.setLocalProperty` while we use "XXX" as key to set MDC pattern in log4j.properties. It could also bring extra burden to the user. ### Does this PR introduce _any_ user-facing change? No, as MDC feature is added in version 3.1, which hasn't been released. ### How was this patch tested? Tested manually. Closes #28801 from Ngone51/consistent-mdc. Authored-by: yi.wu <yi.wu@databricks.com> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>	2020-06-14 14:26:11 -07:00
Izek Greenfield	eaf7a2a4ed	[SPARK-8981][CORE][TEST-HADOOP3.2][TEST-JAVA11] Add MDC support in Executor ### What changes were proposed in this pull request? Added MDC support in all thread pools. ThreaddUtils create new pools that pass over MDC. ### Why are the changes needed? In many cases, it is very hard to understand from which actions the logs in the executor come from. when you are doing multi-thread work in the driver and send actions in parallel. ### Does this PR introduce any user-facing change? No ### How was this patch tested? No test added because no new functionality added it is thread pull change and all current tests pass. Closes #26624 from igreenfield/master. Authored-by: Izek Greenfield <igreenfield@axiomsl.com> Signed-off-by: Wenchen Fan <wenchen@databricks.com>	2020-05-20 07:41:00 +00:00
Xingbo Jiang	b7cde42b04	[SPARK-31619][CORE] Rename config "spark.dynamicAllocation.shuffleTimeout" to "spark.dynamicAllocation.shuffleTracking.timeout" ### What changes were proposed in this pull request? The "spark.dynamicAllocation.shuffleTimeout" configuration only takes effect if "spark.dynamicAllocation.shuffleTracking.enabled" is true, so we should re-namespace that configuration so that it's nested under the "shuffleTracking" one. ### How was this patch tested? Covered by current existing test cases. Closes #28426 from jiangxb1987/confName. Authored-by: Xingbo Jiang <xingbo.jiang@databricks.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-05-01 11:46:17 +09:00
Kent Yao	5ba467ca1d	[SPARK-31550][SQL][DOCS] Set nondeterministic configurations with general meanings in sql configuration doc ### What changes were proposed in this pull request? ```scala spark.sql.session.timeZone spark.sql.warehouse.dir ``` these 2 configs are nondeterministic and vary with environments Besides, reflect code in `gen-sql-config-docs.py` via https://github.com/apache/spark/pull/28274#discussion_r412893096 and `configuration.md` via https://github.com/apache/spark/pull/28274#discussion_r412894905 ### Why are the changes needed? doc fix ### Does this PR introduce any user-facing change? no ### How was this patch tested? verify locally ![image](https://user-images.githubusercontent.com/8326978/80179099-5e7da200-8632-11ea-803f-d47a93151869.png) Closes #28322 from yaooqinn/SPARK-31550. Authored-by: Kent Yao <yaooqinn@hotmail.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-04-27 17:08:52 +09:00
Kent Yao	2c2062ea7c	[SPARK-31498][SQL][DOCS] Dump public static sql configurations through doc generation ### What changes were proposed in this pull request? Currently, only the non-static public SQL configurations are dump to public doc, we'd better also add those static public ones as the command `set -v` This PR force call StaticSQLConf to buildStaticConf. ### Why are the changes needed? Fix missing SQL configurations in doc ### Does this PR introduce any user-facing change? NO ### How was this patch tested? add unit test and verify locally to see if public static SQL conf is in `docs/sql-config.html` Closes #28274 from yaooqinn/SPARK-31498. Authored-by: Kent Yao <yaooqinn@hotmail.com> Signed-off-by: Wenchen Fan <wenchen@databricks.com>	2020-04-22 10:16:39 +00:00
Takeshi Yamamuro	e42dbe7cd4	[SPARK-31429][SQL][DOC] Automatically generates a SQL document for built-in functions ### What changes were proposed in this pull request? This PR intends to add a Python script to generates a SQL document for built-in functions and the document in SQL references. ### Why are the changes needed? To make SQL references complete. ### Does this PR introduce any user-facing change? Yes; ![a](https://user-images.githubusercontent.com/692303/79406712-c39e1b80-7fd2-11ea-8b85-9f9cbb6efed3.png) ![b](https://user-images.githubusercontent.com/692303/79320526-eb46a280-7f44-11ea-8639-90b1fb2b8848.png) ![c](https://user-images.githubusercontent.com/692303/79320707-3365c500-7f45-11ea-9984-69ffe800fb87.png) ### How was this patch tested? Manually checked and added tests. Closes #28224 from maropu/SPARK-31429. Lead-authored-by: Takeshi Yamamuro <yamamuro@apache.org> Co-authored-by: HyukjinKwon <gurwls223@apache.org> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-04-21 10:55:13 +09:00
beliefer	0fc859b4d5	[SPARK-31269][DOC][FOLLOWUP][MINOR] Add version head of GraphX table ### What changes were proposed in this pull request? HyukjinKwon have ported back all the PR about version to branch-3.0. I make a double check and found GraphX table lost version head. This PR will fix the issue. HyukjinKwon, please help me merge this PR to master and branch-3.0 ### Why are the changes needed? Add version head of GraphX table ### Does this PR introduce any user-facing change? 'No'. ### How was this patch tested? Jenkins test. Closes #28149 from beliefer/fix-head-of-graphx-table. Authored-by: beliefer <beliefer@163.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-04-08 12:25:06 +09:00
Dongjoon Hyun	3886442332	[SPARK-27963][DOCS][FOLLOWUP] Update requirements for spark.dynamicAllocation.enabled ### What changes were proposed in this pull request? This PR fixes the outdated requirement for `spark.dynamicAllocation.enabled=true`. ### Why are the changes needed? This is found during 3.0.0 RC1 document review and testing. As described at `spark.dynamicAllocation.shuffleTracking.enabled` in the same table, we can enabled Dynamic Allocation without external shuffle service. ### Does this PR introduce any user-facing change? Yes. (Doc.) ### How was this patch tested? Manually generate the doc by `SKIP_API=1 jekyll build` BEFORE ![Screen Shot 2020-04-05 at 2 31 23 PM](https://user-images.githubusercontent.com/9700541/78510472-29c0ae00-774a-11ea-9916-ba80015fae82.png) AFTER ![Screen Shot 2020-04-05 at 2 29 25 PM](https://user-images.githubusercontent.com/9700541/78510434-ea925d00-7749-11ea-8db8-018955507fd5.png) Closes #28132 from dongjoon-hyun/SPARK-DA-DOC. Authored-by: Dongjoon Hyun <dongjoon@apache.org> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-04-06 11:04:21 +09:00
Thomas Graves	55dea9be62	[SPARK-29153][CORE] Add ability to merge resource profiles within a stage with Stage Level Scheduling ### What changes were proposed in this pull request? For the stage level scheduling feature, add the ability to optionally merged resource profiles if they were specified on multiple RDD within a stage. There is a config to enable this feature, its off by default (spark.scheduler.resourceProfile.mergeConflicts). When the config is set to true, Spark will merge the profiles selecting the max value of each resource (cores, memory, gpu, etc). further documentation will be added with SPARK-30322. This also added in the ability to check if an equivalent resource profile already exists. This is so that if a user is running stages and combining the same profiles over and over again we don't get an explosion in the number of profiles. ### Why are the changes needed? To allow users to specify resource on multiple RDD and not worry as much about if they go into the same stage and fail. ### Does this PR introduce any user-facing change? Yes, when the config is turned on it now merges the profiles instead of errorring out. ### How was this patch tested? Unit tests Closes #28053 from tgravescs/SPARK-29153. Lead-authored-by: Thomas Graves <tgraves@apache.org> Co-authored-by: Thomas Graves <tgraves@nvidia.com> Signed-off-by: Thomas Graves <tgraves@apache.org>	2020-04-02 08:30:18 -05:00
beliefer	18b73a5b59	[SPARK-31269][DOC] Supplement version for configuration only appear in configuration doc ### What changes were proposed in this pull request? The `configuration.md` exists some config not organized by `ConfigEntry`. This PR supplements version for configuration only appear in configuration doc. I sorted out some information show below. Item name \| Since version \| JIRA ID \| Commit ID \| Note -- \| -- \| -- \| -- \| -- spark.app.name \| 0.9.0 \| None \| 994f080f8ae3372366e6004600ba791c8a372ff0#diff-529fc5c06b9731c1fbda6f3db60b16aa \| spark.driver.resource.{resourceName}.amount \| 3.0.0 \| SPARK-27760 \| d30284b5a51dd784f663eb4eea37087b35a54d00#diff-76e731333fb756df3bff5ddb3b731c46 \| spark.driver.resource.{resourceName}.discoveryScript \| 3.0.0 \| SPARK-27488 \| 74e5e41eebf9ed596b48e6db52a2a9c642e5cbc3#diff-76e731333fb756df3bff5ddb3b731c46 \| spark.driver.resource.{resourceName}.vendor \| 3.0.0 \| SPARK-27362 \| 1277f8fa92da85d9e39d9146e3099fcb75c71a8f#diff-76e731333fb756df3bff5ddb3b731c46 \| spark.executor.resource.{resourceName}.amount \| 3.0.0 \| SPARK-27760 \| d30284b5a51dd784f663eb4eea37087b35a54d00#diff-76e731333fb756df3bff5ddb3b731c46 \| spark.executor.resource.{resourceType}.discoveryScript \| 3.0.0 \| SPARK-27024 \| db2e3c43412e4a7fb4a46c58d73d9ab304a1e949#diff-76e731333fb756df3bff5ddb3b731c46 \| spark.executor.resource.{resourceName}.vendor \| 3.0.0 \| SPARK-27362 \| 1277f8fa92da85d9e39d9146e3099fcb75c71a8f#diff-76e731333fb756df3bff5ddb3b731c46 \| spark.local.dir \| 0.5.0 \| None \| 0e93891d3d7df849cff6442038c111ffd42a5243#diff-17fd275d280b667722664ed833c6402a \| spark.logConf \| 0.9.0 \| None \| d8bcc8e9a095c1b20dd7a17b6535800d39bff80e#diff-364713d7776956cb8b0a771e9b62f82d \| spark.master \| 0.9.0 \| SPARK-544 \| 2573add94cf920a88f74d80d8ea94218d812704d#diff-529fc5c06b9731c1fbda6f3db60b16aa \| spark.driver.defaultJavaOptions \| 3.0.0 \| SPARK-23472 \| f83000597f250868de9722d8285fed013abc5ecf#diff-a78ecfc6a89edfaf0b60a5eaa0381970 \| spark.executor.defaultJavaOptions \| 3.0.0 \| SPARK-23472 \| f83000597f250868de9722d8285fed013abc5ecf#diff-a78ecfc6a89edfaf0b60a5eaa0381970 \| spark.executorEnv.[EnvironmentVariableName] \| 0.9.0 \| None \| 642029e7f43322f84abe4f7f36bb0b1b95d8101d#diff-529fc5c06b9731c1fbda6f3db60b16aa \| spark.python.profile \| 1.2.0 \| SPARK-3478 \| 1aa549ba9839565274a12c52fa1075b424f138a6#diff-d6fe2792e44f6babc94aabfefc8b9bce \| spark.python.profile.dump \| 1.2.0 \| SPARK-3478 \| 1aa549ba9839565274a12c52fa1075b424f138a6#diff-d6fe2792e44f6babc94aabfefc8b9bce \| spark.python.worker.memory \| 1.1.0 \| SPARK-2538 \| 14174abd421318e71c16edd24224fd5094bdfed4#diff-d6fe2792e44f6babc94aabfefc8b9bce \| spark.jars.packages \| 1.5.0 \| SPARK-9263 \| 34335719a372c1951fdb4dd25b75b086faf1076f#diff-63a5d817d2d45ae24de577f6a1bd80f9 \| spark.jars.excludes \| 1.5.0 \| SPARK-9263 \| 34335719a372c1951fdb4dd25b75b086faf1076f#diff-63a5d817d2d45ae24de577f6a1bd80f9 \| spark.jars.ivy \| 1.3.0 \| SPARK-5341 \| 3b7acd22ab4a134c74746e3b9a803dbd34d43855#diff-63a5d817d2d45ae24de577f6a1bd80f9 \| spark.jars.ivySettings \| 2.2.0 \| SPARK-17568 \| 3bc2eff8880a3ba8d4318118715ea1a47048e3de#diff-4d2ab44195558d5a9d5f15b8803ef39d \| spark.jars.repositories \| 2.3.0 \| SPARK-21403 \| d8257b99ddae23f702f312640a5335ddb4554403#diff-4d2ab44195558d5a9d5f15b8803ef39d \| spark.shuffle.io.maxRetries \| 1.2.0 \| SPARK-4188 \| c1ea5c542f3267c0b23a7775887e3a6ece793fe3#diff-d2ce9b38bdc38ca9d7119f9c2cf79907 \| spark.shuffle.io.numConnectionsPerPeer \| 1.2.1 \| SPARK-4740 \| 441ec3451730c7ae3dbef8952e313071d6147ab6#diff-d2ce9b38bdc38ca9d7119f9c2cf79907 \| spark.shuffle.io.preferDirectBufs \| 1.2.0 \| SPARK-4188 \| c1ea5c542f3267c0b23a7775887e3a6ece793fe3#diff-d2ce9b38bdc38ca9d7119f9c2cf79907 \| spark.shuffle.io.retryWait \| 1.2.1 \| None \| 5e5d8f469a1bea9bbe606f772ccdcab7c184c651#diff-d2ce9b38bdc38ca9d7119f9c2cf79907 \| spark.shuffle.io.backLog \| 1.1.1 \| SPARK-2468 \| 66b4c81db7e826c00f7fb449b8a8af810cf7dd9a#diff-bdee8e601924d41e93baa7287189e878 \| spark.shuffle.service.index.cache.size \| 2.3.0 \| SPARK-21501 \| 1662e93119d68498942386906de309d35f4a135f#diff-97d5edc927a83a678e013ae00343df94 \| spark.shuffle.maxChunksBeingTransferred \| 2.3.0 \| SPARK-21175 \| 799e13161e89f1ea96cb1bc7b507a05af2e89cd0#diff-0ac65da2bc6b083fb861fe410c7688c2 \| spark.sql.ui.retainedExecutions \| 1.5.0 \| SPARK-8861 and SPARK-8862 \| ebc3aad272b91cf58e2e1b4aa92b49b8a947a045#diff-81764e4d52817f83bdd5336ef1226bd9 \| spark.streaming.ui.retainedBatches \| 1.0.0 \| SPARK-1386 \| f36dc3fed0a0671b0712d664db859da28c0a98e2#diff-56b8d67d07284cfab165d5363bd3500e \| spark.default.parallelism \| 0.5.0 \| None \| e5c4cd8a5e188592f8786a265c0cd073c69ac886#diff-0544ebf7533fa70ff5103e0fe1f0b036 \| spark.files.fetchTimeout \| 1.0.0 \| None \| f6f9d02e85d17da2f742ed0062f1648a9293e73c#diff-d239aee594001f8391676e1047a0381e \| spark.files.useFetchCache \| 1.2.2 \| SPARK-6313 \| a2a94a154bdd00753b8d5e344d712664c7151050#diff-d239aee594001f8391676e1047a0381e \| spark.files.overwrite \| 1.0.0 \| None \| 84670f2715392859624df290c1b52eb4ed4a9cb1#diff-d239aee594001f8391676e1047a0381e \| Exists in branch-1.0, but the version of pom is 0.9.0-incubating-SNAPSHOT spark.hadoop.cloneConf \| 1.0.3 \| SPARK-2546 \| 6d8f1dd15afdc7432b5721c89f9b2b402460322b#diff-83eb37f7b0ebed3c14ccb7bff0d577c2 \| spark.hadoop.validateOutputSpecs \| 1.0.1 \| SPARK-1677 \| 8100cbdb7546e8438019443cfc00683017c81278#diff-f70e97c099b5eac05c75288cb215e080 \| spark.hadoop.mapreduce.fileoutputcommitter.algorithm.version \| 2.2.0 \| SPARK-20107 \| edc87d76efea7b4d19d9d0c4ddba274a3ccb8752#diff-76e731333fb756df3bff5ddb3b731c46 \| spark.rpc.io.backLog \| 3.0.0 \| SPARK-27868 \| 09ed64d795d3199a94e175273fff6fcea6b52131#diff-76e731333fb756df3bff5ddb3b731c46 \| spark.network.io.preferDirectBufs \| 3.0.0 \| SPARK-24920 \| e103c4a5e72bab8862ff49d6d4c1e62e642fc412#diff-0ac65da2bc6b083fb861fe410c7688c2 \| spark.port.maxRetries \| 1.1.1 \| SPARK-3565 \| 32f2222e915f31422089139944a077e2cbd442f9#diff-d239aee594001f8391676e1047a0381e \| spark.core.connection.ack.wait.timeout \| 1.1.1 \| SPARK-2677 \| bd3ce2ffb8964abb4d59918ebb2c230fe4614aa2#diff-f748e95f2aa97ed715afa53ddeeac9de \| spark.scheduler.listenerbus.eventqueue.shared.capacity \| 3.0.0 \| SPARK-28574 \| c212c9d9ed7375cd1ea16c118733edd84037ec0d#diff-eb519ad78cc3cf0b95839cc37413b509 \| spark.scheduler.listenerbus.eventqueue.appStatus.capacity \| 3.0.0 \| SPARK-28574 \| c212c9d9ed7375cd1ea16c118733edd84037ec0d#diff-eb519ad78cc3cf0b95839cc37413b509 \| spark.scheduler.listenerbus.eventqueue.executorManagement.capacity \| 3.0.0 \| SPARK-28574 \| c212c9d9ed7375cd1ea16c118733edd84037ec0d#diff-eb519ad78cc3cf0b95839cc37413b509 \| spark.scheduler.listenerbus.eventqueue.eventLog.capacity \| 3.0.0 \| SPARK-28574 \| c212c9d9ed7375cd1ea16c118733edd84037ec0d#diff-eb519ad78cc3cf0b95839cc37413b509 \| spark.scheduler.listenerbus.eventqueue.streams.capacity \| 3.0.0 \| SPARK-28574 \| c212c9d9ed7375cd1ea16c118733edd84037ec0d#diff-eb519ad78cc3cf0b95839cc37413b509 \| spark.task.resource.{resourceName}.amount \| 3.0.0 \| SPARK-27760 \| d30284b5a51dd784f663eb4eea37087b35a54d00#diff-76e731333fb756df3bff5ddb3b731c46 \| spark.stage.maxConsecutiveAttempts \| 2.2.0 \| SPARK-13369 \| 7b5d873aef672aa0aee41e338bab7428101e1ad3#diff-6a9ff7fb74fd490a50462d45db2d5e11 \| spark.{driver\\|executor}.rpc.io.serverThreads \| 1.6.0 \| SPARK-10745 \| 7c5b641808740ba5eed05ba8204cdbaf3fc579f5#diff-d2ce9b38bdc38ca9d7119f9c2cf79907 \| spark.{driver\\|executor}.rpc.io.clientThreads \| 1.6.0 \| SPARK-10745 \| 7c5b641808740ba5eed05ba8204cdbaf3fc579f5#diff-d2ce9b38bdc38ca9d7119f9c2cf79907 \| spark.{driver\\|executor}.rpc.netty.dispatcher.numThreads \| 3.0.0 \| SPARK-29398 \| 2f0a38cb50e3e8b4b72219c7b2b8b15d51f6b931#diff-a68a21481fea5053848ca666dd3201d8 \| spark.r.driver.command \| 1.5.3 \| SPARK-10971 \| 9695f452e86a88bef3bcbd1f3c0b00ad9e9ac6e1#diff-025470e1b7094d7cf4a78ea353fb3981 \| spark.r.shell.command \| 2.1.0 \| SPARK-17178 \| fa6347938fc1c72ddc03a5f3cd2e929b5694f0a6#diff-a78ecfc6a89edfaf0b60a5eaa0381970 \| spark.graphx.pregel.checkpointInterval \| 2.2.0 \| SPARK-5484 \| f971ce5dd0788fe7f5d2ca820b9ea3db72033ddc#diff-e399679417ffa6eeedf26a7630baca16 \| ### Why are the changes needed? Supplemental configuration version information. ### Does this PR introduce any user-facing change? 'No'. ### How was this patch tested? Jenkins test Closes #28035 from beliefer/supplement-configuration-version. Authored-by: beliefer <beliefer@163.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-03-31 12:32:04 +09:00
beliefer	bed21770af	[SPARK-31215][SQL][DOC] Add version information to the static configuration of SQL ### What changes were proposed in this pull request? Add version information to the static configuration of `SQL`. I sorted out some information show below. Item name \| Since version \| JIRA ID \| Commit ID \| Note -- \| -- \| -- \| -- \| -- spark.sql.warehouse.dir \| 2.0.0 \| SPARK-14994 \| 054f991c4350af1350af7a4109ee77f4a34822f0#diff-32bb9518401c0948c5ea19377b5069ab \| spark.sql.catalogImplementation \| 2.0.0 \| SPARK-14720 and SPARK-13643 \| 8fc267ab3322e46db81e725a5cb1adb5a71b2b4d#diff-6bdad48cfc34314e89599655442ff210 \| spark.sql.globalTempDatabase \| 2.1.0 \| SPARK-17338 \| 23ddff4b2b2744c3dc84d928e144c541ad5df376#diff-6bdad48cfc34314e89599655442ff210 \| spark.sql.sources.schemaStringLengthThreshold \| 1.3.1 \| SPARK-6024 \| 6200f0709c5c8440decae8bf700d7859f32ac9d5#diff-41ef65b9ef5b518f77e2a03559893f4d \| 1.3 spark.sql.filesourceTableRelationCacheSize \| 2.2.0 \| SPARK-19265 \| 9d9d67c7957f7cbbdbe889bdbc073568b2bfbb16#diff-32bb9518401c0948c5ea19377b5069ab \| spark.sql.codegen.cache.maxEntries \| 2.4.0 \| SPARK-24727 \| b2deef64f604ddd9502a31105ed47cb63470ec85#diff-5081b9388de3add800b6e4a6ddf55c01 \| spark.sql.codegen.comments \| 2.0.0 \| SPARK-15680 \| f0e8738c1ec0e4c5526aeada6f50cf76428f9afd#diff-8bcc5aea39c73d4bf38aef6f6951d42c \| spark.sql.debug \| 2.1.0 \| SPARK-17899 \| db8784feaa605adcbd37af4bc8b7146479b631f8#diff-32bb9518401c0948c5ea19377b5069ab \| spark.sql.hive.thriftServer.singleSession \| 1.6.0 \| SPARK-11089 \| 167ea61a6a604fd9c0b00122a94d1bc4b1de24ff#diff-ff50aea397a607b79df9bec6f2a841db \| spark.sql.extensions \| 2.2.0 \| SPARK-18127 \| f0de600797ff4883927d0c70732675fd8629e239#diff-5081b9388de3add800b6e4a6ddf55c01 \| spark.sql.queryExecutionListeners \| 2.3.0 \| SPARK-19558 \| bd4eb9ce57da7bacff69d9ed958c94f349b7e6fb#diff-5081b9388de3add800b6e4a6ddf55c01 \| spark.sql.streaming.streamingQueryListeners \| 2.4.0 \| SPARK-24479 \| 7703b46d2843db99e28110c4c7ccf60934412504#diff-5081b9388de3add800b6e4a6ddf55c01 \| spark.sql.ui.retainedExecutions \| 1.5.0 \| SPARK-8861 and SPARK-8862 \| ebc3aad272b91cf58e2e1b4aa92b49b8a947a045#diff-81764e4d52817f83bdd5336ef1226bd9 \| spark.sql.broadcastExchange.maxThreadThreshold \| 3.0.0 \| SPARK-26601 \| 126310ca68f2f248ea8b312c4637eccaba2fdc2b#diff-5081b9388de3add800b6e4a6ddf55c01 \| spark.sql.subquery.maxThreadThreshold \| 2.4.6 \| SPARK-30556 \| 2fc562cafd71ec8f438f37a28b65118906ab2ad2#diff-5081b9388de3add800b6e4a6ddf55c01 \| spark.sql.event.truncate.length \| 3.0.0 \| SPARK-27045 \| e60d8fce0b0cf2a6d766ea2fc5f994546550570a#diff-5081b9388de3add800b6e4a6ddf55c01 \| spark.sql.legacy.sessionInitWithConfigDefaults \| 3.0.0 \| SPARK-27253 \| 83f628b57da39ad9732d1393aebac373634a2eb9#diff-5081b9388de3add800b6e4a6ddf55c01 \| spark.sql.defaultUrlStreamHandlerFactory.enabled \| 3.0.0 \| SPARK-25694 \| 8469614c0513fbed87977d4e741649db3fdd8add#diff-5081b9388de3add800b6e4a6ddf55c01 \| spark.sql.streaming.ui.enabled \| 3.0.0 \| SPARK-29543 \| f9b86370cb04b72a4f00cbd4d60873960aa2792c#diff-5081b9388de3add800b6e4a6ddf55c01 \| spark.sql.streaming.ui.retainedProgressUpdates \| 3.0.0 \| SPARK-29543 \| f9b86370cb04b72a4f00cbd4d60873960aa2792c#diff-5081b9388de3add800b6e4a6ddf55c01 \| spark.sql.streaming.ui.retainedQueries \| 3.0.0 \| SPARK-29543 \| f9b86370cb04b72a4f00cbd4d60873960aa2792c#diff-5081b9388de3add800b6e4a6ddf55c01 \| ### Why are the changes needed? Supplemental configuration version information. ### Does this PR introduce any user-facing change? 'No'. ### How was this patch tested? Exists UT Closes #27981 from beliefer/add-version-to-sql-static-config. Authored-by: beliefer <beliefer@163.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-03-31 12:31:25 +09:00
beliefer	35d286bafb	[SPARK-31228][DSTREAMS] Add version information to the configuration of Kafka ### What changes were proposed in this pull request? Add version information to the configuration of Kafka. I sorted out some information show below. Item name \| Since version \| JIRA ID \| Commit ID \| Note -- \| -- \| -- \| -- \| -- spark.streaming.kafka.consumer.cache.enabled \| 2.2.1 \| SPARK-19185 \| 02cf178bb2a7dc8b4c06eb040c44b6453e41ed15#diff-c465bbcc83b2ecc7530d1c0128e4432b \| spark.streaming.kafka.consumer.poll.ms \| 2.0.1 \| SPARK-12177 \| 3134f116a3565c3a299fa2e7094acd7304d64280#diff-4597d93a0e951f7199697dba7dd0dc32 \| spark.streaming.kafka.consumer.cache.initialCapacity \| 2.0.1 \| SPARK-12177 \| 3134f116a3565c3a299fa2e7094acd7304d64280#diff-4597d93a0e951f7199697dba7dd0dc32 \| spark.streaming.kafka.consumer.cache.maxCapacity \| 2.0.1 \| SPARK-12177 \| 3134f116a3565c3a299fa2e7094acd7304d64280#diff-4597d93a0e951f7199697dba7dd0dc32 \| spark.streaming.kafka.consumer.cache.loadFactor \| 2.0.1 \| SPARK-12177 \| 3134f116a3565c3a299fa2e7094acd7304d64280#diff-4597d93a0e951f7199697dba7dd0dc32 \| spark.streaming.kafka.maxRatePerPartition \| 1.3.0 \| SPARK-4964 \| a119cae48030520da9f26ee9a1270bed7f33031e#diff-26cb4369f86050dc2e75cd16291b2844 \| spark.streaming.kafka.minRatePerPartition \| 2.4.0 \| SPARK-25233 \| 135ff16a3510a4dfb3470904004dae9848005019#diff-815f6ec5caf9e4beb355f5f981171f1f \| spark.streaming.kafka.allowNonConsecutiveOffsets \| 2.3.1 \| SPARK-24067 \| 1d598b771de3b588a2f377ae7ccf8193156641f2#diff-4597d93a0e951f7199697dba7dd0dc32 \| spark.kafka.producer.cache.timeout \| 2.2.1 \| SPARK-19968 \| f6730a70cb47ebb3df7f42209df7b076aece1093#diff-ac8844e8d791a75aaee3d0d10bfc1f2a \| spark.kafka.producer.cache.evictorThreadRunInterval \| 3.0.0 \| SPARK-21869 \| 7bff2db9ed803e05a43c2d875c1dea819d81248a#diff-ea8349d528fe8d1b0a8ffa2840ff4bcd \| spark.kafka.consumer.cache.capacity \| 3.0.0 \| SPARK-27687 \| efa303581ac61d6f517aacd08883da2d01530bd2#diff-ea8349d528fe8d1b0a8ffa2840ff4bcd \| spark.kafka.consumer.cache.jmx.enable \| 3.0.0 \| SPARK-25151 \| 594c9c5a3ece0e913949c7160bb4925e5d289e44#diff-ea8349d528fe8d1b0a8ffa2840ff4bcd \| spark.kafka.consumer.cache.timeout \| 3.0.0 \| SPARK-25151 \| 594c9c5a3ece0e913949c7160bb4925e5d289e44#diff-ea8349d528fe8d1b0a8ffa2840ff4bcd \| spark.kafka.consumer.cache.evictorThreadRunInterval \| 3.0.0 \| SPARK-25151 \| 594c9c5a3ece0e913949c7160bb4925e5d289e44#diff-ea8349d528fe8d1b0a8ffa2840ff4bcd \| spark.kafka.consumer.fetchedData.cache.timeout \| 3.0.0 \| SPARK-25151 \| 594c9c5a3ece0e913949c7160bb4925e5d289e44#diff-ea8349d528fe8d1b0a8ffa2840ff4bcd \| spark.kafka.consumer.fetchedData.cache.evictorThreadRunInterval \| 3.0.0 \| SPARK-25151 \| 594c9c5a3ece0e913949c7160bb4925e5d289e44#diff-ea8349d528fe8d1b0a8ffa2840ff4bcd \| spark.kafka.clusters.${cluster}.auth.bootstrap.servers \| 3.0.0 \| SPARK-27294 \| 2f558094257c38d26650049f2ac93be6d65d6d85#diff-7df71bd47f5a3428ebdb05ced3c31f49 \| spark.kafka.clusters.${cluster}.target.bootstrap.servers.regex \| 3.0.0 \| SPARK-27294 \| 2f558094257c38d26650049f2ac93be6d65d6d85#diff-7df71bd47f5a3428ebdb05ced3c31f49 \| spark.kafka.clusters.${cluster}.security.protocol \| 3.0.0 \| SPARK-27294 \| 2f558094257c38d26650049f2ac93be6d65d6d85#diff-7df71bd47f5a3428ebdb05ced3c31f49 \| spark.kafka.clusters.${cluster}.sasl.kerberos.service.name \| 3.0.0 \| SPARK-27294 \| 2f558094257c38d26650049f2ac93be6d65d6d85#diff-7df71bd47f5a3428ebdb05ced3c31f49 \| spark.kafka.clusters.${cluster}.ssl.truststore.location \| 3.0.0 \| SPARK-27294 \| 2f558094257c38d26650049f2ac93be6d65d6d85#diff-7df71bd47f5a3428ebdb05ced3c31f49 \| spark.kafka.clusters.${cluster}.ssl.truststore.password \| 3.0.0 \| SPARK-27294 \| 2f558094257c38d26650049f2ac93be6d65d6d85#diff-7df71bd47f5a3428ebdb05ced3c31f49 \| spark.kafka.clusters.${cluster}.ssl.keystore.location \| 3.0.0 \| SPARK-27294 \| 2f558094257c38d26650049f2ac93be6d65d6d85#diff-7df71bd47f5a3428ebdb05ced3c31f49 \| spark.kafka.clusters.${cluster}.ssl.keystore.password \| 3.0.0 \| SPARK-27294 \| 2f558094257c38d26650049f2ac93be6d65d6d85#diff-7df71bd47f5a3428ebdb05ced3c31f49 \| spark.kafka.clusters.${cluster}.ssl.key.password \| 3.0.0 \| SPARK-27294 \| 2f558094257c38d26650049f2ac93be6d65d6d85#diff-7df71bd47f5a3428ebdb05ced3c31f49 \| spark.kafka.clusters.${cluster}.sasl.token.mechanism \| 3.0.0 \| SPARK-27294 \| 2f558094257c38d26650049f2ac93be6d65d6d85#diff-7df71bd47f5a3428ebdb05ced3c31f49 \| ### Why are the changes needed? Supplemental configuration version information. ### Does this PR introduce any user-facing change? 'No'. ### How was this patch tested? Exists UT Closes #27989 from beliefer/add-version-to-kafka-config. Authored-by: beliefer <beliefer@163.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-03-26 20:11:15 +09:00
beliefer	a0cf972985	[SPARK-31141][DSTREAMS][DOC] Add version information to the configuration of Dstreams ### What changes were proposed in this pull request? Add version information to the configuration of `Dstreams`. I sorted out some information show below. Item name \| Since version \| JIRA ID \| Commit ID \| Note -- \| -- \| -- \| -- \| -- spark.streaming.backpressure.enabled \| 1.5.0 \| SPARK-9967 and SPARK-10099 \| 392bd19d678567751cd3844d9d166a7491c5887e#diff-1b584c4ed88a9022abb11d594f760997 \| spark.streaming.backpressure.initialRate \| 2.0.0 \| SPARK-11627 \| 7218c0eba957e0a079a407b79c3a050cce9647b2#diff-c64d571ef32d2dbf76e965ecd04a9f52 \| spark.streaming.blockInterval \| 0.8.0 \| None \| 46eecd110a4017ea0c86cbb1010d0ccd6a5eb2ef#diff-54d85b29e4349628a0de525c119399b5 \| spark.streaming.receiver.maxRate \| 1.0.2 \| SPARK-1341 \| ca19cfbcd5cfac9ad731350dfeea14355aec87d6#diff-c64d571ef32d2dbf76e965ecd04a9f52 \| spark.streaming.receiver.writeAheadLog.enable \| 1.2.1 \| SPARK-4482 \| ce5ea0fd611ce560f6e1fac83562469bdb97091e#diff-0607b70e4e79cbbc1a128c45784cb813 \| spark.streaming.unpersist \| 0.9.0 \| None \| 08b9fec93d00ff0ebb49af4d9ac72d2806eded02#diff-bcf5f84f78d23ebde7d532bea756bc57 \| spark.streaming.stopGracefullyOnShutdown \| 1.4.0 \| SPARK-7776 \| a17a5cb302c5fa6a4d3e9e3e0fa2100c0b5436d6#diff-8a7f0e3f26c15ba484e6312c3caf033d \| spark.streaming.kafka.maxRetries \| 1.3.0 \| SPARK-4964 \| a119cae48030520da9f26ee9a1270bed7f33031e#diff-26cb4369f86050dc2e75cd16291b2844 \| spark.streaming.ui.retainedBatches \| 1.0.0 \| SPARK-1386 \| f36dc3fed0a0671b0712d664db859da28c0a98e2#diff-56b8d67d07284cfab165d5363bd3500e \| spark.streaming.driver.writeAheadLog.closeFileAfterWrite \| 1.6.0 \| SPARK-11324 \| 4f030b9e82172659d250281782ac573cbd1438fc#diff-a1b3ec72e8d7cc91433a1cc64fe6e91d \| spark.streaming.receiver.writeAheadLog.closeFileAfterWrite \| 1.6.0 \| SPARK-11324 \| 4f030b9e82172659d250281782ac573cbd1438fc#diff-a1b3ec72e8d7cc91433a1cc64fe6e91d \| spark.streaming.receiver.writeAheadLog.class \| 1.4.0 \| SPARK-7056 \| 1868bd40dcce23990b98748b0239bd00452b1ca5#diff-a1b3ec72e8d7cc91433a1cc64fe6e91d \| spark.streaming.receiver.writeAheadLog.rollingIntervalSecs \| 1.4.0 \| SPARK-7056 \| 1868bd40dcce23990b98748b0239bd00452b1ca5#diff-a1b3ec72e8d7cc91433a1cc64fe6e91d \| spark.streaming.receiver.writeAheadLog.maxFailures \| 1.2.0 \| SPARK-4028 \| 234de9232bcfa212317a8073c4a82c3863b36b14#diff-8cec1a581eebcad673dc8930b1a2801c \| spark.streaming.driver.writeAheadLog.class \| 1.4.0 \| SPARK-7056 \| 1868bd40dcce23990b98748b0239bd00452b1ca5#diff-a1b3ec72e8d7cc91433a1cc64fe6e91d \| spark.streaming.driver.writeAheadLog.rollingIntervalSecs \| 1.4.0 \| SPARK-7056 \| 1868bd40dcce23990b98748b0239bd00452b1ca5#diff-a1b3ec72e8d7cc91433a1cc64fe6e91d \| spark.streaming.driver.writeAheadLog.maxFailures \| 1.4.0 \| SPARK-7056 \| 1868bd40dcce23990b98748b0239bd00452b1ca5#diff-a1b3ec72e8d7cc91433a1cc64fe6e91d \| spark.streaming.driver.writeAheadLog.allowBatching \| 1.6.0 \| SPARK-11141 \| dccc4645df629f35c4788d50b2c0a6ab381db4b7#diff-a1b3ec72e8d7cc91433a1cc64fe6e91d \| spark.streaming.driver.writeAheadLog.batchingTimeout \| 1.6.0 \| SPARK-11141 \| dccc4645df629f35c4788d50b2c0a6ab381db4b7#diff-a1b3ec72e8d7cc91433a1cc64fe6e91d \| spark.streaming.sessionByKey.deltaChainThreshold \| 1.6.0 \| SPARK-11290 \| daa74be6f863061221bb0c2f94e70672e6fcbeaa#diff-e0a40541298f885606a2361ff9c5af6c \| spark.streaming.backpressure.rateEstimator \| 1.5.0 \| SPARK-8977 \| 819be46e5a73f2d19230354ebba30c58538590f5#diff-5dcaea3a4eca07f898fa88fe6d69e5c3 \| spark.streaming.backpressure.pid.proportional \| 1.5.0 \| SPARK-8979 \| 0a1d2ca42c8b31d6b0e70163795f0185d4622f87#diff-5dcaea3a4eca07f898fa88fe6d69e5c3 \| spark.streaming.backpressure.pid.integral \| 1.5.0 \| SPARK-8979 \| 0a1d2ca42c8b31d6b0e70163795f0185d4622f87#diff-5dcaea3a4eca07f898fa88fe6d69e5c3 \| spark.streaming.backpressure.pid.derived \| 1.5.0 \| SPARK-8979 \| 0a1d2ca42c8b31d6b0e70163795f0185d4622f87#diff-5dcaea3a4eca07f898fa88fe6d69e5c3 \| spark.streaming.backpressure.pid.minRate \| 1.5.0 \| SPARK-9966 \| 612b4609bdd38763725ae07d77c2176aa6756e64#diff-5dcaea3a4eca07f898fa88fe6d69e5c3 \| spark.streaming.concurrentJobs \| 0.7.0 \| None \| c97ebf64377e853ab7c616a103869a4417f25954#diff-839f06302b2d648a85436486fc13c85d \| spark.streaming.internal.batchTime \| 1.4.0 \| SPARK-6862 \| 1b7106b867bc0aa4d64b669d79b646f862acaf47#diff-25124e4f06a1da237bf486eceb1f7967 \| It's not a configuration, it's a property spark.streaming.internal.outputOpId \| 1.4.0 \| SPARK-6862 \| 1b7106b867bc0aa4d64b669d79b646f862acaf47#diff-25124e4f06a1da237bf486eceb1f7967 \| It's not a configuration, it's a property spark.streaming.clock \| 0.7.0 \| None \| cae894ee7aefa4cf9b1952038a48be81e1d2a856#diff-839f06302b2d648a85436486fc13c85d \| spark.streaming.gracefulStopTimeout \| 1.0.0 \| SPARK-1332 \| 94cbe2329021296b660d88f3e8ef3734374020d2#diff-2f8c5c038fda47b9875e10785fdd2498 \| spark.streaming.manualClock.jump \| 0.7.0 \| None \| fc3d0b602a08fdd182c2138506d1cd9952631f95#diff-839f06302b2d648a85436486fc13c85d \| ### Why are the changes needed? Supplemental configuration version information. ### Does this PR introduce any user-facing change? 'No' ### How was this patch tested? Exists UT Closes #27898 from beliefer/add-version-to-dstream-config. Authored-by: beliefer <beliefer@163.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-03-23 13:01:44 +09:00
beliefer	ae0699d4b5	[SPARK-31002][CORE][DOC][FOLLOWUP] Add version information to the configuration of Core ### What changes were proposed in this pull request? This PR follows up #27847, #27852 and https://github.com/apache/spark/pull/27913. I sorted out some information show below. Item name \| Since version \| JIRA ID \| Commit ID \| Note -- \| -- \| -- \| -- \| -- spark.storage.localDiskByExecutors.cacheSize \| 3.0.0 \| SPARK-27651 \| fd2bf55abaab08798a428d4e47d4050ba2b82a95#diff-6bdad48cfc34314e89599655442ff210 \| spark.storage.memoryMapLimitForTests \| 2.3.0 \| SPARK-3151 \| b8ffb51055108fd606b86f034747006962cd2df3#diff-abd96f2ae793cd6ea6aab5b96a3c1d7a \| spark.barrier.sync.timeout \| 2.4.0 \| SPARK-24817 \| 388f5a0635a2812cd71b08352e3ddc20293ec189#diff-6bdad48cfc34314e89599655442ff210 \| spark.scheduler.blacklist.unschedulableTaskSetTimeout \| 2.4.1 \| SPARK-22148 \| 52e9711d01694158ecb3691f2ec25c0ebe4b0207#diff-6bdad48cfc34314e89599655442ff210 \| spark.scheduler.barrier.maxConcurrentTasksCheck.interval \| 2.4.0 \| SPARK-24819 \| bfb74394a5513134ea1da9fcf4a1783b77dd64e4#diff-6bdad48cfc34314e89599655442ff210 \| spark.scheduler.barrier.maxConcurrentTasksCheck.maxFailures \| 2.4.0 \| SPARK-24819 \| bfb74394a5513134ea1da9fcf4a1783b77dd64e4#diff-6bdad48cfc34314e89599655442ff210 \| spark.unsafe.exceptionOnMemoryLeak \| 1.4.0 \| SPARK-7076 and SPARK-7077 and SPARK-7080 \| f49284b5bf3a69ed91a5e3e6e0ed3be93a6ab9e4#diff-5a0de266c82b95adb47d9bca714e1f1b \| spark.unsafe.sorter.spill.read.ahead.enabled \| 2.3.0 \| SPARK-21113 \| 1e978b17d63d7ba20368057aa4e65f5ef6e87369#diff-93a086317cea72a113cf81056882c206 \| spark.unsafe.sorter.spill.reader.buffer.size \| 2.1.0 \| SPARK-16862 \| c1937dd19a23bd096a4707656c7ba19fb5c16966#diff-93a086317cea72a113cf81056882c206 \| spark.plugins \| 3.0.0 \| SPARK-29397 \| d51d228048d519a9a666f48dc532625de13e7587#diff-6bdad48cfc34314e89599655442ff210 \| spark.cleaner.periodicGC.interval \| 1.6.0 \| SPARK-8414 \| 72da2a21f0940b97757ace5975535e559d627688#diff-75141521b1d55bc32d72b70032ad96c0 \| spark.cleaner.referenceTracking \| 1.0.0 \| SPARK-1103 \| 11eabbe125b2ee572fad359c33c93f5e6fdf0b2d#diff-364713d7776956cb8b0a771e9b62f82d \| spark.cleaner.referenceTracking.blocking \| 1.0.0 \| SPARK-1103 \| 11eabbe125b2ee572fad359c33c93f5e6fdf0b2d#diff-364713d7776956cb8b0a771e9b62f82d \| spark.cleaner.referenceTracking.blocking.shuffle \| 1.1.1 \| SPARK-3139 \| 5cf1e440137006eedd6846ac8fa57ccf9fd1958d#diff-75141521b1d55bc32d72b70032ad96c0 \| spark.cleaner.referenceTracking.cleanCheckpoints \| 1.4.0 \| SPARK-2033 \| 25998e4d73bcc95ac85d9af71adfdc726ec89568#diff-440e866c5df0b8386aff57f9f8bd8db1 \| spark.executor.logs.rolling.strategy \| 1.1.0 \| SPARK-1940 \| 4823bf470ec1b47a6f404834d4453e61d3dcbec9#diff-2b4575e096e4db7165e087f9429f2a02 \| spark.executor.logs.rolling.time.interval \| 1.1.0 \| SPARK-1940 \| 4823bf470ec1b47a6f404834d4453e61d3dcbec9#diff-2b4575e096e4db7165e087f9429f2a02 \| spark.executor.logs.rolling.maxSize \| 1.4.0 \| SPARK-5932 \| 2d222fb39dd978e5a33cde6ceb59307cbdf7b171#diff-529fc5c06b9731c1fbda6f3db60b16aa \| spark.executor.logs.rolling.maxRetainedFiles \| 1.1.0 \| SPARK-1940 \| 4823bf470ec1b47a6f404834d4453e61d3dcbec9#diff-2b4575e096e4db7165e087f9429f2a02 \| spark.executor.logs.rolling.enableCompression \| 2.0.2 \| SPARK-17711 \| 26e978a93f029e1a1b5c7524d0b52c8141b70997#diff-2b4575e096e4db7165e087f9429f2a02 \| spark.master.rest.enabled \| 1.3.0 \| SPARK-5388 \| 6ec0cdc14390d4dc45acf31040f21e1efc476fc0#diff-29dffdccd5a7f4c8b496c293e87c8668 \| spark.master.rest.port \| 1.3.0 \| SPARK-5388 \| 6ec0cdc14390d4dc45acf31040f21e1efc476fc0#diff-29dffdccd5a7f4c8b496c293e87c8668 \| spark.master.ui.port \| 1.1.0 \| SPARK-2857 \| 12f99cf5f88faf94d9dbfe85cb72d0010a3a25ac#diff-366c88f47e9b5cfa4d4305febeb8b026 \| spark.io.compression.snappy.blockSize \| 1.4.0 \| SPARK-5932 \| 2d222fb39dd978e5a33cde6ceb59307cbdf7b171#diff-529fc5c06b9731c1fbda6f3db60b16aa \| spark.io.compression.lz4.blockSize \| 1.4.0 \| SPARK-5932 \| 2d222fb39dd978e5a33cde6ceb59307cbdf7b171#diff-529fc5c06b9731c1fbda6f3db60b16aa \| spark.io.compression.codec \| 0.8.0 \| None \| 46eecd110a4017ea0c86cbb1010d0ccd6a5eb2ef#diff-df9e6118c481ceb27faa399114fac0a1 \| spark.io.compression.zstd.bufferSize \| 2.3.0 \| SPARK-19112 \| 444bce1c98c45147fe63e2132e9743a0c5e49598#diff-df9e6118c481ceb27faa399114fac0a1 \| spark.io.compression.zstd.level \| 2.3.0 \| SPARK-19112 \| 444bce1c98c45147fe63e2132e9743a0c5e49598#diff-df9e6118c481ceb27faa399114fac0a1 \| spark.io.warning.largeFileThreshold \| 3.0.0 \| SPARK-28366 \| 26d03b62e20d053943d03b5c5573dd349e49654c#diff-6bdad48cfc34314e89599655442ff210 \| spark.eventLog.compression.codec \| 3.0.0 \| SPARK-28118 \| 47f54b1ec717d0d744bf3ad46bb1ed3542b667c8#diff-6bdad48cfc34314e89599655442ff210 \| spark.buffer.size \| 0.5.0 \| None \| 4b1646a25f7581cecae108553da13833e842e68a#diff-eaf125f56ce786d64dcef99cf446a751 \| spark.locality.wait.process \| 0.8.0 \| None \| 46eecd110a4017ea0c86cbb1010d0ccd6a5eb2ef#diff-264da78fe625d594eae59d1adabc8ae9 \| spark.locality.wait.node \| 0.8.0 \| None \| 46eecd110a4017ea0c86cbb1010d0ccd6a5eb2ef#diff-264da78fe625d594eae59d1adabc8ae9 \| spark.locality.wait.rack \| 0.8.0 \| None \| 46eecd110a4017ea0c86cbb1010d0ccd6a5eb2ef#diff-264da78fe625d594eae59d1adabc8ae9 \| spark.reducer.maxSizeInFlight \| 1.4.0 \| SPARK-5932 \| 2d222fb39dd978e5a33cde6ceb59307cbdf7b171#diff-529fc5c06b9731c1fbda6f3db60b16aa \| spark.reducer.maxReqsInFlight \| 2.0.0 \| SPARK-6166 \| 894921d813a259f2f266fde7d86d2ecb5a0af24b#diff-eb30a71e0d04150b8e0b64929852e38b \| spark.broadcast.compress \| 0.6.0 \| None \| efc5423210d1aadeaea78273a4a8f10425753079#diff-76170a9c8f67b542bc58240a0a12fe08 \| spark.broadcast.blockSize \| 0.5.0 \| None \| b8ab7862b8bd168bca60bd930cd97c1099fbc8a8#diff-271d7958e14cdaa46cf3737cfcf51341 \| spark.broadcast.checksum \| 2.1.1 \| SPARK-18188 \| 06a56df226aa0c03c21f23258630d8a96385c696#diff-4f43d14923008c6650a8eb7b40c07f74 \| spark.broadcast.UDFCompressionThreshold \| 3.0.0 \| SPARK-28355 \| 79e204770300dab4a669b9f8e2421ef905236e7b#diff-6bdad48cfc34314e89599655442ff210 \| spark.rdd.compress \| 0.6.0 \| None \| efc5423210d1aadeaea78273a4a8f10425753079#diff-76170a9c8f67b542bc58240a0a12fe08 \| spark.rdd.parallelListingThreshold \| 2.0.0 \| SPARK-9926 \| 80a4bfa4d1c86398b90b26c34d8dcbc2355f5a6a#diff-eaababfc87ea4949f97860e8b89b7586 \| spark.rdd.limit.scaleUpFactor \| 2.1.0 \| SPARK-16984 \| 806d8a8e980d8ba2f4261bceb393c40bafaa2f73#diff-1d55e54678eff2076263f2fe36150c17 \| spark.serializer \| 0.5.0 \| None \| fd1d255821bde844af28e897fabd59a715659038#diff-b920b65c23bf3a1b3326325b0d6a81b2 \| spark.serializer.objectStreamReset \| 1.0.0 \| SPARK-942 \| 40566e10aae4b21ffc71ea72702b8df118ac5c8e#diff-6a59dfc43d1b31dc1c3072ceafa829f5 \| spark.serializer.extraDebugInfo \| 1.3.0 \| SPARK-5307 \| 636408311deeebd77fb83d2249e0afad1a1ba149#diff-6a59dfc43d1b31dc1c3072ceafa829f5 \| spark.jars \| 0.9.0 \| None \| f1d206c6b4c0a5b2517b05af05fdda6049e2f7c2#diff-364713d7776956cb8b0a771e9b62f82d \| spark.files \| 1.0.0 \| None \| 29ee101c73bf066bf7f4f8141c475b8d1bd3cf1c#diff-364713d7776956cb8b0a771e9b62f82d \| spark.submit.deployMode \| 1.5.0 \| SPARK-6797 \| 7f487c8bde14dbdd244a3493ad11a129ef2bb327#diff-4d2ab44195558d5a9d5f15b8803ef39d \| spark.submit.pyFiles \| 1.0.1 \| SPARK-1549 \| d7ddb26e1fa02e773999cc4a97c48d2cd1723956#diff-4d2ab44195558d5a9d5f15b8803ef39d \| spark.scheduler.allocation.file \| 0.8.1 \| None \| 976fe60f7609d7b905a34f18743efabd966407f0#diff-9bc0105ee454005379abed710cd20ced \| spark.scheduler.minRegisteredResourcesRatio \| 1.1.1 \| SPARK-2635 \| 3311da2f9efc5ff2c7d01273ac08f719b067d11d#diff-7d99a7c7a051e5e851aaaefb275a44a1 \| spark.scheduler.maxRegisteredResourcesWaitingTime \| 1.1.1 \| SPARK-2635 \| 3311da2f9efc5ff2c7d01273ac08f719b067d11d#diff-7d99a7c7a051e5e851aaaefb275a44a1 \| spark.scheduler.mode \| 0.8.0 \| None \| 98fb69822cf780160bca51abeaab7c82e49fab54#diff-cb7a25b3c9a7341c6d99bcb8e9780c92 \| spark.scheduler.revive.interval \| 0.8.1 \| None \| d0c9d41a061969d409715b86a91937d8de4c29f7#diff-7d99a7c7a051e5e851aaaefb275a44a1 \| spark.speculation \| 0.6.0 \| None \| e72afdb817bcc8388aeb8b8d31628fd5fd67acf1#diff-4e188f32951dc989d97fa7577858bc7c \| spark.speculation.interval \| 0.6.0 \| None \| e72afdb817bcc8388aeb8b8d31628fd5fd67acf1#diff-4e188f32951dc989d97fa7577858bc7c \| spark.speculation.multiplier \| 0.6.0 \| None \| e72afdb817bcc8388aeb8b8d31628fd5fd67acf1#diff-fff59f72dfe6ca4ccb607ad12535da07 \| spark.speculation.quantile \| 0.6.0 \| None \| e72afdb817bcc8388aeb8b8d31628fd5fd67acf1#diff-fff59f72dfe6ca4ccb607ad12535da07 \| spark.speculation.task.duration.threshold \| 3.0.0 \| SPARK-29976 \| ad238a2238a9d0da89be4424574436cbfaee579d#diff-6bdad48cfc34314e89599655442ff210 \| spark.yarn.stagingDir \| 2.0.0 \| SPARK-13063 \| bc36df127d3b9f56b4edaeb5eca7697d4aef761a#diff-14b8ed2ef4e3da985300b8d796a38fa9 \| spark.buffer.pageSize \| 1.5.0 \| SPARK-9411 \| 1b0099fc62d02ff6216a76fbfe17a4ec5b2f3536#diff-1b22e54318c04824a6d53ed3f4d1bb35 \| ### Why are the changes needed? Supplemental configuration version information. ### Does this PR introduce any user-facing change? 'No'. ### How was this patch tested? Exists UT Closes #27931 from beliefer/add-version-to-core-config-part-four. Authored-by: beliefer <beliefer@163.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-03-23 11:07:43 +09:00
beliefer	f4cd7495f1	[SPARK-31002][CORE][DOC][FOLLOWUP] Add version information to the configuration of Core ### What changes were proposed in this pull request? This PR follows up #27847 and https://github.com/apache/spark/pull/27852. I sorted out some information show below. Item name \| Since version \| JIRA ID \| Commit ID \| Note -- \| -- \| -- \| -- \| -- spark.metrics.namespace \| 2.1.0 \| SPARK-5847 \| 70f846a313061e4db6174e0dc6c12c8c806ccf78#diff-6bdad48cfc34314e89599655442ff210 \| spark.metrics.conf \| 0.8.0 \| None \| 46eecd110a4017ea0c86cbb1010d0ccd6a5eb2ef#diff-7ea2624e832b166ca27cd4baca8691d9 \| spark.metrics.executorMetricsSource.enabled \| 3.0.0 \| SPARK-27189 \| 729f43f499f3dd2718c0b28d73f2ca29cc811eac#diff-6bdad48cfc34314e89599655442ff210 \| spark.metrics.staticSources.enabled \| 3.0.0 \| SPARK-30060 \| 60f20e5ea2000ab8f4a593b5e4217fd5637c5e22#diff-6bdad48cfc34314e89599655442ff210 \| spark.pyspark.driver.python \| 2.1.0 \| SPARK-13081 \| 7a9e25c38380e6c62080d62ad38a4830e44fe753#diff-6bdad48cfc34314e89599655442ff210 \| spark.pyspark.python \| 2.1.0 \| SPARK-13081 \| 7a9e25c38380e6c62080d62ad38a4830e44fe753#diff-6bdad48cfc34314e89599655442ff210 \| spark.history.ui.maxApplications \| 2.0.1 \| SPARK-17243 \| 021aa28f439443cda1bc7c5e3eee7c85b40c1a2d#diff-6bdad48cfc34314e89599655442ff210 \| spark.io.encryption.enabled \| 2.1.0 \| SPARK-5682 \| 4b4e329e49f8af28fa6301bd06c48d7097eaf9e6#diff-6bdad48cfc34314e89599655442ff210 \| spark.io.encryption.keygen.algorithm \| 2.1.0 \| SPARK-5682 \| 4b4e329e49f8af28fa6301bd06c48d7097eaf9e6#diff-6bdad48cfc34314e89599655442ff210 \| spark.io.encryption.keySizeBits \| 2.1.0 \| SPARK-5682 \| 4b4e329e49f8af28fa6301bd06c48d7097eaf9e6#diff-6bdad48cfc34314e89599655442ff210 \| spark.io.encryption.commons.config.* \| 2.1.0 \| SPARK-5682 \| `4b4e329e49` \| spark.io.crypto.cipher.transformation \| 2.1.0 \| SPARK-5682 \| 4b4e329e49f8af28fa6301bd06c48d7097eaf9e6#diff-6bdad48cfc34314e89599655442ff210 \| spark.driver.host \| 0.7.0 \| None \| 02a6761589c35f15f1a6e3b63a7964ba057d3ba6#diff-eaf125f56ce786d64dcef99cf446a751 \| spark.driver.port \| 0.7.0 \| None \| 02a6761589c35f15f1a6e3b63a7964ba057d3ba6#diff-eaf125f56ce786d64dcef99cf446a751 \| spark.driver.supervise \| 1.3.0 \| SPARK-5388 \| 6ec0cdc14390d4dc45acf31040f21e1efc476fc0#diff-4d2ab44195558d5a9d5f15b8803ef39d \| spark.driver.bindAddress \| 2.1.0 \| SPARK-4563 \| 2cd1bfa4f0c6625b0ab1dbeba2b9586b9a6a9f42#diff-6bdad48cfc34314e89599655442ff210 \| spark.blockManager.port \| 1.1.0 \| SPARK-2157 \| 31090e43ca91f687b0bc6e25c824dc25bd7027cd#diff-2b643ea78c1add0381754b1f47eec132 \| spark.driver.blockManager.port \| 2.1.0 \| SPARK-4563 \| 2cd1bfa4f0c6625b0ab1dbeba2b9586b9a6a9f42#diff-6bdad48cfc34314e89599655442ff210 \| spark.files.ignoreCorruptFiles \| 2.1.0 \| SPARK-17850 \| 47776e7c0c68590fe446cef910900b1aaead06f9#diff-6bdad48cfc34314e89599655442ff210 \| spark.files.ignoreMissingFiles \| 2.4.0 \| SPARK-22676 \| ed4101d29f50d54fd7846421e4c00e9ecd3599d0#diff-6bdad48cfc34314e89599655442ff210 \| spark.log.callerContext \| 2.2.0 \| SPARK-16759 \| 3af894511be6fcc17731e28b284dba432fe911f5#diff-6bdad48cfc34314e89599655442ff210 \| In branch-2.2 but pom.xml is 2.1.0-SNAPSHOT spark.files.maxPartitionBytes \| 2.1.0 \| SPARK-16575 \| c8879bf1ee2af9ccd5d5656571d931d2fc1da024#diff-6bdad48cfc34314e89599655442ff210 \| spark.files.openCostInBytes \| 2.1.0 \| SPARK-16575 \| c8879bf1ee2af9ccd5d5656571d931d2fc1da024#diff-6bdad48cfc34314e89599655442ff210 \| spark.hadoopRDD.ignoreEmptySplits \| 2.3.0 \| SPARK-22233 \| 0fa10666cf75e3c4929940af49c8a6f6ea874759#diff-6bdad48cfc34314e89599655442ff210 \| spark.redaction.regex \| 2.1.2 \| SPARK-18535 and SPARK-19720 \| 444cca14d7ac8c5ab5d7e9d080b11f4d6babe3bf#diff-6bdad48cfc34314e89599655442ff210 \| spark.redaction.string.regex \| 2.2.0 \| SPARK-20070 \| 91fa80fe8a2480d64c430bd10f97b3d44c007bcc#diff-6bdad48cfc34314e89599655442ff210 \| spark.authenticate.secret \| 1.0.0 \| SPARK-1189 \| 7edbea41b43e0dc11a2de156be220db8b7952d01#diff-afd88f677ec5ff8b5e96a5cbbe00cd98 \| spark.authenticate.secretBitLength \| 1.6.0 \| SPARK-11073 \| f8d93edec82eedab59d50aec06ca2de7e4cf14f6#diff-afd88f677ec5ff8b5e96a5cbbe00cd98 \| spark.authenticate \| 1.0.0 \| SPARK-1189 \| 7edbea41b43e0dc11a2de156be220db8b7952d01#diff-afd88f677ec5ff8b5e96a5cbbe00cd98 \| spark.authenticate.enableSaslEncryption \| 1.4.0 \| SPARK-6229 \| 38d4e9e446b425ca6a8fe8d8080f387b08683842#diff-afd88f677ec5ff8b5e96a5cbbe00cd98 \| spark.authenticate.secret.file \| 3.0.0 \| SPARK-26239 \| 57d6fbfa8c803ce1791e7be36aba0219a1fcaa63#diff-6bdad48cfc34314e89599655442ff210 \| spark.authenticate.secret.driver.file \| 3.0.0 \| SPARK-26239 \| 57d6fbfa8c803ce1791e7be36aba0219a1fcaa63#diff-6bdad48cfc34314e89599655442ff210 \| spark.authenticate.secret.executor.file \| 3.0.0 \| SPARK-26239 \| 57d6fbfa8c803ce1791e7be36aba0219a1fcaa63#diff-6bdad48cfc34314e89599655442ff210 \| spark.buffer.write.chunkSize \| 2.3.0 \| SPARK-21527 \| 574ef6c987c636210828e96d2f797d8f10aff05e#diff-6bdad48cfc34314e89599655442ff210 \| spark.checkpoint.compress \| 2.2.0 \| SPARK-19525 \| 1405862382185e04b09f84af18f82f2f0295a755#diff-6bdad48cfc34314e89599655442ff210 \| spark.rdd.checkpoint.cachePreferredLocsExpireTime \| 3.0.0 \| SPARK-29182 \| 4ecbdbb6a7bd3908da32c82832e886b4f9f9e596#diff-6bdad48cfc34314e89599655442ff210 \| spark.shuffle.accurateBlockThreshold \| 2.2.1 \| SPARK-20801 \| 81f63c8923416014d5c6bc227dd3c4e2a62bac8e#diff-6bdad48cfc34314e89599655442ff210 \| spark.shuffle.registration.timeout \| 2.3.0 \| SPARK-20640 \| d107b3b910d8f434fb15b663a9db4c2dfe0a9f43#diff-6bdad48cfc34314e89599655442ff210 \| spark.shuffle.registration.maxAttempts \| 2.3.0 \| SPARK-20640 \| d107b3b910d8f434fb15b663a9db4c2dfe0a9f43#diff-6bdad48cfc34314e89599655442ff210 \| spark.reducer.maxBlocksInFlightPerAddress \| 2.2.1 \| SPARK-21243 \| 88dccda393bc79dc6032f71b6acf8eb2b4b152be#diff-6bdad48cfc34314e89599655442ff210 \| spark.network.maxRemoteBlockSizeFetchToMem \| 3.0.0 \| SPARK-26700 \| d8613571bc1847775dd5c1945757279234cb388c#diff-6bdad48cfc34314e89599655442ff210 \| spark.taskMetrics.trackUpdatedBlockStatuses \| 2.3.0 \| SPARK-20923 \| 5b5a69bea9de806e2c39b04b248ee82a7b664d7b#diff-6bdad48cfc34314e89599655442ff210 \| spark.shuffle.sort.io.plugin.class \| 3.0.0 \| SPARK-28209 \| abef84a868e9e15f346eea315bbab0ec8ac8e389#diff-6bdad48cfc34314e89599655442ff210 \| spark.shuffle.file.buffer \| 1.4.0 \| SPARK-7081 \| c53ebea9db418099df50f9adc1a18cee7849cd97#diff-ecdafc46b901740134261d2cab24ccd9 \| spark.shuffle.unsafe.file.output.buffer \| 2.3.0 \| SPARK-20950 \| 565e7a8d4ae7879ee704fb94ae9b3da31e202d7e#diff-6bdad48cfc34314e89599655442ff210 \| spark.shuffle.spill.diskWriteBufferSize \| 2.3.0 \| SPARK-20950 \| 565e7a8d4ae7879ee704fb94ae9b3da31e202d7e#diff-6bdad48cfc34314e89599655442ff210 \| spark.storage.unrollMemoryCheckPeriod \| 2.3.0 \| SPARK-21923 \| a11db942aaf4c470a85f8a1b180f034f7a584254#diff-6bdad48cfc34314e89599655442ff210 \| spark.storage.unrollMemoryGrowthFactor \| 2.3.0 \| SPARK-21923 \| a11db942aaf4c470a85f8a1b180f034f7a584254#diff-6bdad48cfc34314e89599655442ff210 \| spark.yarn.dist.forceDownloadSchemes \| 2.3.0 \| SPARK-21917 \| 8319432af60b8e1dc00f08d794f7d80591e24d0c#diff-6bdad48cfc34314e89599655442ff210 \| spark.extraListeners \| 1.3.0 \| SPARK-5411 \| 47e4d579eb4a9aab8e0dd9c1400394d80c8d0388#diff-364713d7776956cb8b0a771e9b62f82d \| spark.shuffle.spill.numElementsForceSpillThreshold \| 1.6.0 \| SPARK-10708 \| f6d06adf05afa9c5386dc2396c94e7a98730289f#diff-3eedc75de4787b842477138d8cc7f150 \| spark.shuffle.mapOutput.parallelAggregationThreshold \| 2.3.0 \| SPARK-22537 \| efd0036ec88bdc385f5a9ea568d2e2bbfcda2912#diff-6bdad48cfc34314e89599655442ff210 \| spark.driver.maxResultSize \| 1.2.0 \| SPARK-3466 \| 6181577e9935f46b646ba3925b873d031aa3d6ba#diff-d239aee594001f8391676e1047a0381e \| spark.security.credentials.renewalRatio \| 2.4.0 \| SPARK-23361 \| 5fa438471110afbf4e2174df449ac79e292501f8#diff-6bdad48cfc34314e89599655442ff210 \| spark.security.credentials.retryWait \| 2.4.0 \| SPARK-23361 \| 5fa438471110afbf4e2174df449ac79e292501f8#diff-6bdad48cfc34314e89599655442ff210 \| spark.shuffle.sort.initialBufferSize \| 2.1.0 \| SPARK-15958 \| bf665a958631125a1670504ef5966ef1a0e14798#diff-a1d00506391c1c4b2209f9bbff590c5b \| On branch-2.1, but in pom.xml it is 2.0.0-SNAPSHOT spark.shuffle.compress \| 0.6.0 \| None \| efc5423210d1aadeaea78273a4a8f10425753079#diff-76170a9c8f67b542bc58240a0a12fe08 \| spark.shuffle.spill.compress \| 0.9.0 \| None \| c3816de5040e3c48e58ed4762d2f4eb606812938#diff-2b643ea78c1add0381754b1f47eec132 \| spark.shuffle.mapStatus.compression.codec \| 3.0.0 \| SPARK-29939 \| 456cfe6e4693efd26d64f089d53c4e01bf8150a2#diff-6bdad48cfc34314e89599655442ff210 \| spark.shuffle.spill.initialMemoryThreshold \| 1.1.1 \| SPARK-4480 \| 16bf5f3d17624db2a96c921fe8a1e153cdafb06c#diff-31417c461d8901d8e08167b0cbc344c1 \| spark.shuffle.spill.batchSize \| 0.9.0 \| None \| c3816de5040e3c48e58ed4762d2f4eb606812938#diff-a470b9812a5ac8c37d732da7d9fbe39a \| spark.shuffle.sort.bypassMergeThreshold \| 1.1.1 \| SPARK-2787 \| 0f2274f8ed6131ad17326e3fff7f7e093863b72d#diff-31417c461d8901d8e08167b0cbc344c1 \| spark.shuffle.manager \| 1.1.0 \| SPARK-2044 \| 508fd371d6dbb826fd8a00787d347235b549e189#diff-60df49b5d3c59f2c4540fa16a90033a1 \| spark.shuffle.reduceLocality.enabled \| 1.5.0 \| SPARK-2774 \| 96a7c888d806adfdb2c722025a1079ed7eaa2052#diff-6a9ff7fb74fd490a50462d45db2d5e11 \| spark.shuffle.mapOutput.minSizeForBroadcast \| 2.0.0 \| SPARK-1239 \| d98dd72e7baeb59eacec4fefd66397513a607b2f#diff-609c3f8c26150ca96a94cd27146a809b \| spark.shuffle.mapOutput.dispatcher.numThreads \| 2.0.0 \| SPARK-1239 \| d98dd72e7baeb59eacec4fefd66397513a607b2f#diff-609c3f8c26150ca96a94cd27146a809b \| spark.shuffle.detectCorrupt \| 2.2.0 \| SPARK-4105 \| cf33a86285629abe72c1acf235b8bfa6057220a8#diff-eb30a71e0d04150b8e0b64929852e38b \| spark.shuffle.detectCorrupt.useExtraMemory \| 3.0.0 \| SPARK-26089 \| 688b0c01fac0db80f6473181673a89f1ce1be65b#diff-6bdad48cfc34314e89599655442ff210 \| spark.shuffle.sync \| 0.8.0 \| None \| 31da065b1d08c1fad5283e4bcf8e0ed01818c03e#diff-ad46ed23fcc3fa87f30d05204917b917 \| spark.shuffle.unsafe.fastMergeEnabled \| 1.4.0 \| SPARK-7081 \| c53ebea9db418099df50f9adc1a18cee7849cd97#diff-642ce9f439435408382c3ac3b5c5e0a0 \| spark.shuffle.sort.useRadixSort \| 2.0.0 \| SPARK-14724 \| e2b5647ab92eb478b3f7b36a0ce6faf83e24c0e5#diff-3eedc75de4787b842477138d8cc7f150 \| spark.shuffle.minNumPartitionsToHighlyCompress \| 2.4.0 \| SPARK-24519 \| 39dfaf2fd167cafc84ec9cc637c114ed54a331e3#diff-6bdad48cfc34314e89599655442ff210 \| spark.shuffle.useOldFetchProtocol \| 3.0.0 \| SPARK-25341 \| f725d472f51fb80c6ce1882ec283ff69bafb0de4#diff-6bdad48cfc34314e89599655442ff210 \| spark.shuffle.readHostLocalDisk \| 3.0.0 \| SPARK-30812 \| 68d7edf9497bea2f73707d32ab55dd8e53088e7c#diff-6bdad48cfc34314e89599655442ff210 \| ### Why are the changes needed? Supplemental configuration version information. ### Does this PR introduce any user-facing change? 'No'. ### How was this patch tested? Exists UT Closes #27913 from beliefer/add-version-to-core-config-part-three. Authored-by: beliefer <beliefer@163.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-03-16 10:08:07 +09:00
beliefer	bd2b3f9132	[SPARK-30911][CORE][DOC] Add version information to the configuration of Status ### What changes were proposed in this pull request? 1.Add version information to the configuration of `Status`. 2.Update the docs of `Status`. 3.By the way supplementary documentation about https://github.com/apache/spark/pull/27847 I sorted out some information show below. Item name \| Since version \| JIRA ID \| Commit ID \| Note -- \| -- \| -- \| -- \| -- spark.appStateStore.asyncTracking.enable \| 2.3.0 \| SPARK-20653 \| 772e4648d95bda3353723337723543c741ea8476#diff-9ab674b7af7b2097f7d28cb6f5fd1e8c \| spark.ui.liveUpdate.period \| 2.3.0 \| SPARK-20644 \| c7f38e5adb88d43ef60662c5d6ff4e7a95bff580#diff-9ab674b7af7b2097f7d28cb6f5fd1e8c \| spark.ui.liveUpdate.minFlushPeriod \| 2.4.2 \| SPARK-27394 \| a8a2ba11ac10051423e58920062b50f328b06421#diff-9ab674b7af7b2097f7d28cb6f5fd1e8c \| spark.ui.retainedJobs \| 1.2.0 \| SPARK-2321 \| 9530316887612dca060a128fca34dd5a6ab2a9a9#diff-1f32bcb61f51133bd0959a4177a066a5 \| spark.ui.retainedStages \| 0.9.0 \| None \| 112c0a1776bbc866a1026a9579c6f72f293414c4#diff-1f32bcb61f51133bd0959a4177a066a5 \| 0.9.0-incubating-SNAPSHOT spark.ui.retainedTasks \| 2.0.1 \| SPARK-15083 \| 55db26245d69bb02b7d7d5f25029b1a1cd571644#diff-6bdad48cfc34314e89599655442ff210 \| spark.ui.retainedDeadExecutors \| 2.0.0 \| SPARK-7729 \| 9f4263392e492b5bc0acecec2712438ff9a257b7#diff-a0ba36f9b1f9829bf3c4689b05ab6cf2 \| spark.ui.dagGraph.retainedRootRDDs \| 2.1.0 \| SPARK-17171 \| cc87280fcd065b01667ca7a59a1a32c7ab757355#diff-3f492c527ea26679d4307041b28455b8 \| spark.metrics.appStatusSource.enabled \| 3.0.0 \| SPARK-30060 \| 60f20e5ea2000ab8f4a593b5e4217fd5637c5e22#diff-9f796ae06b0272c1f0a012652a5b68d0 \| ### Why are the changes needed? Supplemental configuration version information. ### Does this PR introduce any user-facing change? No ### How was this patch tested? Exists UT Closes #27848 from beliefer/add-version-to-status-config. Lead-authored-by: beliefer <beliefer@163.com> Co-authored-by: Jiaan Geng <beliefer@163.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-03-12 11:03:47 +09:00
beliefer	c1b2675f2e	[SPARK-31002][CORE][DOC][FOLLOWUP] Add version information to the configuration of Core ### What changes were proposed in this pull request? This PR follows up https://github.com/apache/spark/pull/27847. I sorted out some information show below. Item name \| Since version \| JIRA ID \| Commit ID \| Note -- \| -- \| -- \| -- \| -- spark.yarn.isPython \| 1.5.0 \| SPARK-5479 \| 38112905bc3b33f2ae75274afba1c30e116f6e46#diff-4d2ab44195558d5a9d5f15b8803ef39d \| spark.task.cpus \| 0.5.0 \| None \| e5c4cd8a5e188592f8786a265c0cd073c69ac886#diff-391214d132a0fb4478f4f9c2313d8966 \| spark.dynamicAllocation.enabled \| 1.2.0 \| SPARK-3795 \| 8d59b37b02eb36f37bcefafb952519d7dca744ad#diff-364713d7776956cb8b0a771e9b62f82d \| spark.dynamicAllocation.testing \| 1.2.0 \| SPARK-3795 \| 8d59b37b02eb36f37bcefafb952519d7dca744ad#diff-364713d7776956cb8b0a771e9b62f82d \| spark.dynamicAllocation.minExecutors \| 1.2.0 \| SPARK-3795 \| 8d59b37b02eb36f37bcefafb952519d7dca744ad#diff-364713d7776956cb8b0a771e9b62f82d \| spark.dynamicAllocation.initialExecutors \| 1.3.0 \| SPARK-4585 \| b2047b55c5fc85de6b63276d8ab9610d2496e08b#diff-b096353602813e47074ace09a3890d56 \| spark.dynamicAllocation.maxExecutors \| 1.2.0 \| SPARK-3795 \| 8d59b37b02eb36f37bcefafb952519d7dca744ad#diff-364713d7776956cb8b0a771e9b62f82d \| spark.dynamicAllocation.executorAllocationRatio \| 2.4.0 \| SPARK-22683 \| 55c4ca88a3b093ee197a8689631be8d1fac1f10f#diff-6bdad48cfc34314e89599655442ff210 \| spark.dynamicAllocation.cachedExecutorIdleTimeout \| 1.4.0 \| SPARK-7955 \| 6faaf15ba311bc3a79aae40a6c9c4befabb6889f#diff-b096353602813e47074ace09a3890d56 \| spark.dynamicAllocation.executorIdleTimeout \| 1.2.0 \| SPARK-3795 \| 8d59b37b02eb36f37bcefafb952519d7dca744ad#diff-364713d7776956cb8b0a771e9b62f82d \| spark.dynamicAllocation.shuffleTracking.enabled \| 3.0.0 \| SPARK-27963 \| 2ddeff97d7329942a98ef363991eeabc3fa71a76#diff-6bdad48cfc34314e89599655442ff210 \| spark.dynamicAllocation.shuffleTimeout \| 3.0.0 \| SPARK-27963 \| 2ddeff97d7329942a98ef363991eeabc3fa71a76#diff-6bdad48cfc34314e89599655442ff210 \| spark.dynamicAllocation.schedulerBacklogTimeout \| 1.2.0 \| SPARK-3795 \| 8d59b37b02eb36f37bcefafb952519d7dca744ad#diff-364713d7776956cb8b0a771e9b62f82d \| spark.dynamicAllocation.sustainedSchedulerBacklogTimeout \| 1.2.0 \| SPARK-3795 \| 8d59b37b02eb36f37bcefafb952519d7dca744ad#diff-364713d7776956cb8b0a771e9b62f82d \| spark.locality.wait \| 0.5.0 \| None \| e5c4cd8a5e188592f8786a265c0cd073c69ac886#diff-391214d132a0fb4478f4f9c2313d8966 \| spark.shuffle.service.enabled \| 1.2.0 \| SPARK-3796 \| f55218aeb1e9d638df6229b36a59a15ce5363482#diff-2b643ea78c1add0381754b1f47eec132 \| Constants.SHUFFLE_SERVICE_FETCH_RDD_ENABLED \| 3.0.0 \| SPARK-27677 \| e9f3f62b2c0f521f3cc23fef381fc6754853ad4f#diff-6bdad48cfc34314e89599655442ff210 \| spark.shuffle.service.fetch.rdd.enabled spark.shuffle.service.db.enabled \| 3.0.0 \| SPARK-26288 \| 8b0aa59218c209d39cbba5959302d8668b885cf6#diff-6bdad48cfc34314e89599655442ff210 \| spark.shuffle.service.port \| 1.2.0 \| SPARK-3796 \| f55218aeb1e9d638df6229b36a59a15ce5363482#diff-2b643ea78c1add0381754b1f47eec132 \| spark.kerberos.keytab \| 3.0.0 \| SPARK-25372 \| 51540c2fa677658be954c820bc18ba748e4c8583#diff-6bdad48cfc34314e89599655442ff210 \| spark.kerberos.principal \| 3.0.0 \| SPARK-25372 \| 51540c2fa677658be954c820bc18ba748e4c8583#diff-6bdad48cfc34314e89599655442ff210 \| spark.kerberos.relogin.period \| 3.0.0 \| SPARK-23781 \| 68dde3481ea458b0b8deeec2f99233c2d4c1e056#diff-6bdad48cfc34314e89599655442ff210 \| spark.kerberos.renewal.credentials \| 3.0.0 \| SPARK-26595 \| 2a67dbfbd341af166b1c85904875f26a6dea5ba8#diff-6bdad48cfc34314e89599655442ff210 \| spark.kerberos.access.hadoopFileSystems \| 3.0.0 \| SPARK-26766 \| d0443a74d185ec72b747fa39994fa9a40ce974cf#diff-6bdad48cfc34314e89599655442ff210 \| spark.executor.instances \| 1.0.0 \| SPARK-1126 \| 1617816090e7b20124a512a43860a21232ebf511#diff-4d2ab44195558d5a9d5f15b8803ef39d \| spark.yarn.dist.pyFiles \| 2.2.1 \| SPARK-21714 \| d10c9dc3f631a26dbbbd8f5c601ca2001a5d7c80#diff-6bdad48cfc34314e89599655442ff210 \| spark.task.maxDirectResultSize \| 2.0.0 \| SPARK-13830 \| 2ef4c5963bff3574fe17e669d703b25ddd064e5d#diff-5a0de266c82b95adb47d9bca714e1f1b \| spark.task.maxFailures \| 0.8.0 \| None \| 46eecd110a4017ea0c86cbb1010d0ccd6a5eb2ef#diff-264da78fe625d594eae59d1adabc8ae9 \| spark.task.reaper.enabled \| 2.0.3 \| SPARK-18761 \| 678d91c1d2283d9965a39656af9d383bad093ba8#diff-5a0de266c82b95adb47d9bca714e1f1b \| spark.task.reaper.killTimeout \| 2.0.3 \| SPARK-18761 \| 678d91c1d2283d9965a39656af9d383bad093ba8#diff-5a0de266c82b95adb47d9bca714e1f1b \| spark.task.reaper.pollingInterval \| 2.0.3 \| SPARK-18761 \| 678d91c1d2283d9965a39656af9d383bad093ba8#diff-5a0de266c82b95adb47d9bca714e1f1b \| spark.task.reaper.threadDump \| 2.0.3 \| SPARK-18761 \| 678d91c1d2283d9965a39656af9d383bad093ba8#diff-5a0de266c82b95adb47d9bca714e1f1b \| spark.blacklist.enabled \| 2.1.0 \| SPARK-17675 \| 9ce7d3e542e786c62f047c13f3001e178f76e06a#diff-6bdad48cfc34314e89599655442ff210 \| spark.blacklist.task.maxTaskAttemptsPerExecutor \| 2.1.0 \| SPARK-17675 \| 9ce7d3e542e786c62f047c13f3001e178f76e06a#diff-6bdad48cfc34314e89599655442ff210 \| spark.blacklist.task.maxTaskAttemptsPerNode \| 2.1.0 \| SPARK-17675 \| 9ce7d3e542e786c62f047c13f3001e178f76e06a#diff-6bdad48cfc34314e89599655442ff210 \| spark.blacklist.application.maxFailedTasksPerExecutor \| 2.2.0 \| SPARK-8425 \| 93cdb8a7d0f124b4db069fd8242207c82e263c52#diff-6bdad48cfc34314e89599655442ff210 \| spark.blacklist.stage.maxFailedTasksPerExecutor \| 2.1.0 \| SPARK-17675 \| 9ce7d3e542e786c62f047c13f3001e178f76e06a#diff-6bdad48cfc34314e89599655442ff210 \| spark.blacklist.application.maxFailedExecutorsPerNode \| 2.2.0 \| SPARK-8425 \| 93cdb8a7d0f124b4db069fd8242207c82e263c52#diff-6bdad48cfc34314e89599655442ff210 \| spark.blacklist.stage.maxFailedExecutorsPerNode \| 2.1.0 \| SPARK-17675 \| 9ce7d3e542e786c62f047c13f3001e178f76e06a#diff-6bdad48cfc34314e89599655442ff210 \| spark.blacklist.timeout \| 2.1.0 \| SPARK-17675 \| 9ce7d3e542e786c62f047c13f3001e178f76e06a#diff-6bdad48cfc34314e89599655442ff210 \| spark.blacklist.killBlacklistedExecutors \| 2.2.0 \| SPARK-16554 \| 6287c94f08200d548df5cc0a401b73b84f9968c4#diff-6bdad48cfc34314e89599655442ff210 \| spark.scheduler.executorTaskBlacklistTime \| 1.0.0 \| None \| ab747d39ddc7c8a314ed2fb26548fc5652af0d74#diff-bad3987c83bd22d46416d3dd9d208e76 \| spark.blacklist.application.fetchFailure.enabled \| 2.3.0 \| SPARK-13669 and SPARK-20898 \| 9e50a1d37a4cf0c34e20a7c1a910ceaff41535a2#diff-6bdad48cfc34314e89599655442ff210 \| spark.files.fetchFailure.unRegisterOutputOnHost \| 2.3.0 \| SPARK-19753 \| dccc0aa3cf957c8eceac598ac81ac82f03b52105#diff-6bdad48cfc34314e89599655442ff210 \| spark.scheduler.listenerbus.eventqueue.capacity \| 2.3.0 \| SPARK-20887 \| 629f38e171409da614fd635bd8dd951b7fde17a4#diff-6bdad48cfc34314e89599655442ff210 \| spark.scheduler.listenerbus.metrics.maxListenerClassesTimed \| 2.3.0 \| SPARK-20863 \| 2a23cdd078a7409d0bb92cf27718995766c41b1d#diff-6bdad48cfc34314e89599655442ff210 \| spark.scheduler.listenerbus.logSlowEvent \| 3.0.0 \| SPARK-30812 \| 68d7edf9497bea2f73707d32ab55dd8e53088e7c#diff-6bdad48cfc34314e89599655442ff210 \| spark.scheduler.listenerbus.logSlowEvent.threshold \| 3.0.0 \| SPARK-29001 \| 0346afa8fc348aa1b3f5110df747a64e3b2da388#diff-6bdad48cfc34314e89599655442ff210 \| ### Why are the changes needed? Supplemental configuration version information. ### Does this PR introduce any user-facing change? No ### How was this patch tested? Exists UT Closes #27852 from beliefer/add-version-to-core-config-part-two. Authored-by: beliefer <beliefer@163.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-03-12 09:52:20 +09:00
beliefer	bc490f383d	[SPARK-31002][CORE][DOC] Add version information to the configuration of Core ### What changes were proposed in this pull request? Add version information to the configuration of `Core`. Note: Because `Core` has a lot of configuration items, I split the items into four PR. Other PR will follows this PR. I sorted out some information show below. Item name \| Since version \| JIRA ID \| Commit ID \| Note -- \| -- \| -- \| -- \| -- spark.resources.discoveryPlugin \| 3.0.0 \| SPARK-30689 \| 742e35f1d48c2523dda2ce21d73b7ab5ade20582#diff-6bdad48cfc34314e89599655442ff210 \| spark.driver.resourcesFile \| 3.0.0 \| SPARK-27835 \| 6748b486a9afe8370786efb64a8c9f3470c62dcf#diff-6bdad48cfc34314e89599655442ff210 \| SparkLauncher.DRIVER_EXTRA_CLASSPATH \| 1.0.0 \| None \| 29ee101c73bf066bf7f4f8141c475b8d1bd3cf1c#diff-4d2ab44195558d5a9d5f15b8803ef39d \| spark.driver.extraClassPath SparkLauncher.DRIVER_EXTRA_JAVA_OPTIONS \| 1.0.0 \| None \| 29ee101c73bf066bf7f4f8141c475b8d1bd3cf1c#diff-4d2ab44195558d5a9d5f15b8803ef39d \| spark.driver.extraJavaOptions SparkLauncher.DRIVER_EXTRA_LIBRARY_PATH \| 1.0.0 \| None \| 29ee101c73bf066bf7f4f8141c475b8d1bd3cf1c#diff-4d2ab44195558d5a9d5f15b8803ef39d \| spark.driver.extraLibraryPath spark.driver.userClassPathFirst \| 1.3.0 \| SPARK-2996 \| 6a1e0f967286945db13d94aeb6ed19f0a347c236#diff-4d2ab44195558d5a9d5f15b8803ef39d \| spark.driver.cores \| 1.3.0 \| SPARK-1507 \| 2be82b1e66cd188456bbf1e5abb13af04d1629d5#diff-4d2ab44195558d5a9d5f15b8803ef39d \| SparkLauncher.DRIVER_MEMORY \| 1.1.1 \| SPARK-3243 \| c1ffa3e4cdfbd1f84b5c8d8de5d0fb958a19e211#diff-4d2ab44195558d5a9d5f15b8803ef39d \| spark.driver.memory spark.driver.memoryOverhead \| 2.3.0 \| SPARK-22646 \| 3f4060c340d6bac412e8819c4388ccba226efcf3#diff-6bdad48cfc34314e89599655442ff210 \| spark.driver.log.dfsDir \| 3.0.0 \| SPARK-25118 \| 5f11e8c4cb9a5db037ac239b8fcc97f3a746e772#diff-6bdad48cfc34314e89599655442ff210 \| spark.driver.log.layout \| 3.0.0 \| SPARK-25118 \| 5f11e8c4cb9a5db037ac239b8fcc97f3a746e772#diff-6bdad48cfc34314e89599655442ff210 \| spark.driver.log.persistToDfs.enabled \| 3.0.0 \| SPARK-25118 \| 5f11e8c4cb9a5db037ac239b8fcc97f3a746e772#diff-6bdad48cfc34314e89599655442ff210 \| spark.driver.log.allowErasureCoding \| 3.0.0 \| SPARK-29105 \| 276aaaae8d404975f8701089e9f4dfecd16e0d9f#diff-6bdad48cfc34314e89599655442ff210 \| spark.eventLog.enabled \| 1.0.0 \| SPARK-1132 \| 79d07d66040f206708e14de393ab0b80020ed96a#diff-364713d7776956cb8b0a771e9b62f82d \| spark.eventLog.dir \| 1.0.0 \| SPARK-1132 \| 79d07d66040f206708e14de393ab0b80020ed96a#diff-364713d7776956cb8b0a771e9b62f82d \| spark.eventLog.compress \| 1.0.0 \| SPARK-1132 \| 79d07d66040f206708e14de393ab0b80020ed96a#diff-364713d7776956cb8b0a771e9b62f82d \| spark.eventLog.logBlockUpdates.enabled \| 2.3.0 \| SPARK-22050 \| 1437e344ec0c29a44a19f4513986f5f184c44695#diff-6bdad48cfc34314e89599655442ff210 \| spark.eventLog.erasureCoding.enabled \| 3.0.0 \| SPARK-25855 \| 35506dced739ef16136e9f3d5d48c638899d3cec#diff-6bdad48cfc34314e89599655442ff210 \| spark.eventLog.testing \| 1.0.1 \| None \| d4c8af87994acf3707027e6fab25363f51fd4615#diff-e4a5a68c15eed95d038acfed84b0b66a \| spark.eventLog.buffer.kb \| 1.0.0 \| SPARK-1132 \| 79d07d66040f206708e14de393ab0b80020ed96a#diff-364713d7776956cb8b0a771e9b62f82d \| spark.eventLog.logStageExecutorMetrics \| 3.0.0 \| SPARK-30812 \| 68d7edf9497bea2f73707d32ab55dd8e53088e7c#diff-6bdad48cfc34314e89599655442ff210 \| spark.eventLog.gcMetrics.youngGenerationGarbageCollectors \| 3.0.0 \| SPARK-25865 \| e5c502c596563dce8eb58f86e42c1aea2c51ed17#diff-6bdad48cfc34314e89599655442ff210 \| spark.eventLog.gcMetrics.oldGenerationGarbageCollectors \| 3.0.0 \| SPARK-25865 \| e5c502c596563dce8eb58f86e42c1aea2c51ed17#diff-6bdad48cfc34314e89599655442ff210 \| spark.eventLog.overwrite \| 1.0.0 \| SPARK-1132 \| 79d07d66040f206708e14de393ab0b80020ed96a#diff-364713d7776956cb8b0a771e9b62f82d \| spark.eventLog.longForm.enabled \| 2.4.0 \| SPARK-23820 \| 71f70130f1b2b4ec70595627f0a02a88e2c0e27d#diff-6bdad48cfc34314e89599655442ff210 \| spark.eventLog.rolling.enabled \| 3.0.0 \| SPARK-28869 \| 100fc58da54e026cda87832a10e2d06eaeccdf87#diff-6bdad48cfc34314e89599655442ff210 \| spark.eventLog.rolling.maxFileSize \| 3.0.0 \| SPARK-28869 \| 100fc58da54e026cda87832a10e2d06eaeccdf87#diff-6bdad48cfc34314e89599655442ff210 \| spark.executor.id \| 1.2.0 \| SPARK-3377 \| 79e45c9323455a51f25ed9acd0edd8682b4bbb88#diff-364713d7776956cb8b0a771e9b62f82d \| SparkLauncher.EXECUTOR_EXTRA_CLASSPATH \| 1.0.0 \| None \| 29ee101c73bf066bf7f4f8141c475b8d1bd3cf1c#diff-4d2ab44195558d5a9d5f15b8803ef39d \| spark.executor.extraClassPath spark.executor.heartbeat.dropZeroAccumulatorUpdates \| 3.0.0 \| SPARK-25449 \| 9362c5cc273fdd09f9b3b512e2f6b64bcefc25ab#diff-6bdad48cfc34314e89599655442ff210 \| spark.executor.heartbeatInterval \| 1.1.0 \| SPARK-2099 \| 8d338f64c4eda45d22ae33f61ef7928011cc2846#diff-5a0de266c82b95adb47d9bca714e1f1b \| spark.executor.heartbeat.maxFailures \| 1.6.2 \| SPARK-13522 \| 86bf93e65481b8fe5d7532ca6d4cd29cafc9e9dd#diff-5a0de266c82b95adb47d9bca714e1f1b \| spark.executor.processTreeMetrics.enabled \| 3.0.0 \| SPARK-27324 \| 387ce89a0631f1a4c6668b90ff2a7bbcf11919cd#diff-6bdad48cfc34314e89599655442ff210 \| spark.executor.metrics.pollingInterval \| 3.0.0 \| SPARK-26329 \| 80ab19b9fd268adfc419457f12b99a5da7b6d1c7#diff-6bdad48cfc34314e89599655442ff210 \| SparkLauncher.EXECUTOR_EXTRA_JAVA_OPTIONS \| 1.0.0 \| None \| 29ee101c73bf066bf7f4f8141c475b8d1bd3cf1c#diff-4d2ab44195558d5a9d5f15b8803ef39d \| spark.executor.extraJavaOptions SparkLauncher.EXECUTOR_EXTRA_LIBRARY_PATH \| 1.0.0 \| None \| 29ee101c73bf066bf7f4f8141c475b8d1bd3cf1c#diff-4d2ab44195558d5a9d5f15b8803ef39d \| spark.executor.extraLibraryPath spark.executor.userClassPathFirst \| 1.3.0 \| SPARK-2996 \| 6a1e0f967286945db13d94aeb6ed19f0a347c236#diff-529fc5c06b9731c1fbda6f3db60b16aa \| SparkLauncher.EXECUTOR_CORES \| 1.0.0 \| SPARK-1126 \| 1617816090e7b20124a512a43860a21232ebf511#diff-4d2ab44195558d5a9d5f15b8803ef39d \| spark.executor.cores SparkLauncher.EXECUTOR_MEMORY \| 0.7.0 \| None \| 696eec32c982ca516c506de33f383a173bcbd131#diff-4f50ad37deb6742ad45472636c9a870b \| spark.executor.memory spark.executor.memoryOverhead \| 2.3.0 \| SPARK-22646 \| 3f4060c340d6bac412e8819c4388ccba226efcf3#diff-6bdad48cfc34314e89599655442ff210 \| spark.cores.max \| 0.6.0 \| None \| 0a472840030e4e7e84fe748f7bfa49f1ece599c5#diff-b6cc54c092b861f645c3cd69ea0f91e2 \| spark.memory.offHeap.enabled \| 1.6.0 \| SPARK-12251 \| 9870e5c7af87190167ca3845ede918671b9420ca#diff-529fc5c06b9731c1fbda6f3db60b16aa \| spark.memory.offHeap.size \| 1.6.0 \| SPARK-12251 \| 9870e5c7af87190167ca3845ede918671b9420ca#diff-529fc5c06b9731c1fbda6f3db60b16aa \| spark.memory.storageFraction \| 1.6.0 \| SPARK-10983 \| b3ffac5178795f2d8e7908b3e77e8e89f50b5f6f#diff-529fc5c06b9731c1fbda6f3db60b16aa \| spark.memory.fraction \| 1.6.0 \| SPARK-10983 \| b3ffac5178795f2d8e7908b3e77e8e89f50b5f6f#diff-529fc5c06b9731c1fbda6f3db60b16aa \| spark.storage.safetyFraction \| 1.1.0 \| [SPARK-1777 \| ecf30ee7e78ea59c462c54db0fde5328f997466c#diff-2b643ea78c1add0381754b1f47eec132 \| spark.storage.unrollMemoryThreshold \| 1.1.0 \| SPARK-1777 \| ecf30ee7e78ea59c462c54db0fde5328f997466c#diff-692a329b5a7fb4134c55d559457b94e4 \| spark.storage.replication.proactive \| 2.2.0 \| SPARK-15355 \| fa7c582e9442b985a0493fb1dd15b3fb9b6031b4#diff-186864190089a718680accb51de5f0d4 \| spark.storage.memoryMapThreshold \| 0.9.2 \| SPARK-1145 \| 76339495153dd895667ad609815c887b2c8960ea#diff-abd96f2ae793cd6ea6aab5b96a3c1d7a \| spark.storage.replication.policy \| 2.1.0 \| SPARK-15353 \| a26afd52198523dbd51dc94053424494638c7de5#diff-2b643ea78c1add0381754b1f47eec132 \| spark.storage.replication.topologyMapper \| 2.1.0 \| SPARK-15353 \| a26afd52198523dbd51dc94053424494638c7de5#diff-186864190089a718680accb51de5f0d4 \| spark.storage.cachedPeersTtl \| 1.1.1 \| SPARK-3495 and SPARK-3496 \| be0cc9952d6c8b4cfe9ff10a761e0677cba64489#diff-2b643ea78c1add0381754b1f47eec132 \| spark.storage.maxReplicationFailures \| 1.1.1 \| SPARK-3495 and SPARK-3496 \| be0cc9952d6c8b4cfe9ff10a761e0677cba64489#diff-2b643ea78c1add0381754b1f47eec132 \| spark.storage.replication.topologyFile \| 2.1.0 \| SPARK-15353 \| a26afd52198523dbd51dc94053424494638c7de5#diff-e550ce522c12a31d805a7d0f41e802af \| spark.storage.exceptionOnPinLeak \| 1.6.2 \| SPARK-13566 \| ab006523b840b1d2dbf3f5ff0a238558e7665a1e#diff-5a0de266c82b95adb47d9bca714e1f1b \| spark.storage.blockManagerTimeoutIntervalMs \| 0.7.3 \| None \| 9085ebf3750c7d9bb7c6b5f6b4bdc5b807af93c2#diff-76170a9c8f67b542bc58240a0a12fe08 \| spark.storage.blockManagerSlaveTimeoutMs \| 0.7.0 \| None \| 97434f49b8c029e9b78c91ec5f58557cd1b5c943#diff-2ce6374aac24d70c69182b067216e684 \| spark.storage.cleanupFilesAfterExecutorExit \| 2.4.0 \| SPARK-24340 \| 8ef167a5f9ba8a79bb7ca98a9844fe9cfcfea060#diff-916ca56b663f178f302c265b7ef38499 \| spark.diskStore.subDirectories \| 0.6.0 \| None \| 815d6bd69a0c1ba0e94fc0785f5c3619b37f19c5#diff-e8b73c5b81c403a5e5d581f97624c510 \| spark.block.failures.beforeLocationRefresh \| 2.0.0 \| SPARK-13328 \| ff776b2fc1cd4c571fd542dbf807e6fa3373cb34#diff-2b643ea78c1add0381754b1f47eec132 \| ### Why are the changes needed? Supplemental configuration version information. ### Does this PR introduce any user-facing change? No ### How was this patch tested? Exists UT Closes #27847 from beliefer/add-version-to-core-config-part-one. Authored-by: beliefer <beliefer@163.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-03-08 12:31:57 +09:00
beliefer	e36227e2d9	[SPARK-30914][CORE][DOC] Add version information to the configuration of UI ### What changes were proposed in this pull request? 1.Add version information to the configuration of `UI`. 2.Update the docs of `UI`. I sorted out some information show below. Item name \| Since version \| JIRA ID \| Commit ID \| Note -- \| -- \| -- \| -- \| -- spark.ui.showConsoleProgress \| 1.2.1 \| SPARK-4017 \| 04b1bdbae31c3039125100e703121daf7d9dabf5#diff-364713d7776956cb8b0a771e9b62f82d \| spark.ui.consoleProgress.update.interval \| 2.1.0 \| SPARK-16919 \| e076fb05ac83a3ed6995e29bb03ea07ea05e39db#diff-fbf4e388a66b6a37e984b91cd71a3e2c \| spark.ui.enabled \| 1.1.1 \| SPARK-3490 \| 937de93e80e6d299c4d08be426da2d5bc2d66f98#diff-364713d7776956cb8b0a771e9b62f82d \| spark.ui.port \| 0.7.0 \| None \| f03d9760fd8ac67fd0865cb355ba75d2eff507fe#diff-ed8dbcebe16fda5ecd6df1a981dc6fee \| spark.ui.filters \| 1.0.0 \| SPARK-1189 \| 7edbea41b43e0dc11a2de156be220db8b7952d01#diff-f79a5ead735b3d0b34b6b94486918e1c \| spark.ui.allowFramingFrom \| 1.6.0 \| SPARK-10589 \| 5dbaf3d3911bbfa003bc75459aaad66b4f6e0c67#diff-f79a5ead735b3d0b34b6b94486918e1c \| spark.ui.reverseProxy \| 2.1.0 \| SPARK-15487 \| 92ce8d4849a0341c4636e70821b7be57ad3055b1#diff-364713d7776956cb8b0a771e9b62f82d \| spark.ui.reverseProxyUrl \| 2.1.0 \| SPARK-15487 \| 92ce8d4849a0341c4636e70821b7be57ad3055b1#diff-364713d7776956cb8b0a771e9b62f82d \| spark.ui.killEnabled \| 1.0.0 \| SPARK-1202 \| 211f97447b5f078afcb1619a08d2e2349325f61a#diff-a40023c80383451b6e29ee7a6e0593e9 \| spark.ui.threadDumpsEnabled \| 1.2.0 \| SPARK-611 \| 866c7bbe56f9c7fd96d3f4afe8a76405dc877a6e#diff-5d18fb70c572369a0fff0b97de94f265 \| spark.ui.prometheus.enabled \| 3.0.0 \| SPARK-29064 \| bbfaadb280a80b511a98d18881641c6d9851dd51#diff-f70174ad0759db1fb4cb36a7ff9324a7 \| spark.ui.xXssProtection \| 2.3.0 \| SPARK-22188 \| 5a07aca4d464e96d75ea17bf6768e24b829872ec#diff-6bdad48cfc34314e89599655442ff210 \| spark.ui.xContentTypeOptions.enabled \| 2.3.0 \| SPARK-22188 \| 5a07aca4d464e96d75ea17bf6768e24b829872ec#diff-6bdad48cfc34314e89599655442ff210 \| spark.ui.strictTransportSecurity \| 2.3.0 \| SPARK-22188 \| 5a07aca4d464e96d75ea17bf6768e24b829872ec#diff-6bdad48cfc34314e89599655442ff210 \| spark.ui.requestHeaderSize \| 2.2.3 \| SPARK-26118 \| 9ceee6f188e6c3794d31ce15cc61d29f907bebf7#diff-6bdad48cfc34314e89599655442ff210 \| spark.ui.timeline.tasks.maximum \| 1.4.0 \| SPARK-7296 \| a5f7b3b9c7f05598a1cc8e582e5facee1029cd5e#diff-fa4cfb2cce1b925f55f41f2dfa8c8501 \| spark.acls.enable \| 1.1.0 \| SPARK-1890 and SPARK-1891 \| e3fe6571decfdc406ec6d505fd92f9f2b85a618c#diff-afd88f677ec5ff8b5e96a5cbbe00cd98 \| spark.ui.view.acls \| 1.0.0 \| SPARK-1189 \| 7edbea41b43e0dc11a2de156be220db8b7952d01#diff-afd88f677ec5ff8b5e96a5cbbe00cd98 \| spark.ui.view.acls.groups \| 2.0.0 \| SPARK-4224 \| ae79032dcf160796851ca29116cca146c4d86ada#diff-afd88f677ec5ff8b5e96a5cbbe00cd98 \| spark.admin.acls \| 1.1.0 \| SPARK-1890 and SPARK-1891 \| e3fe6571decfdc406ec6d505fd92f9f2b85a618c#diff-afd88f677ec5ff8b5e96a5cbbe00cd98 \| spark.admin.acls.groups \| 2.0.0 \| SPARK-4224 \| ae79032dcf160796851ca29116cca146c4d86ada#diff-afd88f677ec5ff8b5e96a5cbbe00cd98 \| spark.modify.acls \| 1.1.0 \| SPARK-1890 and SPARK-1891 \| e3fe6571decfdc406ec6d505fd92f9f2b85a618c#diff-afd88f677ec5ff8b5e96a5cbbe00cd98 \| spark.modify.acls.groups \| 2.0.0 \| SPARK-4224 \| ae79032dcf160796851ca29116cca146c4d86ada#diff-afd88f677ec5ff8b5e96a5cbbe00cd98 \| spark.user.groups.mapping \| 2.0.0 \| SPARK-4224 \| ae79032dcf160796851ca29116cca146c4d86ada#diff-afd88f677ec5ff8b5e96a5cbbe00cd98 \| spark.ui.proxyRedirectUri \| 3.0.0 \| SPARK-30240 \| a9fbd310300e57ed58818d7347f3c3172701c491#diff-f70174ad0759db1fb4cb36a7ff9324a7 \| spark.ui.custom.executor.log.url \| 3.0.0 \| SPARK-26792 \| d5bda2c9e8dde6afc075cc7f65b15fa9aa82231c#diff-f70174ad0759db1fb4cb36a7ff9324a7 \| ### Why are the changes needed? Supplemental configuration version information. ### Does this PR introduce any user-facing change? No ### How was this patch tested? Exists UT Closes #27806 from beliefer/add-version-to-UI-config. Authored-by: beliefer <beliefer@163.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-03-06 11:08:57 +09:00
Kent Yao	3edab6cc1d	[MINOR][CORE] Expose the alias -c flag of --conf for spark-submit ### What changes were proposed in this pull request? -c is short for --conf, it was introduced since v1.1.0 but hidden from users until now ### Why are the changes needed? ### Does this PR introduce any user-facing change? no expose hidden feature ### How was this patch tested? Nah Closes #27802 from yaooqinn/conf. Authored-by: Kent Yao <yaooqinn@hotmail.com> Signed-off-by: Dongjoon Hyun <dhyun@apple.com>	2020-03-04 20:37:51 -08:00
beliefer	ebcff675e0	[SPARK-30889][SPARK-30913][CORE][DOC] Add version information to the configuration of Tests.scala and Worker ### What changes were proposed in this pull request? 1.Add version information to the configuration of `Tests` and `Worker`. 2.Update the docs of `Worker`. I sorted out some information of `Tests` show below. Item name \| Since version \| JIRA ID \| Commit ID \| Note -- \| -- \| -- \| -- \| -- spark.testing.memory \| 1.6.0 \| SPARK-10983 \| b3ffac5178795f2d8e7908b3e77e8e89f50b5f6f#diff-395d07dcd46359cca610ce74357f0bb4 \| spark.testing.dynamicAllocation.scheduleInterval \| 2.3.0 \| SPARK-22864 \| 4e9e6aee44bb2ddb41b567d659358b22fd824222#diff-b096353602813e47074ace09a3890d56 \| spark.testing \| 1.0.1 \| SPARK-1606 \| ce57624b8232159fe3ec6db228afc622133df591#diff-d239aee594001f8391676e1047a0381e \| spark.test.noStageRetry \| 1.2.0 \| SPARK-3796 \| f55218aeb1e9d638df6229b36a59a15ce5363482#diff-6a9ff7fb74fd490a50462d45db2d5e11 \| spark.testing.reservedMemory \| 1.6.0 \| SPARK-12081 \| 84c44b500b5c90dffbe1a6b0aa86f01699b09b96#diff-395d07dcd46359cca610ce74357f0bb4 \| spark.testing.nHosts \| 3.0.0 \| SPARK-26491 \| 1a641525e60039cc6b10816e946cb6f44b3e2696#diff-8b4ea8f3b0cc1e7ce7e943de1abbb165 \| spark.testing.nExecutorsPerHost \| 3.0.0 \| SPARK-26491 \| 1a641525e60039cc6b10816e946cb6f44b3e2696#diff-8b4ea8f3b0cc1e7ce7e943de1abbb165 \| spark.testing.nCoresPerExecutor \| 3.0.0 \| SPARK-26491 \| 1a641525e60039cc6b10816e946cb6f44b3e2696#diff-8b4ea8f3b0cc1e7ce7e943de1abbb165 \| spark.resources.warnings.testing \| 3.1.0 \| SPARK-29148 \| 496f6ac86001d284cbfb7488a63dd3a168919c0f#diff-8b4ea8f3b0cc1e7ce7e943de1abbb165 \| spark.testing.resourceProfileManager \| 3.1.0 \| SPARK-29148 \| 496f6ac86001d284cbfb7488a63dd3a168919c0f#diff-8b4ea8f3b0cc1e7ce7e943de1abbb165 \| I sorted out some information of `Worker` show below. Item name \| Since version \| JIRA ID \| Commit ID \| Note -- \| -- \| -- \| -- \| -- spark.worker.resourcesFile \| 3.0.0 \| SPARK-27369 \| 7cbe01e8efc3f6cd3a0cac4bcfadea8fcc74a955#diff-b2fc8d6ab7ac5735085e2d6cfacb95da \| spark.worker.timeout \| 0.6.2 \| None \| e395aa295aeec6767df798bf1002b1f30983c1cd#diff-776a630ac2b2ec5fe85c07ca20a58fc0 \| spark.worker.driverTerminateTimeout \| 2.1.2 \| SPARK-20843 \| ebd72f453aa0b4f68760d28b3e93e6dd33856659#diff-829a8674171f92acd61007bedb1bfa4f \| spark.worker.cleanup.enabled \| 1.0.0 \| SPARK-1154 \| 1440154c27ca48b5a75103eccc9057286d3f6ca8#diff-916ca56b663f178f302c265b7ef38499 \| spark.worker.cleanup.interval \| 1.0.0 \| SPARK-1154 \| 1440154c27ca48b5a75103eccc9057286d3f6ca8#diff-916ca56b663f178f302c265b7ef38499 \| spark.worker.cleanup.appDataTtl \| 1.0.0 \| SPARK-1154 \| 1440154c27ca48b5a75103eccc9057286d3f6ca8#diff-916ca56b663f178f302c265b7ef38499 \| spark.worker.preferConfiguredMasterAddress \| 2.2.1 \| SPARK-20529 \| 75e5ea294c15ecfb7366ae15dce196aa92c87ca4#diff-916ca56b663f178f302c265b7ef38499 \| spark.worker.ui.port \| 1.1.0 \| SPARK-2857 \| 12f99cf5f88faf94d9dbfe85cb72d0010a3a25ac#diff-48ca297b6536cb92362bec1487581f05 \| spark.worker.ui.retainedExecutors \| 1.5.0 \| SPARK-9202 \| c0686668ae6a92b6bb4801a55c3b78aedbee816a#diff-916ca56b663f178f302c265b7ef38499 \| spark.worker.ui.retainedDrivers \| 1.5.0 \| SPARK-9202 \| c0686668ae6a92b6bb4801a55c3b78aedbee816a#diff-916ca56b663f178f302c265b7ef38499 \| spark.worker.ui.compressedLogFileLengthCacheSize \| 2.0.2 \| SPARK-17711 \| 26e978a93f029e1a1b5c7524d0b52c8141b70997#diff-d239aee594001f8391676e1047a0381e \| spark.worker.decommission.enabled \| 3.1.0 \| SPARK-20628 \| d273a2bb0fac452a97f5670edd69d3e452e3e57e#diff-b2fc8d6ab7ac5735085e2d6cfacb95da \| ### Why are the changes needed? Supplemental configuration version information. ### Does this PR introduce any user-facing change? No ### How was this patch tested? Exists UT Closes #27783 from beliefer/add-version-to-tests-config. Authored-by: beliefer <beliefer@163.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-03-05 11:58:21 +09:00
yi.wu	b517f991fe	[SPARK-30969][CORE] Remove resource coordination support from Standalone ### What changes were proposed in this pull request? Remove automatically resource coordination support from Standalone. ### Why are the changes needed? Resource coordination is mainly designed for the scenario where multiple workers launched on the same host. However, it's, actually, a non-existed scenario for today's Spark. Because, Spark now can start multiple executors in a single Worker, while it only allow one executor per Worker at very beginning. So, now, it really help nothing for user to launch multiple workers on the same host. Thus, it's not worth for us to bring over complicated implementation and potential high maintain cost for such an impossible scenario. ### Does this PR introduce any user-facing change? No, it's Spark 3.0 feature. ### How was this patch tested? Pass Jenkins. Closes #27722 from Ngone51/abandon_coordination. Authored-by: yi.wu <yi.wu@databricks.com> Signed-off-by: Xingbo Jiang <xingbo.jiang@databricks.com>	2020-03-02 11:23:07 -08:00
beliefer	3beb4f875d	[SPARK-30908][CORE][DOC] Add version information to the configuration of Kryo ### What changes were proposed in this pull request? 1.Add version information to the configuration of `Kryo`. 2.Update the docs of `Kryo`. I sorted out some information show below. Item name \| Since version \| JIRA ID \| Commit ID \| Note -- \| -- \| -- \| -- \| -- spark.kryo.registrationRequired \| 1.1.0 \| SPARK-2102 \| efdaeb111917dd0314f1d00ee8524bed1e2e21ca#diff-1f81c62dad0e2dfc387a974bb08c497c \| spark.kryo.registrator \| 0.5.0 \| None \| 91c07a33d90ab0357e8713507134ecef5c14e28a#diff-792ed56b3398163fa14e8578549d0d98 \| This is not a release version, do we need to record it? spark.kryo.classesToRegister \| 1.2.0 \| SPARK-1813 \| 6bb56faea8d238ea22c2de33db93b1b39f492b3a#diff-529fc5c06b9731c1fbda6f3db60b16aa \| spark.kryo.unsafe \| 2.1.0 \| SPARK-928 \| bc167a2a53f5a795d089e8a884569b1b3e2cd439#diff-1f81c62dad0e2dfc387a974bb08c497c \| spark.kryo.pool \| 3.0.0 \| SPARK-26466 \| 38f030725c561979ca98b2a6cc7ca6c02a1f80ed#diff-a3c6b992784f9abeb9f3047d3dcf3ed9 \| spark.kryo.referenceTracking \| 0.8.0 \| None \| 0a8cc309211c62f8824d76618705c817edcf2424#diff-1f81c62dad0e2dfc387a974bb08c497c \| spark.kryoserializer.buffer \| 1.4.0 \| SPARK-5932 \| 2d222fb39dd978e5a33cde6ceb59307cbdf7b171#diff-1f81c62dad0e2dfc387a974bb08c497c \| spark.kryoserializer.buffer.max \| 1.4.0 \| SPARK-5932 \| 2d222fb39dd978e5a33cde6ceb59307cbdf7b171#diff-1f81c62dad0e2dfc387a974bb08c497c \| ### Why are the changes needed? Supplemental configuration version information. ### Does this PR introduce any user-facing change? No ### How was this patch tested? Exists UT Closes #27734 from beliefer/add-version-to-kryo-config. Authored-by: beliefer <beliefer@163.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-03-02 15:14:47 +09:00
beliefer	325bf56e73	[SPARK-30888][CORE][DOC] Add version information to the configuration of Network ### What changes were proposed in this pull request? 1.Add version information to the configuration of `Network`. 2.Update the docs of `Network`. I sorted out some information show below. Item name \| Since version \| JIRA ID \| Commit ID \| Note -- \| -- \| -- \| -- \| -- spark.network.crypto.saslFallback \| 2.2.0 \| SPARK-19139 \| 8f3f73abc1fe62496722476460c174af0250e3fe#diff-0ac65da2bc6b083fb861fe410c7688c2 \| spark.network.crypto.enabled \| 2.2.0 \| SPARK-19139 \| 8f3f73abc1fe62496722476460c174af0250e3fe#diff-6bdad48cfc34314e89599655442ff210 \| spark.network.remoteReadNioBufferConversion \| 2.4.0 \| SPARK-24307 \| 2c82745686f4456c4d5c84040a431dcb5b6cb60b#diff-2b643ea78c1add0381754b1f47eec132 \| spark.network.timeout \| 1.3.0 \| SPARK-4688 \| d3f07fd23cc26a70f44c52e24445974d4885d58a#diff-1df6b5af3d8f9f16255ff8c7a06f402f \| spark.network.timeoutInterval \| 1.3.2 \| SPARK-5529 \| ec196ab1c7569d7ab0a50c9d7338c2835f2c84d5#diff-47779b72f095f7e7f926898fa1a425ee \| spark.rpc.askTimeout \| 1.4.0 \| SPARK-6490 \| 8136810dfad12008ac300116df7bc8448740f1ae#diff-529fc5c06b9731c1fbda6f3db60b16aa \| spark.rpc.connect.threads \| 1.6.0 \| SPARK-6028 \| 084e4e126211d74a79e8dbd2d0e604dd3c650822#diff-0c89b4a60c30a7cd2224bb64d93da942 \| spark.rpc.io.numConnectionsPerPeer \| 1.6.0 \| SPARK-10745 \| 34a77679877bc40b58a10ec539a8da00fed7db39#diff-0c89b4a60c30a7cd2224bb64d93da942 \| spark.rpc.io.threads \| 1.6.0 \| SPARK-6028 \| 084e4e126211d74a79e8dbd2d0e604dd3c650822#diff-0c89b4a60c30a7cd2224bb64d93da942 \| spark.rpc.lookupTimeout \| 1.4.0 \| SPARK-6490 \| 8136810dfad12008ac300116df7bc8448740f1ae#diff-529fc5c06b9731c1fbda6f3db60b16aa \| spark.rpc.message.maxSize \| 2.0.0 \| SPARK-7997 \| bc1babd63da4ee56e6d371eb24805a5d714e8295#diff-529fc5c06b9731c1fbda6f3db60b16aa \| spark.rpc.netty.dispatcher.numThreads \| 1.6.0 \| SPARK-11079 \| 1797055dbf1d2fd7714d7c65c8d2efde2f15efc1#diff-05133dfc4bfdb6a27aa092d86ce24866 \| spark.rpc.numRetries \| 1.4.0 \| SPARK-6490 \| 8136810dfad12008ac300116df7bc8448740f1ae#diff-529fc5c06b9731c1fbda6f3db60b16aa \| spark.rpc.retry.wait \| 1.4.0 \| SPARK-6490 \| 8136810dfad12008ac300116df7bc8448740f1ae#diff-529fc5c06b9731c1fbda6f3db60b16aa \| ### Why are the changes needed? Supplemental configuration version information. ### Does this PR introduce any user-facing change? No ### How was this patch tested? Exists UT Closes #27674 from beliefer/add-version-to-network-config. Authored-by: beliefer <beliefer@163.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-02-27 11:05:11 +09:00
beliefer	c2857501d5	[SPARK-30909][CORE][DOC] Add version information to the configuration of Python ### What changes were proposed in this pull request? 1.Add version information to the configuration of `Python`. 2.Update the docs of `Python`. I sorted out some information show below. Item name \| Since version \| JIRA ID \| Commit ID \| Note -- \| -- \| -- \| -- \| -- spark.python.worker.reuse \| 1.2.0 \| SPARK-3030 \| 2aea0da84c58a179917311290083456dfa043db7#diff-0a67bc4d171abe4df8eb305b0f4123a2 \| spark.python.task.killTimeout \| 2.2.2 \| SPARK-22535 \| be68f86e11d64209d9e325ce807025318f383bea#diff-0a67bc4d171abe4df8eb305b0f4123a2 \| spark.python.use.daemon \| 2.3.0 \| SPARK-22554 \| 57c5514de9dba1c14e296f85fb13fef23ce8c73f#diff-9008ad45db34a7eee2e265a50626841b \| spark.python.daemon.module \| 2.4.0 \| SPARK-22959 \| afae8f2bc82597593595af68d1aa2d802210ea8b#diff-9008ad45db34a7eee2e265a50626841b \| spark.python.worker.module \| 2.4.0 \| SPARK-22959 \| afae8f2bc82597593595af68d1aa2d802210ea8b#diff-9008ad45db34a7eee2e265a50626841b \| spark.executor.pyspark.memory \| 2.4.0 \| SPARK-25004 \| 7ad18ee9f26e75dbe038c6034700f9cd4c0e2baa#diff-6bdad48cfc34314e89599655442ff210 \| ### Why are the changes needed? Supplemental configuration version information. ### Does this PR introduce any user-facing change? No ### How was this patch tested? Exists UT Closes #27704 from beliefer/add-version-to-python-config. Authored-by: beliefer <beliefer@163.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-02-27 10:57:34 +09:00
beliefer	776e21af40	[SPARK-30910][CORE][DOC] Add version information to the configuration of R ### What changes were proposed in this pull request? 1.Add version information to the configuration of `R`. 2.Update the docs of `R`. I sorted out some information show below. Item name \| Since version \| JIRA ID \| Commit ID \| Note -- \| -- \| -- \| -- \| -- spark.r.backendConnectionTimeout \| 2.1.0 \| SPARK-17919 \| 2881a2d1d1a650a91df2c6a01275eba14a43b42a#diff-025470e1b7094d7cf4a78ea353fb3981 \| spark.r.numRBackendThreads \| 1.4.0 \| SPARK-8282 \| 28e8a6ea65fd08ab9cefc4d179d5c66ffefd3eb4#diff-697f7f2fc89808e0113efc71ed235db2 \| spark.r.heartBeatInterval \| 2.1.0 \| SPARK-17919 \| 2881a2d1d1a650a91df2c6a01275eba14a43b42a#diff-fe903bf14db371aa320b7cc516f2463c \| spark.sparkr.r.command \| 1.5.3 \| SPARK-10971 \| 9695f452e86a88bef3bcbd1f3c0b00ad9e9ac6e1#diff-025470e1b7094d7cf4a78ea353fb3981 \| spark.r.command \| 1.5.3 \| SPARK-10971 \| 9695f452e86a88bef3bcbd1f3c0b00ad9e9ac6e1#diff-025470e1b7094d7cf4a78ea353fb3981 \| ### Why are the changes needed? Supplemental configuration version information. ### Does this PR introduce any user-facing change? No ### How was this patch tested? Exists UT Closes #27708 from beliefer/add-version-to-R-config. Authored-by: beliefer <beliefer@163.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-02-27 10:56:38 +09:00
yi.wu	e9fd52282e	[SPARK-30689][CORE][FOLLOW-UP] Rename config name of discovery plugin ### What changes were proposed in this pull request? Rename config `spark.resources.discovery.plugin` to `spark.resources.discoveryPlugin`. Also, as a side minor change: labeled `ResourceDiscoveryScriptPlugin` as `DeveloperApi` since it's not for end user. ### Why are the changes needed? Discovery plugin doesn't need to reserve the "discovery" namespace here and it's more consistent with the interface name `ResourceDiscoveryPlugin` if we use `discoveryPlugin` instead. ### Does this PR introduce any user-facing change? No, it's newly added in Spark3.0. ### How was this patch tested? Pass Jenkins. Closes #27689 from Ngone51/spark_30689_followup. Authored-by: yi.wu <yi.wu@databricks.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-02-26 11:55:05 +09:00
beliefer	7911de9d10	[SPARK-30887][CORE][DOC] Add version information to the configuration of Deploy ### What changes were proposed in this pull request? 1.Add version information to the configuration of `Deploy`. 2.Update the docs of `Deploy`. I sorted out some information show below. Item name \| Since version \| JIRA ID \| Commit ID \| Note -- \| -- \| -- \| -- \| -- spark.deploy.recoveryMode \| 0.8.1 \| None \| d66c01f2b6defb3db6c1be99523b734a4d960532#diff-29dffdccd5a7f4c8b496c293e87c8668 \| spark.deploy.recoveryMode.factory \| 1.2.0 \| SPARK-1830 \| deefd9d7377a8091a1d184b99066febd0e9f6afd#diff-29dffdccd5a7f4c8b496c293e87c8668 \| This configuration appears in branch-1.3, but the version number in the pom.xml file corresponding to the commit is 1.2.0-SNAPSHOT spark.deploy.recoveryDirectory \| 0.8.1 \| None \| d66c01f2b6defb3db6c1be99523b734a4d960532#diff-29dffdccd5a7f4c8b496c293e87c8668 \| spark.deploy.zookeeper.url \| 0.8.1 \| None \| d66c01f2b6defb3db6c1be99523b734a4d960532#diff-4457313ca662a1cd60197122d924585c \| spark.deploy.zookeeper.dir \| 0.8.1 \| None \| d66c01f2b6defb3db6c1be99523b734a4d960532#diff-a84228cb45c7d5bd93305a1f5bf720b6 \| spark.deploy.retainedApplications \| 0.8.0 \| None \| 46eecd110a4017ea0c86cbb1010d0ccd6a5eb2ef#diff-29dffdccd5a7f4c8b496c293e87c8668 \| spark.deploy.retainedDrivers \| 1.1.0 \| None \| 7446f5ff93142d2dd5c79c63fa947f47a1d4db8b#diff-29dffdccd5a7f4c8b496c293e87c8668 \| spark.dead.worker.persistence \| 0.8.0 \| None \| 46eecd110a4017ea0c86cbb1010d0ccd6a5eb2ef#diff-29dffdccd5a7f4c8b496c293e87c8668 \| spark.deploy.maxExecutorRetries \| 1.6.3 \| SPARK-16956 \| ace458f0330f22463ecf7cbee7c0465e10fba8a8#diff-29dffdccd5a7f4c8b496c293e87c8668 \| spark.deploy.spreadOut \| 0.6.1 \| None \| bb2b9ff37cd2503cc6ea82c5dd395187b0910af0#diff-0e7ae91819fc8f7b47b0f97be7116325 \| spark.deploy.defaultCores \| 0.9.0 \| None \| d8bcc8e9a095c1b20dd7a17b6535800d39bff80e#diff-29dffdccd5a7f4c8b496c293e87c8668 \| ### Why are the changes needed? Supplemental configuration version information. ### Does this PR introduce any user-facing change? No ### How was this patch tested? Exists UT Closes #27668 from beliefer/add-version-to-deploy-config. Authored-by: beliefer <beliefer@163.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-02-25 11:39:11 +09:00
Gengliang Wang	2a695e6d15	[SPARK-30907][DOCS] Revise the doc of spark.ui.retainedTasks ### What changes were proposed in this pull request? Revise the documentation of `spark.ui.retainedTasks` to make it clear that the configuration is for one stage. ### Why are the changes needed? There are configurations for the limitation of UI data. `spark.ui.retainedJobs`, `spark.ui.retainedStages` and `spark.worker.ui.retainedExecutors` are the total max number for one application, while the configuration `spark.ui.retainedTasks` is the max number for one stage. ### Does this PR introduce any user-facing change? No ### How was this patch tested? None, just doc. Closes #27660 from gengliangwang/reviseRetainTask. Authored-by: Gengliang Wang <gengliang.wang@databricks.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>	2020-02-21 10:06:45 +09:00

1 2 3 4 5 ...

475 commits