ODIn/spark-instrumented-optimizer

Author	SHA1	Message	Date
CodeGod	a29df5fa02	[SPARK-27080][SQL] bug fix: mergeWithMetastoreSchema with uniform lower case comparison ## What changes were proposed in this pull request? When reading parquet file with merging metastore schema and file schema, we should compare field names using uniform case. In current implementation, lowercase is used but one omission. And this patch fix it. ## How was this patch tested? Unit test Closes #24001 from codeborui/mergeSchemaBugFix. Authored-by: CodeGod <> Signed-off-by: Wenchen Fan <wenchen@databricks.com>	2019-03-09 21:28:10 +08:00
Kris Mok	57ae251f75	[SPARK-27097] Avoid embedding platform-dependent offsets literally in whole-stage generated code ## What changes were proposed in this pull request? Spark SQL performs whole-stage code generation to speed up query execution. There are two steps to it: - Java source code is generated from the physical query plan on the driver. A single version of the source code is generated from a query plan, and sent to all executors. - It's compiled to bytecode on the driver to catch compilation errors before sending to executors, but currently only the generated source code gets sent to the executors. The bytecode compilation is for fail-fast only. - Executors receive the generated source code and compile to bytecode, then the query runs like a hand-written Java program. In this model, there's an implicit assumption about the driver and executors being run on similar platforms. Some code paths accidentally embedded platform-dependent object layout information into the generated code, such as: ```java Platform.putLong(buffer, /* offset / 24, / value */ 1); ``` This code expects a field to be at offset +24 of the `buffer` object, and sets a value to that field. But whole-stage code generation generally uses platform-dependent information from the driver. If the object layout is significantly different on the driver and executors, the generated code can be reading/writing to wrong offsets on the executors, causing all kinds of data corruption. One code pattern that leads to such problem is the use of `Platform.XXX` constants in generated code, e.g. `Platform.BYTE_ARRAY_OFFSET`. Bad: ```scala val baseOffset = Platform.BYTE_ARRAY_OFFSET // codegen template: s"Platform.putLong($buffer, $baseOffset, $value);" ``` This will embed the value of `Platform.BYTE_ARRAY_OFFSET` on the driver into the generated code. Good: ```scala val baseOffset = "Platform.BYTE_ARRAY_OFFSET" // codegen template: s"Platform.putLong($buffer, $baseOffset, $value);" ``` This will generate the offset symbolically -- `Platform.putLong(buffer, Platform.BYTE_ARRAY_OFFSET, value)`, which will be able to pick up the correct value on the executors. Caveat: these offset constants are declared as runtime-initialized `static final` in Java, so they're not compile-time constants from the Java language's perspective. It does lead to a slightly increased size of the generated code, but this is necessary for correctness. NOTE: there can be other patterns that generate platform-dependent code on the driver which is invalid on the executors. e.g. if the endianness is different between the driver and the executors, and if some generated code makes strong assumption about endianness, it would also be problematic. ## How was this patch tested? Added a new test suite `WholeStageCodegenSparkSubmitSuite`. This test suite needs to set the driver's extraJavaOptions to force the driver and executor use different Java object layouts, so it's run as an actual SparkSubmit job. Authored-by: Kris Mok <kris.mokdatabricks.com> Closes #24031 from gatorsmile/cherrypickSPARK-27097. Lead-authored-by: Kris Mok <kris.mok@databricks.com> Co-authored-by: gatorsmile <gatorsmile@gmail.com> Signed-off-by: DB Tsai <d_tsai@apple.com>	2019-03-09 01:20:32 +00:00
liuxian	326fc746d3	[SPARK-27056][MESOS] Remove start-shuffle-service.sh ## What changes were proposed in this pull request? `start-shuffle-service.sh` was only used by Mesos before `start-mesos-shuffle-service.sh`. Obviously, `start-mesos-shuffle-service.sh` solves some problems, it is better than `start-shuffle-service.sh`. So now we should delete `start-shuffle-service.sh` in case users use it. ## How was this patch tested? N/A Closes #23975 from 10110346/rmshuffleservice_sh. Authored-by: liuxian <liu.xian3@zte.com.cn> Signed-off-by: Sean Owen <sean.owen@databricks.com>	2019-03-08 18:51:38 -06:00
sandeep-katta	14f2286e56	[SPARK-27101][PYTHON] Drop the created database after the test in test_session ## What changes were proposed in this pull request? Cleaning the testcase, drop the database after use ## How was this patch tested? existing UT Closes #24021 from sandeep-katta/cleanPythonTest. Authored-by: sandeep-katta <sandeep.katta2007@gmail.com> Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>	2019-03-09 09:12:33 +09:00
Sunitha Kambhampati	bd2710bd79	[MINOR][SQL] Fix the typo in the spark.sql.extensions conf doc ## What changes were proposed in this pull request? Fix the typo (missing the s) in the class name (SparkSessionExtensions) in the doc for Spark conf spark.sql.extensions. ## How was this patch tested? Verified by checking that the configuration doc shows up correctly in spark-shell using the SET -v Closes #24020 from skambha/fixnametypo. Authored-by: Sunitha Kambhampati <skambha@us.ibm.com> Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>	2019-03-09 08:51:19 +09:00
SongYadong	14b1312727	[SPARK-27103][SQL][MINOR] List SparkSql reserved keywords in alphabet order ## What changes were proposed in this pull request? This PR tries to correct spark-sql reserved keywords' position in list if they are not in alphabetical order. In test suite some repeated words are removed. Also some comments are added for remind. ## How was this patch tested? Existing unit tests. Closes #23985 from SongYadong/sql_reserved_alphabet. Authored-by: SongYadong <song.yadong1@zte.com.cn> Signed-off-by: Dongjoon Hyun <dhyun@apple.com>	2019-03-08 10:51:39 -08:00
wangguangxin.cn	d3d9c7bb0a	[SPARK-27079][MINOR][SQL] Fix typo & Remove useless imports & Add missing `override` annotation ## What changes were proposed in this pull request? 1. Fix two typos 2. Remove useless imports in `CSVExprUtils.scala` 3. Add missing `override` annotation ## How was this patch tested? test by existing uts Closes #24000 from WangGuangxin/SPARK-27079. Authored-by: wangguangxin.cn <wangguangxin.cn@gmail.com> Signed-off-by: Sean Owen <sean.owen@databricks.com>	2019-03-08 12:14:04 -06:00
Ryan Blue	6170e40c15	[SPARK-24252][SQL] Add v2 catalog plugin system ## What changes were proposed in this pull request? This adds a v2 API for adding new catalog plugins to Spark. * Catalog implementations extend `CatalogPlugin` and are loaded via reflection, similar to data sources * `Catalogs` loads and initializes catalogs using configuration from a `SQLConf` * `CaseInsensitiveStringMap` is used to pass configuration to `CatalogPlugin` via `initialize` Catalogs are configured by adding config properties starting with `spark.sql.catalog.(name)`. The name property must specify a class that implements `CatalogPlugin`. Other properties under the namespace (`spark.sql.catalog.(name).(prop)`) are passed to the provider during initialization along with the catalog name. This replaces #21306, which will be implemented in two multiple parts: the catalog plugin system (this commit) and specific catalog APIs, like `TableCatalog`. ## How was this patch tested? Added test suites for `CaseInsensitiveStringMap` and for catalog loading. Closes #23915 from rdblue/SPARK-24252-add-v2-catalog-plugins. Authored-by: Ryan Blue <blue@apache.org> Signed-off-by: Wenchen Fan <wenchen@databricks.com>	2019-03-08 19:31:49 +08:00
Yuming Wang	2036074b99	[SPARK-26004][SQL] InMemoryTable support StartsWith predicate push down ## What changes were proposed in this pull request? [SPARK-24638](https://issues.apache.org/jira/browse/SPARK-24638) adds support for Parquet file `StartsWith` predicate push down. `InMemoryTable` can also support this feature. This is an example to explain how it works, Imagine that the `id` column stored as below: Partition ID \| lowerBound \| upperBound -- \| -- \| -- p1 \| '1' \| '9' p2 \| '10' \| '19' p3 \| '20' \| '29' p4 \| '30' \| '39' p5 \| '40' \| '49' A filter ```df.filter($"id".startsWith("2"))``` or ```id like '2%'``` then we substr lowerBound and upperBound: Partition ID \| lowerBound.substr(0, Length("2")) \| upperBound.substr(0, Length("2")) -- \| -- \| -- p1 \| '1' \| '9' p2 \| '1' \| '1' p3 \| '2' \| '2' p4 \| '3' \| '3' p5 \| '4' \| '4' We can see that we only need to read `p1` and `p3`. ## How was this patch tested? unit tests and benchmark tests benchmark test result: ``` ================================================================================================ Pushdown benchmark for StringStartsWith ================================================================================================ Java HotSpot(TM) 64-Bit Server VM 1.8.0_191-b12 on Mac OS X 10.12.6 Intel(R) Core(TM) i7-7820HQ CPU 2.90GHz StringStartsWith filter: (value like '10%'): Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative ------------------------------------------------------------------------------------------------ InMemoryTable Vectorized 12068 / 14198 1.3 767.3 1.0X InMemoryTable Vectorized (Pushdown) 5457 / 8662 2.9 347.0 2.2X Java HotSpot(TM) 64-Bit Server VM 1.8.0_191-b12 on Mac OS X 10.12.6 Intel(R) Core(TM) i7-7820HQ CPU 2.90GHz StringStartsWith filter: (value like '1000%'): Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative ------------------------------------------------------------------------------------------------ InMemoryTable Vectorized 5246 / 5355 3.0 333.5 1.0X InMemoryTable Vectorized (Pushdown) 2185 / 2346 7.2 138.9 2.4X Java HotSpot(TM) 64-Bit Server VM 1.8.0_191-b12 on Mac OS X 10.12.6 Intel(R) Core(TM) i7-7820HQ CPU 2.90GHz StringStartsWith filter: (value like '786432%'): Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative ------------------------------------------------------------------------------------------------ InMemoryTable Vectorized 5112 / 5312 3.1 325.0 1.0X InMemoryTable Vectorized (Pushdown) 2292 / 2522 6.9 145.7 2.2X ``` Closes #23004 from wangyum/SPARK-26004. Authored-by: Yuming Wang <yumwang@ebay.com> Signed-off-by: Wenchen Fan <wenchen@databricks.com>	2019-03-08 19:18:32 +08:00
Sean Owen	5ebb4b5723	[SPARK-24783][SQL] spark.sql.shuffle.partitions=0 should throw exception ## What changes were proposed in this pull request? Throw an exception if spark.sql.shuffle.partitions=0 This takes over https://github.com/apache/spark/pull/23835 ## How was this patch tested? Existing tests. Closes #24008 from srowen/SPARK-24783.2. Lead-authored-by: Sean Owen <sean.owen@databricks.com> Co-authored-by: WindCanDie <491237260@qq.com> Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>	2019-03-08 14:09:53 +09:00
Jungtaek Lim (HeartSaVioR)	d8f77e11a4	[SPARK-27001][SQL][FOLLOWUP] Address primitive array type for serializer ## What changes were proposed in this pull request? This is follow-up PR which addresses review comment in PR for SPARK-27001: https://github.com/apache/spark/pull/23908#discussion_r261511454 This patch proposes addressing primitive array type for serializer - instead of handling it to generic one, Spark now handles it efficiently as primitive array. ## How was this patch tested? UT modified to include primitive array. Closes #24015 from HeartSaVioR/SPARK-27001-FOLLOW-UP-java-primitive-array. Authored-by: Jungtaek Lim (HeartSaVioR) <kabhwan@gmail.com> Signed-off-by: Wenchen Fan <wenchen@databricks.com>	2019-03-08 11:54:04 +08:00
Yuming Wang	43dcb91a4c	[SPARK-19678][FOLLOW-UP][SQL] Add behavior change test when table statistics are incorrect ## What changes were proposed in this pull request? Since Spark 2.2.0 ([SPARK-19678](https://issues.apache.org/jira/browse/SPARK-19678)), the below SQL changed from `broadcast join` to `sort merge join`: ```sql -- small external table with incorrect statistics CREATE EXTERNAL TABLE t1(c1 int) ROW FORMAT SERDE 'org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe' WITH SERDEPROPERTIES ( 'serialization.format' = '1' ) STORED AS INPUTFORMAT 'org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat' OUTPUTFORMAT 'org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat' LOCATION 'file:///tmp/t1' TBLPROPERTIES ( 'rawDataSize'='-1', 'numFiles'='0', 'totalSize'='0', 'COLUMN_STATS_ACCURATE'='false', 'numRows'='-1' ); -- big table CREATE TABLE t2 (c1 int) LOCATION 'file:///tmp/t2' TBLPROPERTIES ( 'rawDataSize'='23437737', 'numFiles'='12222', 'totalSize'='333442230', 'COLUMN_STATS_ACCURATE'='false', 'numRows'='443442223' ); explain SELECT t1.c1 FROM t1 INNER JOIN t2 ON t1.c1 = t2.c1; ``` This pr add a test case for this behavior change. ## How was this patch tested? unit tests Closes #24003 from wangyum/SPARK-19678. Authored-by: Yuming Wang <yumwang@ebay.com> Signed-off-by: Wenchen Fan <wenchen@databricks.com>	2019-03-08 11:47:49 +08:00
Yuming Wang	d70b6a39e1	[MINOR][BUILD] Add 2 maven properties(hive.classifier and hive.parquet.group) ## What changes were proposed in this pull request? This pr adds 2 maven properties to help us upgrade the built-in Hive. \| Property Name \| Default \| In future \| \| ------ \| ------ \| ------ \| \| hive.classifier \| (none) \| core \| \| hive.parquet.group \| com.twitter \| org.apache.parquet \| ## How was this patch tested? existing tests Closes #23996 from wangyum/add_2_maven_properties. Authored-by: Yuming Wang <yumwang@ebay.com> Signed-off-by: Sean Owen <sean.owen@databricks.com>	2019-03-07 16:46:07 -06:00
Shahid	713646ddc2	[SPARK-27075] Remove duplicate execution tag parameters from the url, when accessing the execution table in the SQL page ## What changes were proposed in this pull request? When we sort any columns in the execution table of the SQL page in the WEBUI, it throws IllegalArgumentException. The root cause is that, in the url, we are duplicating the execution tag parameters in the 'parameterPath'. Actually we should filter out the executionTag related entries while getting the 'parameterOtherTable' `e9e8bb33ef/sql/core/src/main/scala/org/apache/spark/sql/execution/ui/AllExecutionsPage.scala (L161-L163)` `e9e8bb33ef/sql/core/src/main/scala/org/apache/spark/sql/execution/ui/AllExecutionsPage.scala (L241)` `e9e8bb33ef/sql/core/src/main/scala/org/apache/spark/sql/execution/ui/AllExecutionsPage.scala (L263-L266)` ## How was this patch tested? Manually tested Test steps: Sort any column in the sql page execution table Before fix: ![screenshot from 2019-03-07 01-38-17](https://user-images.githubusercontent.com/23054875/53913261-f0b69580-4080-11e9-88ea-f238b47a21d5.png) After fix: ![screenshot from 2019-03-07 02-01-40](https://user-images.githubusercontent.com/23054875/53913285-01670b80-4081-11e9-81b6-78cdbf5a0817.png) Closes #23994 from shahidki31/SPARK-27075. Authored-by: Shahid <shahidki31@gmail.com> Signed-off-by: Marcelo Vanzin <vanzin@cloudera.com>	2019-03-07 12:52:46 -08:00
Gabor Somogyi	98a8725e66	[SPARK-27022][DSTREAMS] Add kafka delegation token support. ## What changes were proposed in this pull request? It adds Kafka delegation token support for DStreams. Please be aware as Kafka native sink is not available for DStreams this PR contains delegation token usage only on consumer side. What this PR contains: * Usage of token through dynamic JAAS configuration * `KafkaConfigUpdater` moved to `kafka-0-10-token-provider` * `KafkaSecurityHelper` functionality moved into `KafkaTokenUtil` * Documentation ## How was this patch tested? Existing unit tests + on cluster. Long running Kafka to file tests on 4 node cluster with randomly thrown artificial exceptions. Test scenario: * 4 node cluster * Yarn * Kafka broker version 2.1.0 * security.protocol = SASL_SSL * sasl.mechanism = SCRAM-SHA-512 Kafka broker settings: * delegation.token.expiry.time.ms=600000 (10 min) * delegation.token.max.lifetime.ms=1200000 (20 min) * delegation.token.expiry.check.interval.ms=300000 (5 min) After each 7.5 minutes new delegation token obtained from Kafka broker (10 min * 0.75). When token expired after 10 minutes (Spark obtains new one and doesn't renew the old), the brokers expiring thread comes after each 5 minutes (invalidates expired tokens) and artificial exception has been thrown inside the Spark application (such case Spark closes connection), then the latest delegation token picked up correctly. cd docs/ SKIP_API=1 jekyll build Manual webpage check. Closes #23929 from gaborgsomogyi/SPARK-27022. Authored-by: Gabor Somogyi <gabor.g.somogyi@gmail.com> Signed-off-by: Marcelo Vanzin <vanzin@cloudera.com>	2019-03-07 11:36:37 -08:00
Bryan Cutler	ddc2052ebd	[SPARK-23836][PYTHON] Add support for StructType return in Scalar Pandas UDF ## What changes were proposed in this pull request? This change adds support for returning StructType from a scalar Pandas UDF, where the return value of the function is a pandas.DataFrame. Nested structs are not supported and an error will be raised, child types can be any other type currently supported. ## How was this patch tested? Added additional unit tests to `test_pandas_udf_scalar` Closes #23900 from BryanCutler/pyspark-support-scalar_udf-StructType-SPARK-23836. Authored-by: Bryan Cutler <cutlerb@gmail.com> Signed-off-by: Bryan Cutler <cutlerb@gmail.com>	2019-03-07 08:52:24 -08:00
Takeshi Yamamuro	315c95c399	[SPARK-25863][SPARK-21871][SQL] Check if code size statistics is empty or not in updateAndGetCompilationStats ## What changes were proposed in this pull request? `CodeGenerator.updateAndGetCompilationStats` throws an unsupported exception for empty code size statistics. This pr added code to check if it is empty or not. ## How was this patch tested? Pass Jenkins. Closes #23947 from maropu/SPARK-21871-FOLLOWUP. Authored-by: Takeshi Yamamuro <yamamuro@apache.org> Signed-off-by: Takeshi Yamamuro <yamamuro@apache.org>	2019-03-07 17:25:22 +09:00
Dilip Biswal	a0e26cffc5	[MINOR][SQL][TEST] Include usage example for generating output for single test in SQLQueryTestSuite ## What changes were proposed in this pull request? This is a very minor pr to include the usage example to generate output for single test in SQLQueryTestSuite. I tried to deduce it from the existing example and ran into a scenario where sbt is simply looping to run the same test over and over again. Here is the example of running a single test. ``` build/sbt "~sql/test-only SQLQueryTestSuite -- -z inline-table.sql" ``` I tried to generate the output for a single test by prepending `SPARK_GENERATE_GOLDEN_FILES=1` like following ``` SPARK_GENERATE_GOLDEN_FILES=1 build/sbt "~sql/test-only SQLQueryTestSuite -- -z describe.sql" ``` In this case i found that sbt is looping trying to run describe.sql over and over again as we are running the test in on continuous mode (because of `~` prefix ) where it detects a change in the generated result file which in turn triggers a build and test. I have included an example where we don't run it in continuous mode when generating the output. Hopefully it saves other developers some time. ## How was this patch tested? Verified manually in my dev setup. Closes #23995 from dilipbiswal/dkb_sqlquerytest_usage. Authored-by: Dilip Biswal <dbiswal@us.ibm.com> Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>	2019-03-07 13:06:23 +09:00
Gengliang Wang	a543f917e0	[SPARK-27049][SQL] Create util class to support handling partition values in file source V2 ## What changes were proposed in this pull request? While I am migrating other data sources, I find that we should abstract the logic that: 1. converting safe `InternalRow`s into `UnsafeRow`s 2. appending partition values to the end of the result row if existed This PR proposes to support handling partition values in file source v2 abstraction by adding a util class `PartitionReaderWithPartitionValues`. ## How was this patch tested? Existing unit tests Closes #23987 from gengliangwang/SPARK-27049. Authored-by: Gengliang Wang <gengliang.wang@databricks.com> Signed-off-by: Wenchen Fan <wenchen@databricks.com>	2019-03-07 11:24:15 +08:00
Brooke Wenig	340c8b8387	[MINOR][DOC] Updated PySpark Binarizer docstring to match Scala's. ## What changes were proposed in this pull request? PySpark's Binarizer docstring had two issues: 1) The values did not need to be in the range [0, 1]. 2) It can be used for binary classification prediction. This change corrects both of these issues by making it consistent with Scala's docstring for Binarizer. ## How was this patch tested? Not applicable because I only changed the docstring. But if I need to do any testing, let me know and I'll do it. Please review http://spark.apache.org/contributing.html before opening a pull request. Closes #23934 from brookewenig/binarizer-docs-fix. Authored-by: Brooke Wenig <brookewenig@gmail.com> Signed-off-by: Sean Owen <sean.owen@databricks.com>	2019-03-06 19:42:41 -06:00
Yuming Wang	32848eecc5	[SPARK-27078][SQL] Fix NoSuchFieldError when read Hive materialized views ## What changes were proposed in this pull request? This pr fix `NoSuchFieldError` when reading Hive materialized views from Hive 2.3.4. How to reproduce: Hive side: ```sql CREATE TABLE materialized_view_tbl (key INT); CREATE MATERIALIZED VIEW view_1 DISABLE REWRITE AS SELECT * FROM materialized_view_tbl; ``` Spark side: ```java bin/spark-sql --conf spark.sql.hive.metastore.version=2.3.4 --conf spark.sql.hive.metastore.jars=maven spark-sql> select * from view_1; 19/03/05 19:55:37 ERROR SparkSQLDriver: Failed in [select * from view_1] java.lang.NoSuchFieldError: INDEX_TABLE at org.apache.spark.sql.hive.client.HiveClientImpl.$anonfun$getTableOption$3(HiveClientImpl.scala:438) at scala.Option.map(Option.scala:163) at org.apache.spark.sql.hive.client.HiveClientImpl.$anonfun$getTableOption$1(HiveClientImpl.scala:370) at org.apache.spark.sql.hive.client.HiveClientImpl.$anonfun$withHiveState$1(HiveClientImpl.scala:277) at org.apache.spark.sql.hive.client.HiveClientImpl.liftedTree1$1(HiveClientImpl.scala:215) at org.apache.spark.sql.hive.client.HiveClientImpl.retryLocked(HiveClientImpl.scala:214) at org.apache.spark.sql.hive.client.HiveClientImpl.withHiveState(HiveClientImpl.scala:260) at org.apache.spark.sql.hive.client.HiveClientImpl.getTableOption(HiveClientImpl.scala:368) ``` ## How was this patch tested? unit tests Closes #23984 from wangyum/SPARK-24360. Authored-by: Yuming Wang <yumwang@ebay.com> Signed-off-by: Dongjoon Hyun <dhyun@apple.com>	2019-03-06 16:56:32 -08:00
Maxim Gekk	9513d82edd	[SPARK-27057][SQL] Common trait for limit exec operators ## What changes were proposed in this pull request? I would like to refactor `limit.scala` slightly and introduce common trait `LimitExec` for `CollectLimitExec` and `BaseLimitExec` (`LocalLimitExec` and `GlobalLimitExec`). This will allow to distinguish those operators from others, and to get the `limit` value without casting to concrete class. ## How was this patch tested? by existing test suites. Closes #23976 from MaxGekk/limit-exec. Authored-by: Maxim Gekk <maxim.gekk@databricks.com> Signed-off-by: Wenchen Fan <wenchen@databricks.com>	2019-03-07 08:47:52 +08:00
Shahid	62fd133f74	[SPARK-27019][SQL][WEBUI] onJobStart happens after onExecutionEnd shouldn't overwrite kvstore ## What changes were proposed in this pull request? Currently, when the event reordering happens, especially onJobStart event come after onExecutionEnd event, SQL page in the UI displays weirdly.(for eg:test mentioned in JIRA and also this issue randomly occurs when the TPCDS query fails due to broadcast timeout etc.) The reason is that, In the SQLAppstatusListener, we remove the liveExecutions entry once the execution ends. So, if a jobStart event come after that, then we create a new liveExecution entry corresponding to the execId. Eventually this will overwrite the kvstore and UI displays confusing entries. ## How was this patch tested? Added UT, Also manually tested with the eventLog, provided in the jira, of the failed query. Before fix: ![screenshot from 2019-03-03 03-05-52](https://user-images.githubusercontent.com/23054875/53687929-53e2b800-3d61-11e9-9dca-620fa41e605c.png) After fix: ![screenshot from 2019-03-03 02-40-18](https://user-images.githubusercontent.com/23054875/53687928-4f1e0400-3d61-11e9-86aa-584646ac68f9.png) Closes #23939 from shahidki31/SPARK-27019. Authored-by: Shahid <shahidki31@gmail.com> Signed-off-by: Marcelo Vanzin <vanzin@cloudera.com>	2019-03-06 14:02:30 -08:00
DB Tsai	b6375097bc	[SPARK-27026][BUILD] Upgrade Docker image for release build to Ubuntu 18.04 LTS ## What changes were proposed in this pull request? Upgrade Docker image for release build to Ubuntu 18.04LTS ## How was this patch tested? Manually tested. Closes #23932 from dbtsai/ubuntu18.04. Authored-by: DB Tsai <d_tsai@apple.com> Signed-off-by: Dongjoon Hyun <dhyun@apple.com>	2019-03-06 13:58:21 -08:00
Onur Satici	e9e8bb33ef	[SPARK-27023][K8S] Make k8s client timeouts configurable ## What changes were proposed in this pull request? Make k8s client timeouts configurable. No test suite exists for the client factory class, happy to add one if needed Closes #23928 from onursatici/os/k8s-client-timeouts. Lead-authored-by: Onur Satici <osatici@palantir.com> Co-authored-by: Onur Satici <onursatici@gmail.com> Signed-off-by: Dongjoon Hyun <dhyun@apple.com>	2019-03-06 11:14:39 -08:00
Wenchen Fan	cb20fbc43e	[SPARK-27065][CORE] avoid more than one active task set managers for a stage ## What changes were proposed in this pull request? This is another attempt to fix the more-than-one-active-task-set-managers bug. https://github.com/apache/spark/pull/17208 is the first attempt. It marks the TSM as zombie before sending a task completion event to DAGScheduler. This is necessary, because when the DAGScheduler gets the task completion event, and it's for the last partition, then the stage is finished. However, if it's a shuffle stage and it has missing map outputs, DAGScheduler will resubmit it(see the [code](https://github.com/apache/spark/blob/v2.4.0/core/src/main/scala/org/apache/spark/scheduler/DAGScheduler.scala#L1416-L1422)) and create a new TSM for this stage. This leads to more than one active TSM of a stage and fail. This fix has a hole: Let's say a stage has 10 partitions and 2 task set managers: TSM1(zombie) and TSM2(active). TSM1 has a running task for partition 10 and it completes. TSM2 finishes tasks for partitions 1-9, and thinks he is still active because he hasn't finished partition 10 yet. However, DAGScheduler gets task completion events for all the 10 partitions and thinks the stage is finished. Then the same problem occurs: DAGScheduler may resubmit the stage and cause more than one actice TSM error. https://github.com/apache/spark/pull/21131 fixed this hole by notifying all the task set managers when a task finishes. For the above case, TSM2 will know that partition 10 is already completed, so he can mark himself as zombie after partitions 1-9 are completed. However, #21131 still has a hole: TSM2 may be created after the task from TSM1 is completed. Then TSM2 can't get notified about the task completion, and leads to the more than one active TSM error. #22806 and #23871 are created to fix this hole. However the fix is complicated and there are still ongoing discussions. This PR proposes a simple fix, which can be easy to backport: mark all existing task set managers as zombie when trying to create a new task set manager. After this PR, #21131 is still necessary, to avoid launching unnecessary tasks and fix [SPARK-25250](https://issues.apache.org/jira/browse/SPARK-25250 ). #22806 and #23871 are its followups to fix the hole. ## How was this patch tested? existing tests. Closes #23927 from cloud-fan/scheduler. Authored-by: Wenchen Fan <wenchen@databricks.com> Signed-off-by: Imran Rashid <irashid@cloudera.com>	2019-03-06 12:00:33 -06:00
wuyi	e5c61436a5	[SPARK-23433][SPARK-25250][CORE] Later created TaskSet should learn about the finished partitions ## What changes were proposed in this pull request? This is an optional solution for #22806 . #21131 firstly implement that a previous successful completed task from zombie TaskSetManager could also succeed the active TaskSetManager, which based on an assumption that an active TaskSetManager always exists for that stage when this happen. But that's not always true as an active TaskSetManager may haven't been created when a previous task succeed, and this is the reason why #22806 hit the issue. This pr extends #21131 's behavior by adding `stageIdToFinishedPartitions` into TaskSchedulerImpl, which recording the finished partition whenever a task(from zombie or active) succeed. Thus, a later created active TaskSetManager could also learn about the finished partition by looking into `stageIdToFinishedPartitions ` and won't launch any duplicate tasks. ## How was this patch tested? Add. Closes #23871 from Ngone51/dev-23433-25250. Lead-authored-by: wuyi <ngone_5451@163.com> Co-authored-by: Ngone51 <ngone_5451@163.com> Signed-off-by: Imran Rashid <irashid@cloudera.com>	2019-03-06 11:53:07 -06:00
Udbhav30	9bddf7180e	[SPARK-24669][SQL] Invalidate tables in case of DROP DATABASE CASCADE ## What changes were proposed in this pull request? Before dropping database refresh the tables of that database, so as to refresh all cached entries associated with those tables. We follow the same when dropping a table. ## How was this patch tested? UT is added Closes #23905 from Udbhav30/SPARK-24669. Authored-by: Udbhav30 <u.agrawal30@gmail.com> Signed-off-by: Dongjoon Hyun <dhyun@apple.com>	2019-03-06 09:06:10 -08:00
Ajith	190a3a4ad8	[SPARK-27047] Document stop-slave.sh in spark-standalone ## What changes were proposed in this pull request? spark-standalone documentation do not mention about stop-slave.sh script ## How was this patch tested? Manually tested the changes Closes #23960 from ajithme/slavedoc. Authored-by: Ajith <ajith2489@gmail.com> Signed-off-by: Sean Owen <sean.owen@databricks.com>	2019-03-06 09:12:24 -06:00
Maxim Gekk	6001258398	[SPARK-27035][SQL] Get more precise current time ## What changes were proposed in this pull request? In the PR, I propose to replace `System.currentTimeMillis()` by `Instant.now()` in the `CurrentTimestamp` expression. `Instant.now()` uses the best available clock in the system to take current time. See [JDK-8068730](https://bugs.openjdk.java.net/browse/JDK-8068730) for more details. In JDK8, `Instant.now()` provides results with millisecond resolution but starting from JDK9 resolution of results is increased up to microseconds. ## How was this patch tested? The changes were tested by `DateTimeUtilsSuite` and by `DateFunctionsSuite`. Closes #23945 from MaxGekk/current-time. Authored-by: Maxim Gekk <max.gekk@gmail.com> Signed-off-by: Sean Owen <sean.owen@databricks.com>	2019-03-06 08:32:16 -06:00
masa3141	5fa4ba0cfb	[SPARK-26981][MLLIB] Add 'Recall_at_k' metric to RankingMetrics ## What changes were proposed in this pull request? Add 'Recall_at_k' metric to RankingMetrics ## How was this patch tested? Add test to RankingMetricsSuite. Closes #23881 from masa3141/SPARK-26981. Authored-by: masa3141 <masahiro@kazama.tv> Signed-off-by: Sean Owen <sean.owen@databricks.com>	2019-03-06 08:28:53 -06:00
Maxim Gekk	9b55722161	[SPARK-27031][SQL] Avoid double formatting in timestampToString ## What changes were proposed in this pull request? Removed unnecessary conversion of microseconds in `DateTimeUtils.timestampToString` to `java.sql.Timestamp` which aims to output fraction of seconds by casting it to string. This was replaced by special `TimestampFormatter` which appends the fraction formatter to `DateTimeFormatterBuilder`: `appendFraction(ChronoField.NANO_OF_SECOND, 0, 9, true)`. The former one means trailing zeros in second's fraction should be truncated while formatting. ## How was this patch tested? By existing test suites like `CastSuite`, `DateTimeUtilsSuite`, `JDBCSuite`, and by new test in `TimestampFormatterSuite`. Closes #23936 from MaxGekk/timestamp-to-string. Lead-authored-by: Maxim Gekk <max.gekk@gmail.com> Co-authored-by: Maxim Gekk <maxim.gekk@databricks.com> Signed-off-by: Sean Owen <sean.owen@databricks.com>	2019-03-06 08:26:59 -06:00
moqimoqidea	3fcbc7fb9f	[MINOR] Spelling mistake: forword -> forward ## What changes were proposed in this pull request? Spelling mistake: forword -> forward ## How was this patch tested? This is a private function, there is no place to call this function outside of this file. Closes #23978 from moqimoqidea/master. Authored-by: moqimoqidea <39821951+moqimoqidea@users.noreply.github.com> Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>	2019-03-06 16:29:07 +09:00
Hyukjin Kwon	543cd572c6	[SPARK-26922][R] Set socket timeout consistently in Arrow optimization ## What changes were proposed in this pull request? This PR sets socket timeout consistently across Arrow optimization by `SPARKR_BACKEND_CONNECTION_TIMEOUT`. There looks only one place left. ## How was this patch tested? Existing tests should cover. Closes #23971 from HyukjinKwon/SPARK-26922. Authored-by: Hyukjin Kwon <gurwls223@apache.org> Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>	2019-03-06 08:58:49 +09:00
mwlon	0ba19543d2	[SPARK-27015][MESOS] properly escape mesos scheduler arguments ## What changes were proposed in this pull request? Escape arguments for submissions sent to a Mesos dispatcher; analogous change to https://issues.apache.org/jira/browse/SPARK-24380 for confs. Since this changes behavior than some users are undoubtedly already working around, probably best to only only merge into master. ## How was this patch tested? Added a new unit test, covering some existing behavior as well. Closes #23967 from mwlon/SPARK-27015. Authored-by: mwlon <mloncaric@hmc.edu> Signed-off-by: Marcelo Vanzin <vanzin@cloudera.com>	2019-03-05 13:05:37 -08:00
“attilapiros”	5668c42edf	[SPARK-27021][CORE] Cleanup of Netty event loop group for shuffle chunk fetch requests ## What changes were proposed in this pull request? Creating an Netty `EventLoopGroup` leads to creating a new Thread pool for handling the events. For stopping the threads of the pool the event loop group should be shut down which is properly done for transport servers and clients by calling for example the `shutdownGracefully()` method (for details see the `close()` method of `TransportClientFactory` and `TransportServer`). But there is a separate event loop group for shuffle chunk fetch requests which is in pipeline for handling fetch request (shared between the client and server) and owned by the `TransportContext` and this was never shut down. ## How was this patch tested? With existing unittest. This leak is in the production system too but its effect is spiking in the unittest. Checking the core unittest logs before the PR: ``` $ grep "LEAK IN SUITE" unit-tests.log \| grep -o shuffle-chunk-fetch-handler \| wc -l 381 ``` And after the PR without whitelisting in thread audit and with an extra `await` after the ` chunkFetchWorkers.shutdownGracefully()`: ``` $ grep "LEAK IN SUITE" unit-tests.log \| grep -o shuffle-chunk-fetch-handler \| wc -l 0 ``` Closes #23930 from attilapiros/SPARK-27021. Authored-by: “attilapiros” <piros.attila.zsolt@gmail.com> Signed-off-by: Marcelo Vanzin <vanzin@cloudera.com>	2019-03-05 12:31:06 -08:00
Bo Hai	c27caead43	[SPARK-26932][DOC] Add a warning for Hive 2.1.1 ORC reader issue Hive 2.1.1 cannot read ORC table created by Spark 2.4.0 in default, and I add the information into sql-migration-guide-upgrade.md. for details to see: [SPARK-26932](https://issues.apache.org/jira/browse/SPARK-26932) doc build Closes #23944 from haiboself/SPARK-26932. Authored-by: Bo Hai <haibo-self@163.com> Signed-off-by: Dongjoon Hyun <dhyun@apple.com>	2019-03-05 12:07:15 -08:00
Liang-Chi Hsieh	83857496e5	[SPARK-27043][SQL] Add ORC nested schema pruning benchmarks ## What changes were proposed in this pull request? We have benchmark of nested schema pruning, but only for Parquet. This adds similar benchmark for ORC. This is used with nested schema pruning of ORC. ## How was this patch tested? Added test. Closes #23955 from viirya/orc-nested-schema-pruning-benchmark. Authored-by: Liang-Chi Hsieh <viirya@gmail.com> Signed-off-by: Dongjoon Hyun <dhyun@apple.com>	2019-03-05 11:12:57 -08:00
Luca Canali	25d2850665	[SPARK-26928][CORE] Add driver CPU Time to the metrics system ## What changes were proposed in this pull request? This proposes to add instrumentation for the driver's JVM CPU time via the Spark Dropwizard/Codahale metrics system. It follows directly from previous work SPARK-25228 and shares similar motivations: it is intended as an improvement to be used for Spark performance dashboards and monitoring tools/instrumentation. Implementation details: this PR takes the code introduced in SPARK-25228 and moves it to a new separate Source JVMCPUSource, which is then used to register the jvmCpuTime gauge metric for both executor and driver. The registration of the jvmCpuTime metric for the driver is conditional, a new configuration parameter `spark.metrics.cpu.time.driver.enabled` (proposed default: false) is introduced for this purpose. ## How was this patch tested? Manually tested, using local mode and using YARN. Closes #23838 from LucaCanali/addCPUTimeMetricDriver. Authored-by: Luca Canali <luca.canali@cern.ch> Signed-off-by: Marcelo Vanzin <vanzin@cloudera.com>	2019-03-05 10:47:39 -08:00
Ajith	6207360b00	[SPARK-27012][CORE] Storage tab shows rdd details even after executor ended ## What changes were proposed in this pull request? After we cache a table, we can see its details in Storage Tab of spark UI. If the executor has shutdown ( graceful shutdown/ Dynamic executor scenario) UI still shows the rdd as cached and when we click the link it throws error. This is because on executor remove event, we fail to adjust rdd partition details org.apache.spark.status.AppStatusListener#onExecutorRemoved ## How was this patch tested? Have tested this fix in UI manually Edit: Added UT Closes #23920 from ajithme/cachestorage. Authored-by: Ajith <ajith2489@gmail.com> Signed-off-by: Marcelo Vanzin <vanzin@cloudera.com>	2019-03-05 10:40:38 -08:00
Gabor Somogyi	b99610e9ed	[SPARK-26592][SS][DOC] Add Kafka proxy user caveat to documentation ## What changes were proposed in this pull request? Since this caveat added to the DStreams documentation it would be good to add to Structured Streaming as well. ## How was this patch tested? cd docs/ SKIP_API=1 jekyll build Manual webpage check. Closes #23974 from gaborgsomogyi/SPARK-26592_. Authored-by: Gabor Somogyi <gabor.g.somogyi@gmail.com> Signed-off-by: Marcelo Vanzin <vanzin@cloudera.com>	2019-03-05 09:58:51 -08:00
Takeshi Yamamuro	4490fd0ff0	[SPARK-27001][SQL][FOLLOW-UP] Drop Serializable in WalkedTypePath ## What changes were proposed in this pull request? This pr tried to drop `Serializable` in `WalkedTypePath`. ## How was this patch tested? Pass Jenkins. Closes #23973 from maropu/SPARK-27001-FOLLOWUP. Authored-by: Takeshi Yamamuro <yamamuro@apache.org> Signed-off-by: Wenchen Fan <wenchen@databricks.com>	2019-03-05 23:05:50 +08:00
Sean Owen	3909223681	[MINOR][DOCS] Clarify that Spark apps should mark Spark as a 'provided' dependency, not package it ## What changes were proposed in this pull request? Spark apps do not need to package Spark. In fact it can cause problems in some cases. Our examples should show depending on Spark as a 'provided' dependency. Packaging Spark makes the app much bigger by tens of megabytes. It can also bring in conflicting dependencies that wouldn't otherwise be a problem. https://issues.apache.org/jira/browse/SPARK-26146 was what reminded me of this. ## How was this patch tested? Doc build Closes #23938 from srowen/Provided. Authored-by: Sean Owen <sean.owen@databricks.com> Signed-off-by: Sean Owen <sean.owen@databricks.com>	2019-03-05 08:26:30 -06:00
Yuming Wang	940626b724	[SPARK-15095][FOLLOW-UP][SQL] Remove HiveSessionHook related code from ThriftServer ## What changes were proposed in this pull request? https://github.com/apache/spark/pull/12881 removed `HiveSessionHook`. But there are still some code related to `HiveSessionHook`. This PR removes all `HiveSessionHook` related code. ## How was this patch tested? manual tests Closes #23957 from wangyum/SPARK-15095. Authored-by: Yuming Wang <yumwang@ebay.com> Signed-off-by: Sean Owen <sean.owen@databricks.com>	2019-03-05 07:42:25 -06:00
Yanbo Liang	7857c6d633	[SPARK-27051][CORE] Bump Jackson version to 2.9.8 ## What changes were proposed in this pull request? Fasterxml Jackson version before 2.9.8 is affected by multiple [CVEs](https://github.com/FasterXML/jackson-databind/issues/2186), we need to fix bump the dependent Jackson to 2.9.8. ## How was this patch tested? Existing tests and offline benchmark. I have run ```SPARK_GENERATE_BENCHMARK_FILES=1 build/sbt "sql/test:runMain org.apache.spark.sql.execution.datasources.json.JSONBenchmark"``` to check there is no performance degradation for this upgrade. Closes #23965 from yanboliang/SPARK-27051. Authored-by: Yanbo Liang <ybliang8@gmail.com> Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>	2019-03-05 11:46:51 +09:00
Anton Okolnychyi	0c23a39384	[SPARK-26205][SQL] Optimize InSet Expression for bytes, shorts, ints, dates ## What changes were proposed in this pull request? This PR optimizes `InSet` expressions for byte, short, integer, date types. It is a follow-up on PR #21442 from dbtsai. `In` expressions are compiled into a sequence of if-else statements, which results in O$n$ time complexity. `InSet` is an optimized version of `In`, which is supposed to improve the performance if all values are literals and the number of elements is big enough. However, `InSet` actually worsens the performance in many cases due to various reasons. The main idea of this PR is to use Java `switch` statements to significantly improve the performance of `InSet` expressions for bytes, shorts, ints, dates. All `switch` statements are compiled into `tableswitch` and `lookupswitch` bytecode instructions. We will have O$1$ time complexity if our case values are compact and `tableswitch` can be used. Otherwise, `lookupswitch` will give us O$log n$. Locally, I tried Spark `OpenHashSet` and primitive collections from `fastutils` in order to solve the boxing issue in `InSet`. Both options significantly decreased the memory consumption and `fastutils` improved the time compared to `HashSet` from Scala. However, the switch-based approach was still more than two times faster even on 500+ non-compact elements. I also noticed that applying the switch-based approach on less than 10 elements gives a relatively minor improvement compared to the if-else approach. Therefore, I placed the switch-based logic into `InSet` and added a new config to track when it is applied. Even if we migrate to primitive collections at some point, the switch logic will be still faster unless the number of elements is really big. Another option is to have a separate `InSwitch` expression. However, this would mean we need to modify other places (e.g., `DataSourceStrategy`). See [here](https://docs.oracle.com/javase/specs/jvms/se7/html/jvms-3.html#jvms-3.10) and [here](https://stackoverflow.com/questions/10287700/difference-between-jvms-lookupswitch-and-tableswitch) for more information. This PR does not cover long values as Java `switch` statements cannot be used on them. However, we can have a follow-up PR with an approach similar to binary search. ## How was this patch tested? There are new tests that verify the logic of the proposed optimization. The performance was evaluated using existing benchmarks. This PR was also tested on an EC2 instance (OpenJDK 64-Bit Server VM 1.8.0_191-b12 on Linux 4.14.77-70.59.amzn1.x86_64, Intel(R) Xeon(R) CPU E5-2686 v4 2.30GHz). ## Notes - [This link](http://hg.openjdk.java.net/jdk8/jdk8/langtools/file/30db5e0aaf83/src/share/classes/com/sun/tools/javac/jvm/Gen.java#l1153) contains source code that decides between `tableswitch` and `lookupswitch`. The logic was re-used in the benchmarks. See the `isLookupSwitch` method. Closes #23171 from aokolnychyi/spark-26205. Lead-authored-by: Anton Okolnychyi <aokolnychyi@apple.com> Co-authored-by: Dongjoon Hyun <dhyun@apple.com> Signed-off-by: Dongjoon Hyun <dhyun@apple.com>	2019-03-04 15:40:04 -08:00
Sean Owen	0deebd3820	[SPARK-26016][DOCS] Clarify that text DataSource read/write, and RDD methods that read text, always use UTF-8 ## What changes were proposed in this pull request? Clarify that text DataSource read/write, and RDD methods that read text, always use UTF-8 as they use Hadoop's implementation underneath. I think these are all the places that this needs a mention in the user-facing docs. ## How was this patch tested? Doc tests. Closes #23962 from srowen/SPARK-26016. Authored-by: Sean Owen <sean.owen@databricks.com> Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>	2019-03-05 08:03:39 +09:00
Yuming Wang	827d371877	[SPARK-25689][FOLLOW-UP][CORE] Get proxy user's delegation tokens ## What changes were proposed in this pull request? This pr makes it get proxy user's delegation token, otherwise throws `AccessControlException`([full log](https://issues.apache.org/jira/browse/SPARK-25689?focusedCommentId=16780609&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-16780609)): ```java org.apache.hadoop.security.AccessControlException: Client cannot authenticate via:[TOKEN, KERBEROS] ... at org.apache.spark.scheduler.cluster.YarnClientSchedulerBackend.waitForApplication(YarnClientSchedulerBackend.scala:95) at org.apache.spark.scheduler.cluster.YarnClientSchedulerBackend.start(YarnClientSchedulerBackend.scala:62) at org.apache.spark.scheduler.TaskSchedulerImpl.start(TaskSchedulerImpl.scala:185) ``` How to reproduce this issue: ```shell $ ssh user_admspark-getaway-host1 $ export HADOOP_PROXY_USER=user_a $ spark-sql --master yarn ``` ## How was this patch tested? Test on our production environment. Closes #23922 from wangyum/SPARK-25689. Authored-by: Yuming Wang <yumwang@ebay.com> Signed-off-by: Marcelo Vanzin <vanzin@cloudera.com>	2019-03-04 13:21:24 -08:00
LantaoJin	e5c502c596	[SPARK-25865][CORE] Add GC information to ExecutorMetrics ## What changes were proposed in this pull request? Only memory usage without GC information could not help us to determinate the proper settings of memory. We need the GC metrics about frequency of major & minor GC. For example, two cases, their configured memory for executor are all 10GB and their usages are all near 10GB. So should we increase or decrease the configured memory for them? This metrics may be helpful. We can increase configured memory for the first one if it has very frequency major GC and decrease the second one if only some minor GC and none major GC. GC metrics are only useful in entire lifetime of executors instead of separated stages. ## How was this patch tested? Adding UT. Closes #22874 from LantaoJin/SPARK-25865. Authored-by: LantaoJin <jinlantao@gmail.com> Signed-off-by: Imran Rashid <irashid@cloudera.com>	2019-03-04 14:26:02 -06:00
“attilapiros”	caceaec932	[SPARK-26688][YARN] Provide configuration of initially blacklisted YARN nodes ## What changes were proposed in this pull request? Introducing new config for initially blacklisted YARN nodes. ## How was this patch tested? With existing and a new unit test. Closes #23616 from attilapiros/SPARK-26688. Lead-authored-by: “attilapiros” <piros.attila.zsolt@gmail.com> Co-authored-by: Attila Zsolt Piros <2017933+attilapiros@users.noreply.github.com> Signed-off-by: Imran Rashid <irashid@cloudera.com>	2019-03-04 14:14:20 -06:00

... 2 3 4 5 6 ...

24037 commits