spark-instrumented-optimizer

History

Ximo Guanter 54c5087a3a [SPARK-29248][SQL] provider number of partitions when creating v2 data writer factory ### What changes were proposed in this pull request? When implementing a ScanBuilder, we require the implementor to provide the schema of the data and the number of partitions. However, when someone is implementing WriteBuilder we only pass them the schema, but not the number of partitions. This is an asymetrical developer experience. This PR adds a PhysicalWriteInfo interface that is passed to createBatchWriterFactory and createStreamingWriterFactory that adds the number of partitions of the data that is going to be written. ### Why are the changes needed? Passing in the number of partitions on the WriteBuilder would enable data sources to provision their write targets before starting to write. For example: it could be used to provision a Kafka topic with a specific number of partitions it could be used to scale a microservice prior to sending the data to it it could be used to create a DsV2 that sends the data to another spark cluster (currently not possible since the reader wouldn't be able to know the number of partitions) ### Does this PR introduce any user-facing change? No ### How was this patch tested? Tests passed Closes #26591 from edrevo/temp. Authored-by: Ximo Guanter <joaquin.guantergonzalbez@telefonica.com> Signed-off-by: Wenchen Fan <wenchen@databricks.com>		2019-11-22 00:19:25 +08:00
..
avro	[SPARK-29757][SQL] Move calendar interval constants together	2019-11-07 19:48:19 +08:00
docker	[SPARK-28683][BUILD] Upgrade Scala to 2.12.10	2019-09-18 13:30:36 -07:00
docker-integration-tests	Revert "[SPARK-29644][SQL] Corrected ShortType and ByteType mapping to SmallInt and TinyInt in JDBCUtils	2019-11-18 18:44:16 -08:00
kafka-0-10	[SPARK-21869][SS] Apply Apache Commons Pool to Kafka producer	2019-11-07 17:06:32 -08:00
kafka-0-10-assembly	Revert "Prepare Spark release v3.0.0-preview-rc2"	2019-10-30 17:45:44 -07:00
kafka-0-10-sql	[SPARK-29248][SQL] provider number of partitions when creating v2 data writer factory	2019-11-22 00:19:25 +08:00
kafka-0-10-token-provider	Revert "Prepare Spark release v3.0.0-preview-rc2"	2019-10-30 17:45:44 -07:00
kinesis-asl	[MINOR][TESTS] Replace JVM assert with JUnit Assert in tests	2019-11-20 14:04:15 -06:00
kinesis-asl-assembly	Revert "Prepare Spark release v3.0.0-preview-rc2"	2019-10-30 17:45:44 -07:00
spark-ganglia-lgpl	[SPARK-29674][CORE] Update dropwizard metrics to 4.1.x for JDK 9+	2019-11-03 15:13:06 -08:00