spark-instrumented-optimizer

History

Adrian Ionescu 703c42c398 [SPARK-20194] Add support for partition pruning to in-memory catalog ## What changes were proposed in this pull request? This patch implements `listPartitionsByFilter()` for `InMemoryCatalog` and thus resolves an outstanding TODO causing the `PruneFileSourcePartitions` optimizer rule not to apply when "spark.sql.catalogImplementation" is set to "in-memory" (which is the default). The change is straightforward: it extracts the code for further filtering of the list of partitions returned by the metastore's `getPartitionsByFilter()` out from `HiveExternalCatalog` into `ExternalCatalogUtils` and calls this new function from `InMemoryCatalog` on the whole list of partitions. Now that this method is implemented we can always pass the `CatalogTable` to the `DataSource` in `FindDataSourceTable`, so that the latter is resolved to a relation with a `CatalogFileIndex`, which is what the `PruneFileSourcePartitions` rule matches for. ## How was this patch tested? Ran existing tests and added new test for `listPartitionsByFilter` in `ExternalCatalogSuite`, which is subclassed by both `InMemoryCatalogSuite` and `HiveExternalCatalogSuite`. Author: Adrian Ionescu <adrian@databricks.com> Closes #17510 from adrian-ionescu/InMemoryCatalog.		2017-04-03 08:48:49 -07:00
..
compatibility/src/test/scala/org/apache/spark/sql/hive/execution	[SPARK-20126][SQL] Remove HiveSessionState	2017-03-28 23:14:31 +08:00
src	[SPARK-20194] Add support for partition pruning to in-memory catalog	2017-04-03 08:48:49 -07:00
pom.xml	[SPARK-19550][BUILD][CORE][WIP] Remove Java 7 support	2017-02-16 12:32:45 +00:00