spark-instrumented-optimizer

History

Max Gekk 514172aae7 [SPARK-34074][SQL][TESTS][FOLLOWUP] Fix table size parsing from statistics ### What changes were proposed in this pull request? Fix table size parsing from the `Statistics` field which is formed at: `c3d81fbe79/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/interface.scala (L573)` . Before the fix, `getTableSize()` returns only the last digit. This works for Hive table in the tests because its size < 10 bytes, and accidentally works for V1 In-Memory catalog table in the tests. ### Why are the changes needed? This makes tests more reliable. For example, the parsing can not work in `AlterTableDropPartitionSuite` when table size before partition dropping: ``` +---------+ \|data_type\| +---------+ \|878 bytes\| +---------+ ``` After: ``` +---------+ \|data_type\| +---------+ \|439 bytes\| +---------+ ``` at: ```scala val onePartSize = getTableSize(t) assert(0 < onePartSize && onePartSize < twoPartSize) ``` ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? By existing test suites: ``` $ build/sbt -Phive-2.3 -Phive-thriftserver "test:testOnly .AlterTableAddPartitionSuite" $ build/sbt -Phive-2.3 -Phive-thriftserver "test:testOnly .AlterTableDropPartitionSuite" ``` Closes #31237 from MaxGekk/optimize-updateTableStats-followup. Authored-by: Max Gekk <max.gekk@gmail.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>		2021-01-19 09:51:20 +09:00
..
benchmarks	[SPARK-33523][SQL][TEST][FOLLOWUP] Fix benchmark case name in SubExprEliminationBenchmark	2020-11-25 15:22:47 -08:00
src	[SPARK-34074][SQL][TESTS][FOLLOWUP] Fix table size parsing from statistics	2021-01-19 09:51:20 +09:00
pom.xml	[SPARK-33662][BUILD] Setting version to 3.2.0-SNAPSHOT	2020-12-04 14:10:42 -08:00