spark-instrumented-optimizer

History

Wang Gengliang 96ba217a06 [SPARK-23005][CORE] Improve RDD.take on small number of partitions ## What changes were proposed in this pull request? In current implementation of RDD.take, we overestimate the number of partitions we need to try by 50%: `(1.5 * num * partsScanned / buf.size).toInt` However, when the number is small, the result of `.toInt` is not what we want. E.g, 2.9 will become 2, which should be 3. Use Math.ceil to fix the problem. Also clean up the code in RDD.scala. ## How was this patch tested? Unit test Author: Wang Gengliang <ltnwgl@gmail.com> Closes #20200 from gengliangwang/Take.		2018-01-10 10:15:27 +08:00
..
benchmarks	[SPARK-17335][SQL] Fix ArrayType and MapType CatalogString.	2016-09-03 19:02:20 +02:00
src	[SPARK-23005][CORE] Improve RDD.take on small number of partitions	2018-01-10 10:15:27 +08:00
pom.xml	[SPARK-22516][SQL] Bump up Univocity version to 2.5.9	2017-12-06 13:22:08 -08:00