spark-instrumented-optimizer/project
Colin Patrick Mccabe 6e27cb630d SPARK-1767: Prefer HDFS-cached replicas when scheduling data-local tasks
This change reorders the replicas returned by
HadoopRDD#getPreferredLocations so that replicas cached by HDFS are at
the start of the list.  This requires Hadoop 2.5 or higher; previous
versions of Hadoop do not expose the information needed to determine
whether a replica is cached.

Author: Colin Patrick Mccabe <cmccabe@cloudera.com>

Closes #1486 from cmccabe/SPARK-1767 and squashes the following commits:

338d4f8 [Colin Patrick Mccabe] SPARK-1767: Prefer HDFS-cached replicas when scheduling data-local tasks
2014-10-02 00:29:31 -07:00
..
project [SPARK-1776] Have Spark's SBT build read dependencies from Maven. 2014-07-10 11:03:37 -07:00
spark-style/src/main/scala/org/apache/spark/scalastyle [SPARK-2182] Scalastyle rule blocking non ascii characters. 2014-09-16 09:21:03 -07:00
build.properties [SPARK-1776] Have Spark's SBT build read dependencies from Maven. 2014-07-10 11:03:37 -07:00
MimaBuild.scala [SPARK-3433][BUILD] Fix for Mima false-positives with @DeveloperAPI and @Experimental annotations. 2014-09-15 21:14:00 -07:00
MimaExcludes.scala SPARK-1767: Prefer HDFS-cached replicas when scheduling data-local tasks 2014-10-02 00:29:31 -07:00
plugins.sbt [SPARK-2848] Shade Guava in uber-jars. 2014-08-20 16:23:10 -07:00
SparkBuild.scala SPARK-3699: SQL and Hive console tasks now clean up appropriately 2014-09-28 01:01:27 -07:00