spark-instrumented-optimizer/core/src/main/scala/spark/Partitioner.scala

package spark

/**
 * An object that defines how the elements in a key-value pair RDD are partitioned by key.
 * Maps each key to a partition ID, from 0 to `numPartitions - 1`.
 */
abstract class Partitioner extends Serializable {
  def numPartitions: Int
  def getPartition(key: Any): Int
}

object Partitioner {
  /**
   * Choose a partitioner to use for a cogroup-like operation between a number of RDDs.
   *
   * If any of the RDDs already has a partitioner, choose that one.
   *
   * Otherwise, we use a default HashPartitioner. For the number of partitions, if
   * spark.default.parallelism is set, then we'll use the value from SparkContext
   * defaultParallelism, otherwise we'll use the max number of upstream partitions.
   *
   * Unless spark.default.parallelism is set, He number of partitions will be the
   * same as the number of partitions in the largest upstream RDD, as this should
   * be least likely to cause out-of-memory errors.
   *
   * We use two method parameters (rdd, others) to enforce callers passing at least 1 RDD.
   */
  def defaultPartitioner(rdd: RDD[_], others: RDD[_]*): Partitioner = {
    val bySize = (Seq(rdd) ++ others).sortBy(_.partitions.size).reverse
    for (r <- bySize if r.partitioner != None) {
      return r.partitioner.get
    }
    if (System.getProperty("spark.default.parallelism") != null) {
      return new HashPartitioner(rdd.context.defaultParallelism)
    } else {
      return new HashPartitioner(bySize.head.partitions.size)
    }
  }
}

/**
 * A [[spark.Partitioner]] that implements hash-based partitioning using Java's `Object.hashCode`.
 *
 * Java arrays have hashCodes that are based on the arrays' identities rather than their contents,
 * so attempting to partition an RDD[Array[_]] or RDD[(Array[_], _)] using a HashPartitioner will
 * produce an unexpected or incorrect result.
 */
class HashPartitioner(partitions: Int) extends Partitioner {
  def numPartitions = partitions

  def getPartition(key: Any): Int = {
    if (key == null) {
      return 0
    } else {
      val mod = key.hashCode % partitions
      if (mod < 0) {
        mod + partitions
      } else {
        mod // Guard against negative hash codes
      }
    }
  }
  
  override def equals(other: Any): Boolean = other match {
    case h: HashPartitioner =>
      h.numPartitions == numPartitions
    case _ =>
      false
  }
}

/**
 * A [[spark.Partitioner]] that partitions sortable records by range into roughly equal ranges.
 * Determines the ranges by sampling the RDD passed in.
 */
class RangePartitioner[K <% Ordered[K]: ClassManifest, V](
    partitions: Int,
    @transient rdd: RDD[(K,V)],
    private val ascending: Boolean = true) 
  extends Partitioner {

  // An array of upper bounds for the first (partitions - 1) partitions
  private val rangeBounds: Array[K] = {
    if (partitions == 1) {
      Array()
    } else {
      val rddSize = rdd.count()
      val maxSampleSize = partitions * 20.0
      val frac = math.min(maxSampleSize / math.max(rddSize, 1), 1.0)
      val rddSample = rdd.sample(false, frac, 1).map(_._1).collect().sortWith(_ < _)
      if (rddSample.length == 0) {
        Array()
      } else {
        val bounds = new Array[K](partitions - 1)
        for (i <- 0 until partitions - 1) {
          val index = (rddSample.length - 1) * (i + 1) / partitions
          bounds(i) = rddSample(index)
        }
        bounds
      }
    }
  }

  def numPartitions = partitions

  def getPartition(key: Any): Int = {
    // TODO: Use a binary search here if number of partitions is large
    val k = key.asInstanceOf[K]
    var partition = 0
    while (partition < rangeBounds.length && k > rangeBounds(partition)) {
      partition += 1
    }
    if (ascending) {
      partition
    } else {
      rangeBounds.length - partition
    }
  }

  override def equals(other: Any): Boolean = other match {
    case r: RangePartitioner[_,_] =>
      r.rangeBounds.sameElements(rangeBounds) && r.ascending == ascending
    case _ =>
      false
  }
}
More work on new RDD design 2011-02-27 22:15:52 -05:00			`package spark`

More doc updates, and moved Serializer to a subpackage. 2012-10-12 21:19:21 -04:00			`/**`
			`* An object that defines how the elements in a key-value pair RDD are partitioned by key.`
			* Maps each key to a partition ID, from 0 to `numPartitions - 1`.
			`*/`
Fix issue #65: Change @serializable to extends Serializable in 2.9 branch Note that we use scala.Serializable introduced in Scala 2.9 instead of java.io.Serializable. Also, case classes inherit from scala.Serializable by default. 2011-08-02 05:16:33 -04:00			`abstract class Partitioner extends Serializable {`
More work on new RDD design 2011-02-27 22:15:52 -05:00			`def numPartitions: Int`
Finished cogroup stuff 2011-03-07 02:38:16 -05:00			`def getPartition(key: Any): Int`
More work on new RDD design 2011-02-27 22:15:52 -05:00			`}`

Update default.parallelism docs, have StandaloneSchedulerBackend use it. Only brand new RDDs (e.g. parallelize and makeRDD) now use default parallelism, everything else uses their largest parent's partitioner or partition size. 2013-02-16 01:29:11 -05:00			`object Partitioner {`
			`/**`
Use default parallelism if its set. 2013-02-25 00:54:03 -05:00			`* Choose a partitioner to use for a cogroup-like operation between a number of RDDs.`
			`*`
			`* If any of the RDDs already has a partitioner, choose that one.`
Update default.parallelism docs, have StandaloneSchedulerBackend use it. Only brand new RDDs (e.g. parallelize and makeRDD) now use default parallelism, everything else uses their largest parent's partitioner or partition size. 2013-02-16 01:29:11 -05:00			`*`
Use default parallelism if its set. 2013-02-25 00:54:03 -05:00			`* Otherwise, we use a default HashPartitioner. For the number of partitions, if`
			`* spark.default.parallelism is set, then we'll use the value from SparkContext`
			`* defaultParallelism, otherwise we'll use the max number of upstream partitions.`
			`*`
			`* Unless spark.default.parallelism is set, He number of partitions will be the`
			`* same as the number of partitions in the largest upstream RDD, as this should`
			`* be least likely to cause out-of-memory errors.`
Update default.parallelism docs, have StandaloneSchedulerBackend use it. Only brand new RDDs (e.g. parallelize and makeRDD) now use default parallelism, everything else uses their largest parent's partitioner or partition size. 2013-02-16 01:29:11 -05:00			`*`
			`* We use two method parameters (rdd, others) to enforce callers passing at least 1 RDD.`
			`*/`
			`def defaultPartitioner(rdd: RDD[_], others: RDD[_]*): Partitioner = {`
Merge branch 'master' into bettersplits Conflicts: core/src/main/scala/spark/RDD.scala core/src/main/scala/spark/scheduler/cluster/StandaloneSchedulerBackend.scala core/src/test/scala/spark/ShuffleSuite.scala 2013-02-24 23:08:14 -05:00			`val bySize = (Seq(rdd) ++ others).sortBy(_.partitions.size).reverse`
Update default.parallelism docs, have StandaloneSchedulerBackend use it. Only brand new RDDs (e.g. parallelize and makeRDD) now use default parallelism, everything else uses their largest parent's partitioner or partition size. 2013-02-16 01:29:11 -05:00			`for (r <- bySize if r.partitioner != None) {`
			`return r.partitioner.get`
			`}`
Get spark.default.paralellism on each call to defaultPartitioner, instead of only once, in case the user changes it across Spark uses 2013-02-25 13:28:08 -05:00			`if (System.getProperty("spark.default.parallelism") != null) {`
Use default parallelism if its set. 2013-02-25 00:54:03 -05:00			`return new HashPartitioner(rdd.context.defaultParallelism)`
			`} else {`
			`return new HashPartitioner(bySize.head.partitions.size)`
			`}`
Update default.parallelism docs, have StandaloneSchedulerBackend use it. Only brand new RDDs (e.g. parallelize and makeRDD) now use default parallelism, everything else uses their largest parent's partitioner or partition size. 2013-02-16 01:29:11 -05:00			`}`
			`}`

More doc updates, and moved Serializer to a subpackage. 2012-10-12 21:19:21 -04:00			`/**`
			* A [[spark.Partitioner]] that implements hash-based partitioning using Java's `Object.hashCode`.
Raise exception when hashing Java arrays (SPARK-597) 2012-12-30 15:43:06 -05:00			`*`
			`* Java arrays have hashCodes that are based on the arrays' identities rather than their contents,`
			`* so attempting to partition an RDD[Array[_]] or RDD[(Array[_], _)] using a HashPartitioner will`
			`* produce an unexpected or incorrect result.`
More doc updates, and moved Serializer to a subpackage. 2012-10-12 21:19:21 -04:00			`*/`
Finished cogroup stuff 2011-03-07 02:38:16 -05:00			`class HashPartitioner(partitions: Int) extends Partitioner {`
More work on new RDD design 2011-02-27 22:15:52 -05:00			`def numPartitions = partitions`

Allow null keys in Spark's reduce and group by 2012-07-12 21:36:02 -04:00			`def getPartition(key: Any): Int = {`
			`if (key == null) {`
			`return 0`
Code format. 2012-02-10 11:19:53 -05:00			`} else {`
Allow null keys in Spark's reduce and group by 2012-07-12 21:36:02 -04:00			`val mod = key.hashCode % partitions`
			`if (mod < 0) {`
			`mod + partitions`
			`} else {`
			`mod // Guard against negative hash codes`
			`}`
Code format. 2012-02-10 11:19:53 -05:00			`}`
More work on new RDD design 2011-02-27 22:15:52 -05:00			`}`

			`override def equals(other: Any): Boolean = other match {`
Finished cogroup stuff 2011-03-07 02:38:16 -05:00			`case h: HashPartitioner =>`
More work on new RDD design 2011-02-27 22:15:52 -05:00			`h.numPartitions == numPartitions`
Code format. 2012-02-10 11:19:53 -05:00			`case _ =>`
			`false`
More work on new RDD design 2011-02-27 22:15:52 -05:00			`}`
Added sorting by key for pair RDDs 2012-02-11 03:56:28 -05:00			`}`

More doc updates, and moved Serializer to a subpackage. 2012-10-12 21:19:21 -04:00			`/**`
			`* A [[spark.Partitioner]] that partitions sortable records by range into roughly equal ranges.`
			`* Determines the ranges by sampling the RDD passed in.`
			`*/`
Some fixes to sorting for when the RDD has fewer elements than the number of partitions we ask to partition it into. Also, removed a test that was taking way too long to run. 2012-03-17 16:08:36 -04:00			`class RangePartitioner[K <% Ordered[K]: ClassManifest, V](`
Performance improvements to shuffle operations: in particular, preserve RDD partitioning in more cases where it's possible, and use iterators instead of materializing collections when doing joins. 2012-06-09 17:44:18 -04:00			`partitions: Int,`
			`@transient rdd: RDD[(K,V)],`
			`private val ascending: Boolean = true)`
Added sorting by key for pair RDDs 2012-02-11 03:56:28 -05:00			`extends Partitioner {`

Added a unit test for cross-partition balancing in sort, and changes to RangePartitioner to make it pass. It turns out that the first partition was always kind of small due to how we picked partition boundaries. 2012-08-03 16:37:35 -04:00			`// An array of upper bounds for the first (partitions - 1) partitions`
Some fixes to sorting for when the RDD has fewer elements than the number of partitions we ask to partition it into. Also, removed a test that was taking way too long to run. 2012-03-17 16:08:36 -04:00			`private val rangeBounds: Array[K] = {`
Added a unit test for cross-partition balancing in sort, and changes to RangePartitioner to make it pass. It turns out that the first partition was always kind of small due to how we picked partition boundaries. 2012-08-03 16:37:35 -04:00			`if (partitions == 1) {`
Some fixes to sorting for when the RDD has fewer elements than the number of partitions we ask to partition it into. Also, removed a test that was taking way too long to run. 2012-03-17 16:08:36 -04:00			`Array()`
			`} else {`
Added a unit test for cross-partition balancing in sort, and changes to RangePartitioner to make it pass. It turns out that the first partition was always kind of small due to how we picked partition boundaries. 2012-08-03 16:37:35 -04:00			`val rddSize = rdd.count()`
Fixed a test that was getting extremely lucky before, and increased the number of samples used for sorting 2012-09-26 03:25:34 -04:00			`val maxSampleSize = partitions * 20.0`
Added a unit test for cross-partition balancing in sort, and changes to RangePartitioner to make it pass. It turns out that the first partition was always kind of small due to how we picked partition boundaries. 2012-08-03 16:37:35 -04:00			`val frac = math.min(maxSampleSize / math.max(rddSize, 1), 1.0)`
Fixed a test that was getting extremely lucky before, and increased the number of samples used for sorting 2012-09-26 03:25:34 -04:00			`val rddSample = rdd.sample(false, frac, 1).map(_._1).collect().sortWith(_ < _)`
Added a unit test for cross-partition balancing in sort, and changes to RangePartitioner to make it pass. It turns out that the first partition was always kind of small due to how we picked partition boundaries. 2012-08-03 16:37:35 -04:00			`if (rddSample.length == 0) {`
			`Array()`
			`} else {`
			`val bounds = new Array[K](partitions - 1)`
			`for (i <- 0 until partitions - 1) {`
			`val index = (rddSample.length - 1) * (i + 1) / partitions`
			`bounds(i) = rddSample(index)`
			`}`
			`bounds`
Some fixes to sorting for when the RDD has fewer elements than the number of partitions we ask to partition it into. Also, removed a test that was taking way too long to run. 2012-03-17 16:08:36 -04:00			`}`
			`}`
			`}`

Added a unit test for cross-partition balancing in sort, and changes to RangePartitioner to make it pass. It turns out that the first partition was always kind of small due to how we picked partition boundaries. 2012-08-03 16:37:35 -04:00			`def numPartitions = partitions`
Added sorting by key for pair RDDs 2012-02-11 03:56:28 -05:00
Some fixes to sorting for when the RDD has fewer elements than the number of partitions we ask to partition it into. Also, removed a test that was taking way too long to run. 2012-03-17 16:08:36 -04:00			`def getPartition(key: Any): Int = {`
			`// TODO: Use a binary search here if number of partitions is large`
Added fixes to sorting 2012-02-13 03:07:39 -05:00			`val k = key.asInstanceOf[K]`
Some fixes to sorting for when the RDD has fewer elements than the number of partitions we ask to partition it into. Also, removed a test that was taking way too long to run. 2012-03-17 16:08:36 -04:00			`var partition = 0`
Added a unit test for cross-partition balancing in sort, and changes to RangePartitioner to make it pass. It turns out that the first partition was always kind of small due to how we picked partition boundaries. 2012-08-03 16:37:35 -04:00			`while (partition < rangeBounds.length && k > rangeBounds(partition)) {`
Some fixes to sorting for when the RDD has fewer elements than the number of partitions we ask to partition it into. Also, removed a test that was taking way too long to run. 2012-03-17 16:08:36 -04:00			`partition += 1`
			`}`
			`if (ascending) {`
			`partition`
			`} else {`
Added a unit test for cross-partition balancing in sort, and changes to RangePartitioner to make it pass. It turns out that the first partition was always kind of small due to how we picked partition boundaries. 2012-08-03 16:37:35 -04:00			`rangeBounds.length - partition`
Some fixes to sorting for when the RDD has fewer elements than the number of partitions we ask to partition it into. Also, removed a test that was taking way too long to run. 2012-03-17 16:08:36 -04:00			`}`
Added sorting by key for pair RDDs 2012-02-11 03:56:28 -05:00			`}`

			`override def equals(other: Any): Boolean = other match {`
Added fixes to sorting 2012-02-13 03:07:39 -05:00			`case r: RangePartitioner[_,_] =>`
Performance improvements to shuffle operations: in particular, preserve RDD partitioning in more cases where it's possible, and use iterators instead of materializing collections when doing joins. 2012-06-09 17:44:18 -04:00			`r.rangeBounds.sameElements(rangeBounds) && r.ascending == ascending`
Some fixes to sorting for when the RDD has fewer elements than the number of partitions we ask to partition it into. Also, removed a test that was taking way too long to run. 2012-03-17 16:08:36 -04:00			`case _ =>`
			`false`
Added sorting by key for pair RDDs 2012-02-11 03:56:28 -05:00			`}`
			`}`