spark-instrumented-optimizer/yarn
Andrew Or 83e0424d87 [SPARK-1774] Respect SparkSubmit --jars on YARN (client)
SparkSubmit ignores `--jars` for YARN client. This is a bug.

This PR also automatically adds the application jar to `spark.jar`. Previously, when running as yarn-client, you must specify the jar additionally through `--files` (because `--jars` didn't work). Now you don't have to explicitly specify it through either.

Tested on a YARN cluster.

Author: Andrew Or <andrewor14@gmail.com>

Closes #710 from andrewor14/yarn-jars and squashes the following commits:

35d1928 [Andrew Or] Merge branch 'master' of github.com:apache/spark into yarn-jars
c27bf6c [Andrew Or] For yarn-cluster and python, do not add primaryResource to spark.jar
c92c5bf [Andrew Or] Minor cleanups
269f9f3 [Andrew Or] Fix format
013d840 [Andrew Or] Fix tests
1407474 [Andrew Or] Merge branch 'master' of github.com:apache/spark into yarn-jars
3bb75e8 [Andrew Or] Allow SparkSubmit --jars to take effect in yarn-client mode
2014-05-10 20:58:02 -07:00
..
alpha [WIP] SPARK-1676: Cache Hadoop UGIs by default to prevent FileSystem leak 2014-05-03 10:59:05 -07:00
common/src [SPARK-1774] Respect SparkSubmit --jars on YARN (client) 2014-05-10 20:58:02 -07:00
stable [WIP] SPARK-1676: Cache Hadoop UGIs by default to prevent FileSystem leak 2014-05-03 10:59:05 -07:00
pom.xml SPARK-1474: Spark on yarn assembly doesn't include AmIpFilter 2014-05-06 12:00:09 -07:00
README.md fix docs for yarn 2014-01-03 14:14:35 +08:00

YARN DIRECTORY LAYOUT

Hadoop Yarn related codes are organized in separate directories to minimize duplicated code.

  • common : Common codes that do not depending on specific version of Hadoop.

  • alpha / stable : Codes that involve specific version of Hadoop YARN API.

alpha represents 0.23 and 2.0.x stable represents 2.2 and later, until the API changes again.

alpha / stable will build together with common dir into a single jar