ODIn/spark-instrumented-optimizer

Author	SHA1	Message	Date
Patrick Wendell	e549874c33	Preparing development version 1.4.0-SNAPSHOT	2015-05-29 13:07:07 -07:00
Patrick Wendell	dd109a8746	Preparing Spark release v1.4.0-rc3	2015-05-29 13:06:59 -07:00
Patrick Wendell	c68abaa34e	Preparing development version 1.4.0-SNAPSHOT	2015-05-29 12:15:18 -07:00
Patrick Wendell	fb60503ff2	Preparing Spark release v1.4.0-rc3	2015-05-29 12:15:13 -07:00
Patrick Wendell	6bf5a42084	Preparing development version 1.4.0-SNAPSHOT	2015-05-28 23:40:27 -07:00
Patrick Wendell	f2796816be	Preparing Spark release v1.4.0-rc3	2015-05-28 23:40:22 -07:00
Patrick Wendell	119c93af9c	Preparing development version 1.4.0-SNAPSHOT	2015-05-28 22:57:31 -07:00
Patrick Wendell	2d97d7a0aa	Preparing Spark release v1.4.0-rc3	2015-05-28 22:57:26 -07:00
Patrick Wendell	7c342bdd93	Preparing development version 1.4.0-SNAPSHOT	2015-05-27 22:36:30 -07:00
Patrick Wendell	4983dfc878	Preparing Spark release v1.4.0-rc3	2015-05-27 22:36:23 -07:00
Patrick Wendell	947d700ec8	Preparing development version 1.4.0-SNAPSHOT	2015-05-23 20:13:05 -07:00
Patrick Wendell	03fb26a3e5	Preparing Spark release v1.4.0-rc2	2015-05-23 20:13:00 -07:00
Patrick Wendell	f2f74b9b1a	Preparing development version 1.4.1-SNAPSHOT	2015-05-23 14:59:37 -07:00
Patrick Wendell	0da7396990	Preparing Spark release v1.4.0-rc2-test	2015-05-23 14:59:31 -07:00
Patrick Wendell	8da8caab17	Preparing development version 1.4.1-SNAPSHOT	2015-05-23 14:46:27 -07:00
Patrick Wendell	8f50218f38	Preparing Spark release 1.4.0-rc2-test	2015-05-23 14:46:23 -07:00
Patrick Wendell	9b37e32c55	Preparing development version 1.4.0-SNAPSHOT	2015-05-20 17:29:00 -07:00
Patrick Wendell	1e458e3553	Preparing Spark release rc-test	2015-05-20 17:28:55 -07:00
pwendell	8d66849862	Preparing development version 1.4.0-SNAPSHOT	2015-05-20 17:26:15 -07:00
pwendell	ae29aeaf8e	Preparing Spark release rc-test	2015-05-20 17:26:10 -07:00
jenkins	534c787b9f	Preparing development version 1.4.0-SNAPSHOT	2015-05-20 16:49:59 -07:00
jenkins	5f4d87f608	Preparing Spark release rc-test	2015-05-20 16:49:54 -07:00
Patrick Wendell	205ed15f29	Preparing development version 1.4.0-SNAPSHOT	2015-05-20 16:30:01 -07:00
Patrick Wendell	09a1c6231e	Preparing Spark release rc-test	2015-05-20 16:29:52 -07:00
Patrick Wendell	ac3197e1b9	Preparing development version 1.4.1-SNAPSHOT	2015-05-19 09:35:12 +00:00
Patrick Wendell	777a08166f	Preparing Spark release v1.4.0-rc1	2015-05-19 09:35:12 +00:00
Patrick Wendell	586ede6b32	Revert "Preparing Spark release v1.4.0-rc1" This reverts commit `79fb01a3be`.	2015-05-19 02:27:14 -07:00
Patrick Wendell	e7309ec729	Revert "Preparing development version 1.4.1-SNAPSHOT" This reverts commit `a1d896b85b`.	2015-05-19 02:27:07 -07:00
Patrick Wendell	a1d896b85b	Preparing development version 1.4.1-SNAPSHOT	2015-05-19 07:13:24 +00:00
Patrick Wendell	79fb01a3be	Preparing Spark release v1.4.0-rc1	2015-05-19 07:13:24 +00:00
Patrick Wendell	b0c63d2413	Revert "Preparing Spark release v1.4.0-rc1" This reverts commit `38ccef36c1`.	2015-05-19 00:10:39 -07:00
Patrick Wendell	198a186ad3	Revert "Preparing development version 1.4.1-SNAPSHOT" This reverts commit `40190ce226`.	2015-05-19 00:10:37 -07:00
Patrick Wendell	40190ce226	Preparing development version 1.4.1-SNAPSHOT	2015-05-19 06:06:41 +00:00
Patrick Wendell	38ccef36c1	Preparing Spark release v1.4.0-rc1	2015-05-19 06:06:40 +00:00
Patrick Wendell	152b0291c0	Revert "Preparing Spark release v1.4.0-rc1" This reverts commit `e8e97e3a63`.	2015-05-18 23:06:15 -07:00
Patrick Wendell	4d098bc049	Revert "Preparing development version 1.4.1-SNAPSHOT" This reverts commit `758ca74bab`.	2015-05-18 23:06:13 -07:00
Patrick Wendell	758ca74bab	Preparing development version 1.4.1-SNAPSHOT	2015-05-19 05:01:11 +00:00
Patrick Wendell	e8e97e3a63	Preparing Spark release v1.4.0-rc1	2015-05-19 05:01:11 +00:00
Marcelo Vanzin	afe54b76a6	[SPARK-7485] [BUILD] Remove pyspark files from assembly. The sbt part of the build is hacky; it basically tricks sbt into generating the zip by using a generator, but returns an empty list for the generated files so that nothing is actually added to the assembly. Author: Marcelo Vanzin <vanzin@cloudera.com> Closes #6022 from vanzin/SPARK-7485 and squashes the following commits: 22c1e04 [Marcelo Vanzin] Remove unneeded code. 4893622 [Marcelo Vanzin] [SPARK-7485] [build] Remove pyspark files from assembly. (cherry picked from commit `82e890fb19`) Signed-off-by: Andrew Or <andrew@databricks.com>	2015-05-12 01:39:28 -07:00
Steve Loughran	ee11be2582	SPARK-6433 hive tests to import spark-sql test JAR for QueryTest access 1. Test JARs are built & published 1. log4j.resources is explicitly excluded. Without this, downstream test run logging depends on the order the JARs are listed/loaded 1. sql/hive pulls in spark-sql &...spark-catalyst for its test runs 1. The copied in test classes were rm'd, and a test edited to remove its now duplicate assert method 1. Spark streaming is now build with the same plugin/phase as the rest, but its shade plugin declaration is kept in (so different from the rest of the test plugins). Due to (#2), this means the test JAR no longer includes its log4j file. Outstanding issues: * should the JARs be shaded? `spark-streaming-test.jar` does, but given these are test jars for developers only, especially in the same spark source tree, it's hard to justify. * `maven-jar-plugin` v 2.6 was explicitly selected; without this the apache-1.4 parent template JAR version (2.4) chosen. * Are there any other resources to exclude? Author: Steve Loughran <stevel@hortonworks.com> Closes #5119 from steveloughran/stevel/patches/SPARK-6433-test-jars and squashes the following commits: 81ceb01 [Steve Loughran] SPARK-6433 add a clearer comment explaining what the plugin is doing & why a6dca33 [Steve Loughran] SPARK-6433 : pull configuration section form archive plugin c2b5f89 [Steve Loughran] SPARK-6433 omit "jar" goal from jar plugin fdac51b [Steve Loughran] SPARK-6433 -002; indentation & delegate plugin version to parent 650f442 [Steve Loughran] SPARK-6433 patch 001: test JARs are built; sql/hive pulls in spark-sql & spark-catalyst for its test runs	2015-04-01 16:26:54 +01:00
Marcelo Vanzin	a74564591f	[SPARK-6371] [build] Update version to 1.4.0-SNAPSHOT. Author: Marcelo Vanzin <vanzin@cloudera.com> Closes #5056 from vanzin/SPARK-6371 and squashes the following commits: 63220df [Marcelo Vanzin] Merge branch 'master' into SPARK-6371 6506f75 [Marcelo Vanzin] Use more fine-grained exclusion. 178ba71 [Marcelo Vanzin] Oops. 75b2375 [Marcelo Vanzin] Exclude VertexRDD in MiMA. a45a62c [Marcelo Vanzin] Work around MIMA warning. 1d8a670 [Marcelo Vanzin] Re-group jetty exclusion. 0e8e909 [Marcelo Vanzin] Ignore ml, don't ignore graphx. cef4603 [Marcelo Vanzin] Indentation. 296cf82 [Marcelo Vanzin] [SPARK-6371] [build] Update version to 1.4.0-SNAPSHOT.	2015-03-20 18:43:57 +00:00
lisurprise	f149b8b5e5	[SPARK-6077] Remove streaming tab while stopping StreamingContext Currently we would create a new streaming tab for each streamingContext even if there's already one on the same sparkContext which would cause duplicate StreamingTab created and none of them is taking effect. snapshot: https://www.dropbox.com/s/t4gd6hqyqo0nivz/bad%20multiple%20streamings.png?dl=0 How to reproduce: 1) import org.apache.spark.SparkConf import org.apache.spark.streaming. {Seconds, StreamingContext} import org.apache.spark.storage.StorageLevel val ssc = new StreamingContext(sc, Seconds(1)) val lines = ssc.socketTextStream("localhost", 9999, StorageLevel.MEMORY_AND_DISK_SER) val words = lines.flatMap(_.split(" ")) val wordCounts = words.map(x => (x, 1)).reduceByKey(_ + _) wordCounts.print() ssc.start() ..... 2) ssc.stop(false) val ssc = new StreamingContext(sc, Seconds(1)) val lines = ssc.socketTextStream("localhost", 9999, StorageLevel.MEMORY_AND_DISK_SER) val words = lines.flatMap(_.split(" ")) val wordCounts = words.map(x => (x, 1)).reduceByKey(_ + _) wordCounts.print() ssc.start() Author: lisurprise <zhichao.li@intel.com> Closes #4828 from zhichao-li/master and squashes the following commits: c329806 [lisurprise] add test for attaching/detaching streaming tab 51e6c7f [lisurprise] move detach method into StreamingTab 31a44fa [lisurprise] add unit test for attaching and detaching new tab db25ed2 [lisurprise] clean code 8281bcb [lisurprise] clean code 193c542 [lisurprise] remove streaming tab while closing streaming context	2015-03-16 13:10:32 -07:00
Sean Owen	c9cfba0ceb	SPARK-6182 [BUILD] spark-parent pom needs to be published for both 2.10 and 2.11 Option 1 of 2: Convert spark-parent module name to spark-parent_2.10 / spark-parent_2.11 Author: Sean Owen <sowen@cloudera.com> Closes #4912 from srowen/SPARK-6182.1 and squashes the following commits: eff60de [Sean Owen] Convert spark-parent module name to spark-parent_2.10 / spark-parent_2.11	2015-03-05 11:31:48 -08:00
Patrick Wendell	7930d2bef0	SPARK-3996: Add jetty servlet and continuations. These are needed transitively from the other Jetty libraries we include. It was not picked up by unit tests because we disable the UI. Author: Patrick Wendell <patrick@databricks.com> Closes #4323 from pwendell/jetty and squashes the following commits: d8669da [Patrick Wendell] SPARK-3996: Add jetty servlet and continuations.	2015-02-02 21:01:36 -08:00
Patrick Wendell	a15f6e31fc	[SPARK-3996]: Shade Jetty in Spark deliverables (v2 of this patch with a fix that was only relevant for the maven build). This patch piggy-back's on vanzin's work to simplify the Guava shading, and adds Jetty as a shaded library in Spark. Other than adding Jetty, it consilidates the <artifactSet>'s into the root pom. I found it was a bit easier to follow that way, since you don't need to look into child pom's to find out specific artifact sets included in shading. Author: Patrick Wendell <patrick@databricks.com> Closes #4285 from pwendell/jetty and squashes the following commits: d3e7f4e [Patrick Wendell] Fix for shaded deps causing compile errors 19f0710 [Patrick Wendell] More code review feedback 961452d [Patrick Wendell] Responding to feedback from Marcello 6df25ca [Patrick Wendell] [WIP] [SPARK-3996]: Shade Jetty in Spark deliverables	2015-02-01 21:13:57 -08:00
Marcelo Vanzin	f9e569452e	[SPARK-5466] Add explicit guava dependencies where needed. One side-effect of shading guava is that it disappears as a transitive dependency. For Hadoop 2.x, this was masked by the fact that Hadoop itself depends on guava. But certain versions of Hadoop 1.x also shade guava, leaving either no guava or some random version pulled by another dependency on the classpath. So be explicit about the dependency in modules that use guava directly, which is the right thing to do anyway. Author: Marcelo Vanzin <vanzin@cloudera.com> Closes #4272 from vanzin/SPARK-5466 and squashes the following commits: e3f30e5 [Marcelo Vanzin] Dependency for catalyst is not needed. d3b2c84 [Marcelo Vanzin] [SPARK-5466] Add explicit guava dependencies where needed.	2015-01-29 13:00:45 -08:00
Marcelo Vanzin	37a5e272f8	[SPARK-4809] Rework Guava library shading. The current way of shading Guava is a little problematic. Code that depends on "spark-core" does not see the transitive dependency, yet classes in "spark-core" actually depend on Guava. So it's a little tricky to run unit tests that use spark-core classes, since you need a compatible version of Guava in your dependencies when running the tests. This can become a little tricky, and is kind of a bad user experience. This change modifies the way Guava is shaded so that it's applied uniformly across the Spark build. This means Guava is shaded inside spark-core itself, so that the dependency issues above are solved. Aside from that, all Spark sub-modules have their Guava references relocated, so that they refer to the relocated classes now packaged inside spark-core. Before, this was only done by the time the assembly was built, so projects that did not end up inside the assembly (such as streaming backends) could still reference the original location of Guava classes. The Guava classes are added to the "first" artifact Spark generates (network-common), so that all downstream modules have the needed classes available. Since "network-common" is a dependency of spark-core, all Spark apps should get the relocated classes automatically. Author: Marcelo Vanzin <vanzin@cloudera.com> Closes #3658 from vanzin/SPARK-4809 and squashes the following commits: 3c93e42 [Marcelo Vanzin] Shade Guava in the network-common artifact. 5d69ec9 [Marcelo Vanzin] Merge branch 'master' into SPARK-4809 b3104fc [Marcelo Vanzin] Add comment. 941848f [Marcelo Vanzin] Merge branch 'master' into SPARK-4809 f78c48a [Marcelo Vanzin] Merge branch 'master' into SPARK-4809 8053dd4 [Marcelo Vanzin] Merge branch 'master' into SPARK-4809 107d7da [Marcelo Vanzin] Add fix for SPARK-5052 (PR #3874). 40b8723 [Marcelo Vanzin] Merge branch 'master' into SPARK-4809 4a4ed42 [Marcelo Vanzin] [SPARK-4809] Rework Guava library shading.	2015-01-28 00:29:29 -08:00
Davies Liu	bad6c57211	[SPARK-5275] [Streaming] include python source code Include the python source code into assembly jar. cc mengxr pwendell Author: Davies Liu <davies@databricks.com> Closes #4128 from davies/build_streaming2 and squashes the following commits: 546af4c [Davies Liu] fix indent 48859b2 [Davies Liu] include python source code	2015-01-20 22:44:58 -08:00
Marcelo Vanzin	48cecf673c	[SPARK-4048] Enhance and extend hadoop-provided profile. This change does a few things to make the hadoop-provided profile more useful: - Create new profiles for other libraries / services that might be provided by the infrastructure - Simplify and fix the poms so that the profiles are only activated while building assemblies. - Fix tests so that they're able to run when the profiles are activated - Add a new env variable to be used by distributions that use these profiles to provide the runtime classpath for Spark jobs and daemons. Author: Marcelo Vanzin <vanzin@cloudera.com> Closes #2982 from vanzin/SPARK-4048 and squashes the following commits: 82eb688 [Marcelo Vanzin] Add a comment. eb228c0 [Marcelo Vanzin] Fix borked merge. 4e38f4e [Marcelo Vanzin] Merge branch 'master' into SPARK-4048 9ef79a3 [Marcelo Vanzin] Alternative way to propagate test classpath to child processes. 371ebee [Marcelo Vanzin] Review feedback. 52f366d [Marcelo Vanzin] Merge branch 'master' into SPARK-4048 83099fc [Marcelo Vanzin] Merge branch 'master' into SPARK-4048 7377e7b [Marcelo Vanzin] Merge branch 'master' into SPARK-4048 322f882 [Marcelo Vanzin] Fix merge fail. f24e9e7 [Marcelo Vanzin] Merge branch 'master' into SPARK-4048 8b00b6a [Marcelo Vanzin] Merge branch 'master' into SPARK-4048 9640503 [Marcelo Vanzin] Cleanup child process log message. 115fde5 [Marcelo Vanzin] Simplify a comment (and make it consistent with another pom). e3ab2da [Marcelo Vanzin] Fix hive-thriftserver profile. 7820d58 [Marcelo Vanzin] Fix CliSuite with provided profiles. 1be73d4 [Marcelo Vanzin] Restore flume-provided profile. d1399ed [Marcelo Vanzin] Restore jetty dependency. 82a54b9 [Marcelo Vanzin] Remove unused profile. 5c54a25 [Marcelo Vanzin] Fix HiveThriftServer2Suite with *-provided profiles. 1fc4d0b [Marcelo Vanzin] Update dependencies for hive-thriftserver. f7b3bbe [Marcelo Vanzin] Add snappy to hadoop-provided list. 9e4e001 [Marcelo Vanzin] Remove duplicate hive profile. d928d62 [Marcelo Vanzin] Redirect child stderr to parent's log. 4d67469 [Marcelo Vanzin] Propagate SPARK_DIST_CLASSPATH on Yarn. 417d90e [Marcelo Vanzin] Introduce "SPARK_DIST_CLASSPATH". 2f95f0d [Marcelo Vanzin] Propagate classpath to child processes during testing. 1adf91c [Marcelo Vanzin] Re-enable maven-install-plugin for a few projects. 284dda6 [Marcelo Vanzin] Rework the "hadoop-provided" profile, add new ones.	2015-01-08 17:15:13 -08:00
Sean Owen	4cba6eb420	SPARK-4159 [CORE] Maven build doesn't run JUnit test suites This PR: - Reenables `surefire`, and copies config from `scalatest` (which is itself an old fork of `surefire`, so similar) - Tells `surefire` to test only Java tests - Enables `surefire` and `scalatest` for all children, and in turn eliminates some duplication. For me this causes the Scala and Java tests to be run once each, it seems, as desired. It doesn't affect the SBT build but works for Maven. I still need to verify that all of the Scala tests and Java tests are being run. Author: Sean Owen <sowen@cloudera.com> Closes #3651 from srowen/SPARK-4159 and squashes the following commits: 2e8a0af [Sean Owen] Remove specialized SPARK_HOME setting for REPL, YARN tests as it appears to be obsolete 12e4558 [Sean Owen] Append to unit-test.log instead of overwriting, so that both surefire and scalatest output is preserved. Also standardize/correct comments a bit. e6f8601 [Sean Owen] Reenable Java tests by reenabling surefire with config cloned from scalatest; centralize test config in the parent	2015-01-06 12:02:08 -08:00

1 2 3

114 commits