ODIn/spark-instrumented-optimizer

Author	SHA1	Message	Date
Reynold Xin	47011e6854	Use a tigher bound in logistic regression unit test's prediction validation.	2013-07-30 13:58:23 -07:00
Reynold Xin	e35966ae9a	Renamed Classification.scala to ClassificationModel.scala and Regression.scala to RegressionModel.scala	2013-07-30 13:28:31 -07:00
Karen Feng	26144c400f	Fixed wrap style	2013-07-30 12:40:41 -07:00
Karen Feng	218d7c4ed8	Fixed style, lowered height of progress bars	2013-07-30 12:39:17 -07:00
Karen Feng	f1cab31b73	Removed intermediate set for activeTasks, removed progress bar margin	2013-07-30 11:06:47 -07:00
Dmitriy Lyubimov	1bca91633e	+ bug fixes; test added Conflicts: core/src/test/scala/spark/KryoSerializerSuite.scala	2013-07-30 11:04:11 -07:00
shivaram	8aee118216	Merge pull request #748 from atalwalkar/master made SimpleUpdater consistent with other updaters	2013-07-30 10:27:54 -07:00
Benjamin Hindman	f6f46455eb	Added property 'spark.executor.uri' for launching on Mesos without requiring Spark to be installed. Using 'make_distribution.sh' a user can put a Spark distribution at a URI supported by Mesos (e.g., 'hdfs://...') and then set that when launching their job. Also added SPARK_EXECUTOR_URI for the REPL.	2013-07-29 23:32:52 -07:00
Ameet Talwalkar	e4387ddf5d	made SimpleUpdater consistent with other updaters	2013-07-29 22:21:50 -07:00
Josh Rosen	49be084ed3	Use File.pathSeparator instead of hardcoding ':'.	2013-07-29 22:08:57 -07:00
Josh Rosen	b95732632b	Do not inherit master's PYTHONPATH on workers. This fixes SPARK-832, an issue where PySpark would not work when the master and workers used different SPARK_HOME paths. This change may potentially break code that relied on the master's PYTHONPATH being used on workers. To have custom PYTHONPATH additions used on the workers, users should set a custom PYTHONPATH in spark-env.sh rather than setting it in the shell.	2013-07-29 22:08:57 -07:00
Andrew xia	5406013997	refactor codes less than 100 character per line	2013-07-30 11:41:38 +08:00
Andrew xia	614ee16cc4	refactor job ui with pool information	2013-07-30 10:57:26 +08:00
Matei Zaharia	468a36c005	Merge pull request #746 from rxin/cleanup Internal cleanup	2013-07-29 19:44:33 -07:00
atalwalkar	1e1ffb192a	Merge pull request #745 from shivaram/loss-update-fix Remove duplicate loss history in Gradient Descent	2013-07-29 19:26:19 -07:00
Matei Zaharia	01f94931d5	Update the Python logistic regression example to read from a file and batch input records for more efficient NumPy computations	2013-07-29 19:23:41 -07:00
Dmitriy Lyubimov	8e5cd041bb	initial externalization of ParallelCollectionRDD's split	2013-07-29 19:02:53 -07:00
Shivaram Venkataraman	3ca9faa341	Clarify how regVal is computed in Updater docs	2013-07-29 18:37:28 -07:00
Reynold Xin	81720e13fc	Moved all StandaloneClusterMessage's into StandaloneClusterMessages object.	2013-07-29 17:53:01 -07:00
Reynold Xin	23b5da14ed	Moved block manager messages into BlockManagerMessages object.	2013-07-29 17:42:05 -07:00
Reynold Xin	105f4d22e9	Removed Cache and SoftReferenceCache since they are no longer used.	2013-07-29 17:30:38 -07:00
Matei Zaharia	207548b67b	Open up Job UI ports (33000-33010) on EC2 clusters	2013-07-29 17:19:33 -07:00
Reynold Xin	17e62113d4	Moved DeployMessage's into its own DeployMessages object. Also renamed MasterState to MasterStateResponse and WorkerState to WorkerStateResponse for clarity.	2013-07-29 17:14:44 -07:00
Karen Feng	87b821dc39	Fixed continuity of executorToTasksActive, changed color of progress bars	2013-07-29 16:50:51 -07:00
Karen Feng	c7b2788948	Merge branch 'master' of https://github.com/mesos/spark into bootstrap-update Conflicts: core/src/main/scala/spark/ui/jobs/IndexPage.scala	2013-07-29 16:36:07 -07:00
Patrick Wendell	c99b674405	Merge pull request #735 from karenfeng/ui-807 Totals for shuffle data and CPU time	2013-07-29 16:32:55 -07:00
Shivaram Venkataraman	07da72b451	Remove duplicate loss history and clarify why. Also some minor style fixes.	2013-07-29 16:25:17 -07:00
Karen Feng	2d6da9195a	Alphabetized imports	2013-07-29 15:50:52 -07:00
Karen Feng	478a2886d9	Added started tasks to progress bar	2013-07-29 14:51:07 -07:00
Karen Feng	e04a37a332	Merge branch 'master' of https://github.com/mesos/spark into bootstrap-update cially if it merges an updated upstream into a topic branch.	2013-07-29 14:32:48 -07:00
Reynold Xin	fe7298b587	Merge pull request #741 from pwendell/usability Fix two small usability issues	2013-07-29 14:01:00 -07:00
Karen Feng	43a2cc15c0	Use Bootstrap progress bars in web UI	2013-07-29 13:37:24 -07:00
shivaram	c34c0f6a7c	Merge pull request #731 from pxinghao/master Adding SVM and Lasso	2013-07-29 13:18:10 -07:00
Xinghao	2b2630ba3c	Style fix Lines shortened to < 100 characters	2013-07-29 09:22:49 -07:00
Xinghao	07f17439a5	Fix validatePrediction functions for Classification models Classifiers return categorical (Int) values that should be compared directly	2013-07-29 09:22:31 -07:00
Xinghao	3a8d07df8c	Deleting extra LogisticRegressionGenerator and RidgeRegressionGenerator	2013-07-29 09:20:26 -07:00
Xinghao	75f3757300	Fix rounding error in LogisticRegression.scala	2013-07-29 09:19:56 -07:00
Matei Zaharia	d8158ced12	Merge branch 'master' of github.com:mesos/spark	2013-07-29 02:52:02 -04:00
Matei Zaharia	497f55755f	Add docs about ipython	2013-07-29 02:51:43 -04:00
Matei Zaharia	feba7ee540	SPARK-815. Python parallelize() should split lists before batching One unfortunate consequence of this fix is that we materialize any collections that are given to us as generators, but this seems necessary to get reasonable behavior on small collections. We could add a batchSize parameter later to bypass auto-computation of batch size if this becomes a problem (e.g. if users really want to parallelize big generators nicely)	2013-07-29 02:51:43 -04:00
Matei Zaharia	d75c308695	Use None instead of empty string as it's slightly smaller/faster	2013-07-29 02:51:43 -04:00
Matei Zaharia	96b50e82dc	Allow python/run-tests to run from any directory	2013-07-29 02:51:43 -04:00
Matei Zaharia	b5ec355622	Optimize Python foreach() to not return as many objects	2013-07-29 02:51:43 -04:00
Matei Zaharia	b9d6783f36	Optimize Python take() to not compute entire first partition	2013-07-29 02:51:43 -04:00
Xinghao	c823ee1e2b	Replace map-reduce with dot operator using DoubleMatrix	2013-07-28 22:17:53 -07:00
Xinghao	96e04f4cb7	Fixed SVM and LR train functions to take Int instead of Double for Classification	2013-07-28 22:12:39 -07:00
Xinghao	9398dced03	Changed Classification to return Int instead of Double Also minor changes to formatting and comments	2013-07-28 21:39:19 -07:00
Xinghao	67de051bbb	SVMSuite and LassoSuite rewritten to follow closely with LogisticRegressionSuite	2013-07-28 21:09:56 -07:00
Xinghao	29e042940a	Move data generators to util	2013-07-28 20:39:52 -07:00
Matei Zaharia	72ff62a37c	Two fixes to IPython support: - Don't attempt to run worker processes with ipython (that can cause some crashes as ipython prints things to standard out) - Allow passing some IPYTHON_OPTS to launch things like the notebook	2013-07-28 22:23:13 -04:00

... 4 5 6 7 8 ...

3670 commits