ODIn/spark-instrumented-optimizer

Author	SHA1	Message	Date
Matei Zaharia	3c8478e1fb	Merge pull request #747 from mateiz/improved-lr Update the Python logistic regression example	2013-08-06 23:25:03 -07:00
Matei Zaharia	6b043a6f11	Merge pull request #724 from dlyubimov/SPARK-826 SPARK-826: fold(), reduce(), collect() always attempt to use java serialization	2013-08-06 22:31:02 -07:00
Shivaram Venkataraman	338b7a7455	Merge branch 'master' of git://github.com/mesos/spark into sgd-cleanup Conflicts: mllib/src/main/scala/spark/mllib/util/MLUtils.scala	2013-08-06 21:21:55 -07:00
Shivaram Venkataraman	7db69d56f2	Refactor GLM algorithms and add Java tests This change adds Java examples and unit tests for all GLM algorithms to make sure the MLLib interface works from Java. Changes include - Introduce LabeledPoint and avoid using Doubles in train arguments - Rename train to run in class methods - Make the optimizer a member variable of GLM to make sure the builder pattern works	2013-08-06 17:23:22 -07:00
Matei Zaharia	7c4b7a53b1	Merge remote-tracking branch 'origin/pr/781' Conflicts: core/src/main/resources/spark/ui/static/webui.css	2013-08-06 17:19:49 -07:00
Matei Zaharia	de6c4c995a	Merge pull request #787 from ash211/master Update spark-standalone.md	2013-08-06 17:09:50 -07:00
Karen Feng	908032e79b	Used saturated colors for progress bars	2013-08-06 16:52:21 -07:00
Andrew Ash	afc2c80fdb	Update spark-standalone.md	2013-08-07 00:44:43 +01:00
Shivaram Venkataraman	6caec3f441	Add a test case for random initialization. Also workaround a bug where double[][] class cast fails	2013-08-06 16:35:47 -07:00
Karen Feng	8bc497fa10	Lightened color of progress bars	2013-08-06 16:33:05 -07:00
Karen Feng	ca1903ea63	Overlays progress text on top of bar	2013-08-06 15:45:42 -07:00
Matei Zaharia	df4d10d630	Merge pull request #779 from adatao/adatao-global-SparkEnv [HOTFIX] Extend thread safety for SparkEnv.get()	2013-08-06 15:44:05 -07:00
Shivaram Venkataraman	471fbadd0c	Java examples, tests for KMeans and ALS - Changes ALS to accept RDD[Rating] instead of (Int, Int, Double) making it easier to call from Java - Renames class methods from `train` to `run` to enable static methods to be called from Java. - Add unit tests which check if both static / class methods can be called. - Also add examples which port the main() function in ALS, KMeans to the examples project. Couple of minor changes to existing code: - Add a toJavaRDD method in RDD to convert scala RDD to java RDD easily - Workaround a bug where using double[] from Java leads to class cast exception in KMeans init	2013-08-06 15:43:46 -07:00
anfeng	dda2ac8b5d	reformat registerFileSystemStat()	2013-08-06 15:22:25 -07:00
Karen Feng	099528b6c4	Pre-sorts stage/env tables, changes text/link of stage summaries	2013-08-06 14:52:12 -07:00
Matei Zaharia	d2b0f0c23d	Merge pull request #770 from stayhf/SPARK-760-Java Simple PageRank algorithm implementation in Java for SPARK-760	2013-08-06 14:49:39 -07:00
stayhf	882baee489	Got rid of unnecessary map function	2013-08-06 21:34:39 +00:00
Karen Feng	254a930730	Reverse sorts StageTable by submitted time	2013-08-06 14:18:38 -07:00
stayhf	326a7a82e0	changes as reviewer requested	2013-08-06 21:03:24 +00:00
Karen Feng	5ed5b73026	Sorts first column of env tables	2013-08-06 13:59:53 -07:00
anfeng	0748c60817	expose HDFS file system stats via Executor metrics	2013-08-06 11:47:06 -07:00
Reynold Xin	d031f73679	Merge pull request #782 from WANdisco/master SHARK-94 Log the files computed by HadoopRDD and NewHadoopRDD	2013-08-05 22:33:00 -07:00
Matei Zaharia	1b63dea816	Merge pull request #769 from markhamstra/NegativeCores SPARK-847 + SPARK-845: Zombie workers and negative cores	2013-08-05 22:21:26 -07:00
Alexander Pivovarov	a30866438b	SHARK-94 Log the files computed by HadoopRDD and NewHadoopRDD	2013-08-05 21:48:43 -07:00
Matei Zaharia	828aff744d	Merge pull request #776 from gingsmith/master adding matrix factorization data generator	2013-08-05 21:37:33 -07:00
Ginger Smith	bf7033f3eb	fixing formatting, style, and input	2013-08-05 21:26:24 -07:00
Matei Zaharia	8b277892c9	Merge pull request #774 from pwendell/job-description Show user-defined job name in UI	2013-08-05 19:14:52 -07:00
Christopher Nguyen	b1bbbe699c	[HOTFIX] Mark lastSetSparkEnv @volatile in case it gets HotSpot-cached On branch adatao-global-SparkEnv Changes to be committed: modified: core/src/main/scala/spark/SparkEnv.scala	2013-08-05 17:22:27 -07:00
Mark Hamstra	35d8f5ee52	Moved handling of timed out workers within the Master actor	2013-08-05 13:13:56 -07:00
Mark Hamstra	37ccf9301a	milliseconds -> seconds in timeOutDeadWorkers logging	2013-08-05 13:13:56 -07:00
Mark Hamstra	cdd1af562e	Timeout zombie workers	2013-08-05 13:13:56 -07:00
Mikhail Bautin	e8bec8365f	Only reduce the number of cores once when removing an executor	2013-08-05 13:13:56 -07:00
Karen Feng	95025afdec	Made most small fixes for SPARK-849 except for table sort, task progress overlay	2013-08-05 13:04:56 -07:00
Patrick Wendell	550b0cf48a	Merge pull request #780 from cybermaster/master SPARK-850	2013-08-05 12:10:32 -07:00
Bill Zhao	33b9b155de	JBoss repository working now	2013-08-05 12:02:36 -07:00
Bill Zhao	9df66cd831	Merge branch 'master' of github.com:cybermaster/spark	2013-08-05 11:56:30 -07:00
Bill Zhao	87134b3648	SPARK-850: give better console message	2013-08-05 11:55:35 -07:00
Ginger Smith	8c8947e2b6	fixing formatting	2013-08-05 11:22:18 -07:00
Bill Zhao	d93d5fcaac	SPARK-850: Give better error message on the console	2013-08-05 10:09:03 -07:00
Christopher Nguyen	39e4fda76f	[HOTFIX] Extend thread safety for SparkEnv.get() A ThreadLocal SparkEnv.env is facing various situations leading to NullPointerExceptions, where SparkEnv.env set in one thread is not gettable in another thread, but often assumed to be available. See, e.g., https://groups.google.com/forum/#!topic/spark-developers/GLx8yunSj0A This hotfixes SparkEnv.env to return either (a) the ThreadLocal value if non-null, or (b) the previously set value in any thread. This approach preserves SparkEnv.set() thread safety needed by RDD.compute() and possibly other places. A refactoring that parameterizes SparkEnv should be addressed subsequently. On branch adatao-global-SparkEnv Changes to be committed: modified: core/src/main/scala/spark/SparkEnv.scala	2013-08-05 02:09:54 -07:00
stayhf	98fd62605d	Updated code with reviewer's suggestions	2013-08-05 00:30:28 +00:00
Patrick Wendell	f3660d5ab8	Make output formatting consistent between bash/scala	2013-08-03 21:30:15 -07:00
Shivaram Venkataraman	7388e27668	Move implicit arg to constructor for Java access.	2013-08-03 18:08:43 -07:00
Patrick Wendell	ad94fbb322	Log the launch command for Spark executors	2013-08-03 09:19:46 -07:00
stayhf	a682637301	Simple PageRank algorithm implementation in Java for SPARK-760	2013-08-03 06:01:16 +00:00
Ginger Smith	4ab4df5edb	adding matrix factorization data generator	2013-08-02 22:22:36 -07:00
Shivaram Venkataraman	00339cc032	Refactor optimizers and create GLMs This change refactors the structure of GLMs to use mixins which maintain a similar interface to other ML lib algorithms. This change also creates an Optimizer trait which allows GLMs to be extended to use other optimization techniques.	2013-08-02 19:15:34 -07:00
Patrick Wendell	b4905c383b	Log the launch command for Spark daemons For debugging and analysis purposes, it's nice to have the exact command used to launch Spark contained within the logs. This adds the necessary hooks to make that possible.	2013-08-02 16:58:19 -07:00
Matei Zaharia	22abbc10d6	Merge pull request #772 from karenfeng/ui-843 Show app duration	2013-08-02 16:37:59 -07:00
Matei Zaharia	abfa9e6f70	Increase Kryo buffer size in ALS since some arrays become big	2013-08-02 16:17:32 -07:00

1 2 3 4 5 ...

3613 commits