spark-instrumented-optimizer

History

Michael Giannakopoulos c281189222 [SPARK-2550][MLLIB][APACHE SPARK] Support regularization and intercept in pyspark's linear methods. Related to issue: [SPARK-2550](https://issues.apache.org/jira/browse/SPARK-2550?jql=project%20%3D%20SPARK%20AND%20resolution%20%3D%20Unresolved%20AND%20priority%20%3D%20Major%20ORDER%20BY%20key%20DESC). Author: Michael Giannakopoulos <miccagiann@gmail.com> Closes #1624 from miccagiann/new-branch and squashes the following commits: c02e5f5 [Michael Giannakopoulos] Merge cleanly with upstream/master. 8dcb888 [Michael Giannakopoulos] Putting the if/else if statements in brackets. fed8eaa [Michael Giannakopoulos] Adding a space in the message related to the IllegalArgumentException. 44e6ff0 [Michael Giannakopoulos] Adding a blank line before python class LinearRegressionWithSGD. 8eba9c5 [Michael Giannakopoulos] Change function signatures. Exception is thrown from the scala component and not from the python one. 638be47 [Michael Giannakopoulos] Modified code to comply with code standards. ec50ee9 [Michael Giannakopoulos] Shorten the if-elif-else statement in regression.py file b962744 [Michael Giannakopoulos] Replaced the enum classes, with strings-keywords for defining the values of 'regType' parameter. 78853ec [Michael Giannakopoulos] Providing intercept and regualizer functionallity for linear methods in only one function. 3ac8874 [Michael Giannakopoulos] Added support for regularizer and intercection parameters for linear regression method.		2014-08-01 21:00:31 -07:00
..
mllib	[SPARK-2550][MLLIB][APACHE SPARK] Support regularization and intercept in pyspark's linear methods.	2014-08-01 21:00:31 -07:00
__init__.py	[SPARK-2724] Python version of RandomRDDGenerators	2014-07-31 20:32:57 -07:00
accumulators.py	SPARK-2282: Reuse Socket for sending accumulator updates to Pyspark	2014-07-31 15:31:53 -07:00
broadcast.py	Fix some Python docs and make sure to unset SPARK_TESTING in Python	2013-12-29 20:15:07 -05:00
cloudpickle.py	[SPARK-791] [PySpark] fix pickle itemgetter with cloudpickle	2014-07-29 01:02:18 -07:00
conf.py	[SPARK-2014] Make PySpark store RDDs in MEMORY_ONLY_SER with compression by default	2014-07-24 18:15:37 -07:00
context.py	[SPARK-2024] Add saveAsSequenceFile to PySpark	2014-07-30 13:19:05 -07:00
daemon.py	[SPARK-2764] Simplify daemon.py process structure	2014-08-01 19:38:21 -07:00
files.py	Initial work to rename package to org.apache.spark	2013-09-01 14:13:13 -07:00
java_gateway.py	[SPARK-2470] PEP8 fixes to PySpark	2014-07-21 22:30:53 -07:00
join.py	[SPARK-2470] PEP8 fixes to PySpark	2014-07-21 22:30:53 -07:00
rdd.py	[SPARK-2010] [PySpark] [SQL] support nested structure in SchemaRDD	2014-08-01 18:47:41 -07:00
rddsampler.py	[SPARK-2656] Python version of stratified sampling	2014-07-24 23:42:08 -07:00
resultiterable.py	[SPARK-2470] PEP8 fixes to PySpark	2014-07-21 22:30:53 -07:00
serializers.py	[SPARK-2538] [PySpark] Hash based disk spilling aggregation	2014-07-24 22:53:47 -07:00
shell.py	[SPARK-2470] PEP8 fixes to PySpark	2014-07-21 22:30:53 -07:00
shuffle.py	[SPARK-2538] [PySpark] Hash based disk spilling aggregation	2014-07-24 22:53:47 -07:00
sql.py	[SPARK-2010] [PySpark] [SQL] support nested structure in SchemaRDD	2014-08-01 18:47:41 -07:00
statcounter.py	[SPARK-2470] PEP8 fixes to PySpark	2014-07-21 22:30:53 -07:00
storagelevel.py	[SPARK-2470] PEP8 fixes to PySpark	2014-07-21 22:30:53 -07:00
tests.py	[SPARK-2024] Add saveAsSequenceFile to PySpark	2014-07-30 13:19:05 -07:00
worker.py	[SPARK-2580] [PySpark] keep silent in worker if JVM close the socket	2014-07-29 00:15:45 -07:00