spark-instrumented-optimizer/python/pyspark
Michael Giannakopoulos c281189222 [SPARK-2550][MLLIB][APACHE SPARK] Support regularization and intercept in pyspark's linear methods.
Related to issue: [SPARK-2550](https://issues.apache.org/jira/browse/SPARK-2550?jql=project%20%3D%20SPARK%20AND%20resolution%20%3D%20Unresolved%20AND%20priority%20%3D%20Major%20ORDER%20BY%20key%20DESC).

Author: Michael Giannakopoulos <miccagiann@gmail.com>

Closes #1624 from miccagiann/new-branch and squashes the following commits:

c02e5f5 [Michael Giannakopoulos] Merge cleanly with upstream/master.
8dcb888 [Michael Giannakopoulos] Putting the if/else if statements in brackets.
fed8eaa [Michael Giannakopoulos] Adding a space in the message related to the IllegalArgumentException.
44e6ff0 [Michael Giannakopoulos] Adding a blank line before python class LinearRegressionWithSGD.
8eba9c5 [Michael Giannakopoulos] Change function signatures. Exception is thrown from the scala component and not from the python one.
638be47 [Michael Giannakopoulos] Modified code to comply with code standards.
ec50ee9 [Michael Giannakopoulos] Shorten the if-elif-else statement in regression.py file
b962744 [Michael Giannakopoulos] Replaced the enum classes, with strings-keywords for defining the values of 'regType' parameter.
78853ec [Michael Giannakopoulos] Providing intercept and regualizer functionallity for linear methods in only one function.
3ac8874 [Michael Giannakopoulos] Added support for regularizer and intercection parameters for linear regression method.
2014-08-01 21:00:31 -07:00
..
mllib [SPARK-2550][MLLIB][APACHE SPARK] Support regularization and intercept in pyspark's linear methods. 2014-08-01 21:00:31 -07:00
__init__.py [SPARK-2724] Python version of RandomRDDGenerators 2014-07-31 20:32:57 -07:00
accumulators.py SPARK-2282: Reuse Socket for sending accumulator updates to Pyspark 2014-07-31 15:31:53 -07:00
broadcast.py Fix some Python docs and make sure to unset SPARK_TESTING in Python 2013-12-29 20:15:07 -05:00
cloudpickle.py [SPARK-791] [PySpark] fix pickle itemgetter with cloudpickle 2014-07-29 01:02:18 -07:00
conf.py [SPARK-2014] Make PySpark store RDDs in MEMORY_ONLY_SER with compression by default 2014-07-24 18:15:37 -07:00
context.py [SPARK-2024] Add saveAsSequenceFile to PySpark 2014-07-30 13:19:05 -07:00
daemon.py [SPARK-2764] Simplify daemon.py process structure 2014-08-01 19:38:21 -07:00
files.py Initial work to rename package to org.apache.spark 2013-09-01 14:13:13 -07:00
java_gateway.py [SPARK-2470] PEP8 fixes to PySpark 2014-07-21 22:30:53 -07:00
join.py [SPARK-2470] PEP8 fixes to PySpark 2014-07-21 22:30:53 -07:00
rdd.py [SPARK-2010] [PySpark] [SQL] support nested structure in SchemaRDD 2014-08-01 18:47:41 -07:00
rddsampler.py [SPARK-2656] Python version of stratified sampling 2014-07-24 23:42:08 -07:00
resultiterable.py [SPARK-2470] PEP8 fixes to PySpark 2014-07-21 22:30:53 -07:00
serializers.py [SPARK-2538] [PySpark] Hash based disk spilling aggregation 2014-07-24 22:53:47 -07:00
shell.py [SPARK-2470] PEP8 fixes to PySpark 2014-07-21 22:30:53 -07:00
shuffle.py [SPARK-2538] [PySpark] Hash based disk spilling aggregation 2014-07-24 22:53:47 -07:00
sql.py [SPARK-2010] [PySpark] [SQL] support nested structure in SchemaRDD 2014-08-01 18:47:41 -07:00
statcounter.py [SPARK-2470] PEP8 fixes to PySpark 2014-07-21 22:30:53 -07:00
storagelevel.py [SPARK-2470] PEP8 fixes to PySpark 2014-07-21 22:30:53 -07:00
tests.py [SPARK-2024] Add saveAsSequenceFile to PySpark 2014-07-30 13:19:05 -07:00
worker.py [SPARK-2580] [PySpark] keep silent in worker if JVM close the socket 2014-07-29 00:15:45 -07:00