2013-08-31 17:21:10 -04:00
|
|
|
---
|
|
|
|
layout: global
|
2016-07-15 16:38:23 -04:00
|
|
|
title: "MLlib: RDD-based API"
|
|
|
|
displayTitle: "MLlib: RDD-based API"
|
2019-03-30 20:49:45 -04:00
|
|
|
license: |
|
|
|
|
Licensed to the Apache Software Foundation (ASF) under one or more
|
|
|
|
contributor license agreements. See the NOTICE file distributed with
|
|
|
|
this work for additional information regarding copyright ownership.
|
|
|
|
The ASF licenses this file to You under the Apache License, Version 2.0
|
|
|
|
(the "License"); you may not use this file except in compliance with
|
|
|
|
the License. You may obtain a copy of the License at
|
|
|
|
|
|
|
|
http://www.apache.org/licenses/LICENSE-2.0
|
|
|
|
|
|
|
|
Unless required by applicable law or agreed to in writing, software
|
|
|
|
distributed under the License is distributed on an "AS IS" BASIS,
|
|
|
|
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
|
|
|
|
See the License for the specific language governing permissions and
|
|
|
|
limitations under the License.
|
2013-08-31 17:21:10 -04:00
|
|
|
---
|
|
|
|
|
2016-07-15 16:38:23 -04:00
|
|
|
This page documents sections of the MLlib guide for the RDD-based API (the `spark.mllib` package).
|
|
|
|
Please see the [MLlib Main Guide](ml-guide.html) for the DataFrame-based API (the `spark.ml` package),
|
|
|
|
which is now the primary API for MLlib.
|
2014-01-03 19:38:33 -05:00
|
|
|
|
2014-08-27 04:19:48 -04:00
|
|
|
* [Data types](mllib-data-types.html)
|
|
|
|
* [Basic statistics](mllib-statistics.html)
|
2015-08-17 18:42:14 -04:00
|
|
|
* [summary statistics](mllib-statistics.html#summary-statistics)
|
|
|
|
* [correlations](mllib-statistics.html#correlations)
|
|
|
|
* [stratified sampling](mllib-statistics.html#stratified-sampling)
|
|
|
|
* [hypothesis testing](mllib-statistics.html#hypothesis-testing)
|
2015-11-30 18:38:44 -05:00
|
|
|
* [streaming significance testing](mllib-statistics.html#streaming-significance-testing)
|
2015-08-17 18:42:14 -04:00
|
|
|
* [random data generation](mllib-statistics.html#random-data-generation)
|
2014-08-12 20:15:21 -04:00
|
|
|
* [Classification and regression](mllib-classification-regression.html)
|
|
|
|
* [linear models (SVMs, logistic regression, linear regression)](mllib-linear-methods.html)
|
2014-04-22 14:20:47 -04:00
|
|
|
* [naive Bayes](mllib-naive-bayes.html)
|
2014-12-03 20:57:50 -05:00
|
|
|
* [decision trees](mllib-decision-tree.html)
|
2015-08-17 18:42:14 -04:00
|
|
|
* [ensembles of trees (Random Forests and Gradient-Boosted Trees)](mllib-ensembles.html)
|
2015-02-15 12:10:03 -05:00
|
|
|
* [isotonic regression](mllib-isotonic-regression.html)
|
2014-04-22 14:20:47 -04:00
|
|
|
* [Collaborative filtering](mllib-collaborative-filtering.html)
|
2015-08-17 18:42:14 -04:00
|
|
|
* [alternating least squares (ALS)](mllib-collaborative-filtering.html#collaborative-filtering)
|
2014-04-22 14:20:47 -04:00
|
|
|
* [Clustering](mllib-clustering.html)
|
2015-02-13 18:09:27 -05:00
|
|
|
* [k-means](mllib-clustering.html#k-means)
|
|
|
|
* [Gaussian mixture](mllib-clustering.html#gaussian-mixture)
|
|
|
|
* [power iteration clustering (PIC)](mllib-clustering.html#power-iteration-clustering-pic)
|
|
|
|
* [latent Dirichlet allocation (LDA)](mllib-clustering.html#latent-dirichlet-allocation-lda)
|
2015-12-16 13:55:42 -05:00
|
|
|
* [bisecting k-means](mllib-clustering.html#bisecting-kmeans)
|
2015-02-13 18:09:27 -05:00
|
|
|
* [streaming k-means](mllib-clustering.html#streaming-k-means)
|
2014-04-22 14:20:47 -04:00
|
|
|
* [Dimensionality reduction](mllib-dimensionality-reduction.html)
|
2015-08-17 18:42:14 -04:00
|
|
|
* [singular value decomposition (SVD)](mllib-dimensionality-reduction.html#singular-value-decomposition-svd)
|
|
|
|
* [principal component analysis (PCA)](mllib-dimensionality-reduction.html#principal-component-analysis-pca)
|
2014-08-12 20:15:21 -04:00
|
|
|
* [Feature extraction and transformation](mllib-feature-extraction.html)
|
2015-02-18 13:09:56 -05:00
|
|
|
* [Frequent pattern mining](mllib-frequent-pattern-mining.html)
|
2015-08-17 18:42:14 -04:00
|
|
|
* [FP-growth](mllib-frequent-pattern-mining.html#fp-growth)
|
2015-08-18 15:53:57 -04:00
|
|
|
* [association rules](mllib-frequent-pattern-mining.html#association-rules)
|
2015-08-17 20:53:24 -04:00
|
|
|
* [PrefixSpan](mllib-frequent-pattern-mining.html#prefix-span)
|
2015-08-30 02:26:23 -04:00
|
|
|
* [Evaluation metrics](mllib-evaluation-metrics.html)
|
|
|
|
* [PMML model export](mllib-pmml-model-export.html)
|
2014-08-12 20:15:21 -04:00
|
|
|
* [Optimization (developer)](mllib-optimization.html)
|
2015-08-17 18:42:14 -04:00
|
|
|
* [stochastic gradient descent](mllib-optimization.html#stochastic-gradient-descent-sgd)
|
|
|
|
* [limited-memory BFGS (L-BFGS)](mllib-optimization.html#limited-memory-bfgs-l-bfgs)
|
2014-04-22 14:20:47 -04:00
|
|
|
|