spark-instrumented-optimizer

History

fwang1 f4344582ba [SPARK-14497][ML] Use top instead of sortBy() to get top N frequent words as dict in ConutVectorizer ## What changes were proposed in this pull request? Replace sortBy() with top() to calculate the top N frequent words as dictionary. ## How was this patch tested? existing unit tests. The terms with same TF would be sorted in descending order. The test would fail if hardcode the terms with same TF the dictionary like "c", "d"... Author: fwang1 <desperado.wf@gmail.com> Closes #12265 from lionelfeng/master.	2016-04-10 01:13:25 -07:00
..
src	[SPARK-14497][ML] Use top instead of sortBy() to get top N frequent words as dict in ConutVectorizer	2016-04-10 01:13:25 -07:00
pom.xml	Revert "[SPARK-14462][ML][MLLIB] add the mllib-local build to maven pom"	2016-04-09 14:03:03 -07:00

fwang1 f4344582ba [SPARK-14497][ML] Use top instead of sortBy() to get top N frequent words as dict in ConutVectorizer

## What changes were proposed in this pull request?

Replace sortBy() with top() to calculate the top N frequent words as dictionary.

## How was this patch tested?
existing unit tests.  The terms with same TF would be sorted in descending order. The test would fail if hardcode the terms with same TF the dictionary like "c", "d"...

Author: fwang1 <desperado.wf@gmail.com>

Closes #12265 from lionelfeng/master.

2016-04-10 01:13:25 -07:00

src

[SPARK-14497][ML] Use top instead of sortBy() to get top N frequent words as dict in ConutVectorizer

2016-04-10 01:13:25 -07:00

pom.xml

Revert "[SPARK-14462][ML][MLLIB] add the mllib-local build to maven pom"

2016-04-09 14:03:03 -07:00