spark-instrumented-optimizer/common
Sanket Chintapalli 1662e93119 [SPARK-21501] Change CacheLoader to limit entries based on memory footprint
Right now the spark shuffle service has a cache for index files. It is based on a # of files cached (spark.shuffle.service.index.cache.entries). This can cause issues if people have a lot of reducers because the size of each entry can fluctuate based on the # of reducers.
We saw an issues with a job that had 170000 reducers and it caused NM with spark shuffle service to use 700-800MB or memory in NM by itself.
We should change this cache to be memory based and only allow a certain memory size used. When I say memory based I mean the cache should have a limit of say 100MB.

https://issues.apache.org/jira/browse/SPARK-21501

Manual Testing with 170000 reducers has been performed with cache loaded up to max 100MB default limit, with each shuffle index file of size 1.3MB. Eviction takes place as soon as the total cache size reaches the 100MB limit and the objects will be ready for garbage collection there by avoiding NM to crash. No notable difference in runtime has been observed.

Author: Sanket Chintapalli <schintap@yahoo-inc.com>

Closes #18940 from redsanket/SPARK-21501.
2017-08-23 11:51:11 -05:00
..
kvstore [SPARK-21671][CORE] Move kvstore to "util" sub-package, add private annotation. 2017-08-08 14:33:27 -07:00
network-common [SPARK-21501] Change CacheLoader to limit entries based on memory footprint 2017-08-23 11:51:11 -05:00
network-shuffle [SPARK-21501] Change CacheLoader to limit entries based on memory footprint 2017-08-23 11:51:11 -05:00
network-yarn [SPARK-21494][NETWORK] Use correct app id when authenticating to external service. 2017-07-25 17:57:26 -07:00
sketch [SPARK-21592][BUILD] Skip maven-compiler-plugin main and test compilations in Maven build 2017-08-01 19:03:34 +01:00
tags [SPARK-20453] Bump master branch version to 2.3.0-SNAPSHOT 2017-04-24 21:48:04 -07:00
unsafe [SPARK-21592][BUILD] Skip maven-compiler-plugin main and test compilations in Maven build 2017-08-01 19:03:34 +01:00