eb6fd7eab7
Currently, when running applications on yarn mode, the app staging directory of is controlled by `spark.yarn.stagingDir` config if specified, and this directory cannot separate different users, sometimes, it's inconvenient for file and quota management for users. Sometimes, there might be an unexpected increasing of the staging files, two possible reasons are: 1. The `spark.yarn.preserve.staging.files` provided can be misused by users 2. cron task constantly starting new applications on non-existent yarn queue(wrong configuration). But now, we are not easy to find out the which user obtains the most HDFS files or spaces. what's more, even we want set HDFS name quota or space quota for each user to limit the increase is impossible. So I propose to add user sub directories under this app staging directory which is more clear. existing UT Closes #23786 from liupc/Support-user-level-app-staging-dir. Authored-by: Liupengcheng <liupengcheng@xiaomi.com> Signed-off-by: Marcelo Vanzin <vanzin@cloudera.com> |
||
---|---|---|
.. | ||
src | ||
pom.xml |