[SPARK-25144][SQL][TEST] Free aggregate map when task ends
## What changes were proposed in this pull request? [SPARK-25144](https://issues.apache.org/jira/browse/SPARK-25144) reports memory leaks on Apache Spark 2.0.2 ~ 2.3.2-RC5. The bug is already fixed via #21738 as a part of SPARK-21743. This PR only adds a test case to prevent any future regression. ```scala scala> case class Foo(bar: Option[String]) scala> val ds = List(Foo(Some("bar"))).toDS scala> val result = ds.flatMap(_.bar).distinct scala> result.rdd.isEmpty 18/08/19 23:01:54 WARN Executor: Managed memory leak detected; size = 8650752 bytes, TID = 125 res0: Boolean = false ``` ## How was this patch tested? Pass the Jenkins with a new added test case. Closes #22155 from dongjoon-hyun/SPARK-25144-2. Authored-by: Dongjoon Hyun <dongjoon@apache.org> Signed-off-by: hyukjinkwon <gurwls223@apache.org>
This commit is contained in:
parent
219ed7b487
commit
883f3aff67
|
@ -2852,4 +2852,12 @@ class SQLQuerySuite extends QueryTest with SharedSQLContext {
|
|||
spark.sql(s"select * from spark_25084 distribute by ($distributeExprs)").count === count)
|
||||
}
|
||||
}
|
||||
|
||||
test("SPARK-25144 'distinct' causes memory leak") {
|
||||
val ds = List(Foo(Some("bar"))).toDS
|
||||
val result = ds.flatMap(_.bar).distinct
|
||||
result.rdd.isEmpty
|
||||
}
|
||||
}
|
||||
|
||||
case class Foo(bar: Option[String])
|
||||
|
|
Loading…
Reference in a new issue