ec28925236
## What changes were proposed in this pull request? For TransportFactory, the requests sent to the same address share a clientPool. Specially, when the io.numConnectionPerPeer is 1, these requests would share a same client. When this address is unreachable, the createClient operation would be still timeout. And these requests would block each other during createClient, because there is a lock for this shared client. It would cost connectionNum \* connectionTimeOut \* maxRetry to retry, and then fail the task. It fact, it is expected that this task could fail in connectionTimeOut * maxRetry. In this PR, I set a fastFail time window for the clientPool, if the last connection failed in this time window, the new connection would fast fail. ## Why are the changes needed? It can save time for some cases. ## Does this PR introduce any user-facing change? No. ## How was this patch tested? Existing UT. Closes #27943 from turboFei/SPARK-31179-fast-fail-connection. Authored-by: turbofei <fwang12@ebay.com> Signed-off-by: Thomas Graves <tgraves@apache.org> |
||
---|---|---|
.. | ||
kvstore | ||
network-common | ||
network-shuffle | ||
network-yarn | ||
sketch | ||
tags | ||
unsafe |