You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have searched in the issues and found no similar issues.
What would you like to be improved?
After running insert overwrite on mixed-hive table, stages are completed and cost much times to commit. The thread dump shows that stack blocked on List.contain() method, it could be improved by using Set.contain()
zhoujinsong
changed the title
[Improvement][Spark]: Spark commit hive partition is too slow
[Improvement][Spark]: Speed up commit process for Mixed Hive format tables
Aug 2, 2023
Search before asking
What would you like to be improved?
After running
insert overwrite
on mixed-hive table, stages are completed and cost much times to commit. The thread dump shows that stack blocked on List.contain() method, it could be improved by using Set.contain()How should we improve?
replace this contain() method to Set.contain()
Are you willing to submit PR?
Subtasks
No response
Code of Conduct
The text was updated successfully, but these errors were encountered: