[Ansor][FLAKY] Bug fix for compute at mutation error#6557
[Ansor][FLAKY] Bug fix for compute at mutation error#6557comaniac merged 1 commit intoapache:masterfrom
Conversation
|
Do you try to build debug version of TVM and use |
The problem is this is not always reproduceable. The only sure thing is that the bug is caused by |
One way you could do is remove |
comaniac
left a comment
There was a problem hiding this comment.
I'll merge this PR first to remove the CI flaky. We should definitely continue diving into InitChangeComputeLocation() to find the root cause.
|
Thanks @jcf94 @FrozenGene |
Thanks. |
|
This kind of general exception catch is not good for future maintenance. We should dig deeper to find out the underlying cause. #6512 does not change any logic, it just moves the location of some functions. Can you confirm this is caused by #6512 or #6529? |
Bug fix for #6548.
From the error log:
we can find that the error of the test was caused by the inferbound error. @merrymercy
Seems this bug was further intruded by #6512, I'm not sure which part of this mutation rule gets a wrong result now.
But it's strange that this bug is not always reproduceable, it occurs with a very small possibility(Which may caused by the multithread?). There may still be some leak conditions in our random generator design.
cc @tqchen @comaniac @FrozenGene