[ROCM] Fixes compiling on ROCM 5 and accuracy on dense op#13847
[ROCM] Fixes compiling on ROCM 5 and accuracy on dense op#13847junrushao merged 1 commit intoapache:mainfrom
Conversation
|
Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.
Generated by tvm-bot |
|
I checked this with Radeon VII, gfx906 and also works on this platform. |
I test this with gfx908 but not works on the platform. See details as #13666 |
|
@ masahi @mvermeulen I use the commit masahi:rocm-t-fix to build and run code.But I get error as follow. onnx_rocm.py```from pyexpat import model dtype="float32" import logging import warnings def build(target:str,mod:tvm.IRModule, params:dict, input_name:str, input_data:np.ndarray, input:tuple, output: tuple) -> np.ndarray: def main(): main() |
Fixes apache#13666 * Some bitcode files need to be updated * There is a strange, device-dependent accuracy issue when using the default topi `dense` op schedule on AMDGPU (see the issue above). I confirmed that other schedule works fine.
Fixes #13666
denseop schedule on AMDGPU (see the issue above). I confirmed that other schedule works fine.