[ROCm] Adding the ROCM backend implementation for GPU dialect + the mlir-rocm-runner utility by deven-amd · Pull Request #169 · tensorflow/mlir

deven-amd · 2019-10-07T18:10:38Z

This PR adds

the ROCM backend for the GPU Dialect
the mlir-rocm-runner utiliy (that can "run" mlir codes with GPU dialect on AMD GPUs)

The ROCM backend support for AMD GPUs is similar in concept and implementation to the CUDA backend support for NV GPUs.

…32 to i64. Also making corresponding changes in the unit test for the same

River707

Hi,
This is a huge commit. Is it possible for you to split this into multiple, more bite sized, changes? That would help the review, and also make sure that each individual component is well tested and gets the proper review it deserves.

Thanks!

River707 · 2019-10-07T22:02:50Z

Hi,
This is a huge commit. Is it possible for you to split this into multiple, more bite sized, changes? That would help the review, and also make sure that each individual component is well tested and gets the proper review it deserves.

Thanks!

Just noticed that you have nice split commits, whoops.

…nel function

… blob associated with the GPU kernels

…HIP API calls

…ility

deven-amd · 2019-10-08T16:00:01Z

@River707 I have pushed out a new commit that addresses all but one of the code review comments, please re-review.

thanks

joker-eph · 2019-10-08T17:01:46Z

This is a huge commit. Is it possible for you to split this into multiple, more bite sized, changes? That would help the review, and also make sure that each individual component is well tested and gets the proper review it deserves.

+1: it seems to me that adding the runner is independent from adding the lowering passes.

joker-eph

(I only reviewed the ConvertKernelFuncToHSACO part, waiting for the PR to be split)

sherhut · 2019-10-08T17:23:43Z

I was not sure whether this is ready for review or whether I should wait for the push. Is this ready now and should I review?

deven-amd · 2019-10-08T19:38:33Z

@joker-eph

I will break this PR into one for each commit, as per your request.

The PR for the first commit (converting return type from i32 to i64) can stand on its own. (Filed PR #171)

The PRs for the other 4 commits (in this PR) will need to be filed sequentially since the later commits assume/require the presence of the former commits.

I will file the PR for the first of those four commits (ConvertKernelFuncToHSACO) after I have addressed all your code review feedback.

Closing out this PR

joker-eph · 2019-10-10T18:45:11Z

The PRs for the other 4 commits (in this PR) will need to be filed sequentially since the later commits assume/require the presence of the former commits.

If you have the 4 commits, then you can file the 4 PRs at once, assuming you push the first commit to a branch, the 2nd to another branch (rebased on the first one of course), etc.
The diff for the second PR will include the first one, but that'll clear itself when the first one merges.

[ROCm] Changing the return type for the device functiona calls from i…

f37340f

…32 to i64. Also making corresponding changes in the unit test for the same

googlebot added the cla: yes label Oct 7, 2019

whchung reviewed Oct 7, 2019

View reviewed changes

Comment thread lib/Conversion/GPUToROCM/ConvertKernelFuncToHSACO.cpp Outdated

joker-eph assigned sherhut Oct 7, 2019

joker-eph reviewed Oct 7, 2019

View reviewed changes

Comment thread include/mlir/Conversion/GPUToROCM/GPUToROCMPass.h Outdated

River707 suggested changes Oct 7, 2019

View reviewed changes

Comment thread lib/Conversion/GPUToROCM/GenerateHSACOAccessors.cpp Outdated

Comment thread lib/Conversion/GPUToROCM/GenerateHSACOAccessors.cpp Outdated

deven-amd added 4 commits October 8, 2019 15:26

[ROCm] Adding pass to generate the HSACO binary blob from the GPU ker…

57cf329

…nel function

[ROCm] Adding a pass to generate a routine to access the HSACO binary…

7e4b144

… blob associated with the GPU kernels

[ROCm] Adding a pass to convert the GPU Launch func to a sequence of …

2f61526

…HIP API calls

[ROCm] Adding the HIP Runtime wrapper library and mlir-rocm-runner ut…

8622f89

…ility

joker-eph suggested changes Oct 8, 2019

View reviewed changes

deven-amd closed this Oct 8, 2019

deven-amd mentioned this pull request Oct 10, 2019

[ROCm] Adding pass to generate the HSACO binary blob from the GPU kernel function #181

Open

Conversation

deven-amd commented Oct 7, 2019

Uh oh!

Uh oh!

Uh oh!

River707 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

River707 commented Oct 7, 2019

Uh oh!

Uh oh!

Uh oh!

deven-amd commented Oct 8, 2019

Uh oh!

joker-eph commented Oct 8, 2019

Uh oh!

joker-eph left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sherhut commented Oct 8, 2019

Uh oh!

deven-amd commented Oct 8, 2019

Uh oh!

joker-eph commented Oct 10, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants