[TensorRT] Add transpose_a/b for TensorRT batch_matmul by ymwangg · Pull Request #8607 · apache/tvm

ymwangg · 2021-07-30T18:00:29Z

This PR added transpose_a/b for TensorRT batch_matmul, fixed a warning and compilation error with TensorRT-8. It also removed the redundant transpose op in onnx matmul. Tested with both TensorRT-7 and TensorRT-8.
cc @trevor-m @comaniac

trevor-m

Thanks @ymwangg! Left some minor comments in review.

trevor-m · 2021-07-30T18:36:10Z

        y = relay.var("y", shape=(y_shape), dtype="float32")
-        out = relay.nn.batch_matmul(x, y)
+        out = relay.nn.batch_matmul(
+            relay.transpose(x, [0, 2, 1]) if transa else x,


I don't think you need these relay.transpose on the inputs to test functionality of transa/transb args.

Good point, I've changed to using x/y_shape instead.

comaniac

LGTM. Leave to @trevor-m to merge after addressing the comments.

jcf94 · 2021-08-02T02:20:12Z

-                # Transpose matrix dimensions of b.
-                b = _op.transpose(b, [0, 2, 1])
                # Perform a batch matmul.
-                output = _op.nn.batch_matmul(a, b)
+                output = _op.nn.batch_matmul(a, b, transpose_b=False)


Thanks! @ymwangg

Just a little concern about changing the default behavior of framework frontend, since currently the default topi schedule support for NN format is not as strong as the original NT one.
This may cause confusions to those who have used onnx frontend before or who is using onnx frontend now.

To give an example, I've added an extra config to TensorFlow frontend which uses the NT format by default but provides an option to use the normal format. I think that would be better before we have prepared a strong enough topi.

p.s.: You see, I've also kept the default layout for nn.batch_matmul to be the original NT.

tvm/python/tvm/relay/frontend/tensorflow_ops.py

Lines 1191 to 1199 in 7653972

if TF_DEFAULT_CONFIGS["use_nt_batch_matmul"]:

# Strictly convert all batch_matmul to NT format

input_x = _op.transpose(input_x, axes=[0, 2, 1]) if adj_x else input_x

input_y = _op.transpose(input_y, axes=[0, 2, 1]) if not adj_y else input_y

ret = get_relay_op("batch_matmul")(input_x, input_y)

else:

ret = get_relay_op("batch_matmul")(

input_x, input_y, transpose_a=adj_x, transpose_b=adj_y

)

@jcf94 Thanks for the pointer. I will refactor to make NN optional.

ymwangg · 2021-08-05T00:15:27Z

@trevor-m @jcf94 Please review again when you get a chance.

jcf94

LGTM. Thanks! @ymwangg

* Add transpose support for tensorrt batch_matmul * Address PR comment * Refactor to add ONNX_DEFAULT_CONFIGS

ymwangg requested review from Huyuwei, ZihengJiang, areusch, comaniac, jroesch, junrushao, jwfromm, kazum, liangfu, masahi, mbrookhart, merrymercy, siju-samuel, srkreddy1238, tmoreau89, tqchen, vinx13 and yzhliu as code owners July 30, 2021 18:00

trevor-m requested changes Jul 30, 2021

View reviewed changes

trevor-m self-assigned this Jul 30, 2021

comaniac approved these changes Jul 30, 2021

View reviewed changes

jcf94 reviewed Aug 2, 2021

View reviewed changes

ymwangg added 2 commits August 4, 2021 23:36

Add transpose support for tensorrt batch_matmul

f70d515

Address PR comment

cc2e6c9

ymwangg force-pushed the matmul-trt branch from 67a6816 to 84f0964 Compare August 4, 2021 23:50

Refactor to add ONNX_DEFAULT_CONFIGS

7095914

ymwangg force-pushed the matmul-trt branch from 9063287 to 7095914 Compare August 5, 2021 00:09

jcf94 approved these changes Aug 5, 2021

View reviewed changes

trevor-m approved these changes Aug 5, 2021

View reviewed changes

trevor-m merged commit 26c2a9a into apache:main Aug 5, 2021

mehrdadh pushed a commit to mehrdadh/tvm that referenced this pull request Aug 11, 2021

[TensorRT] Add transpose_a/b for TensorRT batch_matmul (apache#8607)

6f8decd

* Add transpose support for tensorrt batch_matmul * Address PR comment * Refactor to add ONNX_DEFAULT_CONFIGS

ylc pushed a commit to ylc/tvm that referenced this pull request Sep 29, 2021

[TensorRT] Add transpose_a/b for TensorRT batch_matmul (apache#8607)

ecfbedc

* Add transpose support for tensorrt batch_matmul * Address PR comment * Refactor to add ONNX_DEFAULT_CONFIGS

junrushao mentioned this pull request Nov 1, 2021

Apache TVM v0.8 Release Note Candidate #9416

Closed

ylc pushed a commit to ylc/tvm that referenced this pull request Jan 13, 2022

[TensorRT] Add transpose_a/b for TensorRT batch_matmul (apache#8607)

81949ad

* Add transpose support for tensorrt batch_matmul * Address PR comment * Refactor to add ONNX_DEFAULT_CONFIGS

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TensorRT] Add transpose_a/b for TensorRT batch_matmul#8607

[TensorRT] Add transpose_a/b for TensorRT batch_matmul#8607
trevor-m merged 3 commits intoapache:mainfrom
ymwangg:matmul-trt

ymwangg commented Jul 30, 2021 •

edited

Loading

Uh oh!

trevor-m left a comment

Uh oh!

trevor-m Jul 30, 2021

Uh oh!

ymwangg Jul 30, 2021

Uh oh!

Uh oh!

comaniac left a comment

Uh oh!

jcf94 Aug 2, 2021 •

edited

Loading

Uh oh!

ymwangg Aug 2, 2021

Uh oh!

ymwangg commented Aug 5, 2021

Uh oh!

jcf94 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	if TF_DEFAULT_CONFIGS["use_nt_batch_matmul"]:
	# Strictly convert all batch_matmul to NT format
	input_x = _op.transpose(input_x, axes=[0, 2, 1]) if adj_x else input_x
	input_y = _op.transpose(input_y, axes=[0, 2, 1]) if not adj_y else input_y
	ret = get_relay_op("batch_matmul")(input_x, input_y)
	else:
	ret = get_relay_op("batch_matmul")(
	input_x, input_y, transpose_a=adj_x, transpose_b=adj_y
	)

Conversation

ymwangg commented Jul 30, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

trevor-m left a comment

Choose a reason for hiding this comment

Uh oh!

trevor-m Jul 30, 2021

Choose a reason for hiding this comment

Uh oh!

ymwangg Jul 30, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

comaniac left a comment

Choose a reason for hiding this comment

Uh oh!

jcf94 Aug 2, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ymwangg Aug 2, 2021

Choose a reason for hiding this comment

Uh oh!

ymwangg commented Aug 5, 2021

Uh oh!

jcf94 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ymwangg commented Jul 30, 2021 •

edited

Loading

jcf94 Aug 2, 2021 •

edited

Loading