Broadcast the two input shapes for transposed matmul #1413

FelixXidddd · 2022-10-19T09:25:31Z

Description

There is a transposed matmul layer(cases from meta_benchmark), has two inputs src0 and src1(which is a const).
Error message:

src[0] (Unnamed Layer* 1286) [Shuffle]_output,nbExtraDims 1, [2048,752,200], op TRANSPOSE; src[1] (Unnamed Layer* 1287) [Constant]_output, [752,60], op NONE; 
[10/19/2022-01:00:22] [TRT] [E] 4: [optimizer/shapeof/graphShapeAnalyzer.cpp::analyzeShapes::1877] Error Code 4: Miscellaneous (IMatrixMultiplyLayer [MATRIX_MULTIPLY]-[acc_ops.trt_transposed_matmul]-[trt_transposed_matmul]: broadcast dimensions must be conformable)
getDimensions failed xx: (Unnamed Layer* 1288) [Matrix Multiply]_output

As you can see from the error message, the src0 is [2048,752,200] and src1 is [752,60] which is not broadcast conformable. Actually torch should unsqueeze src1 from [752,60] to [1,752,60]

Fixes # (issue)
So we need to broadcast the inputs of transposed matmul layer.

Type of change

Please delete options that are not relevant and/or add your own.

Bug fix (non-breaking change which fixes an issue)

Checklist:

My code follows the style guidelines of this project (You can use the linters)
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas and hacks
I have made corresponding changes to the documentation
I have added tests to verify my fix or my feature
New and existing unit tests pass locally with my changes
I have added the relevant labels to my PR in so that relevant reviewers are notified

nvpohanh · 2022-10-20T10:23:14Z

@ncomly-nvidia Could you let us know what we need to do to have this reviewed and merged? Thanks

ncomly-nvidia · 2022-10-20T16:10:38Z

@narendasan @frank-wei can you please review or assign?

yinghai

Thank you for the contribution. Could you add a unittest?

FelixXidddd · 2022-10-25T02:08:36Z

@yinghai Could you help to point out where should I add a unit test which can save me a lot of time.

yinghai · 2022-10-25T03:39:59Z

For example: https://github.com/pytorch/TensorRT/blob/master/py/torch_tensorrt/fx/test/converters/acc_op/test_matmul.py

facebook-github-bot · 2022-10-27T07:18:33Z

Hi @FelixXidddd!

Thank you for your pull request and welcome to our community.

Action Required

In order to merge any pull request (code, docs, etc.), we require contributors to sign our Contributor License Agreement, and we don't seem to have one on file for you.

Process

In order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with CLA signed. The tagging process may take up to 1 hour after signing. Please give it that time before contacting us about it.

If you have received this in error or have any questions, please contact us at cla@meta.com. Thanks!

FelixXidddd · 2022-10-27T07:22:10Z

The corresponding test is added. Please help to review.

FelixXidddd · 2022-10-27T07:40:21Z

@narendasan Should I sign up at https://code.facebook.com/cla? Should not we (Nvidia employees) already sign?

frank-wei · 2022-10-27T16:45:58Z

LGTM! Pls sign up for it since every contributor needs to do so.

ncomly-nvidia · 2022-10-27T20:09:50Z

@narendasan Should I sign up at https://code.facebook.com/cla? Should not we (Nvidia employees) already sign?

@FelixXidddd you need to sign the corporate one. @narendasan can you please send the info over?

modified: py/torch_tensorrt/fx/converters/acc_ops_converters.py modified: py/torch_tensorrt/fx/test/passes/test_fuse_permute_matmul_trt.py

nvpohanh · 2022-11-22T10:11:17Z

Let's use #1457 instead and close this. Thanks

facebook-github-bot added the fx label Oct 19, 2022

github-actions bot added component: api [Python] Issues re: Python API component: fx labels Oct 19, 2022

github-actions bot requested review from 842974287, frank-wei, narendasan, wushirong and yinghai October 19, 2022 09:25

yinghai reviewed Oct 22, 2022

View reviewed changes

FelixXidddd force-pushed the addBroadCastToTransposeMatmul branch from bfa971a to 5c6ffa4 Compare October 27, 2022 07:18

FelixXidddd force-pushed the addBroadCastToTransposeMatmul branch from 5c6ffa4 to 19c7112 Compare October 27, 2022 07:21

frank-wei approved these changes Oct 27, 2022

View reviewed changes

FelixXidddd force-pushed the addBroadCastToTransposeMatmul branch from 19c7112 to 416381a Compare October 31, 2022 10:29

nvpohanh mentioned this pull request Nov 17, 2022

broadcast the two input shapes for transposed matmul #1457

Merged

broadcast the two input shapes for transposed matmul

69d371f

modified: py/torch_tensorrt/fx/converters/acc_ops_converters.py modified: py/torch_tensorrt/fx/test/passes/test_fuse_permute_matmul_trt.py

FelixXidddd force-pushed the addBroadCastToTransposeMatmul branch from 416381a to 69d371f Compare November 21, 2022 00:21

FelixXidddd closed this Nov 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Broadcast the two input shapes for transposed matmul #1413

Broadcast the two input shapes for transposed matmul #1413

FelixXidddd commented Oct 19, 2022 •

edited

Loading

nvpohanh commented Oct 20, 2022

ncomly-nvidia commented Oct 20, 2022

yinghai left a comment

FelixXidddd commented Oct 25, 2022

yinghai commented Oct 25, 2022

facebook-github-bot commented Oct 27, 2022

FelixXidddd commented Oct 27, 2022

FelixXidddd commented Oct 27, 2022

frank-wei commented Oct 27, 2022

ncomly-nvidia commented Oct 27, 2022

nvpohanh commented Nov 22, 2022

Broadcast the two input shapes for transposed matmul #1413

Broadcast the two input shapes for transposed matmul #1413

Conversation

FelixXidddd commented Oct 19, 2022 • edited Loading

Description

Type of change

Checklist:

nvpohanh commented Oct 20, 2022

ncomly-nvidia commented Oct 20, 2022

yinghai left a comment

Choose a reason for hiding this comment

FelixXidddd commented Oct 25, 2022

yinghai commented Oct 25, 2022

facebook-github-bot commented Oct 27, 2022

Action Required

Process

FelixXidddd commented Oct 27, 2022

FelixXidddd commented Oct 27, 2022

frank-wei commented Oct 27, 2022

ncomly-nvidia commented Oct 27, 2022

nvpohanh commented Nov 22, 2022

FelixXidddd commented Oct 19, 2022 •

edited

Loading