[TorchToLinalg] Upcast low precision dtypes for direct backward conv lowering #4408

zjgarvey · 2025-12-15T21:30:59Z

The newer direct lowering for backward conv is directly accumulating lower precision types like bf16. This patch adds a check for the default accumulator type. If this type doesn't match the result types for the op, it will also introduce a downcasting elementwise op (post-convolution and pre-collapsing for groups).

Signed-off-by: zjgarvey <zjgarvey@gmail.com>

IanWood1 · 2025-12-15T21:55:27Z

test/Conversion/TorchToLinalg/convolution_bwd.mlir

+// CHECK-LABEL:   func.func @convolution_backward_weights_2x2s_2x2p_2x2d_4g_bf16(
+// CHECK-SAME:                                                %[[VAL_0:.*]]: !torch.vtensor<[2,16,33,33],bf16>, %[[VAL_1:.*]]: !torch.vtensor<[2,128,64,64],bf16>,
+// CHECK-SAME:                                                %[[VAL_2:.*]]: !torch.vtensor<[16,32,2,2],bf16>) -> (!torch.vtensor<[16,32,2,2],bf16>, !torch.vtensor<[16],bf16>) {
+func.func @convolution_backward_weights_2x2s_2x2p_2x2d_4g_bf16(%arg0: !torch.vtensor<[2,16,33,33],bf16>, %arg1: !torch.vtensor<[2,128,64,64],bf16>, %arg2: !torch.vtensor<[16,32,2,2],bf16>) -> (!torch.vtensor<[16,32,2,2],bf16>, !torch.vtensor<[16],bf16>) {


Does this need an e2e test? Will one of the test from 3cebce2 error out if changed to bf16?

The current e2e tests use the decomposition for this op, so won't encounter this logic anyway. Maybe there is a case where the decomposition fails, in which case we would be able to test this e2e. In any case, I'll try it out locally and see.

Yeah, I tried locally, and we don't even support bfloat16 in the e2e tests.

Hacking a few things through to enable testing bf16, all of the tests report numerics mismatches against pytorch for this dtype- whether the decomposition is enabled or not, but at least I get one fewer mismatched individual tensor when including these changes vs. not (when locally disabling the decomposition).

I wouldn't expect much else honestly. Testing lower-precision dtypes through our ref-backend against pytorch's cpu implementation seems a bit hyper-specific. IIRC pytorch cpu often accumulates to float64 instead of float32, but I'd have to double check that. If the device ends up being important for choosing accumulator dtype, we can try to push through a device-info attribute to inform accumulator type selection in the future.

lib/Conversion/TorchToLinalg/Linear.cpp

Signed-off-by: zjgarvey <zjgarvey@gmail.com>

a-sidorova

@zjgarvey thank you for quick fix!

a-sidorova · 2025-12-16T12:52:12Z

test/Conversion/TorchToLinalg/convolution_bwd.mlir

+  %3 = torch.prim.ListConstruct %false, %true, %true : (!torch.bool, !torch.bool, !torch.bool) -> !torch.list<bool>
+  %result0, %result1, %result2 = torch.aten.convolution_backward %arg0, %arg1, %arg2, %0, %1, %1, %1, %false, %2, %int4, %3 : !torch.vtensor<[2,16,33,33],bf16>, !torch.vtensor<[2,128,64,64],bf16>, !torch.vtensor<[16,32,2,2],bf16>, !torch.list<int>, !torch.list<int>, !torch.list<int>, !torch.list<int>, !torch.bool, !torch.list<int>, !torch.int, !torch.list<bool> -> !torch.none, !torch.vtensor<[16,32,2,2],bf16>, !torch.vtensor<[16],bf16>


nit: for better test coverage we can set [true, true, true] here to validate the calculation of input gradient as well. But I don't insist.

Upcast low precision dtypes for backward conv.

bacbc78

Signed-off-by: zjgarvey <zjgarvey@gmail.com>

zjgarvey requested review from IanWood1 and a-sidorova and removed request for a-sidorova December 15, 2025 21:31

IanWood1 reviewed Dec 15, 2025

View reviewed changes

Use the helper function for casting acc dtype.

39d6990

Signed-off-by: zjgarvey <zjgarvey@gmail.com>

a-sidorova approved these changes Dec 16, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[TorchToLinalg] Upcast low precision dtypes for direct backward conv lowering #4408

[TorchToLinalg] Upcast low precision dtypes for direct backward conv lowering #4408

zjgarvey commented Dec 15, 2025

Uh oh!

IanWood1 Dec 15, 2025

Uh oh!

zjgarvey Dec 15, 2025

Uh oh!

zjgarvey Dec 15, 2025

Uh oh!

Uh oh!

a-sidorova left a comment

Uh oh!

a-sidorova Dec 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		%3 = torch.prim.ListConstruct %false, %true, %true : (!torch.bool, !torch.bool, !torch.bool) -> !torch.list<bool>
		%result0, %result1, %result2 = torch.aten.convolution_backward %arg0, %arg1, %arg2, %0, %1, %1, %1, %false, %2, %int4, %3 : !torch.vtensor<[2,16,33,33],bf16>, !torch.vtensor<[2,128,64,64],bf16>, !torch.vtensor<[16,32,2,2],bf16>, !torch.list<int>, !torch.list<int>, !torch.list<int>, !torch.list<int>, !torch.bool, !torch.list<int>, !torch.int, !torch.list<bool> -> !torch.none, !torch.vtensor<[16,32,2,2],bf16>, !torch.vtensor<[16],bf16>

[TorchToLinalg] Upcast low precision dtypes for direct backward conv lowering #4408

Are you sure you want to change the base?

[TorchToLinalg] Upcast low precision dtypes for direct backward conv lowering #4408

Conversation

zjgarvey commented Dec 15, 2025

Uh oh!

IanWood1 Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

zjgarvey Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

zjgarvey Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

a-sidorova left a comment

Choose a reason for hiding this comment

Uh oh!

a-sidorova Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants