[TORCH] Transformer encoder decomposition #4381

catcor01 · 2025-11-19T14:42:07Z

Add a dedicated DecomposeTransformerEncoder pass to expand encoder ops into primitive Torch patterns.
Extend shared lowering helpers (ReduceOpVariants.cpp, Utils.h) so the new pass can reuse reduction utilities during decomposition.
Register the pass in the Torch Transform pipeline so it runs as part of the decomposition flow.
Expand e2e coverage with new transformer encoder tests to validate the lowering path.

Change-Id: I6bcda53569cf7b06df4cb97c624bbf512d8fecb7

- Add a dedicated DecomposeTransformerEncoder pass to expand encoder ops into primitive Torch patterns. - Extend shared lowering helpers (ReduceOpVariants.cpp, Utils.h) so the new pass can reuse reduction utilities during decomposition. - Register the pass in the Torch Transform pipeline so it runs as part of the decomposition flow. - Expand e2e coverage with new transformer encoder tests to validate the lowering path. Signed-off-by: Cathal Corbett <cathal.corbett@arm.com> Change-Id: I6bcda53569cf7b06df4cb97c624bbf512d8fecb7

Lallapallooza

Thanks for the patch! I left comments on a few correctness/pipeline-contract and some nits/cleanup.

Lallapallooza · 2025-12-23T12:36:58Z

lib/Dialect/Torch/Transforms/DecomposeTransformerEncoder.cpp

+    RewritePatternSet &patterns, const llvm::StringSet<> &legalOpsSet) {
+  MLIRContext *context = patterns.getContext();
+  DecomposeAtenTransformerEncoderLayerFwd pattern(context);
+  auto opName = pattern.getRootKind();


populateTransformerEncoderPatterns(patterns, legalOpsSet) must respect the same "legal ops" contract as the other patterns in this pass. As-is, the transformer rewrite is a torch.operator pattern (semantic op name lives in an attribute), so legality gating must be done by inspecting the operator name attribute, not by getRootKind().

Lallapallooza · 2025-12-23T12:37:39Z

lib/Dialect/Torch/Transforms/Utils.h

#pragma once is missing

Lallapallooza · 2025-12-23T12:38:57Z

lib/Dialect/Torch/Transforms/Utils.h

+namespace Torch {
+
+inline bool isTransformerEncoderOperatorName(llvm::StringRef name) {
+  if (!name.consume_front("torch."))


Could we use kTorchOpPrefix here.

Lallapallooza · 2025-12-23T12:40:41Z

lib/Dialect/Torch/Transforms/ReduceOpVariants.cpp

 }

-bool isSpecializedOperation(Torch::OperatorOp op) { return true; }
+bool isSpecializedOperation(Torch::OperatorOp op) {


Previously isSpecializedOperation effectively treated torch.operator as illegal unless explicitly handled. Now it returns true only for flash-attn and returns false for almost everything else, which risks letting unexpected torch.operator ops leak further into the pipeline.

Lallapallooza · 2025-12-23T12:41:37Z

lib/Dialect/Torch/Transforms/Utils.h

Utils.h is an extremely generic filename and easy to confuse with torch-mlir/Dialect/Torch/Utils/Utils.h. Please rename to something domain-specific (e.g. TransformerEncoderUtils.h.

Lallapallooza · 2025-12-23T12:56:15Z

projects/pt1/e2e_testing/xfail_sets.py

    "StdCorrectionLargeInputModule_basic",
    "TupleModule_basic",
    "ThresholdStaticModule_basic",
+    "TransformerEncoderModule_basic",


Could we add a reason why it fails.

Lallapallooza · 2025-12-23T12:56:19Z

projects/pt1/e2e_testing/xfail_sets.py

    "TrilIndicesNegativeOffsetModule_basic",
    "TriuIndicesAllZerosModule_basic",
    "TriuIndicesModule_basic",
+    "TransformerEncoderModule_basic",


Could we add a reason why it fails.

Lallapallooza · 2025-12-23T12:59:14Z

include/torch-mlir/Dialect/Torch/Transforms/Passes.h

 void populateRestructureNonConstantAxesPattern(RewritePatternSet &patterns,
                                               MLIRContext *context);

+void populateTransformerEncoderPatterns(RewritePatternSet &patterns,


populateTransformerEncoderPatterns is being added to the public passes header that's consistent with other populate*Pattern(s) declarations here, but please confirm this is intended API surface (vs a private helper), since it's tightly coupled to DecomposeComplexOps.

Lallapallooza · 2025-12-23T13:01:51Z

projects/pt1/test/python/transformer_encoder_lowering.py

This standalone python script feels redundant/extra maintenance. I'd prefer deleting this file and relying on the lit test + the pt1 e2e transformer coverage.

Lallapallooza · 2025-12-23T14:07:59Z

lib/Dialect/Torch/Transforms/DecomposeTransformerEncoder.cpp

+                              Value bias) -> FailureOr<Value> {
+      auto inputTensorType = cast<ValueTensorType>(input.getType());
+      Value normalizedShape = createIntList(rewriter, loc, {embedDim});
+      Value cudnnEnable = createBoolConstant(rewriter, loc, true);


buildLayerNorm always sets cudnn_enable=true. Please confirm this matches the semantics of _transformer_encoder_layer_fwd for the CPU path, if the fused op can produce cudnn_enable=false, this rewrite could diverge.

catcor01 force-pushed the transformer_encoder branch 2 times, most recently from 2e95de5 to 0de7090 Compare November 26, 2025 09:29

catcor01 force-pushed the transformer_encoder branch from 0de7090 to e446682 Compare November 26, 2025 09:37

Lallapallooza suggested changes Dec 23, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[TORCH] Transformer encoder decomposition #4381

[TORCH] Transformer encoder decomposition #4381

catcor01 commented Nov 19, 2025

Uh oh!

Lallapallooza left a comment

Uh oh!

Lallapallooza Dec 23, 2025

Uh oh!

Lallapallooza Dec 23, 2025

Uh oh!

Lallapallooza Dec 23, 2025

Uh oh!

Lallapallooza Dec 23, 2025

Uh oh!

Lallapallooza Dec 23, 2025

Uh oh!

Lallapallooza Dec 23, 2025

Uh oh!

Lallapallooza Dec 23, 2025

Uh oh!

Lallapallooza Dec 23, 2025

Uh oh!

Lallapallooza Dec 23, 2025

Uh oh!

Lallapallooza Dec 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[TORCH] Transformer encoder decomposition #4381

Are you sure you want to change the base?

[TORCH] Transformer encoder decomposition #4381

Conversation

catcor01 commented Nov 19, 2025

Uh oh!

Lallapallooza left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants