torch-mlir

Commit Graph

Author	SHA1	Message	Date
Aart Bik	1f73895f93	[torch-mlir] bump to llvm/llvm-project@9b78ddf3b2 (#3491 ) This bump triggered an upstream assert. Includes a WAR for #3506. Also includes several things I needed to do to repro: * When TORCH_MLIR_TEST_CONCURRENCY=1, test runs will be printed. * Added TORCH_MLIR_TEST_VERBOSE=1 handling to enable verbose mode (useful on CI). --------- Co-authored-by: Stella Laurenzo <stellaraccident@gmail.com>	2024-06-27 19:28:02 -07:00
zjgarvey	d2bc70f188	[TorchToLinalg][ONNX] Add Basic Determinant Support (#3481 ) This adds support for a few ops: - torch.linalg_det - torch._linalg_det (if the LU and pivot returns are unused) - onnx.Det An scf loop is used, since the row reduction algorithm applied here has some loop-carried dependencies. The current support being added here is very basic, and only works if no permutations are required during row reduction, and assumes the matrices are non-singular.	2024-06-25 13:34:19 -05:00
zjgarvey	368fabf0c1	[ONNX] Basic Support for DeformConv (#3469 ) This adds a torchvision op to torch-mlir and a path from onnx.DeformConv to torchvision.deform_conv2d. I'm not implementing the torch->linalg lowering for the torchvision op yet, but posting this PR to get feedback on some of the choices being made here and to flesh out the onnx frontend a bit.	2024-06-25 12:16:51 -05:00
zjgarvey	e346c911f7	[ONNX] Add basic support for RoiAlign (#3493 ) This adds an onnx->torch conversion for onnx.RoiAlign into torchvision.roi_align or torchvision.roi_pool, and adds those two torchvision ops to torch-mlir.	2024-06-25 11:02:45 -05:00
Vinayak Dev	02340408b7	[torch] Add OnnxToTorch lowering for Onnx.STFT op (#3492 ) Adds OnnxToTorch lowering for `Onnx.STFT` op.	2024-06-25 19:00:45 +05:30
Branko Trifkovic	98c6971a01	Implement lowering of torch.aten.triu_indices (#3451 ) Closes [nod-ai/SHARK-Turbine/issues/709](https://github.com/nod-ai/SHARK-Turbine/issues/709) --------- Co-authored-by: Branko Trifkovic <branko.trifkovic@syrmia.com>	2024-06-21 16:16:38 -07:00
Matthias Gehre	acd57a3520	Support fake_quantize_per_tensor_affine_cachemask (#3477 ) Add a new op with shape/dtypes and decompose into `fake_quantize_per_tensor_affine` when the second result is unused. The xfail_set change is on ONNX because torch cannot export this op to ONNX.	2024-06-21 07:15:31 +00:00
zjgarvey	694210f429	[TorchToLinalg] Fix Quantized Convolution Accumulator Type (#3459 ) 1. truncates zero-points to i32 2. modifies the default accumulator type for i8 from i64 to i32. 3. now uses the input dtype to infer accumulator dtype.	2024-06-20 13:54:20 -07:00
Xinyu Yang	c7d52f63b4	[stablehlo] add aten::_int_mm lowering (#3474 ) as title	2024-06-20 16:10:31 +08:00
Branko Trifkovic	676fa8cc09	Implement lowering of torch.aten.renorm (#3388 ) Closes [nod-ai/SHARK-Turbine/issues/689](https://github.com/nod-ai/SHARK-Turbine/issues/689) --------- Co-authored-by: Branko Trifkovic <branko.trifkovic@syrmia.com>	2024-06-17 10:40:57 -07:00
ptrifunovic98	4555629246	Implement lowering of torch.aten.kthvalue (#3360 ) Closes [nod-ai/SHARK-Turbine#620](https://github.com/nod-ai/SHARK-Turbine/issues/620)	2024-06-15 11:18:39 +05:30
Arham Khan	09c988046c	[ONNX] Add OnnxToTorch lowering for Onnx.NegativeLogLikelihoodLoss Op (#3380 ) This implements the Onnx.NegativeLogLikelihoodLoss op using the signature provided [here](https://onnx.ai/onnx/operators/onnx__NegativeLogLikelihoodLoss.html) by replacing it with a `NLLLossForward` op. Additionally, I included a helper function `get_loss_reduction_enum` to convert from a string `reduction` parameter to the corresponding intended integer value since this is an operation that will be reused for any loss function module. This differs from `get_reduction_enum` in `TorchUpstream.cpp` which handles the `reduce` parameter from `scatter_reduce` type operations.	2024-06-14 22:01:11 +05:30
Xinyu Yang	6f94c7b0aa	[Torch] Add support for Meshgrid (#3462 )	2024-06-14 23:59:08 +08:00
Vinayak Dev	39d882f7c9	[torch] Add OnnxToTorch lowering for the Col2Im op (#3424 ) Adds OnnxToTorch lowering for the `onnx.Col2Im` op.	2024-06-13 08:42:06 +00:00
Lei Zhang	77d7f64472	Update to llvm/llvm-proect@27ac46e6be (2024-6-12) (#3454 ) This would require to bump stablehlo at the same time.	2024-06-12 19:34:01 -07:00
zjgarvey	de28c8540b	[ONNX] add int16 quantization support (#3446 ) There is currently no int16 quantization support in torch. This patch adds a new mlir type to correspond to the missing "torch.qint16" type, and enables lowering of quantization-related onnx ops using int16 types. In follow-up patches, custom quantization logic for ops like aten.matmul/aten.mm/aten.convolution may need to be revisited to allow support for qint16. The passes in FuseQuantizedOps.cpp may also need slight modifications.	2024-06-12 10:37:22 +05:30
Yuanqiang Liu	689efc8917	[Torch] fix toBuiltinTensor() (#3415 ) * Let `toBuiltinTensor()` reflects the original dtype of `!torch.vtensor`. * Backend handles dtype conversion themselves.	2024-06-08 09:36:32 +08:00
Rob Suderman	75af64fc12	[torch] Add support for f8 types for linalg conversion (#3436 ) Linalg conversion requires mapping for f8 types	2024-06-07 13:59:38 -07:00
Sambhav Jain	d0a818a03e	Representing Symbolic Shape Expressions in Torch Dialect (#3372 ) Torch Dialect with symbolic shape expressions: ```ll module { func.func @main(%arg0: !torch.vtensor<[?,?,3],f32>, %arg1: !torch.vtensor<[?,?,3],f32>) -> !torch.vtensor<[?,?,3],f32> { %0 = torch.symbolic_int "s0" {min_val = 5, max_val = 10} : !torch.int %1 = torch.symbolic_int "s1" {min_val = 0, max_val = 100} : !torch.int %2 = torch.symbolic_int "s3" {min_val = 0, max_val = 50} : !torch.int torch.bind_symbolic_shape %arg0, [%0, %1], #affine_map<()[s0, s1] -> (s0, s1, 3)> : !torch.vtensor<[?,?,3],f32> torch.bind_symbolic_shape %arg1, [%0, %2], #affine_map<()[s0, s1] -> (s0, s1, 3)> : !torch.vtensor<[?,?,3],f32> %3 = torch.aten.tanh %arg0 : !torch.vtensor<[?,?,3],f32> -> !torch.vtensor<[?,?,3],f32> torch.bind_symbolic_shape %3, [%0, %1], #affine_map<()[s0, s1] -> (s0, s1, 3)> : !torch.vtensor<[?,?,3],f32> %4 = torch.aten.sigmoid %arg1 : !torch.vtensor<[?,?,3],f32> -> !torch.vtensor<[?,?,3],f32> torch.bind_symbolic_shape %4, [%0, %2], #affine_map<()[s0, s1] -> (s0, s1, 3)> : !torch.vtensor<[?,?,3],f32> %5 = torch.prim.ListConstruct %3, %3, %4 : (!torch.vtensor<[?,?,3],f32>, !torch.vtensor<[?,?,3],f32>, !torch.vtensor<[?,?,3],f32>) -> !torch.list<vtensor> %int1 = torch.constant.int 1 %6 = torch.aten.cat %5, %int1 : !torch.list<vtensor>, !torch.int -> !torch.vtensor<[?,?,3],f32> torch.bind_symbolic_shape %6, [%0, %1, %2], #affine_map<()[s0, s1, s2] -> (s0, s1 * 2 + s2, 3)> : !torch.vtensor<[?,?,3],f32> return %6 : !torch.vtensor<[?,?,3],f32> } } ``` For reference, this is the TorchDynamo exported program with symbolic shape expressions that the above Torch dialect program is imported from: ```py ExportedProgram: class GraphModule(torch.nn.Module): def forward(self, x: "f32[s0, s1, 3]", y: "f32[s0, s3, 3]"): # File: /home/sambhav.jain/workspaces/cruise/src/3p/torch-mlir/test/python/fx_importer/symbolic_shape_expr_test.py:31 in forward, code: a = torch.tanh(x) tanh: "f32[s0, s1, 3]" = torch.ops.aten.tanh.default(x); x = None # File: /home/sambhav.jain/workspaces/cruise/src/3p/torch-mlir/test/python/fx_importer/symbolic_shape_expr_test.py:32 in forward, code: b = torch.sigmoid(y) sigmoid: "f32[s0, s3, 3]" = torch.ops.aten.sigmoid.default(y); y = None # File: /home/sambhav.jain/workspaces/cruise/src/3p/torch-mlir/test/python/fx_importer/symbolic_shape_expr_test.py:33 in forward, code: return torch.cat((a, a, b), dim=1) cat: "f32[s0, 2*s1 + s3, 3]" = torch.ops.aten.cat.default([tanh, tanh, sigmoid], 1); tanh = sigmoid = None return (cat,) Graph signature: ExportGraphSignature(input_specs=[InputSpec(kind=<InputKind.USER_INPUT: 1>, arg=TensorArgument(name='x'), target=None, persistent=None), InputSpec(kind=<InputKind.USER_INPUT: 1>, arg=TensorArgument(name='y'), target=None, persistent=None)], output_specs=[OutputSpec(kind=<OutputKind.USER_OUTPUT: 1>, arg=TensorArgument(name='cat'), target=None)]) Range constraints: {s0: ValueRanges(lower=5, upper=10, is_bool=False), s1: ValueRanges(lower=0, upper=100, is_bool=False), s3: ValueRanges(lower=0, upper=50, is_bool=False)} ``` Huge credit to @stellaraccident for the inputs that helped evaluate the various design options and arrive at the representation of choice. - [x] Op definitions for symbolic_int and bind_symbolic_shape ops - [x] fx_importer updates to import range constraints + create symbolic_int ops - [x] fx_importer changes for AffineMapAttr building + adding bind_symbolic_shape ops - [x] custom printer/parser for inlined AffineMap expressions in mlir assembly - [x] Dialect lit test - [x] fx_importer python lit tests - [ ] Cleanup pass to remove these ops (can add in a follow-on)	2024-06-07 04:04:03 -07:00
Vivek Khandelwal	72837fbb3d	build: manually update PyTorch version (#3340 ) Set PyTorch and TorchVision version to nightly release 2024-05-14. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-06-06 22:23:40 +05:30
Vivek Khandelwal	661be2d5b0	[MLIR][Torch] Add TorchToLinalg lowering for AtenAvgPool3dOp (#3030 ) This commit also fixes the average pool op' test failing for OnnxToLinalg lowering. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-06-04 22:12:34 +05:30
zjgarvey	8995c90879	[TorchToLinalg] add support for quantized group conv (#3341 ) This addresses 7 of the model failures I'm seeing in the test suite. See [Shark-Turbine issue #566](https://github.com/nod-ai/SHARK-Turbine/issues/566). Need the op ```linalg.conv_2d_ngchw_gfchw_q``` to be added upstream before merging this. See [llvm-project PR #92136 ](https://github.com/llvm/llvm-project/pull/92136). A small additional expansion to operand quantization is included in this patch to address a model failure that occurs when unblocking the quantized group convolutions in one of these onnx models.	2024-06-03 21:57:44 +05:30
Vivek Khandelwal	6382dbbcc0	[ONNX] Add OnnxToTorch lowering for SpaceToDepth op (#3393 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-06-03 20:29:39 +05:30
Xinyu Yang	285b087a5d	[Torch] Emit rrelu and decompose it (#3250 ) as title	2024-06-03 19:25:52 +08:00
Xinyu Yang	267052df2a	[Torch] decompose AtenLerpTensorOp (#3251 ) as title	2024-06-03 15:25:09 +08:00
Xinyu Yang	23b53050de	[Torch]Support conv_transpose1d and conv_transpose3d (#3286 ) 1. Support conv_transpose1d and conv_transpose3d 2. Fix bugs of convertTransposedConv func in lib/Conversion/TorchToStablehlo/Linear.cpp	2024-06-03 15:11:12 +08:00
Rob Suderman	afca88a058	[NFC] Change to cast instead of .cast variants (#3405 ) Member casts have been deprecated. Changing over a bunch of the member cast calls to the global templated variants to remove deprecation warnings.	2024-05-30 23:45:13 -07:00
Yuanqiang Liu	4e05e2cd1e	[Torch] support recompose of aten.split.with_sizes and aten.tensor_sp… (#3401 ) …lit.sections * support recompose to aten.split.with_sizes and aten.tensor_split.sections * fix recompose of aten.chunk	2024-05-31 09:56:47 +08:00
zjgarvey	074098d20c	Modifies onnx resize lowering to fix numerical issues (#3381 ) Updates: - some unsupported modes are now going to report a match failure for unsupported coordinate transformation modes. - fixes a bug that was introduced in the last patch for resize (my bad...) - uses actual x and y coordinates for computing weights in bilinear interpolation (rather than eps modified values) - slightly simplifies the bilinear interpolation payload for readability and performance - passes coordinate transformation mode information from an onnx.Resize op to the mode string for the aten._interpolate op. This allows us to perform custom logic in the torch->linalg lowering to support onnx.Resize options without losing the default behaviors of the interpolate op.	2024-05-30 20:34:37 -04:00
penguin_wwy	1f544c37d0	[NFC] Remove unused header files (#3386 )	2024-05-30 14:30:36 +08:00
Yuanqiang Liu	e0a5adb1db	[Torch] fix aten.linear's decomposition (#3391 ) * support aten.linear with more rank.	2024-05-27 15:49:50 +08:00
Yuanqiang Liu	5bb1a65ec9	[Stablehlo] refactor reduction lowering and support aten.amin (#3383 ) * implement detailed lowering template pattern `ConvertAtenReduceAllDimsOp` and `ConvertAtenReduceKeepDimOp` * support `aten.amin`'s lowering.	2024-05-23 20:40:20 +08:00
Angel Zhang	2e194e13d6	[Torch] Fix bugs for `Torch::AtenOneHotOp` (#3350 ) This PR fixes the bugs for `Torch::AtenOneHotOp` by: 1) Using `Torch::kUnknownSize` as the default value for `numClasses` in the pattern matching stage in `DecomposeAtenOneHotOp` 2) Adding `AtenIntScalarOp` to the patterns in `TorchToArith` 3) Handling both `int` and `float` types for `off` and `on` values in `TorchOnnxToTorch` conversion It also includes: 1) A new test in `TorchToArith/basic.mlir`, for `torch.aten.Int.Scalar`, and 2) A new test in `decompose-complex-ops.mlir`, for `torch.aten.one_hot` Dependencies This PR is dependent on #3334.	2024-05-22 17:19:08 +00:00
Xinyu Yang	4d7cdba4bf	[Torch] eliminate "getWithLeastStaticInformation" in DecomposeAtenTriuOp (#3330 ) I am trying to eliminate 'getWithLeastStaticInformation' in DecomposeAtenTriuOp. Could you provide me with some suggestions? @qingyunqu @zjgarvey See issue https://github.com/llvm/torch-mlir/issues/3312	2024-05-22 23:16:57 +08:00
Sambhav Jain	6e485574e5	[Pipeline] Use dedicated simplification pipeline for TorchDynamo frontend (#3376 ) Discord Thread: https://discord.com/channels/636084430946959380/1238330633328005243 ## Context: [This](https://github.com/llvm/torch-mlir/blob/main/python/torch_mlir/fx.py#L61) was updated to support e2e tests for the TorchDynamo frontend in Torch-MLIR, where we run FX decompositions and import the FX IR to generate Torch dialect, followed by `torch-function-to-torch-backend-pipeline`, skipping only the shape/type refinement for now. However, we should be able to skip many of the torch simplification passes, as depicted in the [frontend roadmap](https://github.com/llvm/torch-mlir/blob/main/docs/images/roadmap_frontend.png). Based on IREE's TorchDynamo [pipeline](https://github.com/iree-org/iree/blob/main/compiler/plugins/input/Torch/InputConversion/Passes.cpp#L29), the only two passes we seem to require are: `ReduceOpVariantsPass` and `DecomposeComplexOpsPass`. This is inline with our findings as well based on initial exploration. This PR creates a dedicated frontend simplification pipeline for TorchDynamo / FX Importer which calls only `ReduceOpVariantsPass` and `DecomposeComplexOpsPass`. We rely on the e2e fx_importer tests to ensure we're not regressing by removing many of the passes that were historically needed for TorchScript. One notable change here is that we do not call the `LowerToBackendContractPass` anymore, which used to call `TorchSimplificationPipeline` iteratively until VerifyBackendContract was clean. Some of this was required for the shape/type refinement to converge, which seems a non-issue for Dynamo frontend. Do we anticipate this (the iterative invocation of TorchSimplificationPipeline followed by VerifyBackendContract) to be worth retaining in the Dynamo frontend pipeline? If so, I can make those changes, PLMK.	2024-05-22 05:23:18 -07:00
Yuanqiang Liu	8814d0ae64	[Torch] emit aten.dot and canonicalize it to aten.matmul (#3361 ) * canonicalize `aten.dot` to `aten.matmul`	2024-05-18 22:45:14 +08:00
Xinyu Yang	7faba75696	[Torch] Decompose AtenMaskedScatterOp (#3353 ) Co-authored-by: Yuanqiang Liu <liuyuanqiang.yqliu@bytedance.com>	2024-05-16 15:27:25 +08:00
Xinyu Yang	a9edefb3cf	[Torch] Fix AtenSliceTensorOp::fold (#3345 )	2024-05-16 11:42:43 +08:00
Peiming Liu	ccb772cd0f	[sparse] propagate sparsity properly when decompose torch operations. (#3318 )	2024-05-15 10:09:27 -07:00
Aaron St George	ba32b9cee7	Don't fold `aten.clone` if result isn't same type as input (#3347 ) Similar to https://github.com/llvm/torch-mlir/pull/2824, we were seeing some assertion failures after the addition checks around folders were tightened up in LLVM: https://github.com/llvm/llvm-project/pull/75887 . This PR essentially moves the logic that used to be applied at the LLVM level into the folder, which seems to be the suggested fix.	2024-05-16 00:07:45 +08:00
Xinyu Yang	6b95dd461d	[Torch] Fix PrimNumToTensorScalarOp::fold (#3339 ) In constant folding progress, a new constant op will be created according to the origin op's result type. See the code in TorchDialect.cpp. ```cpp Operation TorchDialect::materializeConstant(OpBuilder &builder, Attribute value, Type type, Location loc) { if (auto integerType = dyn_cast<Torch::IntType>(type)) return builder.create<Torch::ConstantIntOp>(loc, cast<IntegerAttr>(value)); if (auto floatType = dyn_cast<Torch::FloatType>(type)) return builder.create<Torch::ConstantFloatOp>(loc, cast<FloatAttr>(value)); if (auto numberType = dyn_cast<Torch::NumberType>(type)) { if (auto floatValue = dyn_cast<mlir::FloatAttr>(value)) { return builder.create<Torch::ConstantNumberOp>(loc, floatValue); } else if (auto intValue = dyn_cast<mlir::IntegerAttr>(value)) { return builder.create<Torch::ConstantNumberOp>(loc, intValue); } } if (isa<Torch::BoolType>(type)) { return builder.create<Torch::ConstantBoolOp>(loc, cast<IntegerAttr>(value)); } if (isa<Torch::NoneType>(type)) return builder.create<ConstantNoneOp>(loc); if (auto stringAttr = dyn_cast<StringAttr>(value)) return builder.create<ConstantStrOp>(loc, stringAttr); if (auto elementsAttr = dyn_cast<ElementsAttr>(value)) { // Only !torch.vtensor can be constant folded. !torch.tensor has // non-trivial aliasing semantics which prevent deduplicating it. assert(isa<ValueTensorType>(type) && "should be a vtensor type!"); return builder.create<ValueTensorLiteralOp>(loc, elementsAttr); } return nullptr; } ``` So when the op has a tensor result type, it must be "ValueTensorType" due to the assert* statement. However, many fold methods in TorchOps.cpp only have a judgment of "BaseTensorType".	2024-05-15 20:54:19 +08:00
zjgarvey	911e723581	Expands Q Commuting Ops (#3332 ) After running the model tests in SHARK-TestSuite, I noticed a few model failures due to half-fusion. Notably, RDN_pytorch_vaiq_int8 had a depth=5 convolution chain with multiple AtenViewOp's.	2024-05-13 11:01:53 -07:00
zjgarvey	75d1d72059	Generalize Operand Quantization in FuseQuantizeOps (#3327 ) This change enables more customization with operand quantization, and generalizes the patterns QuantizeOperands and QuantizeTransposeOperands to QuantizeOperandsPastCommutingOps. This allows for passing quantization through operations which are functionally unaffected by quantization, such as view-like ops. The purpose of this change is to address a myriad of quantization issues seen in quantized onnx models that have some reshape-like operations sandwiched in between a dequant and something like a matmul (whose other operand is immediately quantizable).	2024-05-12 20:49:59 -07:00
NeverRaR	1d4859699b	MaxPool1d lowering to linalg (#3295 ) Co-authored-by: root <root@i32b01216.sqa.eu95>	2024-05-10 22:05:26 +05:30
penguin_wwy	64b59c7fc3	[FxImporter] Eliminate the dependency on the refinement pass (#3309 )	2024-05-10 02:44:36 +08:00
aldesilv	ec6d7aa5d2	OnnxToTorch lowering resize op (#3013 ) https://github.com/nod-ai/SHARK-Turbine/issues/358 adds a lowering from onnx to linalg for bilinear and nearest resize with support for using scales or sizes to get resize shape. uses coordinate transform half pixel for bilinear mode and asymmetrical for nearest mode. See https://github.com/onnx/onnx/blob/main/docs/Operators.md#Resize. Added two passes -- one for bilinear and the other for nearest.	2024-05-08 21:35:03 +00:00
Jiawei Wu	346a536c9f	[Torch Dialect] decompose all index_put-like op to aten.index_put.hacked_twin for stricter semantics (#3071 ) This PR decomposes all index_put-like op to aten.index_put.hacked_twin for stricter semantics, i.e., no None index in indices argument.	2024-05-08 22:44:57 +08:00
Xinyu Yang	abef114c0c	[torch] emit aten.Softshrink and aten.Hardshrink (#3248 ) as title	2024-05-08 15:20:45 +08:00
Vivek Khandelwal	e60160d793	Revert "Decompose AtenNonzeroOp" (#3289 ) Reverts llvm/torch-mlir#3281	2024-05-06 09:52:04 -07:00
Xida Ren (Cedar)	1af00e6040	Decompose AtenNonzeroOp (#3281 ) This fixes some onnx lit tests not lowering to linalg in https://github.com/nod-ai/SHARK-Turbine/issues/450	2024-05-05 21:59:25 +08:00
Ze Zhang	11cd7cd9e7	Folder and Canonicalizer for PrimsConvertElementTypeOp and AtenMaxPool2dWithIndicesOp (#3272 ) While playing with TorchDynamo on ResNet18. I notice following issues: - `prims.convert_element_type` can’t be canonicalized even if the input and the output share the same type - `aten.max_pool2d_with_indices` is always used instead of `aten.max_pool2d`, even if the second returned output (indices) has no user This PR fixes above issues by adding a folder to the PrimsConvertElementTypeOp and a canonicalizer to the AtenMaxPool2dWithIndicesOp Lit test: `cmake --build build --target check-torch-mlir-all` --------- Co-authored-by: Ze Zhang <ze.zhang@getcruise.com>	2024-05-02 00:03:41 -07:00
Xida Ren (Cedar)	315dc6c3e3	[torch] `aten.eye` should use dynamic dims when no static dims are available (#3202 ) Co-authored-by: Xida Ren <xida.ren.dev@gmail.com>	2024-04-30 17:41:03 +00:00
Xinyu Yang	f32ada993d	[Stablehlo] Improve the lowering of pool op in stablehlo (#3259 ) 1. Handle case stride == None 2. add avgpool3d maxpool1d maxpool3d lowering	2024-05-01 00:06:13 +08:00
Vivek Khandelwal	b1e2241479	[ONNX] Fix Onnx.Selu lowering and canonicalizer for IntImplicit op (#3221 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-04-29 04:00:01 +00:00
Yuanqiang Liu	aed2cf3351	[Torch] emit aten.__contains__.str_list and add folder (#3249 )	2024-04-29 10:51:17 +08:00
Xinyu Yang	5684dc0441	[Torch] emit aten.celu and decompose it (#3247 ) CELU(x)=max(0,x)+min(0,α∗(exp(x/α)−1))	2024-04-28 17:23:40 +08:00
Yuanqiang Liu	46c0f3cad0	[Torch] emit aten.log_sigmoid and decompose it to log(sigmoid) (#3246 )	2024-04-28 11:47:43 +08:00
Stella Laurenzo	5d4b803914	[NFC reformat] Run pre-commit on all files and format misc. This is part 1 of ~3, formatting all miscellaneous text files and CPP files matched by a first run of pre-commit. These tend to be low change-traffic and are likely not disruptive. Subsequent patches will format Python files and remaining CPP files.	2024-04-27 14:08:09 -07:00
penguin_wwy	6679728c56	Fix deprecated uses of cast/dyn_cast/dyn_cast_or_null/isa (#3243 ) Like #3130, gradually replace the deprecated code https://github.com/llvm/mlir-www/blob/main/website/content/deprecation/_index.md#deprecated	2024-04-27 14:00:56 -07:00
Yuanqiang Liu	f173a06fa7	[Torch] emit aten.ne.str and add folder (#3242 )	2024-04-28 00:58:50 +08:00
Yuanqiang Liu	634a796933	[Torch] fold aten.log (#3223 )	2024-04-26 10:10:02 +08:00
Yuanqiang Liu	b0ba3def93	[Torch] support AtenScalarImplicitOp canonicalize with float (#3231 )	2024-04-26 02:36:13 +08:00
Yuanqiang Liu	fab2696489	[Torch] support aten.trunc (#3219 ) decompose `trunc(x)` to `sign(x) * floor(abs(x))`	2024-04-24 14:32:33 +08:00
Xinyu Yang	4da3d714cc	[Torch] Support AtenProdOp on linalg and stablehlo (#3215 )	2024-04-24 11:14:04 +08:00
zjgarvey	a8ba865fca	[torch] Adds Quantization Support for `aten.relu` (#3177 ) A choice was made to quantize the return type of Relu with a scale and zero point copied from the input's quantization scheme. With this choice, the torch-to-linalg conversion of quantized Relu essentially computes max(input, zeroPoint) in the elementwise payload.	2024-04-23 11:01:36 -07:00
Yuanqiang Liu	db3842f2e8	[Stablehlo] support lowering sinh & cosh to stablehlo (#3213 )	2024-04-23 19:54:58 +08:00
penguin_wwy	e5bdd71baf	[Torch] Emit and decompose prims.iota op (#3132 )	2024-04-21 19:45:01 -07:00
Xinyu Yang	790a697245	[Torch] Add folder for AtenIntOp, AtenFloatOp (#3189 ) See unit test below: ``` // CHECK-LABEL: func.func @torch.aten.tensor.float( // CHECK-NEXT: torch.vtensor.literal(dense<1.000000e+01> : tensor<f32>) : !torch.vtensor<[],f32> func.func @torch.aten.tensor.float() -> !torch.vtensor<[],f32> { %none = torch.constant.none %false = torch.constant.bool false %float1.000000e01 = torch.constant.float 1.000000e+01 %67 = torch.aten.tensor.float %float1.000000e01, %none, %none, %false : !torch.float, !torch.none, !torch.none, !torch.bool -> !torch.vtensor<[],f32> return %67 : !torch.vtensor<[],f32> } // CHECK-LABEL: func.func @torch.aten.tensor.int( // CHECK-NEXT: torch.vtensor.literal(dense<45> : tensor<si32>) : !torch.vtensor<[],si32> func.func @torch.aten.tensor.int() -> !torch.vtensor<[],si32> { %none = torch.constant.none %false = torch.constant.bool false %int45 = torch.constant.int 45 %67 = torch.aten.tensor.int %int45, %none, %none, %false : !torch.int, !torch.none, !torch.none, !torch.bool -> !torch.vtensor<[],si32> return %67 : !torch.vtensor<[],si32> } ```	2024-04-19 22:17:06 +08:00
Xinyu Yang	d4313eed4a	[Torch] Add decomposition of RepeatInterleaveSelfInt Op (#3075 ) Decomposition RepeatInterleaveSelfInt with following ops: ```python def my_repeat_interleave(input, repeats, dim=None): if dim is None: # Flatten the input and then repeat return input.flatten().unsqueeze(-1).tile((1, repeats)).flatten() else: # Calculate the shape after repeat expanded_shape = list(input.shape) expanded_shape[dim] = repeats # Repeat the tensor along the specified dimension repeat_shape = [1] (input.dim() + 1) repeat_shape[dim + 1] = repeats input = input.unsqueeze(-1) # Tile and then reshape tiled = torch.tile(input, repeat_shape) # Rearrange and reshape repeated = tiled.reshape(expanded_shape) return repeated ``` I passed the tests of stablehlo and linalg. When testing onnx, strange things happened. In torch-mlir's CI torch_nightly* and my own environment(torch==2.4.0.dev20240318+cpu), it can pass the pass. In torch-mlir's CI torch_stable, it failed. The test case is `RepeatInterleaveSelfIntNoDimModule_basic`, the result shape should be [120]. ```python class RepeatInterleaveSelfIntNoDimModule(torch.nn.Module): def __init__(self): super().__init__() @export @annotate_args([ None, ([3, 4, 5], torch.float32, True), ]) def forward(self, x): return x.repeat_interleave(2) @register_test_case(module_factory=lambda: RepeatInterleaveSelfIntNoDimModule()) def RepeatInterleaveSelfIntNoDimModule_basic(module, tu: TestUtils): module.forward(tu.rand(3, 4, 5)) ``` The error log is as follows: ``` Unexpected outcome summary: (onnx) ****** Failed tests - 1 tests FAIL - "RepeatInterleaveSelfIntNoDimModule_basic" @ trace item #0 - call to "forward" @ output of call to "forward" ERROR: shape (torch.Size([6, 4, 5])) is not equal to golden shape (torch.Size([120])) ``` @rsuderman Would you please help me check what's wrong with my PR? Thanks a lot.	2024-04-18 06:27:51 +08:00
Xinyu Yang	d2ba956e69	[Torch] Support Aten_CastLongOp. (#3160 ) By canonicalize Aten_CastLongOp into AtenToDtypeOp	2024-04-17 21:58:32 +08:00
zjgarvey	5e564b5864	Adds Some Quantization Support for AtenMatmulOp (#3147 ) 1. onnx.MatMulInteger now converts to aten.matmul instead of aten.mm 2. aten.matmul, for ranks >=2, now allows quantized inputs and will lower to linalg::quantized_matmul or linalg::quantized_batch_matmul. 3. added AtenMatmulOp to the FuseQuantizeOps rewrite patters QuantizeOperands, QuantizeTransposedOperands, and QuantizeAccumulator 4. added several tests, including some to test AtenMmOp with varying quantization signed-ness. 5. a quantized matmul mat-vec test is added to verify the failure to lower to linalg; cleaned of out-of-date code related to common torch-mlir lowering xfails. 6. in debugging a real model with quantized matmuls, I found a bug on the scalarize-shapes pass which resulted from the aten.full op folder returning an incompatible result type. This is fixed by the small change here to [lib/Dialect/Torch/IR/TorchOps.cpp](https://github.com/llvm/torch-mlir/compare/main...zjgarvey:torch-mlir:MatMulIntegerFix?expand=1#diff-dc8ed165c207918e606490eee3984b1ad51d7034e6aac36fc046bf47f6f03f4f).	2024-04-15 16:06:47 -07:00
IanWood1	5708ee7ec9	Added 2 Ops: Floor divide scalar and Floor divide scalar mode (#3156 ) - Added linalg lowering for `AtenFloorDivideScalarOp` - Needed `AtenDivScalarModeOp` for the decomp. - Added linalg lowering for `AtenDivScalarModeOp` - Moved linalg payload logic to `createDivModePayload()` since the logic was nearly identical for both `AtenDivScalarModeOp` and `AtenDivTensorModeOp`. Just a template function - Added `AtenDivScalarModeOp` lowering for stablehlo Pytorch's [`torch.floor_divide()`](https://pytorch.org/docs/stable/generated/torch.floor_divide.html) in a previous version (for a reason unknown to me) preformed a truncation instead of "floor". The already implemented op `AtenFloorDivideTensorOp` was done before this change. However, this wasn't caught because our testcases only tested positive floor division. I changed this to floor as well as adding a few test cases.	2024-04-15 13:45:10 -07:00
zjgarvey	197ef4224b	Avoid Type Mismatch in Slice Folder (#3154 ) Fixes issue #3153	2024-04-12 11:43:45 -07:00
penguin_wwy	d4a30b7e67	Fix deprecated uses of cast/dyn_cast/dyn_cast_or_null/isa (#3130 ) We should prefer functional style as the method style is deprecated https://github.com/llvm/mlir-www/blob/main/website/content/deprecation/_index.md#deprecated (https://mlir.llvm.org/deprecation/)	2024-04-11 06:47:35 -07:00
Xinyu Yang	308c45e61a	[Torch] Fix PrimListUnpackOp::getCanonicalizationPatterns (#3140 ) Fix the case PrimListUnpackOp's result num is not equal to PrimList length. See the following example: ```python def forward(self, x): if len(x.shape) == 5: b0, t, c0, h0, w0 = x.shape b, c, h, w = torch.mul(b0, t), c0, h0, w0 else: b1, c1, h1, w1 = x.shape b, c, h, w = b1, c1, h1, w1 res = torch.reshape(x, [b, c, h, w]) return res ``` Without this fix, the following error message will occur： ``` /root/torch-mlir/externals/llvm-project/mlir/lib/IR/PatternMatch.cpp:118: virtual void mlir::RewriterBase::replaceOp(mlir::Operation *, mlir::ValueRange): Assertion `op->getNumResults() == newValues.size() && "incorrect # of replacement values"' failed. ```	2024-04-11 19:48:49 +08:00
Xinyu Yang	6524838bcb	[Torch] Add general AdaptiveAvgPool2dOp decompose support (#3111 ) Previously, it could only handle the situations where outputsize == (1, 1) or outputsize == (input_H, input_W). Now it supports all situations where input_H % output_H== 0 && input_W % output_W == 0	2024-04-11 17:02:59 +08:00
Xinyu Yang	5eb0cf9104	[Torch] Add decompose of AtenToPrimDeviceOp (#3131 ) As device information isn't relevant to torch-mlir	2024-04-10 22:26:48 +08:00
Xinyu Yang	42a16fa912	[Torch] Support Aten_CastFloatOp. (#3115 ) By canonicalize Aten_CastFloatOp into AtenToDtypeOp	2024-04-09 11:06:53 +08:00
Xinyu Yang	84c24e5771	[Torch] Support Aten__And__ScalarOp (#3114 )	2024-04-08 20:24:17 +08:00
Yuanqiang Liu	2c56ef9252	[Torch Dialect] canonicalize aten.sign to aten.sgn (#3112 ) * `aten.sign` is a sub-set of `aten.sgn` (`aten.sgn` support complex type).	2024-04-08 20:05:42 +08:00
Vivek Khandelwal	7e778e2179	build: manually update PyTorch version (#3094 ) Set PyTorch and TorchVision version to nightly release 2024-04-01. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-04-03 10:48:37 +05:30
Rob Suderman	f97cd4893f	[torch] Improve shape inference for dynamic shapes (#3091 ) Shapes can be processed as tensors to represent the set of dimensions. As reshapes take a list of scalars this can result in a single dynamic dimension blocking the adjacent static dimensions. This pass attempts to de-couple tensor computations related to shapes and propagate values to better support lowering scalar tensor computations.	2024-04-02 16:19:57 -07:00
zjgarvey	40e762ca42	Adds result types to a prelu decomp (#3098 ) This adds explicit result types instead of relying on shape/dtype computations. Solves a regression issue with IREE: #3092	2024-04-02 11:41:56 -07:00
zjgarvey	532d297c46	[ONNX] Preliminary Work Towards Supporting QuantizedMLP_basic onnx e2e test (#3089 ) See the related issues here: [SHARK-Turbine#556](https://github.com/nod-ai/SHARK-Turbine/issues/556) 1. Adds uint8 casting to onnx.Cast op 2. Fixes an issue with onnx.DequantizeLinear when the scale comes with shape [1]. 3. Adds support for unsigned types in an AtenItemOp folder 4. Adds a simpler quantized model for easier debugging 5. Adds a fusion pass to convert [quant -> dequant -> transpose -> mm] patterns to [transpose -> quant -> mm]. 6. Moved some xfails that are still not passing, but for different reasons than onnx.cast failures.	2024-04-01 16:21:05 -07:00
Xinyu Yang	da88efad89	[Torch] Fix bug of DecomposeAtenSelectIntOp (#3087 ) Fix bug of DecomposeAtenSelectIntOp. Because it may use resultTy when resultTy has not been inferred. ``` auto resultTy = op.getType().cast<BaseTensorType>(); if (sliceTy.getSizes().size() == resultTy.getSizes().size()) { rewriter.replaceOp(op, slice); return success(); } ``` So I add restriction.	2024-04-01 21:25:02 +08:00
Xinyu Yang	40008b025a	[Torch] Support prelu decomposition (#3069 )	2024-03-29 08:05:00 +08:00
Xinyu Yang	e6e7689a24	[Torch] support decompose aten.einsum with ellipsis slicing (#3056 )	2024-03-27 12:42:10 -07:00
Yuanqiang Liu	0a581a97a7	[Torch Dialect] enhance aten.int.tensor's canonicalize (#3058 ) support fold with literal vtensor. change it to canonicalize because this pattern will create new op.	2024-03-27 09:51:58 +08:00
Rob Suderman	14b548f968	[torch] Improve shape inference for `torch-to-linalg` path for reshapes (#3055 ) Reshaping tensors depend on directly matching individual dimensions to their corresponding dim in the `torch.view` reshape dimensions. This involves decoupling dynamic dimensions from their static counterparts and support cleanup / canonicalization.	2024-03-26 12:41:40 -07:00
schnkmwt	1fcbfa87ec	Implement linalg lowering of diag_embed torch op (#2885 ) This PR adds lowering of diag_embed to linalg dilect. Tracked in https://github.com/nod-ai/SHARK-Turbine/issues/288 --------- Co-authored-by: sachink <sachink@xilinx.com>	2024-03-22 16:32:50 -07:00
zjgarvey	99b3a5f117	Converts all Adaptive Pooling Ops to Linalg (#2808 ) The previous conversions for AtenAdaptiveAvgPool1dOp and AtenAdaptiveMaxPool2dOp are refactored into a general templated conversion that works for all of the AtenAdaptive...PoolNdOp's. New support is added for the following ops: 1. AtenAdaptiveMaxPool1d 2. AtenAdaptiveMaxPool3d 3. AtenAdaptiveAvgPool3d Support is also provided for passing inputs without batch dimensions. For example, applying adaptive_avg_pool2d to an input tensor of rank 3. After [pytorch #118162](https://github.com/pytorch/pytorch/pull/118162) gets down to torch-mlir, I'll add a test for AdaptiveMaxPool1d with return_indices (which will pass with that upstream fix). --------- Co-authored-by: James Newling <james.newling@gmail.com>	2024-03-22 11:05:20 -07:00
Yuanqiang Liu	4282eb9e76	[Torch Dialect] support aten.fake_quantize_per_tensor_affine (#3014 )	2024-03-15 08:53:29 +08:00
Nithin Meganathan	798bfd7dff	Adds accumulator types in TorchToLinalg for `AtenMmOp` and `AtenConvolutionOp` (#3027 )	2024-03-14 16:40:40 -07:00
Yuanqiang Liu	870e63bc3c	[Torch Dialect] support decomposition of aten.linspace (#3006 )	2024-03-14 08:28:33 +08:00
Yuanqiang Liu	43c6996a31	[Torch Dialect] add folder for aten.ceil and unify patterns of ceil, … (#3010 ) …floor, round	2024-03-14 07:41:58 +08:00
ptrifunovic98	524ff99216	Implement lowering of torch.aten.linalg_cross (#2986 ) Closes [nod-ai/SHARK-Turbine#497](https://github.com/nod-ai/SHARK-Turbine/issues/497)	2024-03-13 12:17:22 -07:00
Nithin Meganathan	5ecc1d5c0d	Align softmax accumulation types with Torch's CUDA implementation (#2996 )	2024-03-12 15:07:45 -07:00
Rob Suderman	e78c99e74e	[torch] Update folders for splat operators (#3012 ) Splat operators required the output is 1-D. This was not a required restriction and was loosened to 2d.	2024-03-11 16:45:49 -04:00
Yuanqiang Liu	229ca3a9e1	[Torch Dialect] emit aten::mul and add folder (#3007 )	2024-03-11 19:59:34 +08:00
Rob Suderman	0723584936	[torch] Add folder for torch.aten.*.Scalar comparisons (#3000 ) This folds small version of the tensor-scalar comparison operators as they are commonly used for shape computations. This includes le, lt, ge, gt, eq, and ne.	2024-03-08 13:44:00 -08:00

1 2 3 4 5 ...

822 Commits (bb69014a960e67d07a98faa2faa5bdbb350264b8)