torch-mlir

Commit Graph

Author	SHA1	Message	Date
Sambhav Jain	2176176fef	[FX] Add broadcast test with dynamic dim (#3123 ) This scenario was uncovered in a downstream test that failed with a previous snapshot of torch-mlir. See https://github.com/cruise-automation/mlir-tcp/actions/runs/8605480116/job/23581829102?pr=65. ``` File "/home/runner/.cache/bazel/_bazel_runner/ce288f117ee4ca92dc028a6a28476a3d/sandbox/processwrapper-sandbox/2380/execroot/mlir-tcp/bazel-out/k8-opt-exec-2B5CBBC6/bin/test/AotCompile/broadcast_unit_dim_to_dynamic_with_unchanged_dim_dynamic_torch_exporter.runfiles/pip_deps_torch_mlir/site-packages/torch_mlir/extras/fx_importer.py", line 969, in value_info_to_type raise NotImplementedError( NotImplementedError: Could not deduce type from value info: tensor_meta=None, val=s1, sparsity=None ``` It seems to have resolved on current HEAD. Adding this test to ensure coverage in the future.	2024-04-29 09:21:12 -07:00
Vivek Khandelwal	b1e2241479	[ONNX] Fix Onnx.Selu lowering and canonicalizer for IntImplicit op (#3221 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-04-29 04:00:01 +00:00
Yuanqiang Liu	aed2cf3351	[Torch] emit aten.__contains__.str_list and add folder (#3249 )	2024-04-29 10:51:17 +08:00
Stella Laurenzo	6877302504	[NFC reformat] Applies pre-commit formatting to Python files. (#3244 ) This is a large change because prior to this point, Python files in the project were not consistently formatted. This reformats them all with black defaults. Based on experience with prior projects, if you have a dev/long-term branch with Python patches, you can minimize merge conflicts prior to rebasing to include this commit by running `black` on your modified Python files, squashing, and then rebasing/merging.	2024-04-27 14:16:31 -07:00
Stella Laurenzo	5d4b803914	[NFC reformat] Run pre-commit on all files and format misc. This is part 1 of ~3, formatting all miscellaneous text files and CPP files matched by a first run of pre-commit. These tend to be low change-traffic and are likely not disruptive. Subsequent patches will format Python files and remaining CPP files.	2024-04-27 14:08:09 -07:00
Yuanqiang Liu	f173a06fa7	[Torch] emit aten.ne.str and add folder (#3242 )	2024-04-28 00:58:50 +08:00
Yuanqiang Liu	634a796933	[Torch] fold aten.log (#3223 )	2024-04-26 10:10:02 +08:00
penguin_wwy	122eb69a98	[stablehlo] add aten left/right shift op conversion support (#3234 )	2024-04-26 09:20:49 +08:00
Archana Ramalingam	ac11ec796d	[MLIR][ONNX] Add OnnxToTorch support for ReduceLogSum Op (#3229 ) This commit adds the OnnxToTorch support for ReduceLogSum op	2024-04-25 19:37:57 -04:00
Yuanqiang Liu	b0ba3def93	[Torch] support AtenScalarImplicitOp canonicalize with float (#3231 )	2024-04-26 02:36:13 +08:00
Aart Bik	4361178caa	[torch-mlir][sparse] recognize sparse tensor conversion (#3226 ) Sparse tensor conversions are represented by special aten operators. This PR ensures the conversions are recognized (instead of failing the full torch aten lowering to linalg).	2024-04-26 02:32:07 +08:00
Yuanqiang Liu	fab2696489	[Torch] support aten.trunc (#3219 ) decompose `trunc(x)` to `sign(x) * floor(abs(x))`	2024-04-24 14:32:33 +08:00
jinchen	09d42044b4	Support select_last_index attribute of onnx argmin op (#3212 ) The tests listed in https://github.com/nod-ai/SHARK-Turbine/issues/648 all compiled, and the values of results match, but having runtime issue of dtype mismatch of i/si.	2024-04-23 10:43:38 -07:00
jinchen	61e6312c87	Support select_last_index attribute of onnx argmax op (#3192 ) The tests listed in https://github.com/nod-ai/SHARK-Turbine/issues/635 all compiled, but having run issue of dtype mismatch of i/si.	2024-04-23 10:16:08 -07:00
jinchen	ddb29c2c02	[onnx] Add OnnxToTorch support for `onnx.ConvInteger` (#3179 ) All e2e iree tests compiled, but they have the run issue of mismatch of dtype like the following ``` expected: 1x1x2x2xsi32=[[[12 16][24 28]]] actual: 1x1x2x2xi32=[[[12 16][24 28]]] ```	2024-04-23 09:42:02 -07:00
Vinayak Dev	cff2f084d4	[torch] Add OnnxToTorch lowering for `onnx.ReduceL2` (#3175 ) Adds OnnxToTorch lowering for the ReduceL2 op.	2024-04-23 02:03:05 -04:00
Vivek Khandelwal	3c252cdd44	[onnx] Add `onnx-to-torch` lowering for random ops (#3193 ) This commit adds the OnnxToTorch lowering for Onnx's RandomNormal, RandomNormalLike, RandomUniform, and RandomUniformLike op.	2024-04-22 22:28:07 +05:30
Vivek Khandelwal	6abc7371c8	[MLIR][TORCH] Fix OnnxToLinalg lowering issue for Squeeze and Unsqueeze op (#2991 ) This commit also cleans up the OnnxToTorch lowering for the Squeeze and Unsqueeze op and adds the support for handling edge cases. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-04-22 08:52:42 +00:00
Rob Suderman	733cace1df	[onnx] Fix `onnx.split` by directly handling slicing (#3194 ) Previous implementation erroneously mixed up num_outputs with slice_size. New version correctly computs the slice size and directly performs slicing rather than leveraging `aten.split.tensor`. This is due to `onnx` supporting a fixed number of splits making the size computation more easily computeable when lowering to `aten` rather than deferring to `aten.split.tensor`. --------- Co-authored-by: Robert Suderman <rsuderman@Roberts-MacBook-Pro.local>	2024-04-21 12:31:56 -04:00
Xinyu Yang	790a697245	[Torch] Add folder for AtenIntOp, AtenFloatOp (#3189 ) See unit test below: ``` // CHECK-LABEL: func.func @torch.aten.tensor.float( // CHECK-NEXT: torch.vtensor.literal(dense<1.000000e+01> : tensor<f32>) : !torch.vtensor<[],f32> func.func @torch.aten.tensor.float() -> !torch.vtensor<[],f32> { %none = torch.constant.none %false = torch.constant.bool false %float1.000000e01 = torch.constant.float 1.000000e+01 %67 = torch.aten.tensor.float %float1.000000e01, %none, %none, %false : !torch.float, !torch.none, !torch.none, !torch.bool -> !torch.vtensor<[],f32> return %67 : !torch.vtensor<[],f32> } // CHECK-LABEL: func.func @torch.aten.tensor.int( // CHECK-NEXT: torch.vtensor.literal(dense<45> : tensor<si32>) : !torch.vtensor<[],si32> func.func @torch.aten.tensor.int() -> !torch.vtensor<[],si32> { %none = torch.constant.none %false = torch.constant.bool false %int45 = torch.constant.int 45 %67 = torch.aten.tensor.int %int45, %none, %none, %false : !torch.int, !torch.none, !torch.none, !torch.bool -> !torch.vtensor<[],si32> return %67 : !torch.vtensor<[],si32> } ```	2024-04-19 22:17:06 +08:00
Rob Suderman	0e77de996a	[torch] Add support for `torch.view` with dynamic shapes (#3164 ) We can map to `tensor.reshape` for handling multiple output dynamic shapes. Later we can perform a more complex analysis for indentifying expand/collapse cases from the tensor.reshape. Initially we planned to handle this identification at the `torch` level however it will be easier to handle once converted to core mlir-dialects.	2024-04-18 11:47:19 -07:00
Aart Bik	491f4820f5	[torch-mlir][sparse] pre-pend named buffers to parameter list (#3178 ) weights and biases and other model parameters appear as a separate data structure to the traced graph, but are needed when running the MLIR compiled code; this PR implements that extended functionality	2024-04-17 14:44:05 -07:00
Andreas Falkenberg	b66eabd492	[onnx][torch][linalg] Implementing align-corner modes for gridsampler (#3171 ) Align corner modes which select what the corners mean. Either the center of the corner points or the edges of the edge points. --------- Co-authored-by: Rob Suderman <rob.suderman@gmail.com>	2024-04-17 13:38:19 -07:00
Vinayak Dev	a0232e9ebd	[MLIR][TORCH] Add OnnxToTorch lowering for ReduceL1 Op (#3146 ) Adds OnnxToTorch Lowering for the ReduceL1 op.	2024-04-16 12:24:46 +05:30
penguin_wwy	af5509c5d9	[FxImporter] Type conversion to resolve the mismatch between Py type and schema type (#3163 )	2024-04-15 23:14:19 -07:00
zjgarvey	5e564b5864	Adds Some Quantization Support for AtenMatmulOp (#3147 ) 1. onnx.MatMulInteger now converts to aten.matmul instead of aten.mm 2. aten.matmul, for ranks >=2, now allows quantized inputs and will lower to linalg::quantized_matmul or linalg::quantized_batch_matmul. 3. added AtenMatmulOp to the FuseQuantizeOps rewrite patters QuantizeOperands, QuantizeTransposedOperands, and QuantizeAccumulator 4. added several tests, including some to test AtenMmOp with varying quantization signed-ness. 5. a quantized matmul mat-vec test is added to verify the failure to lower to linalg; cleaned of out-of-date code related to common torch-mlir lowering xfails. 6. in debugging a real model with quantized matmuls, I found a bug on the scalarize-shapes pass which resulted from the aten.full op folder returning an incompatible result type. This is fixed by the small change here to [lib/Dialect/Torch/IR/TorchOps.cpp](https://github.com/llvm/torch-mlir/compare/main...zjgarvey:torch-mlir:MatMulIntegerFix?expand=1#diff-dc8ed165c207918e606490eee3984b1ad51d7034e6aac36fc046bf47f6f03f4f).	2024-04-15 16:06:47 -07:00
jinchen	83cba8c696	[onnx] Support for `onnx.EyeLike` via torch lowering (#2994 )	2024-04-15 09:23:26 -07:00
jinchen	859f5d280f	Generalize getting index for onnx compress op (#3150 )	2024-04-12 15:18:22 -07:00
Aart Bik	307f49f566	[torch-mlir][sparse] support sparse tensor output (#3152 ) Sparse inputs and outputs are now fully supported! They always consist of their constituents buffers, passed as numpy arrays. Sparse on!	2024-04-12 09:56:32 -07:00
Xinyu Yang	6524838bcb	[Torch] Add general AdaptiveAvgPool2dOp decompose support (#3111 ) Previously, it could only handle the situations where outputsize == (1, 1) or outputsize == (input_H, input_W). Now it supports all situations where input_H % output_H== 0 && input_W % output_W == 0	2024-04-11 17:02:59 +08:00
Aart Bik	184d8c13f4	[torch-mlir][sparse] add ID-net example (#3127 ) first sparse-in/sparse-out example, will be used to make actual sparse output work!	2024-04-09 11:21:30 -07:00
Yuanqiang Liu	8d5e2578b0	[Stablehlo] lowering aten.view to shape.num_elements + stablehlo.comp… (#3125 ) …ute_reshape_shape as that `aten.view` support at most one `-1` in dim list. The original calculation of `numel` is wrong when there is a `-1` in dim list.	2024-04-09 14:54:57 +08:00
Aart Bik	5797d3aa57	[torch-mlir][sparse] add a COO test for 3-dim (#3119 ) This tests COO for more than 2-dim. Note that sparsity should really propagate into the relu activation and the output, but such cleverness needs to wait for the pending work in the PyTorch tree.	2024-04-08 16:46:51 -07:00
Xida Ren (Cedar)	dd967eb199	[ONNX] Support onnx.LSTM (#2969 ) This PR only performs a lit test. In lieu of an e2e test, https://github.com/nod-ai/SHARK-TestSuite/pull/142 makede sure that the lowering works & the numbers check out. Co-authored-by: Xida Ren <xida.ren.dev@gmail.com>	2024-04-08 12:23:33 -07:00
Vivek Khandelwal	1d6e4c3d77	[MLIR][TORCH] Add OnnxToTorch lowering for Einsum op (#3117 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-04-08 22:38:01 +05:30
Vivek Khandelwal	af54d27820	[MLIR][TORCH] Fix Onnx.TopK lowering (#3103 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-04-03 22:12:48 +05:30
Vivek Khandelwal	ce7d4f1660	[MLIR][TORCH] Fix Onnx.ReduceSum lowering for failing e2e tests (#3095 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-04-03 09:57:19 +05:30
Stella Laurenzo	ffaaf08c31	[fx] Fix type inference for scalar/int types. (#3099 ) This was discovered in a downstream test suite and was due to a control flow nesting merge issue. In-tree test added and fixed.	2024-04-02 13:56:43 -07:00
Vivek Khandelwal	d1f770c620	[MLIR][TORCH] Fix OnnxToLinalg lowering issue for ReduceMean op (#3008 ) This commit also cleans up the OnnxToTorch lowering for the ReduceMean op and adds the support for handling edge cases. Signed-Off By: Vivek Khandelwal vivekkhandelwal1424@gmail.com	2024-04-02 16:54:04 +05:30
Thomas Dietert	d2432bbe5a	[MLIR][Torch] Do not convert bias tensor to element type if NoneType (#3072 ) The `convertTensorToElementType` function expects it's argument to have a valid tensor type that is not `Torch::NoneType`. This PR checks that the bias tensor is not of type `Torch::NoneType` before calling `convertTensorToElementType` on the bias tensor argument in the `matchAndRewrite` member function of the `ConvertAtenConvolutionOp` class.	2024-04-02 14:19:26 +05:30
Rob Suderman	ec4cb8be44	Bump LLVM to llvm/llvm-project@0030fc4ac7 (#3079 ) Co-authored-by: Peiming Liu <peiming@google.com>	2024-04-01 16:34:59 -07:00
Thomas Dietert	3c33dbd987	[MLIR][Torch] Canonicalize torch.from_i1 and torch.to_i1 (#3067 ) When lowering `torch.aten.convolution`, it is expected that the 'transposed' argument is a torch.constant operation. In some cases, the argument was a `from_i1` operation converting an `arith.constant` operation into a torch.bool. This is not wrong semantically, but instead of generalizing the legality of the `torch.aten.convolution` op, we canonicalize `arith.constant` ops followed by `from_i1` ops to `torch.bool` ops. For example: ``` //===-------------------------------------------===// Legalizing operation : 'torch.aten.convolution'(0x124705b90) { %33 = "torch.aten.convolution"(%arg0, %20, %21, %31, %29, %30, %19, %32, %0) : (!torch.vtensor<[1,1,28,28],f32>, !torch.vtensor<[10,1,5,5],f32>, !torch.vtensor<[10],f32>, !torch.list<int>, !torch.list<int>, !torch.list<int>, !torch.bool, !torch.list<int>, !torch.int) -> !torch.vtensor<[1,10,24,24],f32> * Fold { } -> FAILURE : unable to fold * Pattern : 'torch.aten.convolution -> ()' { ** Failure : unimplemented: only constant transposed supported. <-- Resolved by this PR } -> FAILURE : pattern failed to match * Pattern : 'torch.aten.convolution -> ()' { ** Failure : not a supported Scalar to Tensor like op } -> FAILURE : pattern failed to match * Pattern : 'torch.aten.convolution -> ()' { ** Failure : not a supported elementwise op } -> FAILURE : pattern failed to match * Pattern : 'torch.aten.convolution -> ()' { ** Failure : not a supported reduce op } -> FAILURE : pattern failed to match } -> FAILURE : no matched legalization pattern //===-------------------------------------------===// <stdin>:21:11: error: failed to legalize operation 'torch.aten.convolution' that was explicitly marked illegal %17 = torch.operator "onnx.Conv"(%arg0, %0, %1) {torch.onnx.dilations = [1 : si64, 1 : si64], torch.onnx.group = 1 : si64, torch.onnx.kernel_shape = [5 : si64, 5 : si64], torch.onnx.pads = [0 : si64, 0 : si64, 0 : si64, 0 : si64], torch.onnx.strides = [1 : si64, 1 : si64]} : (!torch.vtensor<[1,1,28,28],f32>, !torch.vtensor<[10,1,5,5],f32>, !torch.vtensor<[10],f32>) -> !torch.vtensor<[1,10,24,24],f32> ^ <stdin>:21:11: note: see current operation: %33 = "torch.aten.convolution"(%arg0, %20, %21, %31, %29, %30, %19, %32, %0) : (!torch.vtensor<[1,1,28,28],f32>, !torch.vtensor<[10,1,5,5],f32>, !torch.vtensor<[10],f32>, !torch.list<int>, !torch.list<int>, !torch.list<int>, !torch.bool, !torch.list<int>, !torch.int) -> !torch.vtensor<[1,10,24,24],f32> ``` Additionally, we require the canonicalization of `to_i1` operating on a torch.constant bool to an `arith.constant ... : i1` for the e2e tests to pass successfully.	2024-04-01 14:25:51 -07:00
Stella Laurenzo	826786bdd0	[fx] Support ExportedProgram buffer mutation. (#3080 ) In the prior state when I supported mutation of user inputs by treating them as mutable-tensor SSA values, I had left the case of buffer mutation only vaguely implemented until a concrete use emerged. This patch reworks this buffer mutation support by assuming that buffers must be resolved via the hooks symbolically and treated with load/store semantics. This is implied in the structure since we have no SSA value that represents a buffer and we already assume that reading parameters happens via such a mechanism.	2024-04-01 14:18:12 -07:00
Xinan Jiang(姜曦楠)	1cdae6bc68	[MLIR][TORCH]Add support lowing aten.Int.bool to arith (#3083 ) Now there no lowing for `aten.Int.bool` in `convert-torch-to-arith` pass. this PR add this support. Below is the UT. ``` func.func @torch.aten.Int.bool(%arg0: !torch.bool) -> !torch.int { %0 = torch.aten.Int.bool %arg0 : !torch.bool -> !torch.int return %0 : !torch.int } ```	2024-04-01 10:05:08 -07:00
Stella Laurenzo	282e9b0e64	[fx] Fix type determination for multi-return ops and static `None` returns. (#3081 ) In practice, this was caught by the way that AOT autograd traces `convolution_backward`. For the unit test, we just repro it with a custom op.	2024-04-01 09:39:38 -07:00
Gaurav Shukla	129a79417a	[MLIR][ONNX] Fix onnx.gather_nd implementation (#3070 ) The indices should be expanded before the torch.gather operation. Signed-off-by: Gaurav Shukla <gaurav@amd.com>	2024-04-01 20:17:09 +05:30
Xida Ren (Cedar)	5f325749f9	add lowerings for AtenLtIntOp and AtenLeIntOp (#3061 ) Co-authored-by: Xida Ren <xida.ren.dev@gmail.com>	2024-03-27 10:06:43 -07:00
Yuanqiang Liu	0a581a97a7	[Torch Dialect] enhance aten.int.tensor's canonicalize (#3058 ) support fold with literal vtensor. change it to canonicalize because this pattern will create new op.	2024-03-27 09:51:58 +08:00
Stella Laurenzo	e2343cf4ce	[fx] Implement auto_functionalized higher order op. (#3063 ) * Also adds the basic scaffolding for handling more of these, which will be needed for cond, while, etc. * Refactors some of the support in the generic OpOverload emitter so it can be shared with these other special forms. This has been on my list for a while, but it just so happens that as part of upgrading to PyTorch 2.3 and a pure upstream flow in Turbine, we were using a feature that required integration with auto_functionalized. This is perhaps the "weirdest" of the higher-order ops and a poor place to start, but needs must. We have testing for this in Turbine. Full support in Turbine has an entire custom ops facility. I've reduced this down to a unit test in torch-mlir.	2024-03-26 17:06:05 -07:00
Rob Suderman	14b548f968	[torch] Improve shape inference for `torch-to-linalg` path for reshapes (#3055 ) Reshaping tensors depend on directly matching individual dimensions to their corresponding dim in the `torch.view` reshape dimensions. This involves decoupling dynamic dimensions from their static counterparts and support cleanup / canonicalization.	2024-03-26 12:41:40 -07:00

1 2 3 4 5 ...

751 Commits (fix_lt)