torch-mlir

Commit Graph

Author	SHA1	Message	Date
zjgarvey	5e564b5864	Adds Some Quantization Support for AtenMatmulOp (#3147 ) 1. onnx.MatMulInteger now converts to aten.matmul instead of aten.mm 2. aten.matmul, for ranks >=2, now allows quantized inputs and will lower to linalg::quantized_matmul or linalg::quantized_batch_matmul. 3. added AtenMatmulOp to the FuseQuantizeOps rewrite patters QuantizeOperands, QuantizeTransposedOperands, and QuantizeAccumulator 4. added several tests, including some to test AtenMmOp with varying quantization signed-ness. 5. a quantized matmul mat-vec test is added to verify the failure to lower to linalg; cleaned of out-of-date code related to common torch-mlir lowering xfails. 6. in debugging a real model with quantized matmuls, I found a bug on the scalarize-shapes pass which resulted from the aten.full op folder returning an incompatible result type. This is fixed by the small change here to [lib/Dialect/Torch/IR/TorchOps.cpp](https://github.com/llvm/torch-mlir/compare/main...zjgarvey:torch-mlir:MatMulIntegerFix?expand=1#diff-dc8ed165c207918e606490eee3984b1ad51d7034e6aac36fc046bf47f6f03f4f).	2024-04-15 16:06:47 -07:00
jinchen	83cba8c696	[onnx] Support for `onnx.EyeLike` via torch lowering (#2994 )	2024-04-15 09:23:26 -07:00
jinchen	859f5d280f	Generalize getting index for onnx compress op (#3150 )	2024-04-12 15:18:22 -07:00
Aart Bik	307f49f566	[torch-mlir][sparse] support sparse tensor output (#3152 ) Sparse inputs and outputs are now fully supported! They always consist of their constituents buffers, passed as numpy arrays. Sparse on!	2024-04-12 09:56:32 -07:00
Xinyu Yang	6524838bcb	[Torch] Add general AdaptiveAvgPool2dOp decompose support (#3111 ) Previously, it could only handle the situations where outputsize == (1, 1) or outputsize == (input_H, input_W). Now it supports all situations where input_H % output_H== 0 && input_W % output_W == 0	2024-04-11 17:02:59 +08:00
Aart Bik	184d8c13f4	[torch-mlir][sparse] add ID-net example (#3127 ) first sparse-in/sparse-out example, will be used to make actual sparse output work!	2024-04-09 11:21:30 -07:00
Yuanqiang Liu	8d5e2578b0	[Stablehlo] lowering aten.view to shape.num_elements + stablehlo.comp… (#3125 ) …ute_reshape_shape as that `aten.view` support at most one `-1` in dim list. The original calculation of `numel` is wrong when there is a `-1` in dim list.	2024-04-09 14:54:57 +08:00
Aart Bik	5797d3aa57	[torch-mlir][sparse] add a COO test for 3-dim (#3119 ) This tests COO for more than 2-dim. Note that sparsity should really propagate into the relu activation and the output, but such cleverness needs to wait for the pending work in the PyTorch tree.	2024-04-08 16:46:51 -07:00
Xida Ren (Cedar)	dd967eb199	[ONNX] Support onnx.LSTM (#2969 ) This PR only performs a lit test. In lieu of an e2e test, https://github.com/nod-ai/SHARK-TestSuite/pull/142 makede sure that the lowering works & the numbers check out. Co-authored-by: Xida Ren <xida.ren.dev@gmail.com>	2024-04-08 12:23:33 -07:00
Vivek Khandelwal	1d6e4c3d77	[MLIR][TORCH] Add OnnxToTorch lowering for Einsum op (#3117 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-04-08 22:38:01 +05:30
Vivek Khandelwal	af54d27820	[MLIR][TORCH] Fix Onnx.TopK lowering (#3103 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-04-03 22:12:48 +05:30
Vivek Khandelwal	ce7d4f1660	[MLIR][TORCH] Fix Onnx.ReduceSum lowering for failing e2e tests (#3095 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-04-03 09:57:19 +05:30
Stella Laurenzo	ffaaf08c31	[fx] Fix type inference for scalar/int types. (#3099 ) This was discovered in a downstream test suite and was due to a control flow nesting merge issue. In-tree test added and fixed.	2024-04-02 13:56:43 -07:00
Vivek Khandelwal	d1f770c620	[MLIR][TORCH] Fix OnnxToLinalg lowering issue for ReduceMean op (#3008 ) This commit also cleans up the OnnxToTorch lowering for the ReduceMean op and adds the support for handling edge cases. Signed-Off By: Vivek Khandelwal vivekkhandelwal1424@gmail.com	2024-04-02 16:54:04 +05:30
Thomas Dietert	d2432bbe5a	[MLIR][Torch] Do not convert bias tensor to element type if NoneType (#3072 ) The `convertTensorToElementType` function expects it's argument to have a valid tensor type that is not `Torch::NoneType`. This PR checks that the bias tensor is not of type `Torch::NoneType` before calling `convertTensorToElementType` on the bias tensor argument in the `matchAndRewrite` member function of the `ConvertAtenConvolutionOp` class.	2024-04-02 14:19:26 +05:30
Rob Suderman	ec4cb8be44	Bump LLVM to llvm/llvm-project@0030fc4ac7 (#3079 ) Co-authored-by: Peiming Liu <peiming@google.com>	2024-04-01 16:34:59 -07:00
Thomas Dietert	3c33dbd987	[MLIR][Torch] Canonicalize torch.from_i1 and torch.to_i1 (#3067 ) When lowering `torch.aten.convolution`, it is expected that the 'transposed' argument is a torch.constant operation. In some cases, the argument was a `from_i1` operation converting an `arith.constant` operation into a torch.bool. This is not wrong semantically, but instead of generalizing the legality of the `torch.aten.convolution` op, we canonicalize `arith.constant` ops followed by `from_i1` ops to `torch.bool` ops. For example: ``` //===-------------------------------------------===// Legalizing operation : 'torch.aten.convolution'(0x124705b90) { %33 = "torch.aten.convolution"(%arg0, %20, %21, %31, %29, %30, %19, %32, %0) : (!torch.vtensor<[1,1,28,28],f32>, !torch.vtensor<[10,1,5,5],f32>, !torch.vtensor<[10],f32>, !torch.list<int>, !torch.list<int>, !torch.list<int>, !torch.bool, !torch.list<int>, !torch.int) -> !torch.vtensor<[1,10,24,24],f32> * Fold { } -> FAILURE : unable to fold * Pattern : 'torch.aten.convolution -> ()' { ** Failure : unimplemented: only constant transposed supported. <-- Resolved by this PR } -> FAILURE : pattern failed to match * Pattern : 'torch.aten.convolution -> ()' { ** Failure : not a supported Scalar to Tensor like op } -> FAILURE : pattern failed to match * Pattern : 'torch.aten.convolution -> ()' { ** Failure : not a supported elementwise op } -> FAILURE : pattern failed to match * Pattern : 'torch.aten.convolution -> ()' { ** Failure : not a supported reduce op } -> FAILURE : pattern failed to match } -> FAILURE : no matched legalization pattern //===-------------------------------------------===// <stdin>:21:11: error: failed to legalize operation 'torch.aten.convolution' that was explicitly marked illegal %17 = torch.operator "onnx.Conv"(%arg0, %0, %1) {torch.onnx.dilations = [1 : si64, 1 : si64], torch.onnx.group = 1 : si64, torch.onnx.kernel_shape = [5 : si64, 5 : si64], torch.onnx.pads = [0 : si64, 0 : si64, 0 : si64, 0 : si64], torch.onnx.strides = [1 : si64, 1 : si64]} : (!torch.vtensor<[1,1,28,28],f32>, !torch.vtensor<[10,1,5,5],f32>, !torch.vtensor<[10],f32>) -> !torch.vtensor<[1,10,24,24],f32> ^ <stdin>:21:11: note: see current operation: %33 = "torch.aten.convolution"(%arg0, %20, %21, %31, %29, %30, %19, %32, %0) : (!torch.vtensor<[1,1,28,28],f32>, !torch.vtensor<[10,1,5,5],f32>, !torch.vtensor<[10],f32>, !torch.list<int>, !torch.list<int>, !torch.list<int>, !torch.bool, !torch.list<int>, !torch.int) -> !torch.vtensor<[1,10,24,24],f32> ``` Additionally, we require the canonicalization of `to_i1` operating on a torch.constant bool to an `arith.constant ... : i1` for the e2e tests to pass successfully.	2024-04-01 14:25:51 -07:00
Stella Laurenzo	826786bdd0	[fx] Support ExportedProgram buffer mutation. (#3080 ) In the prior state when I supported mutation of user inputs by treating them as mutable-tensor SSA values, I had left the case of buffer mutation only vaguely implemented until a concrete use emerged. This patch reworks this buffer mutation support by assuming that buffers must be resolved via the hooks symbolically and treated with load/store semantics. This is implied in the structure since we have no SSA value that represents a buffer and we already assume that reading parameters happens via such a mechanism.	2024-04-01 14:18:12 -07:00
Xinan Jiang(姜曦楠)	1cdae6bc68	[MLIR][TORCH]Add support lowing aten.Int.bool to arith (#3083 ) Now there no lowing for `aten.Int.bool` in `convert-torch-to-arith` pass. this PR add this support. Below is the UT. ``` func.func @torch.aten.Int.bool(%arg0: !torch.bool) -> !torch.int { %0 = torch.aten.Int.bool %arg0 : !torch.bool -> !torch.int return %0 : !torch.int } ```	2024-04-01 10:05:08 -07:00
Stella Laurenzo	282e9b0e64	[fx] Fix type determination for multi-return ops and static `None` returns. (#3081 ) In practice, this was caught by the way that AOT autograd traces `convolution_backward`. For the unit test, we just repro it with a custom op.	2024-04-01 09:39:38 -07:00
Gaurav Shukla	129a79417a	[MLIR][ONNX] Fix onnx.gather_nd implementation (#3070 ) The indices should be expanded before the torch.gather operation. Signed-off-by: Gaurav Shukla <gaurav@amd.com>	2024-04-01 20:17:09 +05:30
Xida Ren (Cedar)	5f325749f9	add lowerings for AtenLtIntOp and AtenLeIntOp (#3061 ) Co-authored-by: Xida Ren <xida.ren.dev@gmail.com>	2024-03-27 10:06:43 -07:00
Yuanqiang Liu	0a581a97a7	[Torch Dialect] enhance aten.int.tensor's canonicalize (#3058 ) support fold with literal vtensor. change it to canonicalize because this pattern will create new op.	2024-03-27 09:51:58 +08:00
Stella Laurenzo	e2343cf4ce	[fx] Implement auto_functionalized higher order op. (#3063 ) * Also adds the basic scaffolding for handling more of these, which will be needed for cond, while, etc. * Refactors some of the support in the generic OpOverload emitter so it can be shared with these other special forms. This has been on my list for a while, but it just so happens that as part of upgrading to PyTorch 2.3 and a pure upstream flow in Turbine, we were using a feature that required integration with auto_functionalized. This is perhaps the "weirdest" of the higher-order ops and a poor place to start, but needs must. We have testing for this in Turbine. Full support in Turbine has an entire custom ops facility. I've reduced this down to a unit test in torch-mlir.	2024-03-26 17:06:05 -07:00
Rob Suderman	14b548f968	[torch] Improve shape inference for `torch-to-linalg` path for reshapes (#3055 ) Reshaping tensors depend on directly matching individual dimensions to their corresponding dim in the `torch.view` reshape dimensions. This involves decoupling dynamic dimensions from their static counterparts and support cleanup / canonicalization.	2024-03-26 12:41:40 -07:00
Vivek Khandelwal	9ae33e482e	[MLIR][TORCH] Add OnnxToTorch lowering for ops (#3049 ) This commit adds the OnnxToTorch lowering for the Mish, Softplus, HardSwish, Trilu, ThresholdedRelu op Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-03-25 20:29:07 +05:30
zjgarvey	6aa481c204	[ONNX] LogSoftmax to Torch (#3024 ) This PR adds support for onnx.LogSoftmax both for old versions (<13, with axis >=0), and new versions (13).	2024-03-22 11:01:39 -07:00
Gaurav Shukla	50635dd509	[ONNX][MLIR] Add support for onnx.gather_nd (#2988 ) Signed-off-by: Gaurav Shukla <gaurav@amd.com>	2024-03-22 21:38:39 +05:30
Stella Laurenzo	6ea857c644	[fx] Make the lift_fresh_copy -> clone special form use kwargs. (#3045 ) At some point, this op became kwarg-only instead of arg/kwarg. Discovered when upgrading to PyTorch 2.3. Also adds a test as this was untested in-tree (was caught out of tree).	2024-03-21 15:34:40 -07:00
penguin_wwy	7616d637fd	Add stateless fx graph import (#3036 )	2024-03-21 14:44:54 -07:00
Xida Ren (Cedar)	cb5cb506df	Fix SCF Forloop fails to convert to linalg when a tensor argument is supplied to the loop block (#3040 ) Co-authored-by: Rob Suderman <rob.suderman@gmail.com> Co-authored-by: Xida Ren <xida.ren.dev@gmail.com>	2024-03-20 11:04:02 -07:00
zjgarvey	6ff71b40c8	[ONNX] onnx.DynamicQuantizeLinear to Torch (#3009 ) This adds support for converting DynamicQuantizeLinear from torch-onnx to torch. I could not get an e2e test to pass, since there seems to be some issues with uint8 casting somewhere lower in the pipeline. For example compiling with IREE for llvm-cpu, I would get either the correct zero point (if zp < 128) or the correct zero-point minus 256 (if zp >= 128). The output tensor seems to always return a tensor of zeros, which also occurs when running uint8 examples through QuantizeLinear. Edit: the first problem can be resolved by casting the output back to uint8 on output, the second problem is resolved with PR #3018	2024-03-20 10:58:25 -07:00
jinchen	9cf6c45a39	Add OnnxToTorch support for Compress op (#3025 )	2024-03-20 17:12:08 +00:00
Pavani Chowdary	c51e2130f2	[onnx] support for lowering mod op from onnx to torch (#2859 ) nod-ai/Shark-Turbine#267 --------- Authored-by: boddu.pavani@research.iiit.ac.in Co-authored-by: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-03-18 17:54:37 +05:30
penguin_wwy	f34c187ac4	Normalize type hints to be compatible with multiple Python versions (#3028 ) Although we provide a wheel package for Python 3.8, it may actually throw the following exception: `TypeError: 'type' object is not subscriptable`	2024-03-15 08:29:48 -07:00
Sambhav Jain	0b2f9c89a2	Bring back `dynamic_shapes` constraints in fx importer API (#3026 ) https://github.com/llvm/torch-mlir/pull/2992 dropped `constraints` from the fx importer API, [breaking](https://github.com/cruise-automation/mlir-tcp/actions/runs/8284385380/job/22669774071) downstream AOT compile tests in `mlir-tcp` that use it. This knob has been soft-deprecated for a while now, replaced by `dynamic_shapes` - a more ergonomic interface. This PR brings back dynamic_shapes constraints in the new supported form. Also added a python lit test with dynamic shaped annotations.	2024-03-14 10:26:34 -07:00
aldesilv	6fa21bd8b1	OnnxToTorch lower celu op (#2920 )	2024-03-13 20:34:10 +05:30
Rob Suderman	8fb28661f9	[onnx] Fix onnx.ReduceMean lowering (#3002 ) Reduce mean lowerings did not succesfully lower to `linalg` via torched. There were two separate paths that could be consolidated to a single simpler pass. This resulted in a significant improvement in test coverage.	2024-03-11 11:32:53 -07:00
Rob Suderman	bd7f1baa42	[onnx] Fix expand operation for dynamic shape max (#3001 ) If the broadcast shape is length-1 at a dim while `?` in the input dim then we need to broadcast to the dynamic dim. This is equivalent to taking a max of two dimensions.	2024-03-08 16:23:07 -08:00
Rob Suderman	0723584936	[torch] Add folder for torch.aten.*.Scalar comparisons (#3000 ) This folds small version of the tensor-scalar comparison operators as they are commonly used for shape computations. This includes le, lt, ge, gt, eq, and ne.	2024-03-08 13:44:00 -08:00
Andreas Falkenberg	551a4e45f3	[onnx] Add support for `onnx.Gemm` with no bias (#2993 ) Previous gemm version required a bias vector. This provides an alternate path to `Torch::AtenMm` with no bias operation.	2024-03-07 15:58:38 -08:00
Rob Suderman	1964208d19	[onnx] Fix constant pad for dynamic shape (#2989 ) The current padding operation was not functional for dynamic shapes. Updated and enabled tests so that onnx.pad tests pass. Work TBD for reflection padding.	2024-03-07 13:29:50 -08:00
Scott Todd	7b18646def	[onnx] Handle optional arguments in Clip op pattern. (#2976 ) Spec: https://onnx.ai/onnx/operators/onnx__Clip.html	2024-03-07 17:25:14 +00:00
Vivek Khandelwal	6e84752c39	build: manually update PyTorch version (#2992 ) Set PyTorch and TorchVision version to nightly release 2024-03-07. This commit also removes the deprecated constraints API: `342e7929b8` Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-03-07 21:42:38 +05:30
Rob Suderman	c15f1a2bd2	[onnx] Adding lowering for `onnx.Size` operation (#2985 ) We can support `onnx.Size` by requesing the size of each dimensions and taking the product of the results, then packing it into a tensor. --------- Co-authored-by: Scott Todd <scott.todd0@gmail.com>	2024-03-06 17:01:05 -08:00
Rob Suderman	a78659742a	[onnx] Migrate `onnx.ReduceMax` to match `onnx.ReduceMin` (#2981 ) This mostly copy-pastes the reduce minimum implementation to reduce max to improve test coverage. We also improve the aten lowering for min/max dim for unsigned types.	2024-03-06 16:48:21 -08:00
Andreas Falkenberg	ea76dd12ba	[onnx][torch] Gridsampler E2E test and corrections of gridsampler (#2987 ) The addition of an e2e test is actually provided in the Shark-Testsuite. This adds 2 test cases for the gridsampler e2e test. Also as intended there were some items found which needed correction, so the Gridsampler op has also a change.	2024-03-06 10:56:58 -08:00
Rob Suderman	933db87a07	[onnx] Add support for constants of `i1`s (#2978 ) `getRawBuffer` expects a densely packed vector of `i1` values however `onnx` does not densely pack the values. Include code to handle the packing / unpacking.	2024-03-05 13:55:13 -08:00
Rob Suderman	a86e89ecb5	[torch] Additional folders for shape computations (#2972 ) A handful of operations are commonly used in shape calculations (slice, concat, broadcast). Added these additional folders to better propagate simple shape computations.	2024-03-04 11:46:49 -08:00
Chi_Liu	09875fabd1	[MLIR][ONNX] Add ONNX ReduceProd support (#2943 ) Alternatives to https://github.com/llvm/torch-mlir/pull/2908 Fix https://github.com/nod-ai/SHARK-Turbine/issues/353	2024-03-04 11:07:03 -08:00

1 2 3 4 5 ...

726 Commits (ae4724763acafaea50f41badc57a070e798bc376)