torch-mlir

Commit Graph

Author	SHA1	Message	Date
Rob Suderman	8222637159	[onnx] Extend op version number of `onnx.ScatterElements` (#3195 ) Version number was set too high. Lowered to support more cases allows more tests to pass. Co-authored-by: Robert Suderman <rsuderman@Roberts-MacBook-Pro.local>	2024-04-21 12:32:18 -04:00
Rob Suderman	733cace1df	[onnx] Fix `onnx.split` by directly handling slicing (#3194 ) Previous implementation erroneously mixed up num_outputs with slice_size. New version correctly computs the slice size and directly performs slicing rather than leveraging `aten.split.tensor`. This is due to `onnx` supporting a fixed number of splits making the size computation more easily computeable when lowering to `aten` rather than deferring to `aten.split.tensor`. --------- Co-authored-by: Robert Suderman <rsuderman@Roberts-MacBook-Pro.local>	2024-04-21 12:31:56 -04:00
penguin_wwy	b6b01602d3	[stablehlo] add aten.fmod.Tensor op conversion support (#3198 )	2024-04-21 08:39:36 +08:00
penguin_wwy	ea0ecb67be	[stablehlo] add aten.remainder.Tensor op conversion support (#3197 )	2024-04-21 00:03:37 +08:00
Rob Suderman	b01245c0e8	[onnx] Fix `onnx.Not` for non-bool inputs (#3187 ) Need to perform a bool cast to support `onnx.Not` on non-bool inputs.	2024-04-19 11:32:24 -07:00
penguin_wwy	5a98c72c7f	[StableHLO] Fix aten.clamp.Tensor in FxImporter2StableHLO (#3190 ) The FX importer will pass static shapes to the Torch dialect, so it needs to generate a StableHLO that satisfies shape inference.	2024-04-19 17:08:29 +08:00
penguin_wwy	6c4f7deebb	[stablehlo] add aten.clamp.Tensor op conversion support (#3185 )	2024-04-19 10:55:27 +08:00
Rob Suderman	0e77de996a	[torch] Add support for `torch.view` with dynamic shapes (#3164 ) We can map to `tensor.reshape` for handling multiple output dynamic shapes. Later we can perform a more complex analysis for indentifying expand/collapse cases from the tensor.reshape. Initially we planned to handle this identification at the `torch` level however it will be easier to handle once converted to core mlir-dialects.	2024-04-18 11:47:19 -07:00
Rob Suderman	4c21e20caa	[torch] Support rank-0 index for torch index select (#3182 ) Need to perform an expand in the case where the indices is rank-0.	2024-04-18 11:32:31 -07:00
Xinyu Yang	d4313eed4a	[Torch] Add decomposition of RepeatInterleaveSelfInt Op (#3075 ) Decomposition RepeatInterleaveSelfInt with following ops: ```python def my_repeat_interleave(input, repeats, dim=None): if dim is None: # Flatten the input and then repeat return input.flatten().unsqueeze(-1).tile((1, repeats)).flatten() else: # Calculate the shape after repeat expanded_shape = list(input.shape) expanded_shape[dim] = repeats # Repeat the tensor along the specified dimension repeat_shape = [1] (input.dim() + 1) repeat_shape[dim + 1] = repeats input = input.unsqueeze(-1) # Tile and then reshape tiled = torch.tile(input, repeat_shape) # Rearrange and reshape repeated = tiled.reshape(expanded_shape) return repeated ``` I passed the tests of stablehlo and linalg. When testing onnx, strange things happened. In torch-mlir's CI torch_nightly* and my own environment(torch==2.4.0.dev20240318+cpu), it can pass the pass. In torch-mlir's CI torch_stable, it failed. The test case is `RepeatInterleaveSelfIntNoDimModule_basic`, the result shape should be [120]. ```python class RepeatInterleaveSelfIntNoDimModule(torch.nn.Module): def __init__(self): super().__init__() @export @annotate_args([ None, ([3, 4, 5], torch.float32, True), ]) def forward(self, x): return x.repeat_interleave(2) @register_test_case(module_factory=lambda: RepeatInterleaveSelfIntNoDimModule()) def RepeatInterleaveSelfIntNoDimModule_basic(module, tu: TestUtils): module.forward(tu.rand(3, 4, 5)) ``` The error log is as follows: ``` Unexpected outcome summary: (onnx) ****** Failed tests - 1 tests FAIL - "RepeatInterleaveSelfIntNoDimModule_basic" @ trace item #0 - call to "forward" @ output of call to "forward" ERROR: shape (torch.Size([6, 4, 5])) is not equal to golden shape (torch.Size([120])) ``` @rsuderman Would you please help me check what's wrong with my PR? Thanks a lot.	2024-04-18 06:27:51 +08:00
Andreas Falkenberg	b66eabd492	[onnx][torch][linalg] Implementing align-corner modes for gridsampler (#3171 ) Align corner modes which select what the corners mean. Either the center of the corner points or the edges of the edge points. --------- Co-authored-by: Rob Suderman <rob.suderman@gmail.com>	2024-04-17 13:38:19 -07:00
zjgarvey	7a1ad0d7c0	[TorchToLinalg] Adds Support for Remaining Quantized Matmul Cases (#3167 ) The new cases added for quantized matmuls are: 1. vec-vec 2. vec-mat 3. mat-vec each of which are now lowered to expand(s), quantized_matmul, and collapse.	2024-04-16 09:28:28 -07:00
Vinayak Dev	a0232e9ebd	[MLIR][TORCH] Add OnnxToTorch lowering for ReduceL1 Op (#3146 ) Adds OnnxToTorch Lowering for the ReduceL1 op.	2024-04-16 12:24:46 +05:30
Xinyu Yang	ae4724763a	[Stablehlo] Enhance broadcast pattern in matmul Ops (#3161 ) To pass test "MatmulStaticBroadcast_basic" in stablehlo: ```python class MatmulStaticBroadcast(torch.nn.Module): def __init__(self): super().__init__() @export @annotate_args([ None, ([4, 1, 6, 7], torch.float32, True), ([8, 1, 5, 7, 6], torch.float32, True), ]) def forward(self, lhs, rhs): return torch.matmul(lhs, rhs) @register_test_case(module_factory=lambda: MatmulStaticBroadcast()) def MatmulStaticBroadcast_basic(module, tu: TestUtils): module.forward(tu.rand(4, 1, 6, 7), tu.rand(8, 1, 5, 7, 6)) ```	2024-04-16 10:10:36 +08:00
zjgarvey	5e564b5864	Adds Some Quantization Support for AtenMatmulOp (#3147 ) 1. onnx.MatMulInteger now converts to aten.matmul instead of aten.mm 2. aten.matmul, for ranks >=2, now allows quantized inputs and will lower to linalg::quantized_matmul or linalg::quantized_batch_matmul. 3. added AtenMatmulOp to the FuseQuantizeOps rewrite patters QuantizeOperands, QuantizeTransposedOperands, and QuantizeAccumulator 4. added several tests, including some to test AtenMmOp with varying quantization signed-ness. 5. a quantized matmul mat-vec test is added to verify the failure to lower to linalg; cleaned of out-of-date code related to common torch-mlir lowering xfails. 6. in debugging a real model with quantized matmuls, I found a bug on the scalarize-shapes pass which resulted from the aten.full op folder returning an incompatible result type. This is fixed by the small change here to [lib/Dialect/Torch/IR/TorchOps.cpp](https://github.com/llvm/torch-mlir/compare/main...zjgarvey:torch-mlir:MatMulIntegerFix?expand=1#diff-dc8ed165c207918e606490eee3984b1ad51d7034e6aac36fc046bf47f6f03f4f).	2024-04-15 16:06:47 -07:00
IanWood1	5708ee7ec9	Added 2 Ops: Floor divide scalar and Floor divide scalar mode (#3156 ) - Added linalg lowering for `AtenFloorDivideScalarOp` - Needed `AtenDivScalarModeOp` for the decomp. - Added linalg lowering for `AtenDivScalarModeOp` - Moved linalg payload logic to `createDivModePayload()` since the logic was nearly identical for both `AtenDivScalarModeOp` and `AtenDivTensorModeOp`. Just a template function - Added `AtenDivScalarModeOp` lowering for stablehlo Pytorch's [`torch.floor_divide()`](https://pytorch.org/docs/stable/generated/torch.floor_divide.html) in a previous version (for a reason unknown to me) preformed a truncation instead of "floor". The already implemented op `AtenFloorDivideTensorOp` was done before this change. However, this wasn't caught because our testcases only tested positive floor division. I changed this to floor as well as adding a few test cases.	2024-04-15 13:45:10 -07:00
jinchen	83cba8c696	[onnx] Support for `onnx.EyeLike` via torch lowering (#2994 )	2024-04-15 09:23:26 -07:00
jinchen	859f5d280f	Generalize getting index for onnx compress op (#3150 )	2024-04-12 15:18:22 -07:00
Xinan Jiang(姜曦楠)	71d90788d3	[MLIR][TORCH] Support parallel dimemsions expand/collapse (#3051 ) This PR support `aten.view` with unique unknown dimension both in input shape and output shape while the pass convert-torch-to-linalg that lowing `aten.view` to `tensor.collapse_shape` or `tensor.expand_shape`. Below is an example ``` func.func @test_reshape(%arg0: !torch.vtensor<[1,?,50,16],f32>) -> !torch.vtensor<[1,?,16],f32> attributes {torch.assume_strict_symbolic_shapes, torch.onnx_meta.ir_version = 9 : si64, torch.onnx_meta.opset_version = 19 : si64, torch.onnx_meta.producer_name = "backend-test", torch.onnx_meta.producer_version = ""} { %int1 = torch.constant.int 1 %int-1 = torch.constant.int -1 %int16 = torch.constant.int 16 %0 = torch.prim.ListConstruct %int1, %int-1, %int16 : (!torch.int, !torch.int, !torch.int) -> !torch.list<int> %1 = torch.aten.view %arg0, %0 : !torch.vtensor<[1,?,50,16],f32>, !torch.list<int> -> !torch.vtensor<[1,?,16],f32> return %1 : !torch.vtensor<[1,?,16],f32> } ```	2024-04-11 10:43:03 -07:00
Rob Suderman	a1fe307a76	[torch] Support implicit batch for index_put (#3128 ) If there is only a single value scattered there can be an implicit batch dimension. This includes a check for the implicit batch dimension when reshaping the update tensor. It includes an e2e test to verify correctness.	2024-04-11 10:18:03 -07:00
penguin_wwy	d4a30b7e67	Fix deprecated uses of cast/dyn_cast/dyn_cast_or_null/isa (#3130 ) We should prefer functional style as the method style is deprecated https://github.com/llvm/mlir-www/blob/main/website/content/deprecation/_index.md#deprecated (https://mlir.llvm.org/deprecation/)	2024-04-11 06:47:35 -07:00
Yuanqiang Liu	88533b1968	[Stablehlo] fix aten.arange's lowering to stablehlo (#3138 ) * promote to f64 to do division, avoid division on i64 (floor div) * refactor torch-to-stablehlo-pipeline	2024-04-11 15:55:56 +08:00
zjgarvey	aa5e150313	Adds Some uint8 Quantization Fixes (#3122 ) 1. Changes the linalg lowering for dequantization ops to always sign cast to float to prevent misrepresenting uint32 overflow on subtraction with zero point. 2. Adds a basic quantized model test which only quantizes and dequantizes and now passes with these changes in linalg and onnx configs. 3. Changes the aten.mm lowering to allow mismatched quantized types. 4. If a quantized matmul arg is uint8, we shift by 128 to faithfully represent the quantization as a signed i8 quantization. This worked fine in the AtenMmOp lowering, but I'd be happy to move it to a rewrite in FuseQuantizedOps.cpp instead if that seems more appropriate. With the changes 3 and 4, the QuantizedMLP_basic and QuantizedSingleLayer_basic e2e tests now passes with the onnx config.	2024-04-10 12:36:58 -07:00
Yuanqiang Liu	8d5e2578b0	[Stablehlo] lowering aten.view to shape.num_elements + stablehlo.comp… (#3125 ) …ute_reshape_shape as that `aten.view` support at most one `-1` in dim list. The original calculation of `numel` is wrong when there is a `-1` in dim list.	2024-04-09 14:54:57 +08:00
Xida Ren (Cedar)	dd967eb199	[ONNX] Support onnx.LSTM (#2969 ) This PR only performs a lit test. In lieu of an e2e test, https://github.com/nod-ai/SHARK-TestSuite/pull/142 makede sure that the lowering works & the numbers check out. Co-authored-by: Xida Ren <xida.ren.dev@gmail.com>	2024-04-08 12:23:33 -07:00
Vivek Khandelwal	1d6e4c3d77	[MLIR][TORCH] Add OnnxToTorch lowering for Einsum op (#3117 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-04-08 22:38:01 +05:30
Xinyu Yang	84c24e5771	[Torch] Support Aten__And__ScalarOp (#3114 )	2024-04-08 20:24:17 +08:00
Yuanqiang Liu	498ab997cd	[Stablehlo] lowering aten.log1p to stablehlo.log_plus_one (#3110 )	2024-04-07 17:01:58 +08:00
Rob Suderman	9d9a05366e	[torch] Fix aten.squeeze lowering to use result shape (#3106 ) Squeezes can be ambiguous without the output shape information. For instance (1, 1, 256) squeezed can be either (1, 256) or (256). We need to check the resulting shape to know what the shape should look like.	2024-04-04 09:43:12 -07:00
Vivek Khandelwal	af54d27820	[MLIR][TORCH] Fix Onnx.TopK lowering (#3103 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-04-03 22:12:48 +05:30
Vivek Khandelwal	ce7d4f1660	[MLIR][TORCH] Fix Onnx.ReduceSum lowering for failing e2e tests (#3095 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-04-03 09:57:19 +05:30
Rob Suderman	f97cd4893f	[torch] Improve shape inference for dynamic shapes (#3091 ) Shapes can be processed as tensors to represent the set of dimensions. As reshapes take a list of scalars this can result in a single dynamic dimension blocking the adjacent static dimensions. This pass attempts to de-couple tensor computations related to shapes and propagate values to better support lowering scalar tensor computations.	2024-04-02 16:19:57 -07:00
Vivek Khandelwal	d1f770c620	[MLIR][TORCH] Fix OnnxToLinalg lowering issue for ReduceMean op (#3008 ) This commit also cleans up the OnnxToTorch lowering for the ReduceMean op and adds the support for handling edge cases. Signed-Off By: Vivek Khandelwal vivekkhandelwal1424@gmail.com	2024-04-02 16:54:04 +05:30
Xinyu Yang	ac1cd3d78a	[Torch] Support AtenDivTensorModeOp with static int input for linalg and stablehlo backend (#3088 )	2024-04-02 17:28:53 +08:00
Thomas Dietert	d2432bbe5a	[MLIR][Torch] Do not convert bias tensor to element type if NoneType (#3072 ) The `convertTensorToElementType` function expects it's argument to have a valid tensor type that is not `Torch::NoneType`. This PR checks that the bias tensor is not of type `Torch::NoneType` before calling `convertTensorToElementType` on the bias tensor argument in the `matchAndRewrite` member function of the `ConvertAtenConvolutionOp` class.	2024-04-02 14:19:26 +05:30
ptrifunovic98	1c8c47d483	Add complex support for aten.norm and similar operations (#3052 ) Add support for complex-type input tensors for norm, vector norm, and Frobenius norm operations.	2024-04-02 14:03:30 +05:30
zjgarvey	532d297c46	[ONNX] Preliminary Work Towards Supporting QuantizedMLP_basic onnx e2e test (#3089 ) See the related issues here: [SHARK-Turbine#556](https://github.com/nod-ai/SHARK-Turbine/issues/556) 1. Adds uint8 casting to onnx.Cast op 2. Fixes an issue with onnx.DequantizeLinear when the scale comes with shape [1]. 3. Adds support for unsigned types in an AtenItemOp folder 4. Adds a simpler quantized model for easier debugging 5. Adds a fusion pass to convert [quant -> dequant -> transpose -> mm] patterns to [transpose -> quant -> mm]. 6. Moved some xfails that are still not passing, but for different reasons than onnx.cast failures.	2024-04-01 16:21:05 -07:00
penguin_wwy	b98f7f75dc	[stablehlo] Reduce unnecessary template specialization code (#3047 )	2024-04-01 14:18:49 -07:00
Xinan Jiang(姜曦楠)	1cdae6bc68	[MLIR][TORCH]Add support lowing aten.Int.bool to arith (#3083 ) Now there no lowing for `aten.Int.bool` in `convert-torch-to-arith` pass. this PR add this support. Below is the UT. ``` func.func @torch.aten.Int.bool(%arg0: !torch.bool) -> !torch.int { %0 = torch.aten.Int.bool %arg0 : !torch.bool -> !torch.int return %0 : !torch.int } ```	2024-04-01 10:05:08 -07:00
Vivek Khandelwal	6844c84702	[MLIR][Torch] Fix OnnxToLinalg lowering for AvgPool op (#3076 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-04-01 22:14:14 +05:30
Gaurav Shukla	129a79417a	[MLIR][ONNX] Fix onnx.gather_nd implementation (#3070 ) The indices should be expanded before the torch.gather operation. Signed-off-by: Gaurav Shukla <gaurav@amd.com>	2024-04-01 20:17:09 +05:30
Jiawei Wu	76080936d4	[stablehlo] add aten.index_put and aten.scatter_add op conversion support (#3086 )	2024-04-01 19:39:49 +08:00
zjgarvey	c19fc9ba47	[ONNX] Fixes Issue with Dynamic Dims in GlobalAveragePool -> Torch Conversion (#3053 ) Two e2e tests (AdaptiveAveragePool1/2dUnitOutputSizeDynamic) were failing due to numerics. This was as a result of passing -1 as the kernel size in the lowering for the corresponding onnx op GlobalAveragePool.	2024-03-28 09:43:09 -07:00
Xida Ren (Cedar)	5f325749f9	add lowerings for AtenLtIntOp and AtenLeIntOp (#3061 ) Co-authored-by: Xida Ren <xida.ren.dev@gmail.com>	2024-03-27 10:06:43 -07:00
Rob Suderman	14b548f968	[torch] Improve shape inference for `torch-to-linalg` path for reshapes (#3055 ) Reshaping tensors depend on directly matching individual dimensions to their corresponding dim in the `torch.view` reshape dimensions. This involves decoupling dynamic dimensions from their static counterparts and support cleanup / canonicalization.	2024-03-26 12:41:40 -07:00
Vivek Khandelwal	9ae33e482e	[MLIR][TORCH] Add OnnxToTorch lowering for ops (#3049 ) This commit adds the OnnxToTorch lowering for the Mish, Softplus, HardSwish, Trilu, ThresholdedRelu op Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-03-25 20:29:07 +05:30
schnkmwt	1fcbfa87ec	Implement linalg lowering of diag_embed torch op (#2885 ) This PR adds lowering of diag_embed to linalg dilect. Tracked in https://github.com/nod-ai/SHARK-Turbine/issues/288 --------- Co-authored-by: sachink <sachink@xilinx.com>	2024-03-22 16:32:50 -07:00
zjgarvey	99b3a5f117	Converts all Adaptive Pooling Ops to Linalg (#2808 ) The previous conversions for AtenAdaptiveAvgPool1dOp and AtenAdaptiveMaxPool2dOp are refactored into a general templated conversion that works for all of the AtenAdaptive...PoolNdOp's. New support is added for the following ops: 1. AtenAdaptiveMaxPool1d 2. AtenAdaptiveMaxPool3d 3. AtenAdaptiveAvgPool3d Support is also provided for passing inputs without batch dimensions. For example, applying adaptive_avg_pool2d to an input tensor of rank 3. After [pytorch #118162](https://github.com/pytorch/pytorch/pull/118162) gets down to torch-mlir, I'll add a test for AdaptiveMaxPool1d with return_indices (which will pass with that upstream fix). --------- Co-authored-by: James Newling <james.newling@gmail.com>	2024-03-22 11:05:20 -07:00
zjgarvey	6aa481c204	[ONNX] LogSoftmax to Torch (#3024 ) This PR adds support for onnx.LogSoftmax both for old versions (<13, with axis >=0), and new versions (13).	2024-03-22 11:01:39 -07:00
Gaurav Shukla	50635dd509	[ONNX][MLIR] Add support for onnx.gather_nd (#2988 ) Signed-off-by: Gaurav Shukla <gaurav@amd.com>	2024-03-22 21:38:39 +05:30
Rob Suderman	3a56714bff	[torch] Fix clamp ranges on quantize_per_tensor on unsigned (#3018 ) SExtValue was used for `int` and `uint` clamp values. This caused the result to always be outputed as `zero`.	2024-03-20 13:37:47 -07:00
Xida Ren (Cedar)	cb5cb506df	Fix SCF Forloop fails to convert to linalg when a tensor argument is supplied to the loop block (#3040 ) Co-authored-by: Rob Suderman <rob.suderman@gmail.com> Co-authored-by: Xida Ren <xida.ren.dev@gmail.com>	2024-03-20 11:04:02 -07:00
zjgarvey	6ff71b40c8	[ONNX] onnx.DynamicQuantizeLinear to Torch (#3009 ) This adds support for converting DynamicQuantizeLinear from torch-onnx to torch. I could not get an e2e test to pass, since there seems to be some issues with uint8 casting somewhere lower in the pipeline. For example compiling with IREE for llvm-cpu, I would get either the correct zero point (if zp < 128) or the correct zero-point minus 256 (if zp >= 128). The output tensor seems to always return a tensor of zeros, which also occurs when running uint8 examples through QuantizeLinear. Edit: the first problem can be resolved by casting the output back to uint8 on output, the second problem is resolved with PR #3018	2024-03-20 10:58:25 -07:00
jinchen	9cf6c45a39	Add OnnxToTorch support for Compress op (#3025 )	2024-03-20 17:12:08 +00:00
Abhishek-TyRnT	df02692726	Dynamic size support for flatten (#3005 ) Added support for dynamic shapes in `flattenusingints` op in tosa dialect. Due to this some Argmax tests pass This PR fixes this issue https://github.com/llvm/torch-mlir/issues/3004 The following tests pass after this PR ``` 1. "ArgmaxIntModule_basic" 2. "ArgmaxIntModule_multiple_maxs" 3. "ArgmaxModule_basic" ```	2024-03-19 15:19:29 -07:00
zjgarvey	7a9608bb69	[ONNX] Reduces onnx.Div sinceVersion to 7 (#3041 ) The only difference between version 7 and newer versions is support for different data types. We should allow this pattern to match as early as 7. Earlier versions have a more manual broadcast specification through attributes, so I did not include those versions. See: [onnx.Div docs](https://onnx.ai/onnx/operators/onnx__Div.html#l-onnx-doc-divl)	2024-03-19 13:35:05 -07:00
Pavani Chowdary	c51e2130f2	[onnx] support for lowering mod op from onnx to torch (#2859 ) nod-ai/Shark-Turbine#267 --------- Authored-by: boddu.pavani@research.iiit.ac.in Co-authored-by: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-03-18 17:54:37 +05:30
Xinan Jiang(姜曦楠)	d8a52e82c2	[onnx] Fix onnx.cast cases between int32 and int64 (#2982 ) 2 modifications: 1. torch.int64 is enum 4 in TORCH_DTYPE_TO_INT 2. add int32 support	2024-03-15 17:14:09 +00:00
Nithin Meganathan	798bfd7dff	Adds accumulator types in TorchToLinalg for `AtenMmOp` and `AtenConvolutionOp` (#3027 )	2024-03-14 16:40:40 -07:00
aldesilv	6fa21bd8b1	OnnxToTorch lower celu op (#2920 )	2024-03-13 20:34:10 +05:30
Yuanqiang Liu	ad6159c7cb	[Stablehlo] lowering aten.round to stablehlo.round_nearest_even (#3011 )	2024-03-12 08:58:20 +08:00
Rob Suderman	8fb28661f9	[onnx] Fix onnx.ReduceMean lowering (#3002 ) Reduce mean lowerings did not succesfully lower to `linalg` via torched. There were two separate paths that could be consolidated to a single simpler pass. This resulted in a significant improvement in test coverage.	2024-03-11 11:32:53 -07:00
Rob Suderman	bd7f1baa42	[onnx] Fix expand operation for dynamic shape max (#3001 ) If the broadcast shape is length-1 at a dim while `?` in the input dim then we need to broadcast to the dynamic dim. This is equivalent to taking a max of two dimensions.	2024-03-08 16:23:07 -08:00
Rob Suderman	0723584936	[torch] Add folder for torch.aten.*.Scalar comparisons (#3000 ) This folds small version of the tensor-scalar comparison operators as they are commonly used for shape computations. This includes le, lt, ge, gt, eq, and ne.	2024-03-08 13:44:00 -08:00
Andreas Falkenberg	551a4e45f3	[onnx] Add support for `onnx.Gemm` with no bias (#2993 ) Previous gemm version required a bias vector. This provides an alternate path to `Torch::AtenMm` with no bias operation.	2024-03-07 15:58:38 -08:00
Rob Suderman	1964208d19	[onnx] Fix constant pad for dynamic shape (#2989 ) The current padding operation was not functional for dynamic shapes. Updated and enabled tests so that onnx.pad tests pass. Work TBD for reflection padding.	2024-03-07 13:29:50 -08:00
Scott Todd	7b18646def	[onnx] Handle optional arguments in Clip op pattern. (#2976 ) Spec: https://onnx.ai/onnx/operators/onnx__Clip.html	2024-03-07 17:25:14 +00:00
Rob Suderman	c15f1a2bd2	[onnx] Adding lowering for `onnx.Size` operation (#2985 ) We can support `onnx.Size` by requesing the size of each dimensions and taking the product of the results, then packing it into a tensor. --------- Co-authored-by: Scott Todd <scott.todd0@gmail.com>	2024-03-06 17:01:05 -08:00
Rob Suderman	a78659742a	[onnx] Migrate `onnx.ReduceMax` to match `onnx.ReduceMin` (#2981 ) This mostly copy-pastes the reduce minimum implementation to reduce max to improve test coverage. We also improve the aten lowering for min/max dim for unsigned types.	2024-03-06 16:48:21 -08:00
Andreas Falkenberg	ea76dd12ba	[onnx][torch] Gridsampler E2E test and corrections of gridsampler (#2987 ) The addition of an e2e test is actually provided in the Shark-Testsuite. This adds 2 test cases for the gridsampler e2e test. Also as intended there were some items found which needed correction, so the Gridsampler op has also a change.	2024-03-06 10:56:58 -08:00
Rob Suderman	933db87a07	[onnx] Add support for constants of `i1`s (#2978 ) `getRawBuffer` expects a densely packed vector of `i1` values however `onnx` does not densely pack the values. Include code to handle the packing / unpacking.	2024-03-05 13:55:13 -08:00
Chi_Liu	09875fabd1	[MLIR][ONNX] Add ONNX ReduceProd support (#2943 ) Alternatives to https://github.com/llvm/torch-mlir/pull/2908 Fix https://github.com/nod-ai/SHARK-Turbine/issues/353	2024-03-04 11:07:03 -08:00
Rob Suderman	19d4888278	[torch] Make torch.aten.unflatten lower directly to linalg (#2971 ) Existing lowering via aten.view does not work as well for dynamic shapes as the lowering to tensor.expand must re-infer dynamic shape matching. Better to directly lower.	2024-03-04 10:17:42 -08:00
Rob Suderman	d51e80b648	[onnx] Fix onnx.gather lowering for rank-0 indices (#2973 ) We assumed rank was atleast 1 however it can be rank-0, generating an illegal pair of flatten / unflatten operations. Corrected this.	2024-03-04 08:25:19 -08:00
Yuanqiang Liu	916554f270	[Stablehlo] add torch_to_stablehlo::getBackendTypeForScalarType (#2975 )	2024-03-04 23:31:54 +08:00
Rob Suderman	d030bffc62	[torch] Support `aten.view` rank-0 collapse (#2965 ) Collapsing to a rank-0 tensor using `aten.view` was currently bailing out. Added the special case.	2024-03-01 12:31:07 -08:00
Vivek Khandelwal	579ac8b666	[MLIR][TORCH] Fix OnnxToLinalg lowering issue for sub and sum op (#2954 ) This commit adds the support for scalar conversion to byte. This commit also fixes the OnnxToLinalg lowering issue for Onnx.Sub and Onnx.Sum op. Fixes https://github.com/nod-ai/SHARK-Turbine/issues/466 Fixes https://github.com/nod-ai/SHARK-Turbine/issues/467 Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-02-29 21:48:46 +05:30
mmakevic	76b81e0ccd	Implement lowering of torch.aten.fmod.Tensor (#2767 ) Closing https://github.com/nod-ai/SHARK-Turbine/issues/351	2024-02-29 11:22:03 +05:30
Andreas Falkenberg	5437f32193	[onnx][torch] Lower `onnx.grid_sampler` to the `torch` equivalents (#2952 ) This is the lowering of gridsampler from onnx to torch using our prior implementation of AtenGridSamplerOp. Here are several checks for cornercases implemented. We may decide to have part of these checks in AtenGridSamplerOp instead of the onnx lowering portion.	2024-02-28 13:52:15 -08:00
Rob Suderman	e48fe45886	[onnx] Import `onnx` import to pass remaining tests (#2951 ) Finish supporting importing the vast majority of `onnx` operations. This includes: - region support - region value inherentance - `torch.string` support - `torch.list` support - `torch.optional` support	2024-02-28 12:18:02 -08:00
Rob Suderman	6f3d62ab04	[torch] Fix folders and `cat` and `view` torch lowerings (#2963 ) A bunch of small fixes are interlinked and trigger crashes if not addressed as a group. This includes: - aten view when expand from a rank-0 tensor - slice folder with negative indices - `aten._shape_as_tensor` folder on a rank-0 tensor - `aten.cat` of a tensor with a length-0 tensor	2024-02-28 12:04:52 -08:00
Rob Suderman	dd673cfa8d	[torch] Add edgecase for aten.shape_to_tensor for rank-0 input (#2962 ) Currently lowering uses `tensor.from_elements` which does not allow zero inputs. In this case we return a `tensor.empty` operation.	2024-02-28 09:47:06 -08:00
Rob Suderman	08bc013fcd	[tosa] Fix TOSA batch matmul lowering to correct transpose ordering (#2959 ) The corrective transpose at the end is computed incorrectly. Is it actually computin the inverse transpose. Inverting the permutations fixes the issue.	2024-02-28 09:46:58 -08:00
Rob Suderman	4a7a7d76f8	[onnx] Fix ReduceMean lowering to torch (#2956 ) Torch lowering only supported the most recent version. Refactored the lowering so more easily handle default values and optional operands / attributes.	2024-02-27 22:48:07 -08:00
Abhishek-TyRnT	d541779f37	Add support for torch arange float module (#2749 ) Added Support for float dtype in in torch.arange in TOSA Dialect This resolves the following issue :- https://github.com/llvm/torch-mlir/issues/2762 The following test cases are passing after this change 1. ArangeDtypeIntModule_basic 2. ArangeFloatModule_basic 3. ArangeNegativeStartFloatModule_basic 4. ArangeStartFloatModule_basic 5. ArangeStartNegativeStepFloatModule_basic 6. ArangeStartOutDtypeModule_basic 7. ArangeStartStepFloatModule_basic --------- Co-authored-by: James Newling <james.newling@gmail.com>	2024-02-27 13:40:55 -08:00
Rob Suderman	e30a083aff	[torch] Rework lowering to tm_tensor.scatter to stop serialization (#2940 ) We collapsed and broadcasted scatter indices to a single element version. We should instead upport `tm_tensor.scatter`s support for multiple indices and the implicitly broadcasted behavior. This avoids the serialization and materializing a needlessly large indices tensor.	2024-02-27 11:46:57 -08:00
Vivek Khandelwal	d628b5fd06	[MLIR][TORCH] Add support for tanh approximation for Gelu op (#2941 ) Fixes https://github.com/nod-ai/SHARK-Turbine/issues/461 Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-02-27 19:26:01 +05:30
Vivek Khandelwal	d81747eadb	[MLIR][TORCH] Extend support for OnnxToLinalg lowering for Dropout and Div op (#2938 ) Fixes https://github.com/nod-ai/SHARK-Turbine/issues/451, https://github.com/nod-ai/SHARK-Turbine/issues/452	2024-02-27 11:02:05 +05:30
ptrifunovic98	c5a1da1910	Implement lowering of torch.aten.norm.Scalar (#2899 ) Closes [nod-ai/SHARK-Turbine#365](https://github.com/nod-ai/SHARK-Turbine/issues/365)	2024-02-26 08:46:56 -08:00
Andreas Falkenberg	55dc8deb92	[torch] GridSample TorchToLinalg lowering (#2883 ) Lowers `torch.grid_sample` to the equilvalent `linalg` representation.	2024-02-23 09:14:38 -08:00
Rob Suderman	53f6d06ab8	[onnx] Drop `ConstantOfShape` logic form importer, fix torch lowering (#2930 ) There is no reason to treat `ConstantOfShape` as a specialized import any as there exists a onnx-to-torch equivalent. Dropping the import coding and adding support for resource conversion substantially increases test coverage for dynamically shaped tests.	2024-02-21 21:34:43 -08:00
Rob Suderman	df2aa1a369	[torch] Fixed edge conditions for strided slicing (#2929 ) Strided slicing can occur with a negative stride. In these cases we need to bound end differently. This included removing a function that was generating bad limits.	2024-02-21 21:28:44 -08:00
Srinath Avadhanula	0f80e75c2e	allow tosa.cast to convert from f32 to f16 (#2934 ) According to the [official TOSA spec](https://www.mlplatform.org/tosa/tosa_spec.html#_cast), `tosa.cast` allows a cast from `fp32` to `fp16`. We were not previously accounting for this in the `TorchToTosa` lowering. Also did a tiny bit of cleanup in the code to make it easier to spot which conversions are currently allowed. --------- Co-authored-by: Srinath Avadhanula <srinath.avadhanula@getcruise.com>	2024-02-20 14:22:38 -08:00
Rob Suderman	135c81a416	[torch] Add folder for `prim.NumToTensor.Scalar` (#2921 ) Useful for `slice` lowerings that depend on tensors made form scalars.	2024-02-19 11:55:54 -08:00
Rob Suderman	cea51897a5	[onnx] Simplify onnx.slice lowering (#2919 ) Onnx slice lowering used arange needlessly instead of directly constructing the constant dimension values. This makes lowerings to linalg struggle as multiple folders are required to get what is a constant index value.	2024-02-19 10:26:29 -08:00
Rob Suderman	fd08578bdb	[torch] Support dynamic step size for `torch.slice` (#2922 ) For some reason we did not directly use the step size dynamically despite its constructed using the dynamic value.	2024-02-19 10:26:21 -08:00
aldesilv	d29157b33f	OnnxToTorch support for onnx.InstanceNormalization op (#2710 ) https://github.com/nod-ai/SHARK-Turbine/issues/327	2024-02-19 19:53:48 +05:30
Rob Suderman	d65925a8b4	[onnx] Fix `onnx.sigmoid` for integer inputs/outputs (#2914 ) Sample compilation crashes due to sigmoid with integer inputs/outputs. This fix avoids crashing but still experiences an error.	2024-02-16 13:35:25 -08:00
Rob Suderman	7a0d0e954b	[onnx] Fix onnx.gather lowering to use torch.aten.index_select (#2913 ) Onnx's gather maps directly to `torch.aten.index_select`. We should just use that path.	2024-02-16 16:05:44 -05:00
Rob Suderman	468c533942	[onnx] Fix crash when negative transpose values exist (#2915 ) We are crashing due to indexing into a negative shape. Updated the lowering to avoid the crash.	2024-02-16 16:04:47 -05:00
Rob Suderman	074f112d6a	[onnx] Add testing using the `onnx` compilation using torch tests (#2795 ) We can route the torch tests via `onnx` using the `torch.onnx.export` tooling. We can then reimport, lower to torch, and compile to linalg to validate the onnx path is working correctly. The current implementation exposes some failures in the `onnx` path so we cannot enable the onnx test suite yet due to segmentation faults.	2024-02-15 10:17:13 -08:00
Yuanqiang Liu	f3e8199a6d	[Stablehlo] add refbackend (#2712 )	2024-02-16 01:08:48 +08:00
Vivek Khandelwal	d6d1a173dc	[MLIR][Torch] Add OnnxToTorch and TorchToLinalg support for trig ops (#2903 ) This commit adds the OnnxToTorch lowering for cosh, acosh, asin, asinh, and atanh op. This commit also adds the TorchToLinalg lowering for acosh, asin, asinh, and atanh op. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-02-14 11:58:09 +05:30
Rob Suderman	e9cdd6cbc5	[torch] Fix tm_tensor.attention for end-to-end (#2907 ) Some operations include a backend matcher for specialized operations. We map these back to generics so they appropriately match to the high performance versions. This is done for the attention operation.	2024-02-13 21:18:01 -08:00
saienduri	9b967f6b5a	[MLIR][ONNX] Add OnnxToTorch support for Mean, IsInf, IsNaN, PRelu op (#2801 ) This commit adds the OnnxToTorch support for Mean, IsInf, IsNaN, and PRelu ops. All high priority ops were taken so went with these. The non trivial ones are Mean and IsInf which might require extra review --------- Co-authored-by: MaheshRavishankar <mravisha@amd.com>	2024-02-13 12:38:21 +05:30
Xida Ren (Cedar)	bfb93cb99f	Fix test_add_uint8 failure to lower to linalg (#2893 ) By updating convertScalarToDtype invocation pass original source and destination datatypes for the add op. Also fixes a potential problem with the sub op. --------- Co-authored-by: Xida Ren <xida.ren.dev@gmail.com>	2024-02-12 09:19:39 -08:00
Rob Suderman	d83b576c6e	Bump LLVM to llvm/llvm-project@bb180856ec (#2895 ) Includes some minor first for `AffineMap::inferFromExprList`	2024-02-09 14:07:49 -08:00
Avinash Sharma	9659a436d1	Add lowering support for math::AbsIOp (#2875 ) There is no lowering support for math::AbsIOp, so if the operand is an integer type, it will fail to lower to math::AbsFOp since the op operand #0 must be floating-point-like.	2024-02-08 14:53:40 -08:00
Ashay Rane	21f070e95f	onnx: fix checks in TorchOnnxToTorch pass to match the ONNX spec (#2848 ) This PR contains three commits to update the validation checks in the ONNX -> Torch conversion pass for the AveragePool, Pad, and Slice operators: > onnx: fix preconditions for lowering AveragePool ops > > The `pads` attribute of the AveragePool operator specifies the value to > pad at both the beginning as well as the end of the axis (see > https://onnx.ai/onnx/operators/onnx__AveragePool.html#attributes), so > the size of this attribute should be twice the rank of the input tensor. > However, our TorchOnnxToTorch bails out early since it incorrectly > compares the pads attribute with the rank (not twice the rank) of the > input tensor. > > This patch fixes the code to match the spec and adds a lit test. > onnx: allow optional constant value for Pad operator > > The `constant_value` input of the onnx.Pad operator is optional (see > https://onnx.ai/onnx/operators/onnx__Pad.html#inputs), but the existing > logic for lowering the operator into the Torch dialect assumes that it > is mandatory. > > This patch makes the attribute optional and constructs a default value > (a list of zeros the size of the input tensor) if the attribute was not > specified. > onnx: fix checks for axes and steps inputs of Slice operator > > The ONNX Spec for the Slice operator allows the `starts` and `ends` > inputs to have fewer indices that the dimensions of the `data` tensor > (see https://onnx.ai/onnx/operators/onnx__Slice.html), but our code > expects these inputs to be as many as the `data` tensor's dimensions. > > More precisely, the spec requires that the `starts` and `ends` inputs > are only as long as the `axes` input, but since the `axes` input is > optional, the default type for the `axes` input has to match the type > for the `starts` and `ends` inputs. Moreover, the number of indices in > the `steps` input also has to match those in the `axes` inputs (instad > of matching the dimensions of the `data` input). > > This patch fixes the checks in the TorchOnnxToTorch conversion so that > they match the ONNX spec.	2024-02-07 21:19:27 -08:00
Vivek Khandelwal	4df96616db	[MLIR][TORCH] Modify Onnx.Reshape lowering for static shape cases (#2852 ) This commit modifies the OnnxToTorch lowering of Onnx.Reshape op by creating the result shape list for the aten.reshape using the result shape values inferred from the op's result shape. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-02-07 17:44:07 -08:00
mmakevic	32dbf99ce2	Implement lowering of torch.aten.all.dim (#2873 ) Lowering of torch.aten.all.dim to linalg. Per PyTorch documentation: > This function matches the behaviour of NumPy in returning output of dtype bool for all supported dtypes except uint8. For uint8 the dtype of output is uint8 itself. Since there is no support for ui8 in torch-mlir currently (https://github.com/llvm/torch-mlir/pull/1384#issuecomment-1260011334) implementation returns failure for that case.	2024-02-07 12:34:52 -08:00
Rob Suderman	041a54ae0c	[torch] Supporting `torch.aten.mul.float` lowering to `arith` (#2833 ) Simple missing scalar operation for multiply floats was missing.	2024-02-05 16:23:04 -08:00
Rob Suderman	e3faef5224	[onnx] Convert `onnx.QLinearConv` to `torch` (#2851 ) Leaning on the QDQ functionality in torch we can support the QLinearConv operation by piggybacking through `torch.Convolution`. This includes some changes such as allowing the `onnx` rewriter to run recursively. Doing so allows `QLinearConv` to decopmose to `onnx.Convolution` which is then lowered to `torch`.	2024-02-05 16:09:41 -08:00
Rob Suderman	cb52c4b3cc	[onnx] Fix `onnx-to-torch` lowering for flatten shape (#2834 ) The existing `flatten` lowering did not define what the intermediate shape was. This could result in failures to lower further to linalg as the intermediate shape was unknown. Added a shape refinement section.	2024-02-05 14:23:46 -08:00
Gaurav Shukla	f4562a8eaa	[ONNX] Fix the lowering of onnx.expand op (#2861 ) Signed-off-by: Gaurav Shukla <gauravshukla789@gmail.com>	2024-02-05 23:46:58 +05:30
Xida Ren (Cedar)	24b8c8672a	[torch] Add folders for `torch.fill`, `torch.ones`, `torch.zeros` and `aten.getItem` (#2849 ) So that the CumSum Op in OPT can get the constant that it requires to be lowered to TMTensor --------- Co-authored-by: Rob Suderman <rob.suderman@gmail.com> Co-authored-by: Xida Ren <xida.ren.dev@gmail.com>	2024-02-02 10:46:33 -08:00
Ben Vanik	962d514308	Fixing implicit double->float conversion warning. (#2850 ) `[build] D:\Dev\iree\third_party\torch-mlir\lib\Conversion\TorchOnnxToTorch\DefaultDomainGtoP.cpp(734): warning C4305: 'argument': truncation from 'double' to 'float'`	2024-02-01 22:02:44 -08:00
Rob Suderman	29baa813bd	[onnx] Fix `pool` lowering for non-symmetric padding (#2837 ) `torch` requires that padding be symmetric for pooling operations. To support non-symmetric pad we need to separately materialize out the padding operation. --------- Co-authored-by: James Newling <james.newling@gmail.com>	2024-02-01 14:35:21 -08:00
Rob Suderman	34f6948533	[torch] Support `!countIncludePad` when unpadded for average pool (#2836 ) We do not support average pool when `countIncludePad is set to false. However if the input is unpadded then the setting of the boolean is unneeded. Extended use by checking if padding is zero before rejecting the lowering.	2024-01-31 15:09:36 -08:00
Rob Suderman	0114a570e3	[torch] Support lowering `torch.item` to `tensor.extract` (#2835 ) Extracting scalar values from tensors can be implemented via a lowering to tensor.extract.	2024-01-31 15:09:12 -08:00
Sambhav Jain	8a17c98b74	Bump stablehlo to openxla/stablehlo@fd52182f76 (#2821 ) With the recent LLVM integrate and changes from https://github.com/llvm/llvm-project/pull/78260, we hit this build error in Stablehlo (which is quite old). ``` external/stablehlo/stablehlo/transforms/StablehloRefineShapes.cpp:1020:14: error: no member named 'startRootUpdate' in 'mlir::PatternRewriter' rewriter.startRootUpdate(op); ~~~~~~~~ ^ external/stablehlo/stablehlo/transforms/StablehloRefineShapes.cpp:1026:16: error: no member named 'finalizeRootUpdate' in 'mlir::PatternRewriter' rewriter.finalizeRootUpdate(op); ~~~~~~~~ ^ external/stablehlo/stablehlo/transforms/StablehloRefineShapes.cpp:1029:16: error: no member named 'cancelRootUpdate' in 'mlir::PatternRewriter' rewriter.cancelRootUpdate(op); ~~~~~~~~ ^ external/stablehlo/stablehlo/transforms/StablehloRefineShapes.cpp:1108:14: error: no member named 'updateRootInPlace' in 'mlir::PatternRewriter' rewriter.updateRootInPlace(op->getParentOp(), [&]() { return; }); ~~~~~~~~ ^ 4 errors generated. Target @torch-mlir//:torch-mlir-opt failed to build ``` I'm still puzzled as to how this didn't fail with the CMake merge gating CI (do we not test Stablehlo builds/tests?). In any case, bumping our submodule to https://github.com/openxla/stablehlo/pull/1918 fixes it. It exposes a new failing lit test in TorchToStablehlo though, that I have looped stablehlo developers into ([here](https://discord.com/channels/999073994483433573/999074539138990131/1201235845391331419)). ``` bazel run @torch-mlir//test/Conversion:TorchToStablehlo/scatter.mlir.test ...external/torch-mlir/test/Conversion/TorchToStablehlo/scatter.mlir within split at <stdin>:1 offset :33:8: error: unexpected error: Expects non-empty reduction block for type inference %0 = torch.aten.scatter.src %arg0, %int0, %arg1, %arg2 : !torch.vtensor<[?,?],si64>, !torch.int, !torch.vtensor<[?,?],si64>, !torch.vtensor<[?,?],si64> -> !torch.vtensor<[?,?],si64> ^ LLVM ERROR: Failed to infer result type(s). ``` Bazel CI: https://github.com/sjain-stanford/torch-mlir/actions/runs/7732673480/job/21083102228	2024-01-31 14:21:17 -08:00
Rob Suderman	3500523f75	[onnx] Convert resources to denseattr for `onnx.constant` to `torch` (#2830 ) `onnx` explicitly specifies that `raw_data` is stored in `little-endian` layout. While converting to `torch` we need to convert from a known endian format to an internal format of consistent layout. This means endianness must be correct during the import of `onnx.Constant`. --------- Co-authored-by: Xida Ren (Cedar) <cedar.ren@gmail.com>	2024-01-31 11:40:53 -08:00
Stella Laurenzo	7301aa80fd	Enable -Werror in lib/ and LTC. (#2841 ) Required some massaging of LTC to make it warning clean, and I had to manually disable some warnings on the generated source files (which we don't control). The project is warning clean now. The `-Werror` flag is disabled by default as we can't control everywhere people will try to build/install. The CI enables it via -DTORCH_MLIR_ENABLE_WERROR_FLAG=ON.	2024-01-30 23:33:21 -08:00
Stella Laurenzo	26c0ecd09c	[nfc] Remove unused var causing error downstream	2024-01-30 22:18:13 -08:00
Yuanqiang Liu	d778950f45	[Torch Dialect] add fold pattern for aten.clone (#2804 )	2024-01-31 09:43:21 +08:00
Rob Suderman	25a5a22cbd	[torch] Support `torch.convolution` quantized lowering to `linalg` (#2811 ) Linalg has quantized specific operations. We can lower to these operations when there is a known zeropoint and scale operations. This allows the `convolution` to occur with lower bitwidth's, improving the overall performance.	2024-01-30 13:46:47 -08:00
aldesilv	eff325abc3	OnnxToTorch ReduceMax lowering (#2768 ) Fixes https://github.com/nod-ai/SHARK-Turbine/issues/352	2024-01-30 11:44:48 +05:30
Quinn Dawkins	494089d53d	Clang format refresh (#2812 ) After noticing a number of commits with unrelated formatting changes, I think something was changed with clang-format at one point and we're seeing a number of unrelated changes. Doing a refresh can help avoid this. The changes made here came from ``` find lib -iname .h -o -iname .cpp \| xargs clang-format -i --style=llvm find include -iname .h -o -iname .cpp \| xargs clang-format -i --style=llvm find projects -iname .h -o -iname .cpp \| xargs clang-format -i --style=llvm ```	2024-01-29 12:59:33 -05:00
Rob Suderman	d3fd754b93	[onnx] `onnx.MatMulInteger` lowering to `torch.mm` and `quint*` types (#2761 ) Torch does not have an equivalent matmul operation for integers. Instead it sidechannels the information via its quantized types. For this lowering we setup these sidechannels then invoke `torch.mm`.	2024-01-29 09:40:21 -08:00
Aart Bik	46a25d7241	[torch-mlir][sparse] preserve sparsity during lowering torch to linalg (#2809 ) This preserves sparsity at the most obvious places of lowering TORCH tensors to MLIR RankedTensorType tensors. Other places are marked for audit. With some initial lowering tests.	2024-01-26 10:54:59 -08:00
Vivek Khandelwal	da7c6d2c16	[MLIR][TORCH] Add support for dynamic shape for Onnx.Transpose op (#2803 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-01-26 09:46:54 -08:00
Phaneesh Barwaria	4964977e85	[ONNX][MLIR] support constantOfShape op (#2747 )	2024-01-26 09:36:39 -08:00
Rob Suderman	2ef228328f	[torch] `torch.dequantize` for per channel tensors to` linalg` (#2769 ) Support a lowering for dequantization for per channel tensors from `torch` dialect to a linalg decomposition. Tested via a numerical `torch` test.	2024-01-25 16:40:21 -08:00
lonely eagle	e581b33f96	[Stablehlo]fix CumsumInputDtypeInt32Module_basic on stablehlo backend. (#2797 ) Code used for testing.For the location of CumsumInputDtypeInt32Module in the repo you can see [here](`311b6b0286/projects/pt1/python/torch_mlir_e2e_test/test_suite/basic.py (L4148)`). ```python import torch import torch_mlir class CumsumInputDtypeInt32Module(torch.nn.Module): def __init__(self): super().__init__() def forward(self, val): return torch.ops.aten.cumsum(val, 1) module = torch_mlir.compile(CumsumInputDtypeInt32Module(), [torch.randn(2, 7, 4).to(torch.int32)], output_type="stablehlo") print(module.operation.get_asm()) ``` After fixing the bugs. ``` module attributes {torch.debug_module_name = "CumsumInputDtypeInt32Module"} { func.func @forward(%arg0: tensor<2x7x4xi32>) -> tensor<2x7x4xi64> { %0 = stablehlo.constant dense<0> : tensor<i64> %1 = stablehlo.convert %arg0 : (tensor<2x7x4xi32>) -> tensor<2x7x4xi64> %2 = "stablehlo.reduce_window"(%1, %0) ({ ^bb0(%arg1: tensor<i64>, %arg2: tensor<i64>): %3 = stablehlo.add %arg1, %arg2 : tensor<i64> stablehlo.return %3 : tensor<i64> }) {padding = dense<[[0, 0], [6, 0], [0, 0]]> : tensor<3x2xi64>, window_dilations = dense<1> : tensor<3xi64>, window_dimensions = dense<[1, 7, 1]> : tensor<3xi64>, window_strides = dense<1> : tensor<3xi64>} : (tensor<2x7x4xi64>, tensor<i64>) -> tensor<2x7x4xi64> return %2 : tensor<2x7x4xi64> } } ```	2024-01-25 10:44:08 +08:00
Rob Suderman	f6f890520b	[torch][quant] Quantized `torch.mm` for linalg with end-to-end test (#2750 ) This includes custom op matching for decomposed operations and fusing dequantization into dense operations. As a validation we compare to the dequant+mm torch implementation.	2024-01-24 14:02:50 -08:00
Rob Suderman	60bf6c25af	[onnx] Lower `onnx.QLinearMatMul` lowering to `torch` operators (#2776 ) We can plumb the linear matmul into pytorch using its quantized types with side channel information. To handle the final int8 operation we dequantize and requantize.	2024-01-24 12:28:48 -08:00
Vivek Khandelwal	894805dd5e	[MLIR][TORCH] Support for `onnx.LayerNormalization` (#2789 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-01-24 11:08:20 -08:00
Gaurav Shukla	12f123eff8	[ONNX][MLIR] Add support for pad op in the onnx pipeline (#2738 ) This commit adds mapping from `onnx.pad` op to `torch.pad` op. Currently it does not support `axes` parameter of `onnx.pad` op. Signed-off-by: Gaurav Shukla <gaurav.shukla@amd.com>	2024-01-25 00:33:37 +05:30
Phaneesh Barwaria	ac8975ea12	[MLIR] [ONNX] lowering for onnx tile op and sign op (#2725 )	2024-01-24 22:56:21 +05:30
zjgarvey	c531f5495b	AtenAdaptiveMaxPool2d Conversion to Linalg (#2779 ) The logic here is very similar to the conversion for AdaptiveAvgPool1d #2661 with a few modifications: 1. buffVal = -inf instead of 0 2. the main linalg generic op accumulates a max, instead of a sum, to the first output tensor 3. avg pooling requires dividing the sum pool by the kernel width, which we stored as an auxilliary tensor (kSizeTensor). Here, the auxiliary tensor will be recording the indices. Strangely enough, the only signature available for this function is to return indices, and it appears that they must be computed whether the user desires them or not. See [pytorch/torch/nn/functional.py](https://github.com/pytorch/pytorch/blob/main/torch/nn/functional.py#L1174). Before writing other adaptive pooling conversions, the logic of this decomposition should be rolled into a helper function that will work for both max and avg pooling ops. Even the auxiliary tensor should likely be automated. This code was written in a slightly more tedious way than strictly necessary (often using loops to fill SmallVectors up to rank-2, which is only two in this case), in order to more easily facilitate the transition to a helper function.	2024-01-24 09:09:56 -08:00
Xida Ren (Cedar)	ccaac85788	implement aten.conv1d, aten.conv3d, and aten.conv_tbc (#2757 ) convolution with [time,batch,channel] ordering, as opposed to the default [batch, channel, time]. Currently implementing by transposing the input and output, but may need to get its own implementation in the future because this is supposed to be an op that gives a speedup. This is used by fairseq (https://github.com/facebookresearch/fairseq/issues/172). (in case you were wondering like me, this is different from transposed convolution. Transposed convolution has fractional strides). --------- Co-authored-by: Xida Ren <xida.ren.dev@gmail.com> Co-authored-by: Frederik Harwath <frederik.harwath@amd.com>	2024-01-23 21:30:03 -08:00
Chi_Liu	77ae56337d	[ONNX][MLIR] Add support for onnx.Exp op (#2792 ) https://github.com/nod-ai/SHARK-Turbine/issues/312	2024-01-23 13:45:00 -08:00
James Newling	dc056e58e6	[MLIR][TORCH] Add onnx.cast cases used by OPT-1.25M (#2787 )	2024-01-23 21:06:25 +05:30
Gaurav Shukla	b7a0329676	[ONNX][MLIR] Fix padding size constraint for onnx.maxpool op (#2782 ) Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com>	2024-01-23 19:23:01 +05:30
Chi_Liu	cad98e8113	[ONNX][TORCH-MLIR] Add TopK support (#2774 ) https://github.com/nod-ai/SHARK-Turbine/issues/331	2024-01-22 12:56:39 -08:00
Ramiro Leal-Cavazos	5883ef0f21	Fix unused variable warnings (#2775 )	2024-01-22 11:05:55 -08:00
Srinath Avadhanula	73b30604da	Do not try to legalize transposed convolution (#2721 ) Currently transposed convolution is not handled correctly by `TorchToTosa`. This PR allows transposed convolutions to pass through the conversion so that they can be handled by other conversion passes later in a pipeline. An example input which produces a compilation error is: ``` func.func @forward(%input: !torch.vtensor<[1,64,1,100],f32>) -> !torch.vtensor<[1,64,2,200],f32> { %true = torch.constant.bool true %int1 = torch.constant.int 1 %int2 = torch.constant.int 2 %weight = torch.vtensor.literal(dense<0.0> : tensor<64x64x3x3xf32>) : !torch.vtensor<[64,64,3,3],f32> %bias = torch.vtensor.literal(dense<0.0> : tensor<64xf32>) : !torch.vtensor<[64],f32> %stride = torch.prim.ListConstruct %int2, %int2 : (!torch.int, !torch.int) -> !torch.list<int> %int1x1 = torch.prim.ListConstruct %int1, %int1 : (!torch.int, !torch.int) -> !torch.list<int> %output = torch.aten.convolution %input, %weight, %bias, %stride, %int1x1, %int1x1, %true, %int1x1, %int1 : !torch.vtensor<[1,64,1,100],f32>, !torch.vtensor<[64,64,3,3],f32>, !torch.vtensor<[64],f32>, !torch.list<int>, !torch.list<int>, !torch.list<int>, !torch.bool, !torch.list<int>, !torch.int -> !torch.vtensor<[1,64,2,200],f32> return %output : !torch.vtensor<[1,64,2,200],f32> } ``` This MLIR produces an error about a cast operation with a size mismatch when passed through `torch-to-tosa`: ``` error: 'tensor.cast' op operand type 'tensor<1x64x1x50xf32>' and result type 'tensor<1x64x2x200xf32>' are cast incompatible ``` --------- Co-authored-by: Srinath Avadhanula <srinath.avadhanula@getcruise.com>	2024-01-22 10:57:56 -08:00
Franz Haniel	b9806cfa38	[TorchToLinalg] Add lowering for torch.aten.diagonal (#2632 )	2024-01-22 12:47:13 -05:00
James Newling	50ac3b1912	g++ build fix (#2778 ) Introduced in `704cfdaf08` of @wu-s-john g++ compiler error: Pooling.cpp:177:13: error: explicit specialization in non-namespace scope ‘class Design looks good, g++ is just freaking out for no good reason. Un-nesting the template classes fixes the error. We don't have g++ CI. This hopefully happens infrequently enough that we can just fix manually. My service to those folks who really like building with g++... :)	2024-01-19 19:12:29 -08:00
Dave Liddell	2f4924015d	[onnx] Added flatten (#2760 ) [https://github.com/nod-ai/SHARK-Turbine/issues/328](url) --------- Co-authored-by: Dave Liddell <dliddell@xilinx.com>	2024-01-19 16:18:16 -08:00
Gaurav Shukla	3b85c70748	[ONNX][MLIR] Add support for onnx.gather op (#2726 ) This commit adds support for gather op in the onnx pipeline. https://github.com/nod-ai/SHARK-Turbine/issues/242 Signed-off-by: Gaurav Shukla <gaurav.shukla@amd.com>	2024-01-19 21:58:29 +05:30
John Wu	704cfdaf08	Add aten.pool_max3d support to torch-to-linalg (#2735 ) Added verification logic to the abstract_interpreter_lib_gen.py Also made some unit tests Initially, I thought we can use `linalg::pooling_ndhwc_max` to help implement this problem. However, on a 5-dimensional matrix it does the pooling on dimensions (2, 3, 4) which is not what we want. We want pooling on dimensions (3, 4, 5). To achieve this, we would need to lower our code using the `linalg` dialect. Turns out the pooling code in `linalg` looks like this. ``` func @max_pooling_ncdhw(%I: memref<?x?x?x?x?xf32>, %K: memref<3xindex>, %O: memref<?x?x?x?x?xf32>, %strides: memref<3xindex>, %dilations: memref<3xindex>) { %c0 = arith.constant 0 : index %c1 = arith.constant 1 : index %N = memref.dim %I, %c0 : memref<?x?x?x?x?xf32> %C = memref.dim %I, %c1 : memref<?x?x?x?x?xf32> %D = memref.dim %I, 2 : memref<?x?x?x?x?xf32> %H = memref.dim %I, 3 : memref<?x?x?x?x?xf32> %W = memref.dim %I, 4 : memref<?x?x?x?x?xf32> %kernel_d = memref.load %K[%c0] : memref<3xindex> %kernel_h = memref.load %K[%c1] : memref<3xindex> %kernel_w = memref.load %K[2] : memref<3xindex> %stride_d = memref.load %strides[%c0] : memref<3xindex> %stride_h = memref.load %strides[%c1] : memref<3xindex> %stride_w = memref.load %strides[2] : memref<3xindex> %dilation_d = memref.load %dilations[%c0] : memref<3xindex> %dilation_h = memref.load %dilations[%c1] : memref<3xindex> %dilation_w = memref.load %dilations[2] : memref<3xindex> linalg.generic { indexing_maps = [ affine_map<(n, c, d, h, w, kd, kh, kw) -> (n, c, d * %stride_d + kd * %dilation_d, h * %stride_h + kh * %dilation_h, w * %stride_w + kw * %dilation_w)>, // Map for input tensor affine_map<(n, c, d, h, w, kd, kh, kw) -> (kd, kh, kw)>, // Map for kernel tensor affine_map<(n, c, d, h, w, kd, kh, kw) -> (n, c, d, h, w)> // Map for output tensor ], iterator_types = ["parallel", "parallel", "parallel", "parallel", "parallel", "reduction", "reduction", "reduction"], doc = "3D Max Pooling NCDHW with Strides, Dilations, and Kernel Size" } ins(%I, %K : memref<?x?x?x?x?xf32>, memref<3xindex>) outs(%O : memref<?x?x?x?x?xf32>) { ^bb0(%input_elem: f32, %kernel_elem: index, %output_elem: f32): %max_val = arith.maxf %input_elem, %output_elem : f32 linalg.yield %max_val : f32 } return } ``` This was implemented based on it's source code with the adjustments mentioned above: `4ca1b5e094/mlir/include/mlir/Dialect/Linalg/IR/LinalgNamedStructuredOps.yaml (L5647)` Issues related to this can be found here https://github.com/nod-ai/SHARK-Turbine/issues/324	2024-01-19 21:09:46 +05:30
Ilija Kalinić	faa4517e83	Implement lowering of torch.aten.remainder.Tensor (#2763 ) Closes nod-ai/SHARK-Turbine#349	2024-01-19 18:09:08 +05:30
Andreas Falkenberg	4de4d38b87	Initial commit of NonZero op (#2766 )	2024-01-18 15:23:13 -10:00
Rob Suderman	b5387c0f29	[onnx] Lowering `onnx.dequantize_linear` to `torch` (#2759 ) We can make the per-tensor version of the operation to the dequantize operation via marking with the make quantized tensor component. This introductions the `qint` and `quint` tensor type that can be lowered to teh appropriate dequantization behavior during the torch-to-linalg conversion.	2024-01-18 16:47:21 -08:00
Rob Suderman	bd11877f6f	[onnx] Support lowering quantize linear to `torch` (#2751 ) We can map the per_tensor case to the `torch.aten.quantize_per_linear` operation. In this case we extract the `scale` and `zeropoint` values and directly invoke the quantization, then return the integer representation value.	2024-01-18 16:33:10 -08:00
Ze Zhang	77a03f2069	torch-to-tosa lowering support for AtenLinalgVectorNormOp (#2734 ) This PR add torch-to-tosa lowering support for AtenLinalgVectorNormOp e2e test: python -m e2e_testing.main --config=tosa LIT tests: cmake --build build --target tools/torch-mlir/all --------- Co-authored-by: Ze Zhang <ze.zhang@getcruise.com>	2024-01-18 12:32:23 -08:00
Phaneesh Barwaria	eed144bfbc	[ONNX][MLIR] add Identity op support (#2754 )	2024-01-16 19:06:54 +05:30
kumardeepakamd	87389f0762	[ONNXToTorch] Add conversion for Onnx range (#2752 ) Implemented ONNX.Range. The spec says the data type for start, limit, delta are 0-D can be double, float, int16, int32, int64, All int types mapped to !torch.int and all float types mapped to !torch.float --------- Co-authored-by: Kumar Deepak <kumar@xilinx.com>	2024-01-15 14:26:46 -05:00
lisaliu1	09421b1cf3	[TorchToLinalg] Add lowering for aten.replication_pad2d (#2715 ) Co-authored-by: Lisa Liu <lingl@xilinx.com>	2024-01-15 14:02:27 -05:00
Rob Suderman	197b3b475c	[onnx] Convert `onnx.constant` to `torch` literal tensor (#2748 ) Handles the multiple cases of `onnx` constant values and converts them to `torch` literal tensors. This can include splats with a single integer or floating point value, a set of explicit integer values, or an elements array attr of values.	2024-01-15 09:31:22 -08:00
Han-Chung Wang	10acea71be	Bump LLVM to llvm/llvm-project@0cb024b (#2753 ) - Add fixes for `af78e5daf0` - Add fixes for `bb6d5c2200`	2024-01-15 07:12:12 -08:00
Rob Suderman	dc37616d67	[torch][quant] Support quantize and dequantize for torch (#2731 ) Handle both `torch.dequantize` and `torch.quantize_per_tensor` including the op based quantization parameter tracking. This includes adding `qint32` to torch types as it was missing during the initial type inclusion. For testing we only have `torch.int8` and `torch.float` types on function boundaries as the `qint8` types require passing the scale and zero point quantization information which is not supported yet.	2024-01-12 19:11:14 -08:00
Chi_Liu	c7452af4fa	[MLIR][ONNX] Add OnnxToTorch support for Maxpool Op (#2695 ) Add Maxpool ONNX op support. Add Utils.h/cpp files to create a constant int list for ONNX.	2024-01-12 14:54:38 -08:00
Ze Zhang	670a99ae19	Handle torch.none type in tosa.clamp op (#2739 ) This PR updates the torch-to-tosa conversion with following changes: - Support torch.none as min/max input argument for tosa.clamp op - Support negative value as start index for tosa.slice op - Add tosa.logical_or lowering support e2e test: python -m e2e_testing.main --config=tosa LIT tests: cmake --build build --target tools/torch-mlir/all --------- Co-authored-by: Ze Zhang <ze.zhang@getcruise.com>	2024-01-11 10:36:48 -08:00
James Newling	47ffc90db4	signed/unsigned c++ compiler warning fixes (#2742 )	2024-01-11 09:46:46 -08:00
Ilija Kalinić	e1a86e480a	Implement lowering of torch.aten.logit (#2697 ) Closes nod-ai/SHARK-Turbine#290	2024-01-11 20:25:42 +05:30
Andreas Falkenberg	5862854bc8	[ONNX][TORCH-MLIR] LayerNorm (#2716 ) Layer Normalization using the torch.aten.native_layer_norm https://github.com/nod-ai/SHARK-Turbine/issues/325	2024-01-11 14:27:04 +05:30
Frederik Harwath	0860c41ee2	Implement aten.reflection_pad2d lowering to linalg	2024-01-10 21:32:22 -10:00
Xida Ren (Cedar)	aee1fca251	Minor typo fix: in not implemented message for the exclusive and reverse attributes for cumsum (#2740 )	2024-01-10 14:24:37 -08:00
kumardeepakamd	29569713f3	support for onnx.expand operator (#2729 ) maps onnx.expand to torch aten broadcast_to, three tests added --------- Co-authored-by: Kumar Deepak <kumar@xilinx.com>	2024-01-10 13:05:37 -08:00
Vivek Khandelwal	208ae35583	[MLIR][ONNX] Add TorchToOnnx Support for DepthToSpace op Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-01-10 17:50:47 +05:30
Vivek Khandelwal	4707d3bdc6	[MLIR][ONNX] Add OnnxToTorch support for Bernoulli and CastLike op Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-01-10 16:24:06 +05:30
Vivek Khandelwal	35e8f86792	[MLIR][ONNX] Add OnnxToTorch support for Dropout and Elu op Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-01-10 16:23:55 +05:30
zjgarvey	07d0645f64	[RFC] general support for Adaptive Pooling Ops (#2661 ) Adaptive pooling ops can only be decomposed into their non-adaptive counterparts in trivial cases. For example, the current decomposition for AtenAdaptiveAvgPool1dOp in DecomposeComplexOps.cpp supports outSize = inSize (i.e., do literally nothing), and outSize = 1 (i.e., do a batched average). The reason adaptive pooling ops are difficult to lower to linalg is that they are not constantly strided. They are computed by taking an input tensor of shape (N, C, Hin), and an output size Hout, and computing the output tensor at position (n,c, h) in the following way: 1. compute st(h) = (hHin)//Hout 2. compute en(h) = 1 + ((h+1)Hin -1)//Hout 3. apply a computation (max or avg) to the slice: INPUT[n, c, st(h):en(h)] The provided sample implementation (for ConvertAtenAdaptiveAvgPool1dOp) uses tensor.extract to access the input tensor inside the payload of a linalg generic op. This is likely an unattractive use of linalg generic ops, which is why I am asking for some more targeted feedback on the validity of this approach before attempting to support the many other adaptive pooling ops. Specifically: - Is the performance of this implementation bad enough to warrant targeting different dialects entirely? e.g. TMtensor/linalg ext/ etc. - If the provided implementation is of acceptable performance to the community, then is it permissable to remove the Adaptive pooling decompositions from DecomposeComplexOps.cpp? Based on the current structure of the -torch-decompose-complex-ops pass, it does not seem possible to only decompose the adaptive ops in special cases (it seems to get stuck in an infinite loop on a match failure). I would be happy to instead incorporate the case logic into the conversion directly, and remove the decompositions once they are rendered completely obsolete. As long as this approach is acceptable, I can clean up the implementation with some helper functions, and quickly add support for each of the remaining Adaptive pooling ops.	2024-01-09 11:14:10 -08:00
Ben Vanik	4dd17f0b71	Fixing implicit double->float truncation warnings. (#2733 ) Floating-point literals should use the correct type specifier.	2024-01-08 17:26:38 -05:00
Rob Suderman	985e7796a4	[linalg] Added `aten.clamp` support with integers to `torch-to-linalg` (#2718 ) The lowering for `aten.clamp` did not support integer types. Added support for integer types including a signed integer test.	2024-01-05 15:16:49 -08:00
Han-Chung Wang	6096fcb347	[OnnxToTorch] Delete unused variables. (#2728 )	2024-01-04 17:30:05 -08:00
John Wu	4e5e34d215	[MLIR][ONNX] Add OnnxToTorch support for Slice Op (#2696 )	2024-01-03 19:41:10 -08:00
Xida Ren (Cedar)	1778314620	add basic cumsum. this doesn't support the exclusive and reverse attrs (#2717 ) fixes #2711	2024-01-03 09:52:59 -08:00
kumardeepakamd	9adad9bc40	Add support for reflection_pad1d (#2706 ) Adds a lowering to Linalg for reflection_pad1d. Based on ideas/code from draft PR https://github.com/llvm/torch-mlir/pull/2693. --------- Co-authored-by: Kumar Deepak <kumar@xilinx.com>	2024-01-02 14:05:11 -05:00
Xida Ren (Cedar)	6660a26594	lower torch.aten.isinf to linalg (#2638 ) Co-authored-by: Rob Suderman <rob.suderman@gmail.com>	2023-12-28 17:20:32 -08:00
Xida Ren (Cedar)	9fc212ea9a	support Onnx opset 1-13 ReduceMean where axes is supplied as an attr (#2703 ) (instead of an input) Addresses part of #2689. fixes #2702	2023-12-28 09:31:41 -08:00
Xida Ren (Cedar)	d560698e3d	Lower `onnx.split` to `torch.aten` (#2686 )	2023-12-27 17:53:07 -08:00
aldesilv	2d796b7502	lower onnx max op to torch aten maximum op (#2618 ) lower onnx min op to torch aten minimum op	2023-12-27 11:07:35 -08:00
aldesilv	336cfb64b5	OnnxToTorch support for onnx.Mul op (#2699 )	2023-12-27 10:50:08 -08:00
Xida Ren (Cedar)	6847fc1fc6	Fix since-opset too high (#2701 ) Addresses two of the ops from https://github.com/llvm/torch-mlir/issues/2689 https://github.com/llvm/torch-mlir/issues/2700	2023-12-27 10:08:09 -08:00
aldesilv	abc6b0a25a	onnx to torch pow support (#2656 )	2023-12-27 09:34:48 -08:00
Vivek Khandelwal	4f252c88b4	[MLIR][ONNX] Add OnnxToTorch support for GlobalAveragePool op. (#2692 ) This commit adds the OnnxToTorch support for GlobalAveragePool op. Signed-Off By: vivekkhandelwal1424@gmail.com	2023-12-26 10:25:31 -08:00
saienduri	ee75e8d1ae	[MLIR][ONNX] Add OnnxToTorch support for Reshape Op (#2698 ) This commit adds the OnnxToTorch support for Reshape op.	2023-12-26 10:20:13 -08:00
Vivek Khandelwal	0849fd0a06	[MLIR][ONNX] Fix onnx.conv lowering to handle bias tensor Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2023-12-22 16:36:21 +05:30
Vivek Khandelwal	9a72c6584e	[MLIR][ONNX] Add OnnxToTorch support for BatchNormalization and Concat op. This commit adds the OnnxToTorch support for BatchNormalization and Concat op. Signed-Off By: vivekkhandelwal1424@gmail.com	2023-12-22 11:25:33 +05:30
John Wu	46f2cb50dc	[onnx] Lower onnx.HardSigmoid to torch (#2682 ) The expression for HardSigmoid in Onnx (https://onnx.ai/onnx/operators/onnx__HardSigmoid.html): max(0, min(1, alpha * x + beta)) is inherently different from HardSigmoid in Torch (https://pytorch.org/docs/stable/generated/torch.nn.Hardsigmoid.html) which is: if x < -3 -> 0 elif x > 3 -> 1 else x/6 + 1/2 That being said, it was just better to compute out the entire expression when translating the Onnx expression to Torch mlir, which is done in this PR. Some of the logic is shared from the files in `DecomposeComplexOps`. Therefore, refactored some shared logic between `DecomposeComplexOps` and `DefaultDomainGToP` and put it in a `Utils` file.	2023-12-21 07:29:22 -08:00
Vivek Khandelwal	3226241521	[MLIR][ONNX] Add OnnxToTorch support for Conv and ConvTranspose op. This commit adds the OnnxToTorch support for Conv and ConvTranspose op. Signed-Off By: vivekkhandelwal1424@gmail.com	2023-12-21 11:12:14 +05:30
Stella Laurenzo	d75cff6cd1	NFC: Remove unused variable causing a warning.	2023-12-20 19:23:27 -08:00
Rob Suderman	11cc92d4ab	[onnx] Lowerings from `onnx.tan` (#2642 ) Started work on the `tan` lowerings for ONNX to Torch. Uses `sin` and `cos` to represent a `tan`.	2023-12-20 10:09:39 -08:00
Rob Suderman	a24aadbfab	[aten] Make `torch.aten.matmul` to `linalg` work for non-broadcasting case (#2659 ) Broadcasting for `torch.aten.matmul` is optional so a MxN with NxK matmul should be legalized to a `linalg.matmul`.	2023-12-20 10:09:10 -08:00
Andreas Falkenberg	ebaab4200f	[ONNX] ONNX -> TORCH for Erf (#2673 ) TorchOnnxToTorch For Erf function	2023-12-19 08:07:27 -08:00
Vivek Khandelwal	8649b84e3f	[MLIR][ONNX] Add OnnxToTorch support for AveragePool op. (#2672 ) This commit adds the OnnxToTorch support for AveragePool op. Signed-Off By: vivekkhandelwal1424@gmail.com	2023-12-18 18:17:11 -06:00
saienduri	698ff3a736	[MLIR][ONNX] Add OnnxToTorch support for Reduction Ops (#2657 ) This commit adds the OnnxToTorch support for ReduceSum, ReduceMean, and ReduceMin ops.	2023-12-18 12:37:31 -08:00
John Wu	deacb8ef38	[MLIR][ONNX] Add OnnxToTorch support for Gelu (#2647 ) This commit adds the OnnxToTorch support for Gelu op. --------- Co-authored-by: Rob Suderman <suderman@google.com>	2023-12-18 10:57:08 -08:00
Rob Suderman	791c666479	[torch] Lower `torch.aten.sinh` to `linalg` (#2662 )	2023-12-18 09:15:12 -08:00
Rob Suderman	ae1a6e4a5a	[onnx] Lower `onnx.Gemm` to `torch` (#2663 ) General lowering for `onnx.Gemm` to `torch`	2023-12-16 10:47:58 -08:00
Andreas Falkenberg	cee8563060	[onnx] Support of onnx.Greater, onnx.Less, onnx.GreaterOrEqual to Torch (#2649 ) The three remaining compare operations onnx.Greater onnx.Less onnx.GreaterOrEqual Are also added with this push request. This concludes a set of basic tensor compare functions.	2023-12-16 12:42:11 -05:00
Rob Suderman	61888690bb	[onnx] Add support for `onnx.sinh` (#2643 ) Adds a lowering from `onnx.sinh` to `aten.sinh`. This includes adding the `aten.sinh` operator.	2023-12-15 21:23:51 -08:00
Rob Suderman	705ea958ae	[onnx] Lowerings from `onnx.transpose` (#2641 ) Lowerings for `transpose` from ONNX to `aten`. Implementation depends on making multiple `aten.transpose` operations swapping pairs of dimensions. As `onnx.transpose` can swap around any dimensions it may require constructing multiple `aten.transpose`.	2023-12-15 15:30:05 -08:00
Quinn Dawkins	030b0140d4	[TorchToLinalg] Lower aten.cat to tensor.concat (#2650 ) This replaces the lowering of aten.cat with tensor.concat, allowing more efficient handling of concatenations in downstream flows. The refbackend populates concat decomposition patterns that can be used to recover the previous lowering.	2023-12-15 15:45:32 -05:00
Rob Suderman	061af696ce	[onnx] Lowering for `onnx.shape` to `torch` and `tensor` (#2648 ) Includes the lowering from the `aten` equivalent to `tensor` operations.	2023-12-15 11:37:49 -08:00
Sungsoon Cho	55e9401c5c	Implement lowering of aten.cosh op. (#2635 )	2023-12-15 11:19:26 -08:00
Gaurav Shukla	eb9249e601	[ONNX][MLIR] Add support for LeakyRelu and GatherElements op (#2655 ) This commit adds support for `LeakyRelu and GatherElements` op in the onnx pipeline. Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-12-15 11:18:28 -08:00
saienduri	f59c01fd2f	[MLIR][ONNX] Add OnnxToTorch support for q-z ops (specific ops in description) (#2601 ) This commit adds the OnnxToTorch support for Reciprocal, Round, ScatterElements, Sigmoid, Sin, Tanh, Sqrt, Sub, Sum, Where, Xor, Squeeze, Unsqueeze ops. For reviewers, the ops that weren't trivial and probably require extra review are Sum, Squeeze, and Unsqueeze.	2023-12-15 09:36:18 -08:00
Andreas Falkenberg	4ec8b9fc02	[onnx] add support for onnx.LessOrEqual (#2639 ) Added the less or equal operation to OnnxToTorch. onnx.LessOrEqual --------- Co-authored-by: root <andreas.falkenberg@amd.com>	2023-12-14 22:23:23 -05:00
Rob Suderman	4857606ffe	[onnx] Lowerings from `onnx.selu` (#2634 ) Lowerings for `selu` lowerings for ONNX to the corresponding torch implementations. Torch's `selu` implementation has fewer features so we use the a generalized `elu` with the input scale set to `1.0`.	2023-12-14 08:53:47 -08:00
John Wu	42392bc845	[MLIR][ONNX] Add OnnxToTorch support for matmul ops (#2629 ) This commit adds the OnnxToTorch support for Matmul.	2023-12-13 09:35:32 -08:00
Frederik Harwath	b656c674ee	Implement e2e support for aten.acos op This depends on a change in the LLVM core repository which adds acos support to the MLIR Math dialect.	2023-12-12 10:52:02 +01:00
Vivek Khandelwal	0b4422a253	[MLIR][ONNX] Add OnnxToTorch support for bitwise and math ops This commit adds the OnnxToTorch support for BitwiseXor, BitwiseOr, Div, Equal, Cast, Ceil, Floor, Cos, and Clip op. This commit also adds the TorchToLinalg support for aten.clamp.Tensor and aten.clamp_min.Tensor op. Signed-Off By: vivekkhandelwal1424@gmail.com	2023-12-11 19:36:01 +05:30
Felix Schneider	fb21a85874	[TorchToLinalg] Lower grouped conv2d to linalg Op with correct dimension ordering (#2623 ) The linalg Op `linalg.conv_2d_ngchw_fgchw` had a bug where 1. Weights were accessed as G,F,C,H,W instead of as F,G,C,H,W 2. Output was accessed as N,F,G,H,W instead of as N,G,F,H,W Now this has been fixed in https://github.com/llvm/llvm-project/pull/73855 which broke the torch-mlir lowering to that Op. This patch switches lowering in torch-mlir to the newly introduced `linalg.conv_2d_ngchw_gfchw` op which accesses weights in an order that is compatible with PyTorch's memory layout. Fix https://github.com/llvm/torch-mlir/issues/2622	2023-12-08 14:18:23 +01:00
Stella Laurenzo	8252656b6d	Advance llvm-project and stablehlo. (#2619 ) llvm-project: bbd2b08b95fe76bea138c1b03c1cd42ed3ee04df stablehlo: ab709fe48de88c67717abfbd7ef17425eb95ddaf These commits were chosen in order to account for an MLIR API break from `3dbac2c007` which required a patch to stablehlo. We integrate a bit beyond that commit to deal with some revert/reapply cycles in the intervening range which were discovered in another downstream. Further, it requires adaptation to the stablehlo API breaks introduced from https://github.com/openxla/stablehlo/pull/1872 which are along for the ride. Since some stablehlo builders were changed to directly take int64_t array refs, also traced that up some call stacks to eliminate some signed/unsigned mismatches that result. Also adds a few TOSA tests to the passing set that seem to work now.	2023-12-07 23:13:42 -08:00
Quinn Dawkins	63505ad6b2	[TorchToLinalg] Drop constexpr from ifs in argmin/max.dim (#2617 ) MSVC-19 does not support constexprs of lambda captured constexpr values like this: https://godbolt.org/z/ej65rMzdr Instead, this just drops the constexpr from the if statements. See the discussion in https://discord.com/channels/689900678990135345/1062405112292712499/1182338050664185999	2023-12-07 13:08:17 -05:00
Quinn Dawkins	141202bc01	[TorchToLinalg] Fix integer type handling for aten.mm (#2615 ) Despite aten.mm requiring the input and output types match, we still opt to maintain signedness semantics in case later passes try to do any sort of integer type narrowing.	2023-12-07 00:13:53 -05:00
Frederik Harwath	6248216dca	Add aten.min.dim to linalg lowering (#2600 )	2023-12-05 07:16:35 -08:00
Quinn Dawkins	400752ca8d	[TorchToLinalg] NFC: Move Utils.h to an externally accessible location (#2603 )	2023-12-01 19:38:21 -05:00
Ramiro Leal-Cavazos	e568f7e999	Move handling of integer signedness to the backend conversions (#2597 ) The function `getTypeForScalarType` currently takes an argument to specify the signedness of integer types. This is leakage of backend specific requirements into the torch dialect world. Because `getTypeForScalarType` is a utility function for the torch dialect, it should only produce types that match the sign conventions used by PyTorch (regular integers are signed and unsigned integers are unsigned). This commit removes the signedness argument from `getTypeForScalarType`, and moves the backend specific handling of integer types to the backend code.	2023-11-29 09:43:09 -08:00
Vivek Khandelwal	dc9ea08db5	[MLIR][ONNX] Add OnnxToTorch support for atan and bitwise ops This commit adds the OnnxToTorch support for Atan, Bitshift, BitwiseAnd, and BitwiseNot op. This commit also adds the TorchToLinalg support for AtenBitwiseLeftShiftTensorOp. Signed-Off By: vivekkhandelwal@nod-labs.com	2023-11-28 17:19:07 +05:30
Stella Laurenzo	e06efc5136	Initial TorchOnnxToTorch conversion pipeline. (#2585 ) Adds a pipeline to convert custom ops and metadata represented as `torch.operator` custom ops to corresponding `torch` ops where possible. This is part of a multi-part approach for building ONNX import in as a regular feature of torch-mlir. It is focused on the conversions vs the infra. We will end up maintaining a [pure-python importer](https://github.com/nod-ai/SHARK-Turbine/blob/main/python/shark_turbine/importers/onnx_importer.py) to go with this in torch-mlir, and we will also maintain test case generation utilities derived from it. I have left substantial documentation in the README of the conversion directory, including the recommended approach that we will take to keep building this out. (note that this organizes the code to coincide with the refactoring in #2442 versus the current flat arrangement)	2023-11-21 21:02:55 -08:00
James Newling	03e8f99730	Lowering to linalg of prims split_dim op (#2576 ) Adds support for lowering to prims split_op. Similar design to collapse op lowering in https://github.com/llvm/torch-mlir/pull/2572, with some small differences, because the split_dim op (in pytorch) is view-changing whereas the collapse is not. The difference means that 1) it must be registered in the function Torch::isViewLikeOp 2) it must be be added to the "expected fail" set for the torch dynamo backend.	2023-11-21 07:56:09 -08:00
James Newling	647f2f5076	Additional tests for view lowering (#2584 ) The logic for lowering the aten view op to linalg is fairly complex. In this PR I have tried to follow all non-failing paths through the lowering and add unit tests where they're missing. There is 1 logical change to the lowering: redundant tensor.cast ops (same source and destination type) are folded.	2023-11-20 17:35:25 -08:00
Yuanqiang Liu	7b94189e07	[E2E] add nan case in elementwise comparison e2e tests (#2575 )	2023-11-20 11:27:08 +08:00
James Newling	e81282ae8f	Support for prims collapse op (lowering to linalg) (#2572 ) Steps taken: 1) add generator code to torch_ods_gen.py, run update_torch_ods.sh 2) add (custom) shape and type inference generator code to abstract_interp_lib_gen.py, run update_abstract_interp_lib.sh 3) Implement lowering to tensor.collapse_dims. Requires the `start` and `end` values to be constant, else lowering fails 4) Update xfail_sets.py (append to LTC_XFAIL_SET) after running /tools/e2e_test.sh --filter Collapse --verbose -c XX for all support backends (XX). Motivation: - Supporting the collapse operation will be useful for lowering of pixel_shuffle (see Issue #2559)	2023-11-15 08:34:38 -08:00
saienduri	ad18219820	Fix for unused variable failure when trying to bump torch-mlir in IREE (#2560 ) Due to blob being an unused variable, we are not able to bump torch-mlir in iree. With this PR, we remove this unused variable.	2023-11-08 15:55:41 -08:00
JianzheXiao	a42d4c18ff	[Torch Dialect]Support aten.cosine_similarity (#2364 ) As title, add support for aten.cosine_similarity, support broadcast inputA/inputB to the same shape	2023-11-08 15:28:30 +08:00
Yuanqiang Liu	0378da0abd	[Torch Dialect] support aten.isinf (#2544 ) Also fix linalg lowering from `UEQ` to `OEQ`. I will check other comparison's lowering later.	2023-11-04 22:26:01 +08:00
saienduri	88adf384cc	torch-mlir change for dense resource implementation (#2513 ) Co-authored-by: Avinash Sharma <avinash@nod-labs.com>	2023-11-03 11:44:07 -07:00
Stella Laurenzo	6961f0a247	Re-organize project structure to separate PyTorch dependencies from core project. (#2542 ) This is a first step towards the structure we discussed here: https://gist.github.com/stellaraccident/931b068aaf7fa56f34069426740ebf20 There are two primary goals: 1. Separate the core project (C++ dialects and conversions) from the hard PyTorch dependencies. We move all such things into projects/pt1 as a starting point since they are presently entangled with PT1-era APIs. Additional work can be done to disentangle components from that (specifically LTC is identified as likely ultimately living in a `projects/ltc`). 2. Create space for native PyTorch2 Dynamo-based infra to be upstreamed without needing to co-exist with the original TorchScript path. Very little changes in this path with respect to build layering or options. These can be updated in a followup without commingling directory structure changes. This also takes steps toward a couple of other layering enhancements: * Removes the llvm-external-projects/torch-mlir-dialects sub-project, collapsing it into the main tree. * Audits and fixes up the core C++ build to account for issues found while moving things. This is just an opportunistic pass through but roughly ~halves the number of build actions for the project from the high 4000's to the low 2000's. It deviates from the discussed plan by having a `projects/` tree instead of `compat/`. As I was thinking about it, this will better accommodate the follow-on code movement. Once things are roughly in place and the CI passing, followups will focus on more in-situ fixes and cleanups.	2023-11-02 19:45:55 -07:00
Daniel Garvey	4901773f77	add uncovered cases in view lowering (#2524 ) removes unecessary checks from empty strided	2023-11-01 21:56:44 -05:00
Ze Zhang	7cb2db6279	Update dtype check in torch-to-tosa clamp op (#2529 ) As titled. --------- Co-authored-by: Ze Zhang <ze.zhang@getcruise.com>	2023-10-23 17:04:30 -07:00
Ze Zhang	4279b750da	update AtenClampOp in torch-to-tosa to handle fp inputs (#2516 ) As titled. --------- Co-authored-by: Ze Zhang <ze.zhang@getcruise.com>	2023-10-17 14:49:47 -07:00
Ze Zhang	f2c53b8ca5	Add aten.isclose support and its torch-to-tosa lowering (#2512 ) Add aten.isclose op Add its torch-to-tosa lowering Update the TorchToTosa/basic.mlir tests To test e2e tosa lowering: `python -m e2e_testing.main -v -c=tosa` --------- Co-authored-by: Ze Zhang <ze.zhang@getcruise.com>	2023-10-16 09:44:53 -07:00
Ze Zhang	e649e06b7b	Add aten.unflatten.int support and its torch-to-tosa lowering (#2509 ) Add aten.unflatten.int op Add its torch-to-tosa lowering Update the TorchToTosa/basic.mlir tests To test e2e tosa lowering: `python -m e2e_testing.main -v -c=tosa` --------- Co-authored-by: Ze Zhang <ze.zhang@getcruise.com>	2023-10-13 18:39:41 -07:00
Quinn Dawkins	6f81ad7293	[TorchToLinalg] Improve broadcast lowerings in strict symbolic modes (#2505 ) With strict symbolic shapes, we can assume numpy-style dynamic broadcasts never occur. This improves the lowering in the presence of this assumption.	2023-10-05 15:15:26 -04:00
Ramiro Leal-Cavazos	2e5d65064c	[linalg] Add handling for leadin and trailing size-1 dims in ViewOp This commit adds to the lowering of `aten.view` handling for the following cases: - `(..., a.size(i))` -> `(..., a.size(i), 1, ..., 1)` - `(..., a.size(i), 1, ..., 1)` -> `(..., a.size(i))` - `(a.size(i), ...)` -> `(1, ..., 1, a.size(i), ...)` - `(1, ..., 1, a.size(i), ...)` -> `(a.size(i), ...)`	2023-10-03 23:04:52 +00:00
Ramiro Leal-Cavazos	1c508af0ba	Revert "[linalg] Fix handling of trailing size-1 dimensions in aten.view (#2474 )" This reverts commit `7c6b9d2445`.	2023-10-03 23:04:52 +00:00
Vivek Khandelwal	ca6ce8974f	[MLIR][TORCH] Add support for int8 dtype for sub, add, and bitwise_and op Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2023-10-03 22:12:31 +05:30
Vivek Khandelwal	9293326e1e	[MLIR][TORCH] Add support for bitwise_right_shit and bitwise_and.Scalar op Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2023-10-02 13:06:59 +05:30
Vivek Khandelwal	c434736ee9	[MLIR][TORCH] Add support for conversion to int8 dtype Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2023-10-02 09:48:46 +05:30
Stella Laurenzo	860be09a39	Elide dynamic broadcast checks when in strict symbolic shapes mode. (#2496 ) When importing dynamic shaped programs from Dynamo, via torch.compile or torch.export, we can assume that strict symbolic shape checks have been done prior to generating torch IR. Among other shape checking, this eliminates the case where an unknown dimension can be dynamically '1' in a way that signals a broadcast. Adds a `isAssumingStrictSymbolicShapes` utility which consults a `torch.assume_strict_symbolic_shapes` attribute on an enclosing scope and returns true if present. In the linalg pipeline, many runtime checks are elided when this returns true.	2023-09-29 16:45:48 -07:00
saienduri	4e1dd3bf10	add e2e support for torch.log10 (#2479 )	2023-09-28 10:17:03 -07:00
Ramiro Leal-Cavazos	7c6b9d2445	[linalg] Fix handling of trailing size-1 dimensions in aten.view (#2474 ) This commit adds to the lowering of `aten.view` handling for the following cases: - `(..., a.size(i))` -> `(..., a.size(i), 1, ..., 1)` - `(..., a.size(i), 1, ..., 1)` -> `(..., a.size(i))` Fixes: https://github.com/llvm/torch-mlir/issues/2448	2023-09-27 09:09:30 -07:00
Daniel Garvey	ff7f8b21dc	update llvm-project to d13da154a7c7eff77df8686b2de1cfdfa7cc7029 (#2483 )	2023-09-26 16:15:55 -05:00
Ramiro Leal-Cavazos	c9fd78988e	[NFC] Clean-up `ConvertAtenViewOp` in linalg backend (#2470 ) While trying to fix a bug in the `ConvertAtenViewOp` pattern in the linalg backend, I realized that the pattern had become quite complex and had accumulated some dead code, making it hard to reason about. This commit simplifies the pattern quite a bit. The main changes are: 1. All the static helper functions in the `ConvertAtenViewOp` class have been simplified, both in their signature and their body. Each one now performs simple calculations on arrays, and take the least number of arguments necessary. 2. The body of [the `while` loop](`9fce566b0c/lib/Conversion/TorchToLinalg/DataMovement.cpp (L407)`) inside the main pattern has been changed to work on `MutableArrayRef` slices, to avoid having to keep track of `start` and `end` indices for the input and output shape arrays. 3. All the heuristics used to determine the mapping between the input and output dimensions are now in [this relatively short `if-else` section](`9fce566b0c/lib/Conversion/TorchToLinalg/DataMovement.cpp (L428-L460)`), making it easy to see what is going on. 4. Dead code was eliminated + updates to some of the documentation comments This commit does not add any new functionality to the `ConvertAtenViewOp` pattern.	2023-09-26 09:20:01 -07:00

... 3 4 5 6 7 ...

979 Commits (f66908f190377692e9448f33c9ac6f7daf7160fb)