torch-mlir

Commit Graph

Author	SHA1	Message	Date
zjgarvey	d0933b0eb6	[TorchToLinalg] Fix possible OOB access in Interpolate lowering (#3570 ) Following up from the discussion in <https://github.com/llvm/torch-mlir/pull/3550>, I've edited the lowering to prevent OOB extracts in a more direct fashion (i.e., just clamping directly). I don't think this affects the lit tests at all, but I've tested the changes in our external test suite at <https://github.com/nod-ai/SHARK-TestSuite/tree/main/>. I found the issue when I was unexpectedly getting `nan`'s along the output image border for a resize test there.	2024-08-02 13:55:37 -05:00
Rob Suderman	f7b5c13870	Change linalg.matmul_unsigned to linalg.matmul with unsigned type_fn (#3587 ) Change linalg.matmul_unsigned to linalg.matmul with unsigned type_fn Signed-off-by: Max Dawkins <max.dawkins@gmail.com> Co-authored-by: Max Dawkins <max.dawkins@gmail.com>	2024-08-02 11:32:24 -07:00
zjgarvey	af236dab66	Add support for multiple dynamic reassociation dims for unflatten.int (#3504 ) Addresses an issue with onnx.Gather lowering to linalg: <https://github.com/nod-ai/SHARK-Turbine/issues/242> The builder for tensor.expand_shape, without an explicitly provided output shape, fails to infer an output shape in the case of multiple dynamic reassociation dims. I tried adding the output shape explicitly for tensor.expand_shape, but ran into compilation issues later on (see <https://github.com/iree-org/iree/issues/17760>). This PR adds support by lowering this op to tensor.reshape when multiple dynamic reassociation dims are provided.	2024-06-28 09:59:51 -07:00
Matthias Gehre	6678e1a256	TorchToLinalg: Try folding shape computations to keep static shapes when possible (#3475 ) Before this PR, a statically shaped aten.convolution would generate dynamically shaped linalg IR, and even `-canonicalize` would not be able to fold it back into static shapes. This PR ensure that shape calculations are folded on construction to directly generate statically shaped linalg IR. We achieve that by ensuring that `arith` ops involved in computing shapes are created via `createOrFold`, so that later uses of `getAsOpFoldResult` see constants instead of those ops. For example ``` module { func.func @forward(%arg0: !torch.vtensor<[32,336,112,112],f32>, %arg1: !torch.vtensor<[336,168,3,3],f32>, %arg2: !torch.vtensor<[336],f32>) -> !torch.vtensor<[32,336,56,56],f32> { %false = torch.constant.bool false %int2 = torch.constant.int 2 %int1 = torch.constant.int 1 %0 = torch.prim.ListConstruct %int1, %int1 : (!torch.int, !torch.int) -> !torch.list<int> %1 = torch.prim.ListConstruct %int2, %int2 : (!torch.int, !torch.int) -> !torch.list<int> %2 = torch.prim.ListConstruct : () -> !torch.list<int> %3 = torch.aten.convolution %arg0, %arg1, %arg2, %1, %0, %0, %false, %2, %int2 : !torch.vtensor<[32,336,112,112],f32>, !torch.vtensor<[336,168,3,3],f32>, !torch.vtensor<[336],f32>, !torch.list<int>, !torch.list<int>, !torch.list<int>, !torch.bool, !torch.list<int>, !torch.int -> !torch.vtensor<[32,336,56,56],f32> return %3 : !torch.vtensor<[32,336,56,56],f32> } } ``` would result in ``` [...] %padded = tensor.pad %2 low[%14, %15, %16, %17] high[%14, %15, %16, %17] { ^bb0(%arg3: index, %arg4: index, %arg5: index, %arg6: index): tensor.yield %cst : f32 } : tensor<32x336x112x112xf32> to tensor<?x?x?x?xf32> [...] %45 = linalg.conv_2d_ngchw_gfchw {dilations = dense<1> : vector<2xi64>, strides = dense<2> : vector<2xi64>} ins(%expanded, %expanded_37 : tensor<?x2x?x?x?xf32>, tensor<2x168x168x3x3xf32>) outs(%expanded_44 : tensor<32x2x168x?x?xf32>) -> tensor<32x2x168x?x?xf32> [...] ``` and with this PR all shapes are static.	2024-06-27 08:43:10 +02:00
zjgarvey	694210f429	[TorchToLinalg] Fix Quantized Convolution Accumulator Type (#3459 ) 1. truncates zero-points to i32 2. modifies the default accumulator type for i8 from i64 to i32. 3. now uses the input dtype to infer accumulator dtype.	2024-06-20 13:54:20 -07:00
Peiming Liu	ba16bad8c7	[torch-mlir] bump stablehlo/llvm version (#3471 ) Update to llvm/llvm-project@5207632f86 Update to openxla/stablehlo@d41390c3a7	2024-06-18 16:59:53 -07:00
zjgarvey	7cd3368b20	[ONNX] Fix resize ceil numerics and add half_pixel_symmetric support (#3443 ) This patch fixes several failing tests in our [external test suite](https://github.com/nod-ai/SHARK-TestSuite/tree/main/iree_tests/onnx/node/generated), and addresses some of the issues discussed in #3420	2024-06-11 22:35:50 -05:00
Matthias Gehre	e07a0bfc54	onnx.resize: Add support for coordTfMode "half_pixel" (#3441 ) half_pixel is also the default mode used by ONNX, see https://onnx.ai/onnx/operators/onnx__Resize.html	2024-06-10 20:59:29 +02:00
aldesilv	f794582b18	add resize nearest mode round_prefer_floor, round_prefer_ceil, ceil (#3421 )	2024-06-07 14:04:11 -05:00
zjgarvey	074098d20c	Modifies onnx resize lowering to fix numerical issues (#3381 ) Updates: - some unsupported modes are now going to report a match failure for unsupported coordinate transformation modes. - fixes a bug that was introduced in the last patch for resize (my bad...) - uses actual x and y coordinates for computing weights in bilinear interpolation (rather than eps modified values) - slightly simplifies the bilinear interpolation payload for readability and performance - passes coordinate transformation mode information from an onnx.Resize op to the mode string for the aten._interpolate op. This allows us to perform custom logic in the torch->linalg lowering to support onnx.Resize options without losing the default behaviors of the interpolate op.	2024-05-30 20:34:37 -04:00
zjgarvey	297c270980	onnx.Resize and aten._interpolate : allow n spatial dims. (#3368 ) The old lowering only had logic for 2d (i.e. images). this patch allows interpolation for n spatial dims, which is required for some 3d vision models such as - onnx/models/pytorch-3dunet_vaiq_int8 which successfully compiles and runs with this patch.	2024-05-20 13:35:27 -07:00
zjgarvey	6cba93b16e	[ONNX][TorchToLinalg] Add support for dynamic dims in Interpolate lowering (#3351 ) Addresses [Shark-Turbine #196](https://github.com/nod-ai/SHARK-TestSuite/issues/196) Related tracker [Shark-Turbine #566](https://github.com/nod-ai/SHARK-Turbine/issues/566) Related onnx.Resize issues [Shark-Turbine #616](https://github.com/nod-ai/SHARK-Turbine/issues/616)	2024-05-17 12:18:57 -07:00
Stella Laurenzo	00efec0b73	[linalg] Implement strict mode lowering for aten.view. (#3319 ) * Enables assume_strict_symbolic_shapes on fx_importer imported programs, indicating strict shape semantics. * Reworks the view->reshape lowering to take advantage of strict mode and do one of: * Collapse to 0D * Flatten/Unflatten when there is an inferred dim. * Fallback to tensor.reshape * Splits some test cases up and adds an attribute to control the old pattern (so new corners can be tested in strict mode in isolation). * Dynamic inferred mode needs upstream work to generalize expand_shape (so that case is suppressed here). * Deletes the assert from the existing tensor.reshape lowering if strict shape mode is enabled (since the condition it is dynamically asserting cannot happen).	2024-05-10 13:45:50 -07:00
Andreas Falkenberg	adafd51823	[onnx] Gridsampler addition of nearest mode (#3320 ) Added nearest neighbor selection for onnx.Gridsampler	2024-05-10 11:42:10 -07:00
NeverRaR	1d4859699b	MaxPool1d lowering to linalg (#3295 ) Co-authored-by: root <root@i32b01216.sqa.eu95>	2024-05-10 22:05:26 +05:30
Aart Bik	a033bbfe6c	[torch-mlir][sparse] recognize to_dense primitive (#3308 ) also maps simply to sparse_tensor.convert the sparsity types do the rest!	2024-05-08 22:50:17 -07:00
aldesilv	ec6d7aa5d2	OnnxToTorch lowering resize op (#3013 ) https://github.com/nod-ai/SHARK-Turbine/issues/358 adds a lowering from onnx to linalg for bilinear and nearest resize with support for using scales or sizes to get resize shape. uses coordinate transform half pixel for bilinear mode and asymmetrical for nearest mode. See https://github.com/onnx/onnx/blob/main/docs/Operators.md#Resize. Added two passes -- one for bilinear and the other for nearest.	2024-05-08 21:35:03 +00:00
Benoit Jacob	bce800a3f4	Integrate llvm-project at dabdec1001dc368373dd581cf72f37a440873ce3 (#3300 ) Co-authored-by: Jacques Pienaar <jpienaar@google.com>	2024-05-08 14:43:06 -04:00
Stella Laurenzo	5d4b803914	[NFC reformat] Run pre-commit on all files and format misc. This is part 1 of ~3, formatting all miscellaneous text files and CPP files matched by a first run of pre-commit. These tend to be low change-traffic and are likely not disruptive. Subsequent patches will format Python files and remaining CPP files.	2024-04-27 14:08:09 -07:00
Aart Bik	4361178caa	[torch-mlir][sparse] recognize sparse tensor conversion (#3226 ) Sparse tensor conversions are represented by special aten operators. This PR ensures the conversions are recognized (instead of failing the full torch aten lowering to linalg).	2024-04-26 02:32:07 +08:00
Rob Suderman	0e77de996a	[torch] Add support for `torch.view` with dynamic shapes (#3164 ) We can map to `tensor.reshape` for handling multiple output dynamic shapes. Later we can perform a more complex analysis for indentifying expand/collapse cases from the tensor.reshape. Initially we planned to handle this identification at the `torch` level however it will be easier to handle once converted to core mlir-dialects.	2024-04-18 11:47:19 -07:00
Andreas Falkenberg	b66eabd492	[onnx][torch][linalg] Implementing align-corner modes for gridsampler (#3171 ) Align corner modes which select what the corners mean. Either the center of the corner points or the edges of the edge points. --------- Co-authored-by: Rob Suderman <rob.suderman@gmail.com>	2024-04-17 13:38:19 -07:00
Thomas Dietert	d2432bbe5a	[MLIR][Torch] Do not convert bias tensor to element type if NoneType (#3072 ) The `convertTensorToElementType` function expects it's argument to have a valid tensor type that is not `Torch::NoneType`. This PR checks that the bias tensor is not of type `Torch::NoneType` before calling `convertTensorToElementType` on the bias tensor argument in the `matchAndRewrite` member function of the `ConvertAtenConvolutionOp` class.	2024-04-02 14:19:26 +05:30
Andreas Falkenberg	55dc8deb92	[torch] GridSample TorchToLinalg lowering (#2883 ) Lowers `torch.grid_sample` to the equilvalent `linalg` representation.	2024-02-23 09:14:38 -08:00
Scott Todd	d6e1d836ca	Drop torch attributes at the end of backend conversion. (#2876 ) Fixes https://github.com/llvm/torch-mlir/issues/2866 Some backends / downstream projects expect that a "fully converted" program has no remaining ops or attributes from the original dialect(s).	2024-02-13 14:32:02 -08:00
Aart Bik	46a25d7241	[torch-mlir][sparse] preserve sparsity during lowering torch to linalg (#2809 ) This preserves sparsity at the most obvious places of lowering TORCH tensors to MLIR RankedTensorType tensors. Other places are marked for audit. With some initial lowering tests.	2024-01-26 10:54:59 -08:00
Aart Bik	0aed231e21	[torch-mlir][conversion-test] cleanup trailing whitespace in mlir files (#2807 )	2024-01-25 14:24:28 -08:00
John Wu	704cfdaf08	Add aten.pool_max3d support to torch-to-linalg (#2735 ) Added verification logic to the abstract_interpreter_lib_gen.py Also made some unit tests Initially, I thought we can use `linalg::pooling_ndhwc_max` to help implement this problem. However, on a 5-dimensional matrix it does the pooling on dimensions (2, 3, 4) which is not what we want. We want pooling on dimensions (3, 4, 5). To achieve this, we would need to lower our code using the `linalg` dialect. Turns out the pooling code in `linalg` looks like this. ``` func @max_pooling_ncdhw(%I: memref<?x?x?x?x?xf32>, %K: memref<3xindex>, %O: memref<?x?x?x?x?xf32>, %strides: memref<3xindex>, %dilations: memref<3xindex>) { %c0 = arith.constant 0 : index %c1 = arith.constant 1 : index %N = memref.dim %I, %c0 : memref<?x?x?x?x?xf32> %C = memref.dim %I, %c1 : memref<?x?x?x?x?xf32> %D = memref.dim %I, 2 : memref<?x?x?x?x?xf32> %H = memref.dim %I, 3 : memref<?x?x?x?x?xf32> %W = memref.dim %I, 4 : memref<?x?x?x?x?xf32> %kernel_d = memref.load %K[%c0] : memref<3xindex> %kernel_h = memref.load %K[%c1] : memref<3xindex> %kernel_w = memref.load %K[2] : memref<3xindex> %stride_d = memref.load %strides[%c0] : memref<3xindex> %stride_h = memref.load %strides[%c1] : memref<3xindex> %stride_w = memref.load %strides[2] : memref<3xindex> %dilation_d = memref.load %dilations[%c0] : memref<3xindex> %dilation_h = memref.load %dilations[%c1] : memref<3xindex> %dilation_w = memref.load %dilations[2] : memref<3xindex> linalg.generic { indexing_maps = [ affine_map<(n, c, d, h, w, kd, kh, kw) -> (n, c, d * %stride_d + kd * %dilation_d, h * %stride_h + kh * %dilation_h, w * %stride_w + kw * %dilation_w)>, // Map for input tensor affine_map<(n, c, d, h, w, kd, kh, kw) -> (kd, kh, kw)>, // Map for kernel tensor affine_map<(n, c, d, h, w, kd, kh, kw) -> (n, c, d, h, w)> // Map for output tensor ], iterator_types = ["parallel", "parallel", "parallel", "parallel", "parallel", "reduction", "reduction", "reduction"], doc = "3D Max Pooling NCDHW with Strides, Dilations, and Kernel Size" } ins(%I, %K : memref<?x?x?x?x?xf32>, memref<3xindex>) outs(%O : memref<?x?x?x?x?xf32>) { ^bb0(%input_elem: f32, %kernel_elem: index, %output_elem: f32): %max_val = arith.maxf %input_elem, %output_elem : f32 linalg.yield %max_val : f32 } return } ``` This was implemented based on it's source code with the adjustments mentioned above: `4ca1b5e094/mlir/include/mlir/Dialect/Linalg/IR/LinalgNamedStructuredOps.yaml (L5647)` Issues related to this can be found here https://github.com/nod-ai/SHARK-Turbine/issues/324	2024-01-19 21:09:46 +05:30
Rob Suderman	a24aadbfab	[aten] Make `torch.aten.matmul` to `linalg` work for non-broadcasting case (#2659 ) Broadcasting for `torch.aten.matmul` is optional so a MxN with NxK matmul should be legalized to a `linalg.matmul`.	2023-12-20 10:09:10 -08:00
Rob Suderman	791c666479	[torch] Lower `torch.aten.sinh` to `linalg` (#2662 )	2023-12-18 09:15:12 -08:00
Quinn Dawkins	030b0140d4	[TorchToLinalg] Lower aten.cat to tensor.concat (#2650 ) This replaces the lowering of aten.cat with tensor.concat, allowing more efficient handling of concatenations in downstream flows. The refbackend populates concat decomposition patterns that can be used to recover the previous lowering.	2023-12-15 15:45:32 -05:00
Quinn Dawkins	141202bc01	[TorchToLinalg] Fix integer type handling for aten.mm (#2615 ) Despite aten.mm requiring the input and output types match, we still opt to maintain signedness semantics in case later passes try to do any sort of integer type narrowing.	2023-12-07 00:13:53 -05:00
James Newling	647f2f5076	Additional tests for view lowering (#2584 ) The logic for lowering the aten view op to linalg is fairly complex. In this PR I have tried to follow all non-failing paths through the lowering and add unit tests where they're missing. There is 1 logical change to the lowering: redundant tensor.cast ops (same source and destination type) are folded.	2023-11-20 17:35:25 -08:00
Daniel Garvey	4901773f77	add uncovered cases in view lowering (#2524 ) removes unecessary checks from empty strided	2023-11-01 21:56:44 -05:00
Quinn Dawkins	6f81ad7293	[TorchToLinalg] Improve broadcast lowerings in strict symbolic modes (#2505 ) With strict symbolic shapes, we can assume numpy-style dynamic broadcasts never occur. This improves the lowering in the presence of this assumption.	2023-10-05 15:15:26 -04:00
Stella Laurenzo	860be09a39	Elide dynamic broadcast checks when in strict symbolic shapes mode. (#2496 ) When importing dynamic shaped programs from Dynamo, via torch.compile or torch.export, we can assume that strict symbolic shape checks have been done prior to generating torch IR. Among other shape checking, this eliminates the case where an unknown dimension can be dynamically '1' in a way that signals a broadcast. Adds a `isAssumingStrictSymbolicShapes` utility which consults a `torch.assume_strict_symbolic_shapes` attribute on an enclosing scope and returns true if present. In the linalg pipeline, many runtime checks are elided when this returns true.	2023-09-29 16:45:48 -07:00
Jiawei Wu	60bad54f27	[Torch Dialect] replace none-index in aten.Index.Tensor's param by manually generating it (#2344 ) * [Torch Dialect] replace none-index in aten.Index.Tensor's param by manually generating it Co-authored-by: Jiawei Wu <wujiawei.aml@bytedance.com> Co-authored-by: Jianzhe Xiao <jianzhe.xiao@bytedance.com> * minor typo fix * add new failed e2e tests for ltc * fix typo * Address comments * Add more e2e tests * add failed e2e tests for LTC * address comments * remove decomposition for AtenIndexTensorHackedTwinOp	2023-08-15 19:36:08 +08:00
Vivek Khandelwal	f6a6cfea4e	[MLIR][TORCH] Add support for negative index values for index.Tensor op (#2233 ) This commit adds the support for index.Tensor op when the index values are negative. This commit wraps around the index values by checking their values at run time. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2023-06-16 14:21:04 -05:00
Vivek Khandelwal	da886280fe	[MLIR][TORCH] Add E2E support for aten.tril op (#2202 ) Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2023-06-05 16:17:01 -07:00
Yuanqiang Liu	ef6dae6ae2	[Linalg] fix lowering reduce max with -inf (#2097 )	2023-05-08 09:17:49 -07:00
Vivek Khandelwal	d9cbf01d1e	Revert "build: update llvm tag to 147fe9de" This reverts commit `e45ad313d4`.	2022-11-25 12:41:56 +05:30
Vivek Khandelwal	e45ad313d4	build: update llvm tag to 147fe9de Summary of changes: - Update call to `hasNoEffect` utility - `KDynamicSize` value changed to `std::numeric_limits<int64_t>::min()` from `-1` - Update tags llvm: 147fe9de29dc13c14835127b35280c4d95c8e8ba mhlo: 1944b5fa6062ec4c065d726c9c5d64f1487ee8c5 Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-11-24 12:44:43 +05:30
Ashay Rane	a11ea93877	build: update llvm tag to f8b84268 (#1528 ) The only change required was to update a test to reflect the changes in https://reviews.llvm.org/D136541.	2022-10-26 15:33:53 -05:00
Ramiro Leal-Cavazos	82a3860e25	build: update llvm tag to 4546397e (#1502 ) This commit makes the following changes needed to update bump LLVM: - Replace `linalg.init_tensor` with `tensor.empty` (see: https://reviews.llvm.org/D135129) - Replace `NoSideEffect` with `Pure` (see https://reviews.llvm.org/D135505) - Replace `body` region accessor for `ReduceOp` and `ReduceWindowOp` with `getBody` - Fix incorrect use of `tosa::ReduceSumOp` in `AtenNativeLayerNormOp` conversion pattern. The result type of `tosa::ReduceSumOp` must have the same rank as the input type. (see: https://www.mlplatform.org/tosa/tosa_spec.html#_reduce_sum) Co-authored-by: Ashay Rane <ashay@users.noreply.github.com> Co-authored-by: Ashay Rane <ashay@users.noreply.github.com>	2022-10-18 04:22:53 +00:00
AmosLewis	940959589b	[MLIR][TORCH] Add Byte and Char Dtype support	2022-09-30 13:19:31 +05:30
JakopinA	8ef0c874c2	Implement Expand/Collapse Functionality for Aten.View (#1353 )	2022-09-27 11:08:14 -07:00
Vivek Khandelwal	65d811e267	[MLIR][TORCH] Fix dynamic cases for aten.index.Tensor	2022-08-19 12:13:20 +05:30
Tanyo Kwok	290d7755fb	importer: add initial support for loading Float16 tensors (#1169 ) follow up #761: This patch updates the `torch_mlir::convertTensorToMlirElementsAttr()` method to enable the creation of tensors whose base type is Float16. This patch also adds a test to validate the IR generation, and it updates the test for importing tensors of various types.	2022-08-08 12:37:31 +08:00
Ashay Rane	234fc7fe0c	linalg: lower `aten.triu` op to `linalg.generic` (#965 ) Prior to this patch, the torch dialect included `AtenTriuOp` for computing the upper triangular part of the input matrix, but there was no code for lowering the op to the linalg dialect. This patch adds code to generate a `linalg.generic` operation that compares indices (computed using `linalg.index`) to choose between zero or the original value (using `arith.select`). The lowering fails if the number of dimensions are less than two. This patch also adds a few end-to-end tests.	2022-06-23 22:45:48 -07:00
Vivek Khandelwal	6f548fc3ad	[MLIR][TORCH] Add decomposition of aten.adaptive_avg_pool2d op This commit adds the decomposition of `aten.adaptive_avg_pool2d` op into `aten.avg_pool2d` op. The current decomposition only supports cases where input size is equal to the output size. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-05-27 07:56:37 +05:30

1 2

74 Commits (c71728b18217de0eb56afbb0cb6321715d1b79d3)