torch-mlir

Commit Graph

Author	SHA1	Message	Date
zjgarvey	de28c8540b	[ONNX] add int16 quantization support (#3446 ) There is currently no int16 quantization support in torch. This patch adds a new mlir type to correspond to the missing "torch.qint16" type, and enables lowering of quantization-related onnx ops using int16 types. In follow-up patches, custom quantization logic for ops like aten.matmul/aten.mm/aten.convolution may need to be revisited to allow support for qint16. The passes in FuseQuantizedOps.cpp may also need slight modifications.	2024-06-12 10:37:22 +05:30
zjgarvey	7cd3368b20	[ONNX] Fix resize ceil numerics and add half_pixel_symmetric support (#3443 ) This patch fixes several failing tests in our [external test suite](https://github.com/nod-ai/SHARK-TestSuite/tree/main/iree_tests/onnx/node/generated), and addresses some of the issues discussed in #3420	2024-06-11 22:35:50 -05:00
Matthias Gehre	e07a0bfc54	onnx.resize: Add support for coordTfMode "half_pixel" (#3441 ) half_pixel is also the default mode used by ONNX, see https://onnx.ai/onnx/operators/onnx__Resize.html	2024-06-10 20:59:29 +02:00
Aart Bik	d77bab37d1	[torch-mlir][sparse] re-enable all sparse tests (#3444 ) this fixes the following issue: https://github.com/llvm/torch-mlir/issues/3418	2024-06-10 11:19:32 -07:00
Yuanqiang Liu	689efc8917	[Torch] fix toBuiltinTensor() (#3415 ) * Let `toBuiltinTensor()` reflects the original dtype of `!torch.vtensor`. * Backend handles dtype conversion themselves.	2024-06-08 09:36:32 +08:00
aldesilv	f794582b18	add resize nearest mode round_prefer_floor, round_prefer_ceil, ceil (#3421 )	2024-06-07 14:04:11 -05:00
penguin_wwy	d59d0b6e5a	[Linalg] Promote type for compare tensor op (#3416 )	2024-06-04 16:05:39 -07:00
Vivek Khandelwal	661be2d5b0	[MLIR][Torch] Add TorchToLinalg lowering for AtenAvgPool3dOp (#3030 ) This commit also fixes the average pool op' test failing for OnnxToLinalg lowering. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-06-04 22:12:34 +05:30
Yuanqiang Liu	50f7103098	[Stablehlo] support uint8 (#3367 ) Support lowering unsigned integer type to stablehlo as discussed in https://github.com/llvm/torch-mlir/pull/2184. The things I do in this PR: 1. create `setupBackendTypeConversionForStablehlo()`, `createFuncBackendTypeConversionForStablehloPass` and `createFinalizingBackendTypeConversionForStablehloPass`. 2. remove `InferTypeOpInterface` from `torch_c.to_builtin_tensor`, because it's different result type between linalg backend and stablehlo backend: ``` // linalg backend func.func @forward(%arg0: !torch.vtensor<[3],ui8>) -> tensor<3xf32> { %c = torch_c.to_builtin_tensor %arg0 : (!torch.vtensor<[3], ui8> -> tensor<3xi8> %0 = tensor.empty() : tensor<3xf32> %1 = linalg.generic {indexing_maps = [#map, #map], iterator_types = ["parallel"]} ins(%arg0 : tensor<3xi8>) outs(%0 : tensor<3xf32>) { ^bb0(%in: i8, %out: f32): %2 = arith.uitofp %in : i8 to f32 linalg.yield %2 : f32 } -> tensor<3xf32> return %1 : tensor<3xf32> } // stablehlo backend func.func @forward(%arg0: !torch.vtensor<[3],ui8>) -> tensor<3xf32> { %c = torch_c.to_builtin_tensor %arg0 : (!torch.vtensor<[3], ui8> -> tensor<3xui8> %0 = stablehlo.convert %arg0 : (tensor<3xui8> -> tensor<3xf32> return %0 : tensor<3xf32> } ``` 3. fix stablehlo and linalg's conversion	2024-06-04 09:04:59 +08:00
zjgarvey	8995c90879	[TorchToLinalg] add support for quantized group conv (#3341 ) This addresses 7 of the model failures I'm seeing in the test suite. See [Shark-Turbine issue #566](https://github.com/nod-ai/SHARK-Turbine/issues/566). Need the op ```linalg.conv_2d_ngchw_gfchw_q``` to be added upstream before merging this. See [llvm-project PR #92136 ](https://github.com/llvm/llvm-project/pull/92136). A small additional expansion to operand quantization is included in this patch to address a model failure that occurs when unblocking the quantized group convolutions in one of these onnx models.	2024-06-03 21:57:44 +05:30
Rob Suderman	afca88a058	[NFC] Change to cast instead of .cast variants (#3405 ) Member casts have been deprecated. Changing over a bunch of the member cast calls to the global templated variants to remove deprecation warnings.	2024-05-30 23:45:13 -07:00
zjgarvey	074098d20c	Modifies onnx resize lowering to fix numerical issues (#3381 ) Updates: - some unsupported modes are now going to report a match failure for unsupported coordinate transformation modes. - fixes a bug that was introduced in the last patch for resize (my bad...) - uses actual x and y coordinates for computing weights in bilinear interpolation (rather than eps modified values) - slightly simplifies the bilinear interpolation payload for readability and performance - passes coordinate transformation mode information from an onnx.Resize op to the mode string for the aten._interpolate op. This allows us to perform custom logic in the torch->linalg lowering to support onnx.Resize options without losing the default behaviors of the interpolate op.	2024-05-30 20:34:37 -04:00
penguin_wwy	e4be197efd	[FxImporter] Fix transpose rank zero (#3382 )	2024-05-30 14:31:18 +08:00
penguin_wwy	1f544c37d0	[NFC] Remove unused header files (#3386 )	2024-05-30 14:30:36 +08:00
Gaurav Shukla	43f961eca4	[MLIR] Fix 64-bit product during aten.view lowering (#3378 ) std::accumulate needs 64-bit init value to perform 64-bit arithmetic on a list of integers. Signed-off-by: Gaurav Shukla <gaurav.shukla@amd.com>	2024-05-23 08:59:28 +05:30
zjgarvey	297c270980	onnx.Resize and aten._interpolate : allow n spatial dims. (#3368 ) The old lowering only had logic for 2d (i.e. images). this patch allows interpolation for n spatial dims, which is required for some 3d vision models such as - onnx/models/pytorch-3dunet_vaiq_int8 which successfully compiles and runs with this patch.	2024-05-20 13:35:27 -07:00
zjgarvey	6cba93b16e	[ONNX][TorchToLinalg] Add support for dynamic dims in Interpolate lowering (#3351 ) Addresses [Shark-Turbine #196](https://github.com/nod-ai/SHARK-TestSuite/issues/196) Related tracker [Shark-Turbine #566](https://github.com/nod-ai/SHARK-Turbine/issues/566) Related onnx.Resize issues [Shark-Turbine #616](https://github.com/nod-ai/SHARK-Turbine/issues/616)	2024-05-17 12:18:57 -07:00
Peiming Liu	ccb772cd0f	[sparse] propagate sparsity properly when decompose torch operations. (#3318 )	2024-05-15 10:09:27 -07:00
Stella Laurenzo	00efec0b73	[linalg] Implement strict mode lowering for aten.view. (#3319 ) * Enables assume_strict_symbolic_shapes on fx_importer imported programs, indicating strict shape semantics. * Reworks the view->reshape lowering to take advantage of strict mode and do one of: * Collapse to 0D * Flatten/Unflatten when there is an inferred dim. * Fallback to tensor.reshape * Splits some test cases up and adds an attribute to control the old pattern (so new corners can be tested in strict mode in isolation). * Dynamic inferred mode needs upstream work to generalize expand_shape (so that case is suppressed here). * Deletes the assert from the existing tensor.reshape lowering if strict shape mode is enabled (since the condition it is dynamically asserting cannot happen).	2024-05-10 13:45:50 -07:00
Andreas Falkenberg	adafd51823	[onnx] Gridsampler addition of nearest mode (#3320 ) Added nearest neighbor selection for onnx.Gridsampler	2024-05-10 11:42:10 -07:00
NeverRaR	1d4859699b	MaxPool1d lowering to linalg (#3295 ) Co-authored-by: root <root@i32b01216.sqa.eu95>	2024-05-10 22:05:26 +05:30
penguin_wwy	afe87d62b4	[Linalg] [Stablehlo] Promote type for compare scalar op (#3306 )	2024-05-10 02:20:06 +08:00
Aart Bik	a033bbfe6c	[torch-mlir][sparse] recognize to_dense primitive (#3308 ) also maps simply to sparse_tensor.convert the sparsity types do the rest!	2024-05-08 22:50:17 -07:00
penguin_wwy	0f0f57c960	[Linalg] Refactor compare scalar op (#3294 )	2024-05-09 10:40:19 +08:00
aldesilv	ec6d7aa5d2	OnnxToTorch lowering resize op (#3013 ) https://github.com/nod-ai/SHARK-Turbine/issues/358 adds a lowering from onnx to linalg for bilinear and nearest resize with support for using scales or sizes to get resize shape. uses coordinate transform half pixel for bilinear mode and asymmetrical for nearest mode. See https://github.com/onnx/onnx/blob/main/docs/Operators.md#Resize. Added two passes -- one for bilinear and the other for nearest.	2024-05-08 21:35:03 +00:00
zjgarvey	72349f7522	[TorchToLinalg] Adds Quantization Support for ConvTranspose (#3240 ) I spent a little while debugging numerics issues with some tests similar to the ones in quantized_models.py, only to find that pytorch's quantized conv transpose is catastrophically inaccurate. I'll upstream the issue and only leave the tests here which are of the form quantize -> dequantize -> op.	2024-04-30 09:23:09 -07:00
penguin_wwy	6679728c56	Fix deprecated uses of cast/dyn_cast/dyn_cast_or_null/isa (#3243 ) Like #3130, gradually replace the deprecated code https://github.com/llvm/mlir-www/blob/main/website/content/deprecation/_index.md#deprecated	2024-04-27 14:00:56 -07:00
Aart Bik	4361178caa	[torch-mlir][sparse] recognize sparse tensor conversion (#3226 ) Sparse tensor conversions are represented by special aten operators. This PR ensures the conversions are recognized (instead of failing the full torch aten lowering to linalg).	2024-04-26 02:32:07 +08:00
Xinyu Yang	7030eacb76	[stablehlo] Support aten.any and aten.all lowering (#3217 )	2024-04-25 11:15:52 +08:00
Avinash Sharma	678c03b762	Fix nan issue for fp16 torch.randn/randn_like in ConvertAtenUniformOp (#3184 ) For ops that use ConvertAtenUniformOp (e.g. torch.randn/randn_like), fp16 datatype returns nan values. Trying to lower [this repro](https://gist.github.com/aviator19941/1c65e658241dea6906ca423f9abaee69) will result in nan's, this PR fixes the issue.	2024-04-24 12:28:08 +05:30
Xinyu Yang	4da3d714cc	[Torch] Support AtenProdOp on linalg and stablehlo (#3215 )	2024-04-24 11:14:04 +08:00
zjgarvey	a8ba865fca	[torch] Adds Quantization Support for `aten.relu` (#3177 ) A choice was made to quantize the return type of Relu with a scale and zero point copied from the input's quantization scheme. With this choice, the torch-to-linalg conversion of quantized Relu essentially computes max(input, zeroPoint) in the elementwise payload.	2024-04-23 11:01:36 -07:00
Rob Suderman	0e77de996a	[torch] Add support for `torch.view` with dynamic shapes (#3164 ) We can map to `tensor.reshape` for handling multiple output dynamic shapes. Later we can perform a more complex analysis for indentifying expand/collapse cases from the tensor.reshape. Initially we planned to handle this identification at the `torch` level however it will be easier to handle once converted to core mlir-dialects.	2024-04-18 11:47:19 -07:00
Rob Suderman	4c21e20caa	[torch] Support rank-0 index for torch index select (#3182 ) Need to perform an expand in the case where the indices is rank-0.	2024-04-18 11:32:31 -07:00
Andreas Falkenberg	b66eabd492	[onnx][torch][linalg] Implementing align-corner modes for gridsampler (#3171 ) Align corner modes which select what the corners mean. Either the center of the corner points or the edges of the edge points. --------- Co-authored-by: Rob Suderman <rob.suderman@gmail.com>	2024-04-17 13:38:19 -07:00
zjgarvey	7a1ad0d7c0	[TorchToLinalg] Adds Support for Remaining Quantized Matmul Cases (#3167 ) The new cases added for quantized matmuls are: 1. vec-vec 2. vec-mat 3. mat-vec each of which are now lowered to expand(s), quantized_matmul, and collapse.	2024-04-16 09:28:28 -07:00
zjgarvey	5e564b5864	Adds Some Quantization Support for AtenMatmulOp (#3147 ) 1. onnx.MatMulInteger now converts to aten.matmul instead of aten.mm 2. aten.matmul, for ranks >=2, now allows quantized inputs and will lower to linalg::quantized_matmul or linalg::quantized_batch_matmul. 3. added AtenMatmulOp to the FuseQuantizeOps rewrite patters QuantizeOperands, QuantizeTransposedOperands, and QuantizeAccumulator 4. added several tests, including some to test AtenMmOp with varying quantization signed-ness. 5. a quantized matmul mat-vec test is added to verify the failure to lower to linalg; cleaned of out-of-date code related to common torch-mlir lowering xfails. 6. in debugging a real model with quantized matmuls, I found a bug on the scalarize-shapes pass which resulted from the aten.full op folder returning an incompatible result type. This is fixed by the small change here to [lib/Dialect/Torch/IR/TorchOps.cpp](https://github.com/llvm/torch-mlir/compare/main...zjgarvey:torch-mlir:MatMulIntegerFix?expand=1#diff-dc8ed165c207918e606490eee3984b1ad51d7034e6aac36fc046bf47f6f03f4f).	2024-04-15 16:06:47 -07:00
IanWood1	5708ee7ec9	Added 2 Ops: Floor divide scalar and Floor divide scalar mode (#3156 ) - Added linalg lowering for `AtenFloorDivideScalarOp` - Needed `AtenDivScalarModeOp` for the decomp. - Added linalg lowering for `AtenDivScalarModeOp` - Moved linalg payload logic to `createDivModePayload()` since the logic was nearly identical for both `AtenDivScalarModeOp` and `AtenDivTensorModeOp`. Just a template function - Added `AtenDivScalarModeOp` lowering for stablehlo Pytorch's [`torch.floor_divide()`](https://pytorch.org/docs/stable/generated/torch.floor_divide.html) in a previous version (for a reason unknown to me) preformed a truncation instead of "floor". The already implemented op `AtenFloorDivideTensorOp` was done before this change. However, this wasn't caught because our testcases only tested positive floor division. I changed this to floor as well as adding a few test cases.	2024-04-15 13:45:10 -07:00
Xinan Jiang(姜曦楠)	71d90788d3	[MLIR][TORCH] Support parallel dimemsions expand/collapse (#3051 ) This PR support `aten.view` with unique unknown dimension both in input shape and output shape while the pass convert-torch-to-linalg that lowing `aten.view` to `tensor.collapse_shape` or `tensor.expand_shape`. Below is an example ``` func.func @test_reshape(%arg0: !torch.vtensor<[1,?,50,16],f32>) -> !torch.vtensor<[1,?,16],f32> attributes {torch.assume_strict_symbolic_shapes, torch.onnx_meta.ir_version = 9 : si64, torch.onnx_meta.opset_version = 19 : si64, torch.onnx_meta.producer_name = "backend-test", torch.onnx_meta.producer_version = ""} { %int1 = torch.constant.int 1 %int-1 = torch.constant.int -1 %int16 = torch.constant.int 16 %0 = torch.prim.ListConstruct %int1, %int-1, %int16 : (!torch.int, !torch.int, !torch.int) -> !torch.list<int> %1 = torch.aten.view %arg0, %0 : !torch.vtensor<[1,?,50,16],f32>, !torch.list<int> -> !torch.vtensor<[1,?,16],f32> return %1 : !torch.vtensor<[1,?,16],f32> } ```	2024-04-11 10:43:03 -07:00
penguin_wwy	d4a30b7e67	Fix deprecated uses of cast/dyn_cast/dyn_cast_or_null/isa (#3130 ) We should prefer functional style as the method style is deprecated https://github.com/llvm/mlir-www/blob/main/website/content/deprecation/_index.md#deprecated (https://mlir.llvm.org/deprecation/)	2024-04-11 06:47:35 -07:00
zjgarvey	aa5e150313	Adds Some uint8 Quantization Fixes (#3122 ) 1. Changes the linalg lowering for dequantization ops to always sign cast to float to prevent misrepresenting uint32 overflow on subtraction with zero point. 2. Adds a basic quantized model test which only quantizes and dequantizes and now passes with these changes in linalg and onnx configs. 3. Changes the aten.mm lowering to allow mismatched quantized types. 4. If a quantized matmul arg is uint8, we shift by 128 to faithfully represent the quantization as a signed i8 quantization. This worked fine in the AtenMmOp lowering, but I'd be happy to move it to a rewrite in FuseQuantizedOps.cpp instead if that seems more appropriate. With the changes 3 and 4, the QuantizedMLP_basic and QuantizedSingleLayer_basic e2e tests now passes with the onnx config.	2024-04-10 12:36:58 -07:00
Rob Suderman	9d9a05366e	[torch] Fix aten.squeeze lowering to use result shape (#3106 ) Squeezes can be ambiguous without the output shape information. For instance (1, 1, 256) squeezed can be either (1, 256) or (256). We need to check the resulting shape to know what the shape should look like.	2024-04-04 09:43:12 -07:00
Rob Suderman	f97cd4893f	[torch] Improve shape inference for dynamic shapes (#3091 ) Shapes can be processed as tensors to represent the set of dimensions. As reshapes take a list of scalars this can result in a single dynamic dimension blocking the adjacent static dimensions. This pass attempts to de-couple tensor computations related to shapes and propagate values to better support lowering scalar tensor computations.	2024-04-02 16:19:57 -07:00
Xinyu Yang	ac1cd3d78a	[Torch] Support AtenDivTensorModeOp with static int input for linalg and stablehlo backend (#3088 )	2024-04-02 17:28:53 +08:00
Thomas Dietert	d2432bbe5a	[MLIR][Torch] Do not convert bias tensor to element type if NoneType (#3072 ) The `convertTensorToElementType` function expects it's argument to have a valid tensor type that is not `Torch::NoneType`. This PR checks that the bias tensor is not of type `Torch::NoneType` before calling `convertTensorToElementType` on the bias tensor argument in the `matchAndRewrite` member function of the `ConvertAtenConvolutionOp` class.	2024-04-02 14:19:26 +05:30
ptrifunovic98	1c8c47d483	Add complex support for aten.norm and similar operations (#3052 ) Add support for complex-type input tensors for norm, vector norm, and Frobenius norm operations.	2024-04-02 14:03:30 +05:30
zjgarvey	532d297c46	[ONNX] Preliminary Work Towards Supporting QuantizedMLP_basic onnx e2e test (#3089 ) See the related issues here: [SHARK-Turbine#556](https://github.com/nod-ai/SHARK-Turbine/issues/556) 1. Adds uint8 casting to onnx.Cast op 2. Fixes an issue with onnx.DequantizeLinear when the scale comes with shape [1]. 3. Adds support for unsigned types in an AtenItemOp folder 4. Adds a simpler quantized model for easier debugging 5. Adds a fusion pass to convert [quant -> dequant -> transpose -> mm] patterns to [transpose -> quant -> mm]. 6. Moved some xfails that are still not passing, but for different reasons than onnx.cast failures.	2024-04-01 16:21:05 -07:00
Vivek Khandelwal	6844c84702	[MLIR][Torch] Fix OnnxToLinalg lowering for AvgPool op (#3076 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-04-01 22:14:14 +05:30
Rob Suderman	14b548f968	[torch] Improve shape inference for `torch-to-linalg` path for reshapes (#3055 ) Reshaping tensors depend on directly matching individual dimensions to their corresponding dim in the `torch.view` reshape dimensions. This involves decoupling dynamic dimensions from their static counterparts and support cleanup / canonicalization.	2024-03-26 12:41:40 -07:00
schnkmwt	1fcbfa87ec	Implement linalg lowering of diag_embed torch op (#2885 ) This PR adds lowering of diag_embed to linalg dilect. Tracked in https://github.com/nod-ai/SHARK-Turbine/issues/288 --------- Co-authored-by: sachink <sachink@xilinx.com>	2024-03-22 16:32:50 -07:00

1 2 3 4 5 ...

393 Commits (41d04a89959d9197e167302fcae375f947848a88)