torch-mlir

Commit Graph

Author	SHA1	Message	Date
David Tanner	02327af998	Adds onnx ConvTranspose support for autopadding. (#3797 ) Adds onnx ConvTranspose support for autopadding (https://github.com/nod-ai/SHARK-ModelDev/issues/839). - Adds support for attribute auto_pad="SAME_UPPER" or "SAME_LOWER" which will automatically calculate padding of input based on output shape. - Adds support, during auto-padding, for output_shape=[H,W] which overrides the default output shape of input_shape[i]*stride[i] (for spatial dimensions only). - Adds lit test for auto-padding. - Tests are added by https://github.com/nod-ai/SHARK-TestSuite/pull/370 NOTE: ConvTranspose still doesn't support asymmetric padding, therefore multiple original onnx tests still won't pass.	2024-10-18 12:31:33 -05:00
zjgarvey	f08bfc4ff8	[ONNX] simplify shapes fed to broadcast in Expand lowering (#3756 ) Addresses ~200 onnx model compile failures in <https://github.com/nod-ai/SHARK-TestSuite> related to <https://github.com/iree-org/iree/issues/18631>. This change simplifies the result of the generated broadcast op substantially, but reduces the case coverage slightly. The case which will become unsupported: - trying to actually broadcast a dynamic dim that is secretly 1. When does this case appear in practical scenarios? - for a model where onnx shape inference cannot figure out that a dim should be 1. Why do I think we should not support this case for now? 1. For all models with dynamic dim expand ops, the previous path uniformly generates uglier linalg IR (making it harder for IREE to fuse properly with other ops). 2. For models failing shape inference castastrophically enough to fail to see a dim is statically 1, we can try to apply constant folding in the onnx model before importing. Leaving this as a draft PR, since it may be more appropriate to fix the compilation failure in IREE rather than torch-mlir. ### Example of broadcast required in previous path: ```mlir %300 = linalg.generic {indexing_maps = [#map11], iterator_types = ["parallel", "parallel", "parallel", "parallel"]} outs(%299 : tensor<?x12x?x?xi1>) { ^bb0(%out: i1): %306 = linalg.index 0 : index %307 = linalg.index 3 : index %308 = arith.index_cast %285 : i64 to index %309 = arith.cmpi eq, %308, %c1 : index %310 = arith.select %309, %c0, %306 : index %311 = arith.index_cast %286 : i64 to index %312 = arith.cmpi eq, %311, %c1 : index %313 = arith.select %312, %c0, %307 : index %extracted_79 = tensor.extract %reshape_78[%310, %c0, %c0, %313] : tensor<?x1x1x?xi1> linalg.yield %extracted_79 : i1 } -> tensor<?x12x?x?xi1> ``` ### Example of broadcast with simplified shape list: ```mlir %409 = linalg.generic {indexing_maps = [#map15, #map11], iterator_types = ["parallel", "parallel", "parallel", "parallel"]} ins(%reshape_135 : tensor<?x1x1x?xi1>) outs(%408 : tensor<?x12x?x?xi1>) { ^bb0(%in: i1, %out: i1): linalg.yield %in : i1 } -> tensor<?x12x?x?xi1> ```	2024-10-03 20:11:51 -05:00
zjgarvey	d2c387dd04	[ONNX] Fix issue with absent value in onnx.ConstantOfShape (#3713 ) Previously, if the value was absent, this conversion was creating a dense resource of value 0 with shape equal to the result shape, then later re-extracting a splat value. This only works if the shape is statically known, and even when the shape is known, this is completely unnecessary since the value's shape should be `[1]` and not the result shape. This patch simply sets the `splatvalue` to a `torch.constant.float 0.0` when the onnx op's `value` attr is absent, and adds `nullptr` checks to the subsequent conditionals to avoid them in the case where an `attr` is not given. Addresses <https://github.com/nod-ai/SHARK-Turbine/issues/831>.	2024-09-17 16:01:01 -05:00
giacs-epic	b35675a78e	[onnx] Add support for `auto_pad` in `onnx.Conv` (#3670 ) Add logic for `auto_pad` attribute in the conversion of `onnx.Conv` torch dialect. Add lit tests covering different configurations of `auto_pad`.	2024-09-10 20:31:53 +05:30
Vivek Khandelwal	4a0bed0ce0	[ONNX] Add training mode support for BatchNormalization op (#3597 ) This commit extends the OnnxToTorch lowering for BatchNormalization op for supporting the case when training=True. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-08-14 10:46:38 +05:30
Rob Suderman	8358e8c255	[onnx] Add support for `fp8` `onnx.DequantizeLinear` (#3617 ) Fp8 needs a slightly different path for dequantization as the `torch` dequantize operation does not support `fp8` types.	2024-08-08 16:20:53 -07:00
Rob Suderman	6c33ab024e	[onnx] `onnx.CenterCropPad` used an incorrect type for toScalar (#3605 ) To scalar should have a rank-0 tensor type not rank-1 with length 1. Changing allows proper compilation.	2024-08-07 20:33:33 -07:00
Rob Suderman	d273bdfabf	[onnx] Fix default `alpha` for `onnx.Elu` (#3583 ) We were defaulting to `0.0` for `onnx.Elu` when it is supposed to be `1.0`.	2024-08-02 09:29:17 -07:00
Vivek Khandelwal	15cf7106c4	[ONNX] Reduce Onnx.Flatten op version (#3560 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-07-24 21:27:20 +05:30
zjgarvey	0fb8b017d8	Adds misc fixes for some padding related issues (#3528 ) This patch adds a few misc pad op related changes: 1. Addresses issue <https://github.com/llvm/torch-mlir/issues/3457> 2. Addresses issue <https://github.com/llvm/torch-mlir/issues/3442> 3. Fixes the padding order for asymmetrically padded onnx.Conv ops 4. Enables passing quantization through those onnx.Conv op pre-paddings 5. Modifies the torch-to-linalg lowering of AtenReplicationPad2d op to enable support for input rank != 4 Unfortunately, even with all of these changes, the e2e tests for the ReplicationPad2d still fail the onnx config, since the torch export procedure for rearranging the pad order is complicated enough that the padding ints end up not being able to fold back to constants.	2024-07-11 20:01:45 -05:00
jinchen	3915db0a86	[ONNX] Add OnnxToTorch support for CenterCropPad (#3496 )	2024-06-28 12:47:29 -07:00
Phaneesh Barwaria	5a627c46b7	onnx.DFT basic support (#3463 ) - adds support for DFT v20 on the FFT and IFFT path - adds required skeleton code for IFFT ops to be recognised in TMlir	2024-06-28 20:08:43 +05:30
zjgarvey	d2bc70f188	[TorchToLinalg][ONNX] Add Basic Determinant Support (#3481 ) This adds support for a few ops: - torch.linalg_det - torch._linalg_det (if the LU and pivot returns are unused) - onnx.Det An scf loop is used, since the row reduction algorithm applied here has some loop-carried dependencies. The current support being added here is very basic, and only works if no permutations are required during row reduction, and assumes the matrices are non-singular.	2024-06-25 13:34:19 -05:00
zjgarvey	368fabf0c1	[ONNX] Basic Support for DeformConv (#3469 ) This adds a torchvision op to torch-mlir and a path from onnx.DeformConv to torchvision.deform_conv2d. I'm not implementing the torch->linalg lowering for the torchvision op yet, but posting this PR to get feedback on some of the choices being made here and to flesh out the onnx frontend a bit.	2024-06-25 12:16:51 -05:00
Chi_Liu	fc19709daa	[ONNX] Add averagepool dilations support (#3490 ) - To fix dilations issue: https://github.com/llvm/torch-mlir/issues/3428 - Test by: https://github.com/nod-ai/SHARK-TestSuite/pull/268	2024-06-21 17:24:57 -07:00
Vinayak Dev	39d882f7c9	[torch] Add OnnxToTorch lowering for the Col2Im op (#3424 ) Adds OnnxToTorch lowering for the `onnx.Col2Im` op.	2024-06-13 08:42:06 +00:00
Chi_Liu	ae6f5e8251	[ONNX] Fix AveragePool attributes support (#3235 ) Issues was found here https://github.com/nod-ai/SHARK-Turbine/issues/643 - [ONNX] Fix padding attributes for onnx.AveragePool - [Linalg] Add countIncludePad false support for AtenAvgPool1/2dOp - [Linalg] Add an avg_pool2d countIncludePad False e2e tests - [Linalg] Fix conflict with AtenAvgPool3dOp - [Linalg] Fix e2e crash with AtenAvgPool1dOp - [Linalg] Add dynamic dim support for AtenAvgPool2dOp - [Linalg] Fix AvgPool2dDivisorOverrideModule crash	2024-06-12 12:16:43 -07:00
zjgarvey	de28c8540b	[ONNX] add int16 quantization support (#3446 ) There is currently no int16 quantization support in torch. This patch adds a new mlir type to correspond to the missing "torch.qint16" type, and enables lowering of quantization-related onnx ops using int16 types. In follow-up patches, custom quantization logic for ops like aten.matmul/aten.mm/aten.convolution may need to be revisited to allow support for qint16. The passes in FuseQuantizedOps.cpp may also need slight modifications.	2024-06-12 10:37:22 +05:30
Vivek Khandelwal	5bc626465b	[ONNX] Lower Onnx.Concat lowering version (#3437 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-06-09 12:07:20 +05:30
Yuanqiang Liu	689efc8917	[Torch] fix toBuiltinTensor() (#3415 ) * Let `toBuiltinTensor()` reflects the original dtype of `!torch.vtensor`. * Backend handles dtype conversion themselves.	2024-06-08 09:36:32 +08:00
Suraj Sudhir	1c2778dd56	[ONNX] Conv op adds support for asymmetric padding. (#3426 ) Supports asymmetric padding by performing a torch.nn.functional.pad on the input before performing the convolution. Signed-off-by: Suraj Sudhir <suraj.sudhir@arm.com>	2024-06-07 09:54:39 -07:00
Vivek Khandelwal	6382dbbcc0	[ONNX] Add OnnxToTorch lowering for SpaceToDepth op (#3393 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-06-03 20:29:39 +05:30
Xida Ren (Cedar)	23d2d66a59	Fix error when attempting to read elided onnx constants (#3398 ) Co-authored-by: zjgarvey <zjgarvey@gmail.com>	2024-05-29 16:56:23 -07:00
zjgarvey	27169dcda9	Replace some depreciated uses of cast (#3343 ) Contributing towards #3299	2024-05-23 09:01:47 -07:00
jinchen	4b24909427	Add attributes support for onnx cumsum op (#3241 )	2024-05-11 02:09:01 +08:00
Vinayak Dev	6f911ba3d7	[torch] Add OnnxToTorch lowering for `onnx.HammingWindow` (#3283 ) Adds OnnxToTorch lowering for the `onnx.HammingWindow` op.	2024-05-06 10:21:45 -07:00
Rob Suderman	321b844df7	Revert hyperbolic trigonometric decompositions (#3271 ) We should be using the `torch` path and handling decomposition in the `math` dialect.	2024-05-03 12:06:44 -04:00
Vinayak Dev	67d6a665a4	[torch] Add OnnxToTorch lowering for `onnx.HannWindow` (#3276 ) Adds OnnxToTorch lowering for the `onnx.HannWindow` op. Also factors out common implementation between the window functions.	2024-05-03 12:04:57 -04:00
Vinayak Dev	05f8b69bf6	[MLIR][TORCH] Add OnnxToTorch support for BlackmanWindow function (#3181 ) Implements OnnxToTorch lowering for the BlackmanWindow Function.	2024-04-30 12:21:27 -04:00
jinchen	fbbad2d81e	Fix onnx atanh lowering (#3264 ) iree tests `test_atanh` and `test_atanh_example` passed	2024-04-30 00:50:08 -07:00
jinchen	bf04b53b07	Fix onnx asinh lowering (#3263 ) iree tests `test_asinh` and `test_asinh_example` passed	2024-04-30 00:49:57 -07:00
jinchen	fb499192df	Fix onnx acosh lowering (#3262 ) iree tests `test_acosh` and `test_acosh_example` passed	2024-04-30 00:49:44 -07:00
jinchen	aa471f1d96	Fix onnx cosh lowering (#3254 ) iree tests `test_cosh` and `test_cosh_example` passed	2024-04-30 00:49:29 -07:00
penguin_wwy	6679728c56	Fix deprecated uses of cast/dyn_cast/dyn_cast_or_null/isa (#3243 ) Like #3130, gradually replace the deprecated code https://github.com/llvm/mlir-www/blob/main/website/content/deprecation/_index.md#deprecated	2024-04-27 14:00:56 -07:00
jinchen	09d42044b4	Support select_last_index attribute of onnx argmin op (#3212 ) The tests listed in https://github.com/nod-ai/SHARK-Turbine/issues/648 all compiled, and the values of results match, but having runtime issue of dtype mismatch of i/si.	2024-04-23 10:43:38 -07:00
jinchen	61e6312c87	Support select_last_index attribute of onnx argmax op (#3192 ) The tests listed in https://github.com/nod-ai/SHARK-Turbine/issues/635 all compiled, but having run issue of dtype mismatch of i/si.	2024-04-23 10:16:08 -07:00
jinchen	ddb29c2c02	[onnx] Add OnnxToTorch support for `onnx.ConvInteger` (#3179 ) All e2e iree tests compiled, but they have the run issue of mismatch of dtype like the following ``` expected: 1x1x2x2xsi32=[[[12 16][24 28]]] actual: 1x1x2x2xi32=[[[12 16][24 28]]] ```	2024-04-23 09:42:02 -07:00
Vivek Khandelwal	3c252cdd44	[onnx] Add `onnx-to-torch` lowering for random ops (#3193 ) This commit adds the OnnxToTorch lowering for Onnx's RandomNormal, RandomNormalLike, RandomUniform, and RandomUniformLike op.	2024-04-22 22:28:07 +05:30
jinchen	83cba8c696	[onnx] Support for `onnx.EyeLike` via torch lowering (#2994 )	2024-04-15 09:23:26 -07:00
jinchen	859f5d280f	Generalize getting index for onnx compress op (#3150 )	2024-04-12 15:18:22 -07:00
penguin_wwy	d4a30b7e67	Fix deprecated uses of cast/dyn_cast/dyn_cast_or_null/isa (#3130 ) We should prefer functional style as the method style is deprecated https://github.com/llvm/mlir-www/blob/main/website/content/deprecation/_index.md#deprecated (https://mlir.llvm.org/deprecation/)	2024-04-11 06:47:35 -07:00
Vivek Khandelwal	1d6e4c3d77	[MLIR][TORCH] Add OnnxToTorch lowering for Einsum op (#3117 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-04-08 22:38:01 +05:30
zjgarvey	532d297c46	[ONNX] Preliminary Work Towards Supporting QuantizedMLP_basic onnx e2e test (#3089 ) See the related issues here: [SHARK-Turbine#556](https://github.com/nod-ai/SHARK-Turbine/issues/556) 1. Adds uint8 casting to onnx.Cast op 2. Fixes an issue with onnx.DequantizeLinear when the scale comes with shape [1]. 3. Adds support for unsigned types in an AtenItemOp folder 4. Adds a simpler quantized model for easier debugging 5. Adds a fusion pass to convert [quant -> dequant -> transpose -> mm] patterns to [transpose -> quant -> mm]. 6. Moved some xfails that are still not passing, but for different reasons than onnx.cast failures.	2024-04-01 16:21:05 -07:00
Vivek Khandelwal	6844c84702	[MLIR][Torch] Fix OnnxToLinalg lowering for AvgPool op (#3076 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-04-01 22:14:14 +05:30
zjgarvey	6ff71b40c8	[ONNX] onnx.DynamicQuantizeLinear to Torch (#3009 ) This adds support for converting DynamicQuantizeLinear from torch-onnx to torch. I could not get an e2e test to pass, since there seems to be some issues with uint8 casting somewhere lower in the pipeline. For example compiling with IREE for llvm-cpu, I would get either the correct zero point (if zp < 128) or the correct zero-point minus 256 (if zp >= 128). The output tensor seems to always return a tensor of zeros, which also occurs when running uint8 examples through QuantizeLinear. Edit: the first problem can be resolved by casting the output back to uint8 on output, the second problem is resolved with PR #3018	2024-03-20 10:58:25 -07:00
jinchen	9cf6c45a39	Add OnnxToTorch support for Compress op (#3025 )	2024-03-20 17:12:08 +00:00
zjgarvey	7a9608bb69	[ONNX] Reduces onnx.Div sinceVersion to 7 (#3041 ) The only difference between version 7 and newer versions is support for different data types. We should allow this pattern to match as early as 7. Earlier versions have a more manual broadcast specification through attributes, so I did not include those versions. See: [onnx.Div docs](https://onnx.ai/onnx/operators/onnx__Div.html#l-onnx-doc-divl)	2024-03-19 13:35:05 -07:00
Xinan Jiang(姜曦楠)	d8a52e82c2	[onnx] Fix onnx.cast cases between int32 and int64 (#2982 ) 2 modifications: 1. torch.int64 is enum 4 in TORCH_DTYPE_TO_INT 2. add int32 support	2024-03-15 17:14:09 +00:00
aldesilv	6fa21bd8b1	OnnxToTorch lower celu op (#2920 )	2024-03-13 20:34:10 +05:30
Rob Suderman	bd7f1baa42	[onnx] Fix expand operation for dynamic shape max (#3001 ) If the broadcast shape is length-1 at a dim while `?` in the input dim then we need to broadcast to the dynamic dim. This is equivalent to taking a max of two dimensions.	2024-03-08 16:23:07 -08:00

1 2

85 Commits (aca33f1742096e7e6cb3152be15140cf9f71e508)