torch-mlir

Commit Graph

Author	SHA1	Message	Date
jinchen	032a636c35	Fix onnx.If lowering with scalar condition tensor (#3846 ) Fixes https://github.com/nod-ai/SHARK-ModelDev/issues/696#issuecomment-2442016530	2024-10-31 20:34:50 -07:00
Rob Suderman	25738b8c19	[linalg] Broadcast batch for mask on sdpa lowering (#3824 ) Attention often broadcasts a mask across the batch dimension as masking is usually performed the same across attention heads. Added this materialization to the mask dimensions optionally.	2024-10-31 17:59:24 -07:00
Rob Suderman	5aa323dd29	[linalg] Fix torch.aten.add of `torch.bool` (#3820 ) Addition of bools saturate which equates to an `or` operator. Updated to avoid some noticed downstream failures.	2024-10-31 17:37:25 -07:00
yyp0	9ce2a69703	[Torch] support AtenExp2Op (#3832 ) - support AtenExp2Op by decomposing it to aten.pow.scalar - refine stablehlo pow.scalar pow.Tensor_Scalar pow.Tensor_Tensor lowering according to https://github.com/llvm/torch-mlir/pull/2983 - Close https://github.com/llvm/torch-mlir/pull/2983	2024-10-31 19:14:05 +08:00
Justin Ngo	4dd213b042	[TOSA] Expand Torch to TOSA legalization coverage (#3827 ) - Add/Extend Torch to TOSA legalization for the following ops: + Add aten.threshold_backward + Fix aten.threshold + Re-implement aten.broadcast_to using tosa.reshape and tosa.tile + Add support for rank 0 index for aten.index_select + Fix aten.index_put.hacked_twin + Add aten.uniform + Add aten.logical_and - Update xfail_sets.py with new e2e results - Add LIT tests to basic.mlir for newly added ops Change-Id: I8910564a049d18293284fe2e55e82bc1d2cf10e3 Signed-off-by: Justin Ngo <justin.ngo@arm.com>	2024-10-30 16:26:10 -07:00
Sayan Saha	2b01f8b7f3	[Tosa] : Add support for negative indices in index.tensor and index.Tensor_hacked_twin for TorchToTosa lowering. (#3790 ) 1. Negative indices for tensor indexing is handled by wrapping around the index values by checking their values at run time. Without the fix, there was a runtime error. 2. Added a lit test to lock down the behavior. 3. Updated the `xfails_set` for `fx_importer_tosa` config to lockdown the behavior with e2e test as well. "THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY."	2024-10-25 15:37:19 -07:00
Sriram Kumar	d6feb2179c	Added support for Maxpool (Autopad) (#3774 ) Added autopad. and passed 3 tests test_maxpool_2d_precomputed_same_upper test_maxpool_2d_same_lower' test_maxpool_2d_same_upper Address : https://github.com/nod-ai/SHARK-ModelDev/issues/843 2 attributes yet to complete : storage_order, indices output	2024-10-23 13:04:50 +00:00
Felix Schneider	aca33f1742	[TorchToLinalg] Use Op with native channel order for quantized conv2d (#3807 ) I've upstreamed the necessary quantized linalg Op with the "channel-first" ordering used by torch (https://github.com/llvm/llvm-project/pull/107740) for 2d convolution. This patch changes the lowering for the quantized 2d case of `aten.convolution` accordingly, which saves three transpositions per convolution (input, weights, result) and therefore removes the requirement to try to optimize these away in downstream passes.	2024-10-22 20:26:16 +02:00
David Tanner	02327af998	Adds onnx ConvTranspose support for autopadding. (#3797 ) Adds onnx ConvTranspose support for autopadding (https://github.com/nod-ai/SHARK-ModelDev/issues/839). - Adds support for attribute auto_pad="SAME_UPPER" or "SAME_LOWER" which will automatically calculate padding of input based on output shape. - Adds support, during auto-padding, for output_shape=[H,W] which overrides the default output shape of input_shape[i]*stride[i] (for spatial dimensions only). - Adds lit test for auto-padding. - Tests are added by https://github.com/nod-ai/SHARK-TestSuite/pull/370 NOTE: ConvTranspose still doesn't support asymmetric padding, therefore multiple original onnx tests still won't pass.	2024-10-18 12:31:33 -05:00
Justin Ngo	45bb17ebfe	[TOSA] Add legalization for empty, scatter, slice_scatter, diag_embed (#3792 ) - Add Torch to TOSA legalization for the following ops: + aten.empty.memory_format + aten.scatter.src + aten.slice_scatter + aten.diag_embed - Update xfail_sets.py with new e2e results - Update basic.mlir with new LIT tests Change-Id: I817ecf207bcfcf97ca54f30c10c76c4f0f4145ae Signed-off-by: Justin Ngo <justin.ngo@arm.com>	2024-10-15 08:38:02 -07:00
Hanumanth04	895f490cf5	Remove checking for training specific parameters in EmbeddingBag lowering (#3782 ) Torch-to-linalg pass fails for `EmbeddingBag` when the training only specific properties of the operator are set to `true.` For instance, this operator's `sparse` input/property is training-specific, and if the value of this property is `true,` the existing lowering bails out. However, we don't need to check for training-specific parameters and bailout from the legalization since we don't care about these properties during the eval/inference mode. --------- Co-authored-by: Hanumanth Hanumantharayappa <hhanuman@ah-hhanuman-l.dhcp.mathworks.com>	2024-10-15 09:37:26 -04:00
yyp0	d0041dc310	[stablehlo] support aten.view.dtype lowering (#3778 )	2024-10-10 15:50:17 +08:00
Vivek Khandelwal	94f5410913	[LINALG] Add complex tensor support for `create[Zero\|One]InitTensor` utility (#3777 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-10-09 16:15:08 +05:30
Stephen Baione	d49eabb3fc	Add Op for `torch.aten.unfold` (#3772 ) # Description Implementation of the op for `torch.aten.unfold`: [TorchToLinalg Op Support #347](https://github.com/nod-ai/SHARK-ModelDev/issues/849) Documentation of op can be found here: [PyTorch Docs](https://pytorch.org/docs/stable/generated/torch.Tensor.unfold.html) For this op, we apply a sliding window of some `size` along a single `dimension`, with `step` in between iterations. `Declaration: aten::unfold(Tensor(a) self, int dimension, int size, int step) -> Tensor(a)` The resulting `unfolded` tensor modifies the shape of `dimension` to be equal to the number of blocks that the sliding windows extracts/inserts, with an additional dimension of `size` appended (the number of cols of the output tensor directly translates from the size of the sliding window). So if we had a tensor of rank 3 (A x B x C), with dimension = 1, size = 2 and step = 2: (A x B x C) \|=> (A x (B - size) // step + 1 x C x size) After extracting the window from the input tensor, we insert the (1 x size) slice into the output tensor. We can make this simpler by mapping the output indices from the input indices, like they do in the official implementation: [PyTorch Code](https://github.com/pytorch/pytorch/blob/main/torch/_inductor/lowering.py#L1694)	2024-10-08 21:10:43 +00:00
Phaneesh Barwaria	7830c00ca2	onnx.LSTM - bidirectional, layout attr (#3771 ) - Support Bidirectional LSTM (utilising the forward LSTM layer with flipped Inputs and Outputs) - Support layout 1 - Support default cases for attr `clip` and `input_forget` - Support returning partial outputs (1-3) - fixes for alt_e2e_tests lstm tests (1,2,3)	2024-10-08 11:29:49 -07:00
jinchen	58489faf7f	torch.aten.squeeze.dim lowering with dynamic dims (#3749 ) Address https://github.com/nod-ai/SHARK-ModelDev/issues/846 Assume the dynamic squeezed dim is 1.	2024-10-08 10:37:31 -07:00
Vivek Khandelwal	614fcdd153	[MLIR][TORCH] Add support for 1-d group convolution (#3770 ) This commit adds the support for the 1-d depthwise convolution as a special case of 1-d group convolution. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-10-08 10:48:47 +05:30
Vivek Khandelwal	f6721e5999	[MLIR][TORCH] Add support for negative step in aten.slice.Tensor op (#3763 ) This commit adds the support for negative step values in aten.slice.Tensor op. Although, PyTorch does not allow negative step value for slice op but the Onnx.Slice op supports negative step value which eventually lowers to torch.aten.slice.Tensor op. Hence, the support is added for handling those kind of values during the Torch->Linalg lowering of aten.slice.Tensor op. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-10-08 10:34:27 +05:30
Justin Ngo	b08d08682f	[TOSA] Add legalization for fill, flip, and round (#3768 ) - Add Torch to TOSA lowering for aten.fill.Scalar/Tensor, aten.flip, and aten.round - Fix torchScalarToTosaTensor function to correctly convert Torch scalar input to TOSA tensor - Update xfail_sets.py with new e2e results - Update basic.mlir with LIT tests for new ops Change-Id: If1e42c2e582710dd8ad0465eed29806fbcdbde41 Signed-off-by: Justin Ngo <justin.ngo@arm.com>	2024-10-07 10:28:26 -07:00
Chi_Liu	f4840ed886	[ONNX] Fix onnx.ScatterElements with AtenScatterReduceTwoOp lowering to tm_tensor/linalg_ext dialect (#3754 ) - To fix issue onnx.ScatterElements: https://github.com/nod-ai/SHARK-ModelDev/issues/823 - E2E test: https://github.com/nod-ai/SHARK-TestSuite/pull/363	2024-10-05 22:22:41 -07:00
Rob Suderman	53f7532e76	Revert "[TorchToLinalg] perform rank0 elementwise computations outside linalg generic ops (#3762 )" (#3767 ) Reverted due to downstream model changes. Will reland with fixes post integration. This reverts commit `6e8c7bed4b`.	2024-10-04 14:48:02 -07:00
Justin Ngo	e9ed4af9ce	[TOSA] Add legalization for aten.index_select (#3760 ) - Add Torch to TOSA legalization for aten.index_select - Fix createOneDimTfIndices function in TosaLegalizeCommon.cpp to correctly convert Torch indices to TF-style indices, which is used in convertGatherNdOp - Update e2e tests in xfail_sets.py - Update basic.mlir with new LIT test for aten.index_select Signed-off-by: Justin Ngo <justin.ngo@arm.com> Change-Id: I52519246183949353a3cf22f0a685fe3df8ec8ff Signed-off-by: Justin Ngo <justin.ngo@arm.com>	2024-10-04 12:24:22 -07:00
Rob Suderman	2374b9e02d	Bump to llvm/llvm-project@e813750354 (#3765 ) Includes stablehlo bump	2024-10-04 12:08:35 -07:00
zjgarvey	6e8c7bed4b	[TorchToLinalg] perform rank0 elementwise computations outside linalg generic ops (#3762 ) This is motivated by the fact that shapes are stored as tensors in ONNX, and IREE tries to perform tensor arithmetic on the device. This causes unnecessary dispatches, and makes it harder for the compiler to reason about shapes. Here is a small snippet of torch-IR that is typical seen coming from ONNX models: ```mlir module { func.func @main_graph(%arg0: !torch.vtensor<[?,?,768],f32>, %arg1: !torch.vtensor<[?,?,768],f32>) -> !torch.vtensor<[],si64> { %int0 = torch.constant.int 0 %0 = torch.vtensor.literal(dense<0> : tensor<1xsi64>) : !torch.vtensor<[1],si64> %1 = torch.aten._shape_as_tensor %arg1 : !torch.vtensor<[?,?,768],f32> -> !torch.vtensor<[3],si64> %2 = torch.aten.index_select %1, %int0, %0 : !torch.vtensor<[3],si64>, !torch.int, !torch.vtensor<[1],si64> -> !torch.vtensor<[1],si64> %3 = torch.aten.squeeze.dim %2, %int0 : !torch.vtensor<[1],si64>, !torch.int -> !torch.vtensor<[],si64> %4 = torch.aten.item %3 : !torch.vtensor<[],si64> -> !torch.int %5 = torch.aten.eq.int %4, %int0 : !torch.int, !torch.int -> !torch.bool %6 = torch.aten.Int.bool %5 : !torch.bool -> !torch.int %7 = torch.aten.size.int %arg0, %int0 : !torch.vtensor<[?,?,768],f32>, !torch.int -> !torch.int %8 = torch.prim.NumToTensor.Scalar %6 : !torch.int -> !torch.vtensor<[],i1> %9 = torch.prim.NumToTensor.Scalar %7 : !torch.int -> !torch.vtensor<[],si64> %10 = torch.prim.NumToTensor.Scalar %4 : !torch.int -> !torch.vtensor<[],si64> %11 = torch.aten.where.self %8, %9, %10 : !torch.vtensor<[],i1>, !torch.vtensor<[],si64>, !torch.vtensor<[],si64> -> !torch.vtensor<[],si64> return %11 : !torch.vtensor<[],si64> } } ``` Without the change in this PR, the result would be: ```mlir #map = affine_map<() -> ()> module { ml_program.global private mutable @global_seed(dense<0> : tensor<i64>) : tensor<i64> func.func @main_graph(%arg0: tensor<?x?x768xf32>, %arg1: tensor<?x?x768xf32>) -> tensor<i64> { %c0_i64 = arith.constant 0 : i64 %c0 = arith.constant 0 : index %dim = tensor.dim %arg1, %c0 : tensor<?x?x768xf32> %0 = arith.index_cast %dim : index to i64 %1 = tensor.empty() : tensor<1xi64> %collapsed = tensor.collapse_shape %1 [] : tensor<1xi64> into tensor<i64> %2 = linalg.fill ins(%0 : i64) outs(%collapsed : tensor<i64>) -> tensor<i64> %extracted = tensor.extract %2[] : tensor<i64> %3 = arith.cmpi eq, %extracted, %c0_i64 : i64 %dim_0 = tensor.dim %arg0, %c0 : tensor<?x?x768xf32> %4 = arith.index_cast %dim_0 : index to i64 %5 = tensor.empty() : tensor<i1> %6 = linalg.fill ins(%3 : i1) outs(%5 : tensor<i1>) -> tensor<i1> %7 = tensor.empty() : tensor<i64> %8 = linalg.fill ins(%4 : i64) outs(%7 : tensor<i64>) -> tensor<i64> %9 = linalg.fill ins(%extracted : i64) outs(%7 : tensor<i64>) -> tensor<i64> %10 = linalg.generic {indexing_maps = [#map, #map, #map, #map], iterator_types = []} ins(%6, %8, %9 : tensor<i1>, tensor<i64>, tensor<i64>) outs(%7 : tensor<i64>) { ^bb0(%in: i1, %in_1: i64, %in_2: i64, %out: i64): %11 = arith.select %in, %in_1, %in_2 : i64 linalg.yield %11 : i64 } -> tensor<i64> return %10 : tensor<i64> } } ``` With the change in this PR, we would instead get: ```mlir module { ml_program.global private mutable @global_seed(dense<0> : tensor<i64>) : tensor<i64> func.func @main_graph(%arg0: tensor<?x?x768xf32>, %arg1: tensor<?x?x768xf32>) -> tensor<i64> { %c0_i64 = arith.constant 0 : i64 %c0 = arith.constant 0 : index %dim = tensor.dim %arg1, %c0 : tensor<?x?x768xf32> %0 = arith.index_cast %dim : index to i64 %1 = tensor.empty() : tensor<1xi64> %collapsed = tensor.collapse_shape %1 [] : tensor<1xi64> into tensor<i64> %2 = linalg.fill ins(%0 : i64) outs(%collapsed : tensor<i64>) -> tensor<i64> %extracted = tensor.extract %2[] : tensor<i64> %3 = arith.cmpi eq, %extracted, %c0_i64 : i64 %dim_0 = tensor.dim %arg0, %c0 : tensor<?x?x768xf32> %4 = arith.index_cast %dim_0 : index to i64 %5 = arith.select %3, %4, %extracted : i64 %6 = tensor.empty() : tensor<i64> %7 = linalg.fill ins(%5 : i64) outs(%6 : tensor<i64>) -> tensor<i64> return %7 : tensor<i64> } } ``` Some related issues for context: 1. <https://github.com/iree-org/iree/issues/18677> 2. <https://github.com/iree-org/iree/issues/18631>	2024-10-04 11:27:00 -05:00
zjgarvey	f08bfc4ff8	[ONNX] simplify shapes fed to broadcast in Expand lowering (#3756 ) Addresses ~200 onnx model compile failures in <https://github.com/nod-ai/SHARK-TestSuite> related to <https://github.com/iree-org/iree/issues/18631>. This change simplifies the result of the generated broadcast op substantially, but reduces the case coverage slightly. The case which will become unsupported: - trying to actually broadcast a dynamic dim that is secretly 1. When does this case appear in practical scenarios? - for a model where onnx shape inference cannot figure out that a dim should be 1. Why do I think we should not support this case for now? 1. For all models with dynamic dim expand ops, the previous path uniformly generates uglier linalg IR (making it harder for IREE to fuse properly with other ops). 2. For models failing shape inference castastrophically enough to fail to see a dim is statically 1, we can try to apply constant folding in the onnx model before importing. Leaving this as a draft PR, since it may be more appropriate to fix the compilation failure in IREE rather than torch-mlir. ### Example of broadcast required in previous path: ```mlir %300 = linalg.generic {indexing_maps = [#map11], iterator_types = ["parallel", "parallel", "parallel", "parallel"]} outs(%299 : tensor<?x12x?x?xi1>) { ^bb0(%out: i1): %306 = linalg.index 0 : index %307 = linalg.index 3 : index %308 = arith.index_cast %285 : i64 to index %309 = arith.cmpi eq, %308, %c1 : index %310 = arith.select %309, %c0, %306 : index %311 = arith.index_cast %286 : i64 to index %312 = arith.cmpi eq, %311, %c1 : index %313 = arith.select %312, %c0, %307 : index %extracted_79 = tensor.extract %reshape_78[%310, %c0, %c0, %313] : tensor<?x1x1x?xi1> linalg.yield %extracted_79 : i1 } -> tensor<?x12x?x?xi1> ``` ### Example of broadcast with simplified shape list: ```mlir %409 = linalg.generic {indexing_maps = [#map15, #map11], iterator_types = ["parallel", "parallel", "parallel", "parallel"]} ins(%reshape_135 : tensor<?x1x1x?xi1>) outs(%408 : tensor<?x12x?x?xi1>) { ^bb0(%in: i1, %out: i1): linalg.yield %in : i1 } -> tensor<?x12x?x?xi1> ```	2024-10-03 20:11:51 -05:00
Rob Suderman	9ab0db5789	[torch] `torch.aten.complex` operation with lowering (#3738 ) Add the operation with lowering to linalg. Includes a test for end-to-end correctness.	2024-10-03 11:09:52 -07:00
Kyle Wang	f0b7ca72f5	Fixed GRU quality issues exposed by e2e tests (#3753 ) Issue: https://github.com/nod-ai/SHARK-ModelDev/issues/856 Related tests: ![Screenshot 2024-10-01 175305](https://github.com/user-attachments/assets/0dc0901b-058f-427c-a596-9e806fd38836)	2024-10-02 17:00:19 -04:00
Samu Tamminen	a2bfe47faa	[onnx] Add IDF and TFIDF modes to TFIDF Vectorizer (#3726 ) Address https://github.com/nod-ai/SHARK-Turbine/issues/833	2024-10-02 08:17:58 -05:00
Justin Ngo	5eab669c4a	[TOSA] Add legalization for aten.diagonal (#3740 ) - Add lowering from Torch to TOSA for aten.diagonal - Clean up some code - Update xfail_sets.py with the new e2e results - Update basic_mlir with the new op mlir test Signed-off-by: Justin Ngo <justin.ngo@arm.com> Change-Id: I99bed685455752d09ed96edd837c4dfbee152701 Signed-off-by: Justin Ngo <justin.ngo@arm.com>	2024-09-30 08:24:31 -07:00
Yuanqiang Liu	5f74de5ba0	[Stablehlo] support aten.all.dim (#3746 )	2024-09-30 15:59:27 +08:00
jinchen	a33d1232c5	[onnx] Fix onnx.Shape lowering with scalar input (#3716 ) Address https://github.com/nod-ai/SHARK-Turbine/issues/826	2024-09-27 13:30:02 -07:00
Xida Ren (Cedar)	9938abf25e	AtenCumprodOp (#3737 )	2024-09-26 18:17:22 -04:00
yyp0	335cf5f6d0	[stablehlo] support aten_adaptive_max_pool1d lowering (#3728 )	2024-09-26 11:42:38 +08:00
giacs-epic	99848265c3	[onnx] Relax constraints on input tensors in `onnx.STFT` conversion to torch dialect (#3676 ) - When the signal tensor is real, onnx allows its shape to be `[batch][length]` as well as `[batch][length][1]`. - Onnx also allows to specify `frame_length` together with `window` (not empty), given that it matches the window size. - Adding checks on signal and result shapes.	2024-09-23 12:09:29 +05:30
Justin Ngo	3f79a2982a	[TOSA] Extend Torch to TOSA legalization coverage (#3718 ) - Add Torch to TOSA legalization for the following ops: + aten.logical_not + aten.logical_xor + aten.cos + aten.sin + aten.pow.Scalar + aten.pow.Tensor_Tensor + aten.erf + aten.bitwise_and.Scalar + aten.bitwise_left_shift.Tensor + aten.bitwise_right_shift.Tensor + aten.le.Tensor + aten.le.Scalar - Update e2e tests in xfail_sets - Update basic.mlir with newly legalized ops Signed-off-by: Justin Ngo <justin.ngo@arm.com> Change-Id: I4aa5790073ef2e5ec0e9b374da42887242f8dabc Signed-off-by: Justin Ngo <justin.ngo@arm.com>	2024-09-20 14:33:55 -07:00
Justin Ngo	abaff58c6d	[TOSA] Add div rounding mode, remainder, fmod, and ge.Tensor ops support (#3717 ) - Add legalization for aten.div rounding mode: + trunc: rounds division results towards zero + floor: rounds division results down - Add legalization for aten.remainder.Scalar and aten.fmod ops - Add legalization for aten.ge.Tensor op - Update e2e tests in xfail_sets.py - Update basic.mlir with new legalized ops Signed-off-by: Justin Ngo <justin.ngo@arm.com> Change-Id: Icedd23205254fb893ce6f3de08956772b83b4320 Signed-off-by: Justin Ngo <justin.ngo@arm.com>	2024-09-20 13:34:09 -07:00
Rob Suderman	5ce48dfacd	[torch] Fix attention on linalg for dynamic shapes (#3714 ) Current version does not work for a mixture of dynamic and static shaped batch dimensions. Rework to grab the correct dynamic shapes. --------- Co-authored-by: dan <danimal197@gmail.com>	2024-09-18 14:52:54 -05:00
zjgarvey	d2c387dd04	[ONNX] Fix issue with absent value in onnx.ConstantOfShape (#3713 ) Previously, if the value was absent, this conversion was creating a dense resource of value 0 with shape equal to the result shape, then later re-extracting a splat value. This only works if the shape is statically known, and even when the shape is known, this is completely unnecessary since the value's shape should be `[1]` and not the result shape. This patch simply sets the `splatvalue` to a `torch.constant.float 0.0` when the onnx op's `value` attr is absent, and adds `nullptr` checks to the subsequent conditionals to avoid them in the case where an `attr` is not given. Addresses <https://github.com/nod-ai/SHARK-Turbine/issues/831>.	2024-09-17 16:01:01 -05:00
justin-ngo-arm	14ef05a292	[TOSA] Extend Torch to TOSA reduction ops legalization (#3710 ) - Add Torch to TOSA legalization for the following reduction ops: + aten.min.dim + aten.min + aten.max + aten.prod + aten.prod.dim_int + aten.all.dim - Add dtype casting support for reduce sum and prod ops - Extend aten.max.dim legalization to a template to support aten.min.dim legalization - Update end-to-end tests sets in xfail_sets.py Signed-off-by: Justin Ngo <justin.ngo@arm.com> Change-Id: I854dd6c0c55e570c1fb7242f20c85cf64d6e7fe0 Signed-off-by: Justin Ngo <justin.ngo@arm.com>	2024-09-16 12:40:24 -07:00
Srinath Avadhanula	bc70c50373	Delete unnecessary linalg conversion for aten.fmod (#3707 ) Follow up cleanup for [this PR](https://github.com/llvm/torch-mlir/pull/3689), which introduced a decomposition for `aten.fmod.Tensor`. This means that the lowering for this operator in linalg is no longer needed. Thanks to @vivekkhandelwal1 for pointing this out. --------- Co-authored-by: Srinath Avadhanula <srinath.avadhanula@getcruise.com>	2024-09-13 09:39:58 -07:00
Yuanqiang Liu	7b94ced39a	[Stablehlo] fix aten compare ops' promote rules (#3709 ) previous PR(https://github.com/llvm/torch-mlir/pull/3702)	2024-09-13 18:48:41 +08:00
giacs-epic	b35675a78e	[onnx] Add support for `auto_pad` in `onnx.Conv` (#3670 ) Add logic for `auto_pad` attribute in the conversion of `onnx.Conv` torch dialect. Add lit tests covering different configurations of `auto_pad`.	2024-09-10 20:31:53 +05:30
rohan-tan-bhowmik	e86f56bc76	[Torch] [TMTensor] Added mask and is_causal support for torch.aten.scaled_dot_product_attention (#3690 ) Enabled mask and is_causal parameters for torch.aten.scaled_dot_product attention + relevant comments + tests. The tests added highlight the new capabilities introduced in this PR, including: Attention with F16 mask Attention with Boolean mask Causal attention with same Q K V shapes Causal attention without Q K V shapes Made sure that one cannot input both mask and is_causal.	2024-09-09 15:51:41 -07:00
Felix Schneider	df6098e43d	[TorchToLinalg] Use `linalg.transpose` instead of `generic` when lowering `aten.T` (#3660 ) The lowering pattern for `aten.T` uses transposition implemented via `linalg.generic`. For downstream passes it is advantageous to use named ops wherever possible, so this patch changes the lowering to use `linalg.transpose` instead.	2024-09-07 08:09:10 +02:00
justin-ngo-arm	d4b5e05ac1	[TOSA] Add Torch to Tosa Legalization for torch.tril (#3678 ) Change-Id: Ie5ba31a27394c3adcea00266a9d562862dbd8b08 Signed-off-by: Justin Ngo <justin.ngo@arm.com>	2024-09-05 11:27:29 -07:00
Ze Zhang	b3942ff984	Add canonicalize pattern for aten.mul.int and aten.floordiv.int (#3680 ) This PR add `floordiv` to the `PY_BUILTIN_TO_TORCH_OP`. For `aten.mul.int` and `aten.floordiv.int` ops, we add new Canonicalization Patterns as follow: ``` %1 = torch.aten.mul.int %input, %const-5 %2 = torch.aten.mul.int %1, %const-6 ``` Will be replaced by `torch.aten.mul.int %input, %const-30` And ``` %1 = torch.aten.mul.int %input, %const-5 %2 = torch.aten.floordiv.int %1, %const-5 ``` Will directly return `%input` This PR also relaxes the `float` type constraint in TorchToTosa for the `AtenRsubScalarOp` conversion. To test: `cmake --build build --target check-torch-mlir-all`	2024-09-03 09:13:59 -07:00
Vivek Khandelwal	567ed44fd0	[MLIR][TORCH] Add E2E support for aten.polar op (#3671 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-09-03 10:51:03 +05:30
jinchen	fd759e4b1f	Fix onnx.Gather lowering with dynamic shapes (#3675 ) Supports the result with dynamic shape and scalar indices like ``` func.func @test_gather_scalar(%arg0: !torch.vtensor<[3,4,5],f32>, %arg1: !torch.vtensor<[], si64>) -> !torch.vtensor<[?,?],f32> attributes {torch.onnx_meta.opset_version = 13 : si64} { %0 = torch.operator "onnx.Gather"(%arg0, %arg1) {torch.onnx.axis = 0 : si64} : (!torch.vtensor<[3,4,5],f32>, !torch.vtensor<[], si64>) -> !torch.vtensor<[?,?],f32> return %0 : !torch.vtensor<[?,?],f32> } ``` `Torch::AtenSqueezeOp` is referring to the result shape, so it will failed on lowering if the result shape is dynamic.	2024-08-29 17:02:16 -07:00
lingzhiz1998	5bc59ce1fa	[TorchToLinalg] Support lowering MaxPool3dWithIndices (#3652 ) Support torch.MaxPool3dWithIndices lowering to linalg backend.	2024-08-27 14:14:25 -05:00
Felix Schneider	638ef14512	[TorchToLinalg] Use `linalg.broadcast` instead of `generic` for conv bias (#3661 ) The current implementation uses a `linalg.generic` to broadcast the bias tensor for the lowering of convolutions. This is suboptimal for later pattern matching. This patch changes it to use the respective named op, `linalg.broadcast`, instead.	2024-08-26 20:29:11 +02:00

1 2 3 4 5 ...

1035 Commits (032a636c359456c80e5912eb53e1a2fe4d34f664)