torch-mlir

Commit Graph

Author	SHA1	Message	Date
Xinyu Yang	ac85338491	[Stablehlo] Support AtenPowScalarOp, AtenTanOp, AtenAsinhOp, AtenAcoshOp, AtenAtanhOp, Atan2Op (#3233 )	2024-04-26 15:47:44 +08:00
Yuanqiang Liu	634a796933	[Torch] fold aten.log (#3223 )	2024-04-26 10:10:02 +08:00
penguin_wwy	122eb69a98	[stablehlo] add aten left/right shift op conversion support (#3234 )	2024-04-26 09:20:49 +08:00
Xinyu Yang	7030eacb76	[stablehlo] Support aten.any and aten.all lowering (#3217 )	2024-04-25 11:15:52 +08:00
Yuanqiang Liu	fab2696489	[Torch] support aten.trunc (#3219 ) decompose `trunc(x)` to `sign(x) * floor(abs(x))`	2024-04-24 14:32:33 +08:00
Xinyu Yang	e18bf42d0e	[stablehlo] Support ConstantPadNdOp in stablehlo (#3211 ) as title	2024-04-24 14:15:11 +08:00
Yuanqiang Liu	8a1dbbd597	[torchscript] export extra library file name to user (#3203 ) * so that it could be specified by user.	2024-04-24 11:34:02 +08:00
Xinyu Yang	42b9eccdb3	[Stablehlo] Fix AtenSumDimIntListOp when dim==None (#3216 ) as titile	2024-04-24 11:25:46 +08:00
Xinyu Yang	4da3d714cc	[Torch] Support AtenProdOp on linalg and stablehlo (#3215 )	2024-04-24 11:14:04 +08:00
zjgarvey	a8ba865fca	[torch] Adds Quantization Support for `aten.relu` (#3177 ) A choice was made to quantize the return type of Relu with a scale and zero point copied from the input's quantization scheme. With this choice, the torch-to-linalg conversion of quantized Relu essentially computes max(input, zeroPoint) in the elementwise payload.	2024-04-23 11:01:36 -07:00
Yuanqiang Liu	db3842f2e8	[Stablehlo] support lowering sinh & cosh to stablehlo (#3213 )	2024-04-23 19:54:58 +08:00
Xinyu Yang	c1967b607f	[Stablehlo] add AtenLog10Op, AtenLog2Op lowering to stablehlo (#3208 )	2024-04-23 19:06:55 +08:00
Yuanqiang Liu	1f8123b5f0	[Stablehlo] support unary ops which promote to floating point (#3209 ) * promote input to output element-type when lowering to stablehlo, so that it could satisfy stablehlo's type constraints. * split promote-to-fp unary ops from fp-only unary ops.	2024-04-23 17:57:12 +08:00
Yuanqiang Liu	797e4cd395	[Stablehlo] lowering asin, acos, atan (#3207 ) * lowering asin, acos and atan to chlo ops.	2024-04-23 16:24:53 +08:00
Vinayak Dev	cff2f084d4	[torch] Add OnnxToTorch lowering for `onnx.ReduceL2` (#3175 ) Adds OnnxToTorch lowering for the ReduceL2 op.	2024-04-23 02:03:05 -04:00
Vivek Khandelwal	3c252cdd44	[onnx] Add `onnx-to-torch` lowering for random ops (#3193 ) This commit adds the OnnxToTorch lowering for Onnx's RandomNormal, RandomNormalLike, RandomUniform, and RandomUniformLike op.	2024-04-22 22:28:07 +05:30
Vivek Khandelwal	6abc7371c8	[MLIR][TORCH] Fix OnnxToLinalg lowering issue for Squeeze and Unsqueeze op (#2991 ) This commit also cleans up the OnnxToTorch lowering for the Squeeze and Unsqueeze op and adds the support for handling edge cases. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-04-22 08:52:42 +00:00
penguin_wwy	e5bdd71baf	[Torch] Emit and decompose prims.iota op (#3132 )	2024-04-21 19:45:01 -07:00
penguin_wwy	a60e84e5ee	[stablehlo] add aten.expm1 op conversion support (#3199 )	2024-04-21 19:20:49 -07:00
Rob Suderman	8222637159	[onnx] Extend op version number of `onnx.ScatterElements` (#3195 ) Version number was set too high. Lowered to support more cases allows more tests to pass. Co-authored-by: Robert Suderman <rsuderman@Roberts-MacBook-Pro.local>	2024-04-21 12:32:18 -04:00
Rob Suderman	733cace1df	[onnx] Fix `onnx.split` by directly handling slicing (#3194 ) Previous implementation erroneously mixed up num_outputs with slice_size. New version correctly computs the slice size and directly performs slicing rather than leveraging `aten.split.tensor`. This is due to `onnx` supporting a fixed number of splits making the size computation more easily computeable when lowering to `aten` rather than deferring to `aten.split.tensor`. --------- Co-authored-by: Robert Suderman <rsuderman@Roberts-MacBook-Pro.local>	2024-04-21 12:31:56 -04:00
penguin_wwy	b6b01602d3	[stablehlo] add aten.fmod.Tensor op conversion support (#3198 )	2024-04-21 08:39:36 +08:00
penguin_wwy	ea0ecb67be	[stablehlo] add aten.remainder.Tensor op conversion support (#3197 )	2024-04-21 00:03:37 +08:00
Rob Suderman	b01245c0e8	[onnx] Fix `onnx.Not` for non-bool inputs (#3187 ) Need to perform a bool cast to support `onnx.Not` on non-bool inputs.	2024-04-19 11:32:24 -07:00
Xinyu Yang	790a697245	[Torch] Add folder for AtenIntOp, AtenFloatOp (#3189 ) See unit test below: ``` // CHECK-LABEL: func.func @torch.aten.tensor.float( // CHECK-NEXT: torch.vtensor.literal(dense<1.000000e+01> : tensor<f32>) : !torch.vtensor<[],f32> func.func @torch.aten.tensor.float() -> !torch.vtensor<[],f32> { %none = torch.constant.none %false = torch.constant.bool false %float1.000000e01 = torch.constant.float 1.000000e+01 %67 = torch.aten.tensor.float %float1.000000e01, %none, %none, %false : !torch.float, !torch.none, !torch.none, !torch.bool -> !torch.vtensor<[],f32> return %67 : !torch.vtensor<[],f32> } // CHECK-LABEL: func.func @torch.aten.tensor.int( // CHECK-NEXT: torch.vtensor.literal(dense<45> : tensor<si32>) : !torch.vtensor<[],si32> func.func @torch.aten.tensor.int() -> !torch.vtensor<[],si32> { %none = torch.constant.none %false = torch.constant.bool false %int45 = torch.constant.int 45 %67 = torch.aten.tensor.int %int45, %none, %none, %false : !torch.int, !torch.none, !torch.none, !torch.bool -> !torch.vtensor<[],si32> return %67 : !torch.vtensor<[],si32> } ```	2024-04-19 22:17:06 +08:00
penguin_wwy	5a98c72c7f	[StableHLO] Fix aten.clamp.Tensor in FxImporter2StableHLO (#3190 ) The FX importer will pass static shapes to the Torch dialect, so it needs to generate a StableHLO that satisfies shape inference.	2024-04-19 17:08:29 +08:00
penguin_wwy	0a6073414d	[FxImporter] Add fx importer to stablehlo e2e test config (#3183 )	2024-04-18 21:29:17 -07:00
penguin_wwy	6c4f7deebb	[stablehlo] add aten.clamp.Tensor op conversion support (#3185 )	2024-04-19 10:55:27 +08:00
Rob Suderman	be742a937d	[onnx] Update the failure triage for onnx (#3186 ) Reclassifying what the source of failures are for various bugs so we can reprioritize what failures are common.	2024-04-18 14:58:13 -07:00
Rob Suderman	0e77de996a	[torch] Add support for `torch.view` with dynamic shapes (#3164 ) We can map to `tensor.reshape` for handling multiple output dynamic shapes. Later we can perform a more complex analysis for indentifying expand/collapse cases from the tensor.reshape. Initially we planned to handle this identification at the `torch` level however it will be easier to handle once converted to core mlir-dialects.	2024-04-18 11:47:19 -07:00
Rob Suderman	4c21e20caa	[torch] Support rank-0 index for torch index select (#3182 ) Need to perform an expand in the case where the indices is rank-0.	2024-04-18 11:32:31 -07:00
Xinyu Yang	d4313eed4a	[Torch] Add decomposition of RepeatInterleaveSelfInt Op (#3075 ) Decomposition RepeatInterleaveSelfInt with following ops: ```python def my_repeat_interleave(input, repeats, dim=None): if dim is None: # Flatten the input and then repeat return input.flatten().unsqueeze(-1).tile((1, repeats)).flatten() else: # Calculate the shape after repeat expanded_shape = list(input.shape) expanded_shape[dim] = repeats # Repeat the tensor along the specified dimension repeat_shape = [1] (input.dim() + 1) repeat_shape[dim + 1] = repeats input = input.unsqueeze(-1) # Tile and then reshape tiled = torch.tile(input, repeat_shape) # Rearrange and reshape repeated = tiled.reshape(expanded_shape) return repeated ``` I passed the tests of stablehlo and linalg. When testing onnx, strange things happened. In torch-mlir's CI torch_nightly* and my own environment(torch==2.4.0.dev20240318+cpu), it can pass the pass. In torch-mlir's CI torch_stable, it failed. The test case is `RepeatInterleaveSelfIntNoDimModule_basic`, the result shape should be [120]. ```python class RepeatInterleaveSelfIntNoDimModule(torch.nn.Module): def __init__(self): super().__init__() @export @annotate_args([ None, ([3, 4, 5], torch.float32, True), ]) def forward(self, x): return x.repeat_interleave(2) @register_test_case(module_factory=lambda: RepeatInterleaveSelfIntNoDimModule()) def RepeatInterleaveSelfIntNoDimModule_basic(module, tu: TestUtils): module.forward(tu.rand(3, 4, 5)) ``` The error log is as follows: ``` Unexpected outcome summary: (onnx) ****** Failed tests - 1 tests FAIL - "RepeatInterleaveSelfIntNoDimModule_basic" @ trace item #0 - call to "forward" @ output of call to "forward" ERROR: shape (torch.Size([6, 4, 5])) is not equal to golden shape (torch.Size([120])) ``` @rsuderman Would you please help me check what's wrong with my PR? Thanks a lot.	2024-04-18 06:27:51 +08:00
Andreas Falkenberg	b66eabd492	[onnx][torch][linalg] Implementing align-corner modes for gridsampler (#3171 ) Align corner modes which select what the corners mean. Either the center of the corner points or the edges of the edge points. --------- Co-authored-by: Rob Suderman <rob.suderman@gmail.com>	2024-04-17 13:38:19 -07:00
penguin_wwy	3aa81f78d8	[FxImporter] Replace local_scalar_dense in fx_importer (#3180 )	2024-04-17 22:45:47 +08:00
Xinyu Yang	d2ba956e69	[Torch] Support Aten_CastLongOp. (#3160 ) By canonicalize Aten_CastLongOp into AtenToDtypeOp	2024-04-17 21:58:32 +08:00
penguin_wwy	e4b11a0ab4	[FxImporter] Fix fx importer test config and clean xfail set (#3176 )	2024-04-16 22:36:07 -07:00
penguin_wwy	398aeeec87	[FxImporter] Fix kwarg operands in fx importer (#3166 ) Remove the `kwarg_only` limitation, for example ``` torch.add(x, 3.0, alpha=2) ``` compiled to ``` %0 = torch.aten.add.Scalar %arg0, %float3.000000e00, %int1 ``` fix to ``` %0 = torch.aten.add.Scalar %arg0, %float3.000000e00, %int2 ```	2024-04-16 13:17:05 -07:00
zjgarvey	7a1ad0d7c0	[TorchToLinalg] Adds Support for Remaining Quantized Matmul Cases (#3167 ) The new cases added for quantized matmuls are: 1. vec-vec 2. vec-mat 3. mat-vec each of which are now lowered to expand(s), quantized_matmul, and collapse.	2024-04-16 09:28:28 -07:00
Vinayak Dev	a0232e9ebd	[MLIR][TORCH] Add OnnxToTorch lowering for ReduceL1 Op (#3146 ) Adds OnnxToTorch Lowering for the ReduceL1 op.	2024-04-16 12:24:46 +05:30
penguin_wwy	10b6062d41	[CI] Enable the tests for fx_importer in the CI (#3168 ) Replace the torchdynamo e2e with the fx_importer e2e	2024-04-15 21:20:23 -07:00
Xinyu Yang	ae4724763a	[Stablehlo] Enhance broadcast pattern in matmul Ops (#3161 ) To pass test "MatmulStaticBroadcast_basic" in stablehlo: ```python class MatmulStaticBroadcast(torch.nn.Module): def __init__(self): super().__init__() @export @annotate_args([ None, ([4, 1, 6, 7], torch.float32, True), ([8, 1, 5, 7, 6], torch.float32, True), ]) def forward(self, lhs, rhs): return torch.matmul(lhs, rhs) @register_test_case(module_factory=lambda: MatmulStaticBroadcast()) def MatmulStaticBroadcast_basic(module, tu: TestUtils): module.forward(tu.rand(4, 1, 6, 7), tu.rand(8, 1, 5, 7, 6)) ```	2024-04-16 10:10:36 +08:00
zjgarvey	5e564b5864	Adds Some Quantization Support for AtenMatmulOp (#3147 ) 1. onnx.MatMulInteger now converts to aten.matmul instead of aten.mm 2. aten.matmul, for ranks >=2, now allows quantized inputs and will lower to linalg::quantized_matmul or linalg::quantized_batch_matmul. 3. added AtenMatmulOp to the FuseQuantizeOps rewrite patters QuantizeOperands, QuantizeTransposedOperands, and QuantizeAccumulator 4. added several tests, including some to test AtenMmOp with varying quantization signed-ness. 5. a quantized matmul mat-vec test is added to verify the failure to lower to linalg; cleaned of out-of-date code related to common torch-mlir lowering xfails. 6. in debugging a real model with quantized matmuls, I found a bug on the scalarize-shapes pass which resulted from the aten.full op folder returning an incompatible result type. This is fixed by the small change here to [lib/Dialect/Torch/IR/TorchOps.cpp](https://github.com/llvm/torch-mlir/compare/main...zjgarvey:torch-mlir:MatMulIntegerFix?expand=1#diff-dc8ed165c207918e606490eee3984b1ad51d7034e6aac36fc046bf47f6f03f4f).	2024-04-15 16:06:47 -07:00
IanWood1	5708ee7ec9	Added 2 Ops: Floor divide scalar and Floor divide scalar mode (#3156 ) - Added linalg lowering for `AtenFloorDivideScalarOp` - Needed `AtenDivScalarModeOp` for the decomp. - Added linalg lowering for `AtenDivScalarModeOp` - Moved linalg payload logic to `createDivModePayload()` since the logic was nearly identical for both `AtenDivScalarModeOp` and `AtenDivTensorModeOp`. Just a template function - Added `AtenDivScalarModeOp` lowering for stablehlo Pytorch's [`torch.floor_divide()`](https://pytorch.org/docs/stable/generated/torch.floor_divide.html) in a previous version (for a reason unknown to me) preformed a truncation instead of "floor". The already implemented op `AtenFloorDivideTensorOp` was done before this change. However, this wasn't caught because our testcases only tested positive floor division. I changed this to floor as well as adding a few test cases.	2024-04-15 13:45:10 -07:00
penguin_wwy	45eaeaaf36	[FxImporter] Add FxImporter config in e2e-test (#3151 )	2024-04-12 16:07:56 -07:00
Aart Bik	307f49f566	[torch-mlir][sparse] support sparse tensor output (#3152 ) Sparse inputs and outputs are now fully supported! They always consist of their constituents buffers, passed as numpy arrays. Sparse on!	2024-04-12 09:56:32 -07:00
Xinan Jiang(姜曦楠)	71d90788d3	[MLIR][TORCH] Support parallel dimemsions expand/collapse (#3051 ) This PR support `aten.view` with unique unknown dimension both in input shape and output shape while the pass convert-torch-to-linalg that lowing `aten.view` to `tensor.collapse_shape` or `tensor.expand_shape`. Below is an example ``` func.func @test_reshape(%arg0: !torch.vtensor<[1,?,50,16],f32>) -> !torch.vtensor<[1,?,16],f32> attributes {torch.assume_strict_symbolic_shapes, torch.onnx_meta.ir_version = 9 : si64, torch.onnx_meta.opset_version = 19 : si64, torch.onnx_meta.producer_name = "backend-test", torch.onnx_meta.producer_version = ""} { %int1 = torch.constant.int 1 %int-1 = torch.constant.int -1 %int16 = torch.constant.int 16 %0 = torch.prim.ListConstruct %int1, %int-1, %int16 : (!torch.int, !torch.int, !torch.int) -> !torch.list<int> %1 = torch.aten.view %arg0, %0 : !torch.vtensor<[1,?,50,16],f32>, !torch.list<int> -> !torch.vtensor<[1,?,16],f32> return %1 : !torch.vtensor<[1,?,16],f32> } ```	2024-04-11 10:43:03 -07:00
Rob Suderman	a1fe307a76	[torch] Support implicit batch for index_put (#3128 ) If there is only a single value scattered there can be an implicit batch dimension. This includes a check for the implicit batch dimension when reshaping the update tensor. It includes an e2e test to verify correctness.	2024-04-11 10:18:03 -07:00
Xinyu Yang	308c45e61a	[Torch] Fix PrimListUnpackOp::getCanonicalizationPatterns (#3140 ) Fix the case PrimListUnpackOp's result num is not equal to PrimList length. See the following example: ```python def forward(self, x): if len(x.shape) == 5: b0, t, c0, h0, w0 = x.shape b, c, h, w = torch.mul(b0, t), c0, h0, w0 else: b1, c1, h1, w1 = x.shape b, c, h, w = b1, c1, h1, w1 res = torch.reshape(x, [b, c, h, w]) return res ``` Without this fix, the following error message will occur： ``` /root/torch-mlir/externals/llvm-project/mlir/lib/IR/PatternMatch.cpp:118: virtual void mlir::RewriterBase::replaceOp(mlir::Operation *, mlir::ValueRange): Assertion `op->getNumResults() == newValues.size() && "incorrect # of replacement values"' failed. ```	2024-04-11 19:48:49 +08:00
Xinyu Yang	6524838bcb	[Torch] Add general AdaptiveAvgPool2dOp decompose support (#3111 ) Previously, it could only handle the situations where outputsize == (1, 1) or outputsize == (input_H, input_W). Now it supports all situations where input_H % output_H== 0 && input_W % output_W == 0	2024-04-11 17:02:59 +08:00
Yuanqiang Liu	88533b1968	[Stablehlo] fix aten.arange's lowering to stablehlo (#3138 ) * promote to f64 to do division, avoid division on i64 (floor div) * refactor torch-to-stablehlo-pipeline	2024-04-11 15:55:56 +08:00
zjgarvey	aa5e150313	Adds Some uint8 Quantization Fixes (#3122 ) 1. Changes the linalg lowering for dequantization ops to always sign cast to float to prevent misrepresenting uint32 overflow on subtraction with zero point. 2. Adds a basic quantized model test which only quantizes and dequantizes and now passes with these changes in linalg and onnx configs. 3. Changes the aten.mm lowering to allow mismatched quantized types. 4. If a quantized matmul arg is uint8, we shift by 128 to faithfully represent the quantization as a signed i8 quantization. This worked fine in the AtenMmOp lowering, but I'd be happy to move it to a rewrite in FuseQuantizedOps.cpp instead if that seems more appropriate. With the changes 3 and 4, the QuantizedMLP_basic and QuantizedSingleLayer_basic e2e tests now passes with the onnx config.	2024-04-10 12:36:58 -07:00
Xinyu Yang	5eb0cf9104	[Torch] Add decompose of AtenToPrimDeviceOp (#3131 ) As device information isn't relevant to torch-mlir	2024-04-10 22:26:48 +08:00
Jae Hoon (Antonio) Kim	8951a8cc23	Replace c10::optional with std::optional (#3126 ) They replaced all `c10::optional` usages with `std::optional` in torchgen'd code in `fb90b4d4b2` causing the LTC build to break. Replacing all usages of `c10::optional` with `std::optional` in `projects/ltc` has fixed the issue Issue: #3120	2024-04-09 18:38:33 +00:00
IanWood1	8ff28527cb	Add more descriptive error message to torch_ods_gen.py. (#3108 ) Added error message when adding new torch op to [torch_ods_gen.py](https://github.com/llvm/torch-mlir/compare/main...IanWood1:torch-mlir:ods_gen_error_message?expand=1#diff-889b60b904ed67a5065a14e8de6fc89e00e199577e4d2bfa134ac4d1c89832d2). New message displays which op key is failing and possible matches in the torch `Registry`. ```Op does not match any Torch ops in Registry Given op: "aten::hardtanh_wrong : (Tensor, Scalar) -> (Tensor)" Possible matches: "aten::hardshrink : (Tensor, Scalar) -> (Tensor)" "aten::hardtanh_ : (Tensor, Scalar, Scalar) -> (Tensor)" "aten::hardtanh : (Tensor, Scalar, Scalar) -> (Tensor)" "aten::clamp_min : (Tensor, Scalar) -> (Tensor)" "aten::linalg_cond : (Tensor, Scalar?) -> (Tensor)"``` Also, ran black formatting on file. Based on LLVM style guides this seems to be correct, but I can revert the formatting if needed.	2024-04-09 09:50:34 -07:00
Xinyu Yang	42a16fa912	[Torch] Support Aten_CastFloatOp. (#3115 ) By canonicalize Aten_CastFloatOp into AtenToDtypeOp	2024-04-09 11:06:53 +08:00
Xida Ren (Cedar)	dd967eb199	[ONNX] Support onnx.LSTM (#2969 ) This PR only performs a lit test. In lieu of an e2e test, https://github.com/nod-ai/SHARK-TestSuite/pull/142 makede sure that the lowering works & the numbers check out. Co-authored-by: Xida Ren <xida.ren.dev@gmail.com>	2024-04-08 12:23:33 -07:00
Vivek Khandelwal	1d6e4c3d77	[MLIR][TORCH] Add OnnxToTorch lowering for Einsum op (#3117 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-04-08 22:38:01 +05:30
Xinyu Yang	84c24e5771	[Torch] Support Aten__And__ScalarOp (#3114 )	2024-04-08 20:24:17 +08:00
Yuanqiang Liu	2c56ef9252	[Torch Dialect] canonicalize aten.sign to aten.sgn (#3112 ) * `aten.sign` is a sub-set of `aten.sgn` (`aten.sgn` support complex type).	2024-04-08 20:05:42 +08:00
Yuanqiang Liu	498ab997cd	[Stablehlo] lowering aten.log1p to stablehlo.log_plus_one (#3110 )	2024-04-07 17:01:58 +08:00
Yuanqiang Liu	0a00f38a7e	[Stablehlo] add stablehlo-aggressive-simplification in e2e test (#3109 ) * so that more stablehlo e2e testcases would pass.	2024-04-07 10:48:11 +08:00
Vivek Khandelwal	af54d27820	[MLIR][TORCH] Fix Onnx.TopK lowering (#3103 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-04-03 22:12:48 +05:30
Vivek Khandelwal	ce7d4f1660	[MLIR][TORCH] Fix Onnx.ReduceSum lowering for failing e2e tests (#3095 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-04-03 09:57:19 +05:30
Rob Suderman	f97cd4893f	[torch] Improve shape inference for dynamic shapes (#3091 ) Shapes can be processed as tensors to represent the set of dimensions. As reshapes take a list of scalars this can result in a single dynamic dimension blocking the adjacent static dimensions. This pass attempts to de-couple tensor computations related to shapes and propagate values to better support lowering scalar tensor computations.	2024-04-02 16:19:57 -07:00
Yuanqiang Liu	6cbb2f7ae0	[Stablehlo] add stablehlo-canonicalize-dynamism when lowering (#3097 ) so that many stablehlo e2e testcases could pass	2024-04-02 22:47:24 +08:00
Xinyu Yang	ac1cd3d78a	[Torch] Support AtenDivTensorModeOp with static int input for linalg and stablehlo backend (#3088 )	2024-04-02 17:28:53 +08:00
ptrifunovic98	1c8c47d483	Add complex support for aten.norm and similar operations (#3052 ) Add support for complex-type input tensors for norm, vector norm, and Frobenius norm operations.	2024-04-02 14:03:30 +05:30
Rob Suderman	0f5d5e9f4e	[stablehlo] Fix test stablehlo e2e test suite (#3093 ) There is an issue with stablehlo's linalg compilation. Canonicalization appears to cleanup the issues until we can determine what in mlir/stablehlo is the source of the issue.	2024-04-02 12:40:00 +08:00
Rob Suderman	ec4cb8be44	Bump LLVM to llvm/llvm-project@0030fc4ac7 (#3079 ) Co-authored-by: Peiming Liu <peiming@google.com>	2024-04-01 16:34:59 -07:00
zjgarvey	532d297c46	[ONNX] Preliminary Work Towards Supporting QuantizedMLP_basic onnx e2e test (#3089 ) See the related issues here: [SHARK-Turbine#556](https://github.com/nod-ai/SHARK-Turbine/issues/556) 1. Adds uint8 casting to onnx.Cast op 2. Fixes an issue with onnx.DequantizeLinear when the scale comes with shape [1]. 3. Adds support for unsigned types in an AtenItemOp folder 4. Adds a simpler quantized model for easier debugging 5. Adds a fusion pass to convert [quant -> dequant -> transpose -> mm] patterns to [transpose -> quant -> mm]. 6. Moved some xfails that are still not passing, but for different reasons than onnx.cast failures.	2024-04-01 16:21:05 -07:00
Vivek Khandelwal	6844c84702	[MLIR][Torch] Fix OnnxToLinalg lowering for AvgPool op (#3076 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-04-01 22:14:14 +05:30
Stella Laurenzo	6d680ff445	[ods] Allow all tensor returns to be optional. (#3082 ) This was found while tracing backwards graphs: the convolution_backwards op will return None if the first result is not needed. Confirmed by defining a custom op with a `Tensor` return signature and having its meta kernel return None.	2024-03-29 23:09:34 -07:00
Xinyu Yang	40008b025a	[Torch] Support prelu decomposition (#3069 )	2024-03-29 08:05:00 +08:00
zjgarvey	c19fc9ba47	[ONNX] Fixes Issue with Dynamic Dims in GlobalAveragePool -> Torch Conversion (#3053 ) Two e2e tests (AdaptiveAveragePool1/2dUnitOutputSizeDynamic) were failing due to numerics. This was as a result of passing -1 as the kernel size in the lowering for the corresponding onnx op GlobalAveragePool.	2024-03-28 09:43:09 -07:00
Xinyu Yang	e6e7689a24	[Torch] support decompose aten.einsum with ellipsis slicing (#3056 )	2024-03-27 12:42:10 -07:00
Yuanqiang Liu	0a581a97a7	[Torch Dialect] enhance aten.int.tensor's canonicalize (#3058 ) support fold with literal vtensor. change it to canonicalize because this pattern will create new op.	2024-03-27 09:51:58 +08:00
Rob Suderman	14b548f968	[torch] Improve shape inference for `torch-to-linalg` path for reshapes (#3055 ) Reshaping tensors depend on directly matching individual dimensions to their corresponding dim in the `torch.view` reshape dimensions. This involves decoupling dynamic dimensions from their static counterparts and support cleanup / canonicalization.	2024-03-26 12:41:40 -07:00
Vivek Khandelwal	9ae33e482e	[MLIR][TORCH] Add OnnxToTorch lowering for ops (#3049 ) This commit adds the OnnxToTorch lowering for the Mish, Softplus, HardSwish, Trilu, ThresholdedRelu op Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-03-25 20:29:07 +05:30
schnkmwt	1fcbfa87ec	Implement linalg lowering of diag_embed torch op (#2885 ) This PR adds lowering of diag_embed to linalg dilect. Tracked in https://github.com/nod-ai/SHARK-Turbine/issues/288 --------- Co-authored-by: sachink <sachink@xilinx.com>	2024-03-22 16:32:50 -07:00
zjgarvey	99b3a5f117	Converts all Adaptive Pooling Ops to Linalg (#2808 ) The previous conversions for AtenAdaptiveAvgPool1dOp and AtenAdaptiveMaxPool2dOp are refactored into a general templated conversion that works for all of the AtenAdaptive...PoolNdOp's. New support is added for the following ops: 1. AtenAdaptiveMaxPool1d 2. AtenAdaptiveMaxPool3d 3. AtenAdaptiveAvgPool3d Support is also provided for passing inputs without batch dimensions. For example, applying adaptive_avg_pool2d to an input tensor of rank 3. After [pytorch #118162](https://github.com/pytorch/pytorch/pull/118162) gets down to torch-mlir, I'll add a test for AdaptiveMaxPool1d with return_indices (which will pass with that upstream fix). --------- Co-authored-by: James Newling <james.newling@gmail.com>	2024-03-22 11:05:20 -07:00
zjgarvey	6aa481c204	[ONNX] LogSoftmax to Torch (#3024 ) This PR adds support for onnx.LogSoftmax both for old versions (<13, with axis >=0), and new versions (13).	2024-03-22 11:01:39 -07:00
penguin_wwy	7616d637fd	Add stateless fx graph import (#3036 )	2024-03-21 14:44:54 -07:00
Rob Suderman	3a56714bff	[torch] Fix clamp ranges on quantize_per_tensor on unsigned (#3018 ) SExtValue was used for `int` and `uint` clamp values. This caused the result to always be outputed as `zero`.	2024-03-20 13:37:47 -07:00
Xida Ren (Cedar)	cb5cb506df	Fix SCF Forloop fails to convert to linalg when a tensor argument is supplied to the loop block (#3040 ) Co-authored-by: Rob Suderman <rob.suderman@gmail.com> Co-authored-by: Xida Ren <xida.ren.dev@gmail.com>	2024-03-20 11:04:02 -07:00
zjgarvey	6ff71b40c8	[ONNX] onnx.DynamicQuantizeLinear to Torch (#3009 ) This adds support for converting DynamicQuantizeLinear from torch-onnx to torch. I could not get an e2e test to pass, since there seems to be some issues with uint8 casting somewhere lower in the pipeline. For example compiling with IREE for llvm-cpu, I would get either the correct zero point (if zp < 128) or the correct zero-point minus 256 (if zp >= 128). The output tensor seems to always return a tensor of zeros, which also occurs when running uint8 examples through QuantizeLinear. Edit: the first problem can be resolved by casting the output back to uint8 on output, the second problem is resolved with PR #3018	2024-03-20 10:58:25 -07:00
Abhishek-TyRnT	df02692726	Dynamic size support for flatten (#3005 ) Added support for dynamic shapes in `flattenusingints` op in tosa dialect. Due to this some Argmax tests pass This PR fixes this issue https://github.com/llvm/torch-mlir/issues/3004 The following tests pass after this PR ``` 1. "ArgmaxIntModule_basic" 2. "ArgmaxIntModule_multiple_maxs" 3. "ArgmaxModule_basic" ```	2024-03-19 15:19:29 -07:00
Yuanqiang Liu	8b96727d0d	[Stablehlo] lowering chlo to stablehlo in torch-to-stablehlo pipeline (#3037 ) as that stablehlo is better than chlo as the boundary between frontend compiler and backend compiler.	2024-03-19 21:18:54 +08:00
Pavani Chowdary	c51e2130f2	[onnx] support for lowering mod op from onnx to torch (#2859 ) nod-ai/Shark-Turbine#267 --------- Authored-by: boddu.pavani@research.iiit.ac.in Co-authored-by: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-03-18 17:54:37 +05:30
Xinan Jiang(姜曦楠)	d8a52e82c2	[onnx] Fix onnx.cast cases between int32 and int64 (#2982 ) 2 modifications: 1. torch.int64 is enum 4 in TORCH_DTYPE_TO_INT 2. add int32 support	2024-03-15 17:14:09 +00:00
penguin_wwy	f34c187ac4	Normalize type hints to be compatible with multiple Python versions (#3028 ) Although we provide a wheel package for Python 3.8, it may actually throw the following exception: `TypeError: 'type' object is not subscriptable`	2024-03-15 08:29:48 -07:00
Yuanqiang Liu	4282eb9e76	[Torch Dialect] support aten.fake_quantize_per_tensor_affine (#3014 )	2024-03-15 08:53:29 +08:00
Yuanqiang Liu	870e63bc3c	[Torch Dialect] support decomposition of aten.linspace (#3006 )	2024-03-14 08:28:33 +08:00
Yuanqiang Liu	43c6996a31	[Torch Dialect] add folder for aten.ceil and unify patterns of ceil, … (#3010 ) …floor, round	2024-03-14 07:41:58 +08:00
ptrifunovic98	524ff99216	Implement lowering of torch.aten.linalg_cross (#2986 ) Closes [nod-ai/SHARK-Turbine#497](https://github.com/nod-ai/SHARK-Turbine/issues/497)	2024-03-13 12:17:22 -07:00
Yuanqiang Liu	ad6159c7cb	[Stablehlo] lowering aten.round to stablehlo.round_nearest_even (#3011 )	2024-03-12 08:58:20 +08:00
Devjiu	4b1e87ce67	[TorchDynamo] Enable Elemtwise ops for Scalar arg (#2744 ) This commit provides dummy solution to support elmentwise operations (mul, add) with scalar argument. ( op(Tensor, Scalar) ) It replaces `torch.aten.add.Tensor` with `torch.aten.add.Scalar`. ``` Unexpected outcome summary: (torchdynamo) ****** Unexpectedly Passed tests - 22 tests XPASS - "AddCDivModule_basic" XPASS - "BatchNorm1DModule_basic" XPASS - "BatchNorm1DStaticShapeModule_basic" XPASS - "BatchNorm1DWith2DInputModule_basic" XPASS - "BatchNorm2DModule_basic" XPASS - "BatchNorm3DModule_basic" XPASS - "ElementwiseAddScalarInt64Module_basic" XPASS - "ElementwiseAddScalarIntModule_basic" XPASS - "ElementwiseMulScalarModule_basic" XPASS - "ElementwiseMulScalarModule_float" XPASS - "ElementwiseMulScalarModule_int" XPASS - "GroupNormModule_basic" XPASS - "GroupNormNoWeightAndBiasModule_basic" XPASS - "MobilenetV3Module_basic" XPASS - "NativeBatchNorm1DModule_basic" XPASS - "NativeBatchNorm2DModule_basic" XPASS - "NativeBatchNorm3DModule_basic" XPASS - "NativeBatchNormNoneWeightModule_basic" XPASS - "NativeGroupNormBackwardModule_basic" XPASS - "NativeGroupNormModule_basic" XPASS - "ResNet18Module_basic" XPASS - "ResNet18StaticModule_basic" ``` And segfault for test "ElementwiseAddScalar_TensorLiteralInt32_Module_basic". Somehow this change doesn't allow to use Tensors, that are not forward arguments, but local variables of model. e.g. `self.x = torch.tensor(..)` See also: #2745 Signed-off-by: Dmitrii Makarenko <dmitrii.makarenko@intel.com>	2024-03-11 12:22:05 -07:00
Rob Suderman	8fb28661f9	[onnx] Fix onnx.ReduceMean lowering (#3002 ) Reduce mean lowerings did not succesfully lower to `linalg` via torched. There were two separate paths that could be consolidated to a single simpler pass. This resulted in a significant improvement in test coverage.	2024-03-11 11:32:53 -07:00
Yuanqiang Liu	229ca3a9e1	[Torch Dialect] emit aten::mul and add folder (#3007 )	2024-03-11 19:59:34 +08:00
Yuanqiang Liu	a3fe130f73	[Torch Dialect] emit aten::warn (#3003 ) * torch-mlir may not handle `aten.warn`. But it could be handled by custom users' backend which involves torch-mlir.	2024-03-10 08:29:08 +08:00
Rob Suderman	bd7f1baa42	[onnx] Fix expand operation for dynamic shape max (#3001 ) If the broadcast shape is length-1 at a dim while `?` in the input dim then we need to broadcast to the dynamic dim. This is equivalent to taking a max of two dimensions.	2024-03-08 16:23:07 -08:00
Rob Suderman	0723584936	[torch] Add folder for torch.aten.*.Scalar comparisons (#3000 ) This folds small version of the tensor-scalar comparison operators as they are commonly used for shape computations. This includes le, lt, ge, gt, eq, and ne.	2024-03-08 13:44:00 -08:00
Andreas Falkenberg	551a4e45f3	[onnx] Add support for `onnx.Gemm` with no bias (#2993 ) Previous gemm version required a bias vector. This provides an alternate path to `Torch::AtenMm` with no bias operation.	2024-03-07 15:58:38 -08:00
Rob Suderman	1964208d19	[onnx] Fix constant pad for dynamic shape (#2989 ) The current padding operation was not functional for dynamic shapes. Updated and enabled tests so that onnx.pad tests pass. Work TBD for reflection padding.	2024-03-07 13:29:50 -08:00
Scott Todd	7b18646def	[onnx] Handle optional arguments in Clip op pattern. (#2976 ) Spec: https://onnx.ai/onnx/operators/onnx__Clip.html	2024-03-07 17:25:14 +00:00
Vivek Khandelwal	6e84752c39	build: manually update PyTorch version (#2992 ) Set PyTorch and TorchVision version to nightly release 2024-03-07. This commit also removes the deprecated constraints API: `342e7929b8` Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-03-07 21:42:38 +05:30
Rob Suderman	a78659742a	[onnx] Migrate `onnx.ReduceMax` to match `onnx.ReduceMin` (#2981 ) This mostly copy-pastes the reduce minimum implementation to reduce max to improve test coverage. We also improve the aten lowering for min/max dim for unsigned types.	2024-03-06 16:48:21 -08:00
Andreas Falkenberg	ea76dd12ba	[onnx][torch] Gridsampler E2E test and corrections of gridsampler (#2987 ) The addition of an e2e test is actually provided in the Shark-Testsuite. This adds 2 test cases for the gridsampler e2e test. Also as intended there were some items found which needed correction, so the Gridsampler op has also a change.	2024-03-06 10:56:58 -08:00
Rob Suderman	06292d9429	[torch] Rework `aten.repeat` to use flatten and unsqueeze (#2984 ) Current implementation depends on using `aten.view` which has issues inferring tensor collapse/expand operations during the lowering to `linalg`. Using flatten and unsqueeze better infers what the later reshape behavior.	2024-03-06 10:19:18 -08:00
Ze Zhang	aa7c9a9653	e2e support aten.linalg_norm to aten.linalg_vector_norm (#2953 ) Add e2d support for `aten.linalg_norm` by decompose it to `aten.linalg_vector_norm`. Lowering to `aten.linalg_matrix_norm` is still unsupported. To Test: `python -m e2e_testing.main -v` --------- Co-authored-by: Ze Zhang <ze.zhang@getcruise.com>	2024-03-05 16:31:01 -08:00
Rob Suderman	bc0527676b	[torch] Add support for `torch.split_with_sizes` via decompose (#2979 ) Convert to individiual slices and tuple together as a list. --------- Co-authored-by: Scott Todd <scott.todd0@gmail.com>	2024-03-05 15:01:21 -08:00
Chi_Liu	09875fabd1	[MLIR][ONNX] Add ONNX ReduceProd support (#2943 ) Alternatives to https://github.com/llvm/torch-mlir/pull/2908 Fix https://github.com/nod-ai/SHARK-Turbine/issues/353	2024-03-04 11:07:03 -08:00
Rob Suderman	19d4888278	[torch] Make torch.aten.unflatten lower directly to linalg (#2971 ) Existing lowering via aten.view does not work as well for dynamic shapes as the lowering to tensor.expand must re-infer dynamic shape matching. Better to directly lower.	2024-03-04 10:17:42 -08:00
Rob Suderman	d51e80b648	[onnx] Fix onnx.gather lowering for rank-0 indices (#2973 ) We assumed rank was atleast 1 however it can be rank-0, generating an illegal pair of flatten / unflatten operations. Corrected this.	2024-03-04 08:25:19 -08:00
Rob Suderman	61f0a5facf	[torch] Add an `aten.cat` length-0 canonicalization (#2966 ) If an input is length-0 along the dimension of canonicalization we can remove the tensor from the list	2024-03-01 21:41:12 -08:00
Rob Suderman	d030bffc62	[torch] Support `aten.view` rank-0 collapse (#2965 ) Collapsing to a rank-0 tensor using `aten.view` was currently bailing out. Added the special case.	2024-03-01 12:31:07 -08:00
mmakevic	76b81e0ccd	Implement lowering of torch.aten.fmod.Tensor (#2767 ) Closing https://github.com/nod-ai/SHARK-Turbine/issues/351	2024-02-29 11:22:03 +05:30
Rob Suderman	ed6e75908b	Bump LLVM to llvm/llvm-project@e5ed7b6e2f (#2964 )	2024-02-28 14:13:26 -08:00
Rob Suderman	6f3d62ab04	[torch] Fix folders and `cat` and `view` torch lowerings (#2963 ) A bunch of small fixes are interlinked and trigger crashes if not addressed as a group. This includes: - aten view when expand from a rank-0 tensor - slice folder with negative indices - `aten._shape_as_tensor` folder on a rank-0 tensor - `aten.cat` of a tensor with a length-0 tensor	2024-02-28 12:04:52 -08:00
Rob Suderman	08bc013fcd	[tosa] Fix TOSA batch matmul lowering to correct transpose ordering (#2959 ) The corrective transpose at the end is computed incorrectly. Is it actually computin the inverse transpose. Inverting the permutations fixes the issue.	2024-02-28 09:46:58 -08:00
Rob Suderman	4a7a7d76f8	[onnx] Fix ReduceMean lowering to torch (#2956 ) Torch lowering only supported the most recent version. Refactored the lowering so more easily handle default values and optional operands / attributes.	2024-02-27 22:48:07 -08:00
Abhishek-TyRnT	d541779f37	Add support for torch arange float module (#2749 ) Added Support for float dtype in in torch.arange in TOSA Dialect This resolves the following issue :- https://github.com/llvm/torch-mlir/issues/2762 The following test cases are passing after this change 1. ArangeDtypeIntModule_basic 2. ArangeFloatModule_basic 3. ArangeNegativeStartFloatModule_basic 4. ArangeStartFloatModule_basic 5. ArangeStartNegativeStepFloatModule_basic 6. ArangeStartOutDtypeModule_basic 7. ArangeStartStepFloatModule_basic --------- Co-authored-by: James Newling <james.newling@gmail.com>	2024-02-27 13:40:55 -08:00
Rob Suderman	e30a083aff	[torch] Rework lowering to tm_tensor.scatter to stop serialization (#2940 ) We collapsed and broadcasted scatter indices to a single element version. We should instead upport `tm_tensor.scatter`s support for multiple indices and the implicitly broadcasted behavior. This avoids the serialization and materializing a needlessly large indices tensor.	2024-02-27 11:46:57 -08:00
Vivek Khandelwal	d628b5fd06	[MLIR][TORCH] Add support for tanh approximation for Gelu op (#2941 ) Fixes https://github.com/nod-ai/SHARK-Turbine/issues/461 Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-02-27 19:26:01 +05:30
Vivek Khandelwal	d81747eadb	[MLIR][TORCH] Extend support for OnnxToLinalg lowering for Dropout and Div op (#2938 ) Fixes https://github.com/nod-ai/SHARK-Turbine/issues/451, https://github.com/nod-ai/SHARK-Turbine/issues/452	2024-02-27 11:02:05 +05:30
ptrifunovic98	c5a1da1910	Implement lowering of torch.aten.norm.Scalar (#2899 ) Closes [nod-ai/SHARK-Turbine#365](https://github.com/nod-ai/SHARK-Turbine/issues/365)	2024-02-26 08:46:56 -08:00
Andreas Falkenberg	55dc8deb92	[torch] GridSample TorchToLinalg lowering (#2883 ) Lowers `torch.grid_sample` to the equilvalent `linalg` representation.	2024-02-23 09:14:38 -08:00
Rob Suderman	53f6d06ab8	[onnx] Drop `ConstantOfShape` logic form importer, fix torch lowering (#2930 ) There is no reason to treat `ConstantOfShape` as a specialized import any as there exists a onnx-to-torch equivalent. Dropping the import coding and adding support for resource conversion substantially increases test coverage for dynamically shaped tests.	2024-02-21 21:34:43 -08:00
Rob Suderman	df2aa1a369	[torch] Fixed edge conditions for strided slicing (#2929 ) Strided slicing can occur with a negative stride. In these cases we need to bound end differently. This included removing a function that was generating bad limits.	2024-02-21 21:28:44 -08:00
Aart Bik	534b266f2d	[torch-mlir][NFC] remove trailing whitespace (#2936 )	2024-02-20 11:23:14 -08:00
Rob Suderman	13113df33e	[onnx] Enable crashing tests (#2928 ) Crashing tests no longer crash, enable as either passing or xfail tests. Co-authored-by: Xida Ren (Cedar) <cedar.ren@gmail.com>	2024-02-20 18:34:21 +00:00
Rob Suderman	13553d49c9	[onnx] Update the importer to create a `none` for missing operands (#2931 ) Some operands are optional so we require a placeholder for missing operands. We invent an `onnx.None` operation as our placeholder.	2024-02-20 09:30:30 -08:00
Rob Suderman	135c81a416	[torch] Add folder for `prim.NumToTensor.Scalar` (#2921 ) Useful for `slice` lowerings that depend on tensors made form scalars.	2024-02-19 11:55:54 -08:00
Rob Suderman	e80054a3cc	[torch] Folders for `torch.aten.*.tensor` operators [add, sub, mul] (#2878 ) Simple folder for limited size aten tensor operations. This is primarily useful for shape computation folding as they unfortunately can use `aten` operators. Add, sub, mul are common examples of these folders.	2024-02-19 10:28:23 -08:00
Rob Suderman	cea51897a5	[onnx] Simplify onnx.slice lowering (#2919 ) Onnx slice lowering used arange needlessly instead of directly constructing the constant dimension values. This makes lowerings to linalg struggle as multiple folders are required to get what is a constant index value.	2024-02-19 10:26:29 -08:00
aldesilv	d29157b33f	OnnxToTorch support for onnx.InstanceNormalization op (#2710 ) https://github.com/nod-ai/SHARK-Turbine/issues/327	2024-02-19 19:53:48 +05:30
Aart Bik	78e10ff09b	[torch-mlir][sparse] inline sparse helper methods (#2918 ) Even though the reference compiler is not about performance, inlining the generated sparse helper methods has a rather big positive impact on performance, leaving a much better first impression. Therefore, we added this inlining pass (which leaves all other PyTorch modules unaffected, since they tend to be one big main() method to start with). testing: $./tools/e2e_test.sh --config linalg Summary: Passed: 1164 Expectedly Failed: 8 $ python -m e2e_testing.main --config=torchdynamo Summary: Passed: 976 Expectedly Failed: 162	2024-02-16 20:56:42 -08:00
Rob Suderman	d65925a8b4	[onnx] Fix `onnx.sigmoid` for integer inputs/outputs (#2914 ) Sample compilation crashes due to sigmoid with integer inputs/outputs. This fix avoids crashing but still experiences an error.	2024-02-16 13:35:25 -08:00
Rob Suderman	7a0d0e954b	[onnx] Fix onnx.gather lowering to use torch.aten.index_select (#2913 ) Onnx's gather maps directly to `torch.aten.index_select`. We should just use that path.	2024-02-16 16:05:44 -05:00
Rob Suderman	074f112d6a	[onnx] Add testing using the `onnx` compilation using torch tests (#2795 ) We can route the torch tests via `onnx` using the `torch.onnx.export` tooling. We can then reimport, lower to torch, and compile to linalg to validate the onnx path is working correctly. The current implementation exposes some failures in the `onnx` path so we cannot enable the onnx test suite yet due to segmentation faults.	2024-02-15 10:17:13 -08:00
Yuanqiang Liu	5733c84443	[bazel] fix bazel with stablehlo refbackend and fix some typo (#2911 )	2024-02-16 01:38:13 +08:00
Yuanqiang Liu	f3e8199a6d	[Stablehlo] add refbackend (#2712 )	2024-02-16 01:08:48 +08:00
Vivek Khandelwal	d6d1a173dc	[MLIR][Torch] Add OnnxToTorch and TorchToLinalg support for trig ops (#2903 ) This commit adds the OnnxToTorch lowering for cosh, acosh, asin, asinh, and atanh op. This commit also adds the TorchToLinalg lowering for acosh, asin, asinh, and atanh op. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-02-14 11:58:09 +05:30
Rob Suderman	e9cdd6cbc5	[torch] Fix tm_tensor.attention for end-to-end (#2907 ) Some operations include a backend matcher for specialized operations. We map these back to generics so they appropriately match to the high performance versions. This is done for the attention operation.	2024-02-13 21:18:01 -08:00
Ashay Rane	370d6ac9a2	build: find Protobuf using config mode search (#2900 ) This patch makes the Protobuf package mandatory in addition to forcing a config mode search. The (default) module mode search looks for the CMake-provided FindProtobuf.cmake file, but this file does not list Abseil as a dependency, causing linker issues like the one below: ``` ld: Undefined symbols: absl::lts_20230802::log_internal::LogMessageFatal::LogMessageFatal(char const*, int, std::__1::basic_string_view<char, std::__1::char_traits<char>>), referenced from: google::protobuf::RepeatedPtrField<std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>>::TypeHandler::Type const& google::protobuf::internal::RepeatedPtrFieldBase::Get<google::protobuf::RepeatedPtrField<std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>>::TypeHandler>(int) const (.cold.1) in OnnxImporter.cpp.o ``` By forcing a config mode search, CMake looks for the file that is installed as part of the protobuf package and which does contain the Abseil dependency. This workaround is also mentioned in a GitHub issue for Protobuf: https://github.com/protocolbuffers/protobuf/issues/12292#issuecomment-1529680040.	2024-02-12 17:31:41 -06:00
Aart Bik	be8375d350	[torch-mlir][sparse] implement first sparse_jit end-to-end path (#2894 ) This PR introduces a sparse_jit wrapper that can run simple models with sparse tensor inputs end-to-end. The implementation shows all required components on modifying sparse tensor types with a 1:N relation on the call sites. Two tests shows that the JIT runs end-to-end while computing the correct results. More details to follow (generalizing to COO and different ranks, as well as support for output sparse tensors), but the general concepts are all here now. _Update: Thanks to Rob, bump to proper LLVM/MLIR hash is done!_ _NOTE that all parameter passing changes are nicely done "downstream" in MLIR, so very little changes are required in torch-mlir code proper_ --------- Co-authored-by: Franz Haniel <77495327+frafranz@users.noreply.github.com> Co-authored-by: Franz Haniel <franz.haniel@amd.com>	2024-02-12 10:04:54 -08:00
Rob Suderman	c0f139be0f	[torch] Add `torch.aten.eq.Tensor` comparison folder (#2889 ) Added a folded for a equals operator. This allows an equivalent comparison folder, primarily for when shape computations occur small size tensor.	2024-02-09 15:02:20 -08:00
Rob Suderman	7d33ba69ac	[torch] Folder for torch.aten.select.int for splat cases (#2890 ) If the input or result is a splat value we can just constant fold the result. This is common for shape computations and can help with shape inference.	2024-02-09 14:02:54 -08:00
Franz Haniel	4cc62aeb24	Implement trace (#2790 ) The lowering decomposes AtenTraceOp into an AtenDiagonalOp followed by AtenSumOp. The progress is tracked in https://github.com/nod-ai/SHARK-Turbine/issues/333. --------- Co-authored-by: Franz Haniel <franz.haniel@amd.com>	2024-02-09 08:00:24 -08:00
Avinash Sharma	9659a436d1	Add lowering support for math::AbsIOp (#2875 ) There is no lowering support for math::AbsIOp, so if the operand is an integer type, it will fail to lower to math::AbsFOp since the op operand #0 must be floating-point-like.	2024-02-08 14:53:40 -08:00
Aart Bik	44f8f89826	[torch-mlir][sparse] add sparsification to linalg reference backend (#2887 ) This adds a few passes that will ensure linalg with sparse tensors are properly lowered to loops and can run using the ExecutionEngine for testing (a few details on parameter passing from PyTorch still TBD) Test results: $ ./tools/e2e_test.sh --config linalg Summary: Passed: 1144 Expectedly Failed: 8 $ python -m e2e_testing.main --config=torchdynamo -v Summary: Passed: 960 Expectedly Failed: 163 Filed issue: https://github.com/pytorch/pytorch/issues/119407	2024-02-08 09:37:31 -08:00

1 2 3 4 5 ...

332 Commits (574143448bab6b66560f26e2396d4068a12fbd0b)