torch-mlir

Commit Graph

Author	SHA1	Message	Date
Yuanqiang Liu	dc470e65c8	add torch.qint32 to dtype-spec in TorchTypes.td (#3206 )	2024-04-24 11:49:26 +08:00
Yuanqiang Liu	8a1dbbd597	[torchscript] export extra library file name to user (#3203 ) * so that it could be specified by user.	2024-04-24 11:34:02 +08:00
Phaneesh Barwaria	f77d88390a	[onnx] handle dynamic padSize tensor in onnx.Pad (#3214 ) - Fix pad size to data_rank for dynamic paddingSize Tensor. - This fix is in accordance with [input specification](https://onnx.ai/onnx/operators/onnx__Pad.html#inputs) for onnx.Pad - Impl will need to be updated for dynamic padSize when support for `axes` is added.	2024-04-24 11:31:37 +08:00
Xinyu Yang	42b9eccdb3	[Stablehlo] Fix AtenSumDimIntListOp when dim==None (#3216 ) as titile	2024-04-24 11:25:46 +08:00
Xinyu Yang	4da3d714cc	[Torch] Support AtenProdOp on linalg and stablehlo (#3215 )	2024-04-24 11:14:04 +08:00
zjgarvey	a8ba865fca	[torch] Adds Quantization Support for `aten.relu` (#3177 ) A choice was made to quantize the return type of Relu with a scale and zero point copied from the input's quantization scheme. With this choice, the torch-to-linalg conversion of quantized Relu essentially computes max(input, zeroPoint) in the elementwise payload.	2024-04-23 11:01:36 -07:00
jinchen	09d42044b4	Support select_last_index attribute of onnx argmin op (#3212 ) The tests listed in https://github.com/nod-ai/SHARK-Turbine/issues/648 all compiled, and the values of results match, but having runtime issue of dtype mismatch of i/si.	2024-04-23 10:43:38 -07:00
jinchen	61e6312c87	Support select_last_index attribute of onnx argmax op (#3192 ) The tests listed in https://github.com/nod-ai/SHARK-Turbine/issues/635 all compiled, but having run issue of dtype mismatch of i/si.	2024-04-23 10:16:08 -07:00
jinchen	ddb29c2c02	[onnx] Add OnnxToTorch support for `onnx.ConvInteger` (#3179 ) All e2e iree tests compiled, but they have the run issue of mismatch of dtype like the following ``` expected: 1x1x2x2xsi32=[[[12 16][24 28]]] actual: 1x1x2x2xi32=[[[12 16][24 28]]] ```	2024-04-23 09:42:02 -07:00
Yuanqiang Liu	db3842f2e8	[Stablehlo] support lowering sinh & cosh to stablehlo (#3213 )	2024-04-23 19:54:58 +08:00
Xinyu Yang	c1967b607f	[Stablehlo] add AtenLog10Op, AtenLog2Op lowering to stablehlo (#3208 )	2024-04-23 19:06:55 +08:00
Yuanqiang Liu	1f8123b5f0	[Stablehlo] support unary ops which promote to floating point (#3209 ) * promote input to output element-type when lowering to stablehlo, so that it could satisfy stablehlo's type constraints. * split promote-to-fp unary ops from fp-only unary ops.	2024-04-23 17:57:12 +08:00
Yuanqiang Liu	797e4cd395	[Stablehlo] lowering asin, acos, atan (#3207 ) * lowering asin, acos and atan to chlo ops.	2024-04-23 16:24:53 +08:00
Vinayak Dev	cff2f084d4	[torch] Add OnnxToTorch lowering for `onnx.ReduceL2` (#3175 ) Adds OnnxToTorch lowering for the ReduceL2 op.	2024-04-23 02:03:05 -04:00
Vivek Khandelwal	3c252cdd44	[onnx] Add `onnx-to-torch` lowering for random ops (#3193 ) This commit adds the OnnxToTorch lowering for Onnx's RandomNormal, RandomNormalLike, RandomUniform, and RandomUniformLike op.	2024-04-22 22:28:07 +05:30
Vivek Khandelwal	6abc7371c8	[MLIR][TORCH] Fix OnnxToLinalg lowering issue for Squeeze and Unsqueeze op (#2991 ) This commit also cleans up the OnnxToTorch lowering for the Squeeze and Unsqueeze op and adds the support for handling edge cases. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-04-22 08:52:42 +00:00
penguin_wwy	e5bdd71baf	[Torch] Emit and decompose prims.iota op (#3132 )	2024-04-21 19:45:01 -07:00
penguin_wwy	a60e84e5ee	[stablehlo] add aten.expm1 op conversion support (#3199 )	2024-04-21 19:20:49 -07:00
Rob Suderman	8222637159	[onnx] Extend op version number of `onnx.ScatterElements` (#3195 ) Version number was set too high. Lowered to support more cases allows more tests to pass. Co-authored-by: Robert Suderman <rsuderman@Roberts-MacBook-Pro.local>	2024-04-21 12:32:18 -04:00
Rob Suderman	733cace1df	[onnx] Fix `onnx.split` by directly handling slicing (#3194 ) Previous implementation erroneously mixed up num_outputs with slice_size. New version correctly computs the slice size and directly performs slicing rather than leveraging `aten.split.tensor`. This is due to `onnx` supporting a fixed number of splits making the size computation more easily computeable when lowering to `aten` rather than deferring to `aten.split.tensor`. --------- Co-authored-by: Robert Suderman <rsuderman@Roberts-MacBook-Pro.local>	2024-04-21 12:31:56 -04:00
penguin_wwy	b6b01602d3	[stablehlo] add aten.fmod.Tensor op conversion support (#3198 )	2024-04-21 08:39:36 +08:00
penguin_wwy	ea0ecb67be	[stablehlo] add aten.remainder.Tensor op conversion support (#3197 )	2024-04-21 00:03:37 +08:00
Rob Suderman	b01245c0e8	[onnx] Fix `onnx.Not` for non-bool inputs (#3187 ) Need to perform a bool cast to support `onnx.Not` on non-bool inputs.	2024-04-19 11:32:24 -07:00
Xinyu Yang	790a697245	[Torch] Add folder for AtenIntOp, AtenFloatOp (#3189 ) See unit test below: ``` // CHECK-LABEL: func.func @torch.aten.tensor.float( // CHECK-NEXT: torch.vtensor.literal(dense<1.000000e+01> : tensor<f32>) : !torch.vtensor<[],f32> func.func @torch.aten.tensor.float() -> !torch.vtensor<[],f32> { %none = torch.constant.none %false = torch.constant.bool false %float1.000000e01 = torch.constant.float 1.000000e+01 %67 = torch.aten.tensor.float %float1.000000e01, %none, %none, %false : !torch.float, !torch.none, !torch.none, !torch.bool -> !torch.vtensor<[],f32> return %67 : !torch.vtensor<[],f32> } // CHECK-LABEL: func.func @torch.aten.tensor.int( // CHECK-NEXT: torch.vtensor.literal(dense<45> : tensor<si32>) : !torch.vtensor<[],si32> func.func @torch.aten.tensor.int() -> !torch.vtensor<[],si32> { %none = torch.constant.none %false = torch.constant.bool false %int45 = torch.constant.int 45 %67 = torch.aten.tensor.int %int45, %none, %none, %false : !torch.int, !torch.none, !torch.none, !torch.bool -> !torch.vtensor<[],si32> return %67 : !torch.vtensor<[],si32> } ```	2024-04-19 22:17:06 +08:00
penguin_wwy	5a98c72c7f	[StableHLO] Fix aten.clamp.Tensor in FxImporter2StableHLO (#3190 ) The FX importer will pass static shapes to the Torch dialect, so it needs to generate a StableHLO that satisfies shape inference.	2024-04-19 17:08:29 +08:00
penguin_wwy	0a6073414d	[FxImporter] Add fx importer to stablehlo e2e test config (#3183 )	2024-04-18 21:29:17 -07:00
penguin_wwy	6c4f7deebb	[stablehlo] add aten.clamp.Tensor op conversion support (#3185 )	2024-04-19 10:55:27 +08:00
Rob Suderman	be742a937d	[onnx] Update the failure triage for onnx (#3186 ) Reclassifying what the source of failures are for various bugs so we can reprioritize what failures are common.	2024-04-18 14:58:13 -07:00
Rob Suderman	0e77de996a	[torch] Add support for `torch.view` with dynamic shapes (#3164 ) We can map to `tensor.reshape` for handling multiple output dynamic shapes. Later we can perform a more complex analysis for indentifying expand/collapse cases from the tensor.reshape. Initially we planned to handle this identification at the `torch` level however it will be easier to handle once converted to core mlir-dialects.	2024-04-18 11:47:19 -07:00
Rob Suderman	4c21e20caa	[torch] Support rank-0 index for torch index select (#3182 ) Need to perform an expand in the case where the indices is rank-0.	2024-04-18 11:32:31 -07:00
Vivek Khandelwal	6e5630d696	build: manually update PyTorch version (#3170 ) Set PyTorch and TorchVision version to nightly release 2024-04-16. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-04-18 12:55:10 +05:30
Xinyu Yang	d4313eed4a	[Torch] Add decomposition of RepeatInterleaveSelfInt Op (#3075 ) Decomposition RepeatInterleaveSelfInt with following ops: ```python def my_repeat_interleave(input, repeats, dim=None): if dim is None: # Flatten the input and then repeat return input.flatten().unsqueeze(-1).tile((1, repeats)).flatten() else: # Calculate the shape after repeat expanded_shape = list(input.shape) expanded_shape[dim] = repeats # Repeat the tensor along the specified dimension repeat_shape = [1] (input.dim() + 1) repeat_shape[dim + 1] = repeats input = input.unsqueeze(-1) # Tile and then reshape tiled = torch.tile(input, repeat_shape) # Rearrange and reshape repeated = tiled.reshape(expanded_shape) return repeated ``` I passed the tests of stablehlo and linalg. When testing onnx, strange things happened. In torch-mlir's CI torch_nightly* and my own environment(torch==2.4.0.dev20240318+cpu), it can pass the pass. In torch-mlir's CI torch_stable, it failed. The test case is `RepeatInterleaveSelfIntNoDimModule_basic`, the result shape should be [120]. ```python class RepeatInterleaveSelfIntNoDimModule(torch.nn.Module): def __init__(self): super().__init__() @export @annotate_args([ None, ([3, 4, 5], torch.float32, True), ]) def forward(self, x): return x.repeat_interleave(2) @register_test_case(module_factory=lambda: RepeatInterleaveSelfIntNoDimModule()) def RepeatInterleaveSelfIntNoDimModule_basic(module, tu: TestUtils): module.forward(tu.rand(3, 4, 5)) ``` The error log is as follows: ``` Unexpected outcome summary: (onnx) ****** Failed tests - 1 tests FAIL - "RepeatInterleaveSelfIntNoDimModule_basic" @ trace item #0 - call to "forward" @ output of call to "forward" ERROR: shape (torch.Size([6, 4, 5])) is not equal to golden shape (torch.Size([120])) ``` @rsuderman Would you please help me check what's wrong with my PR? Thanks a lot.	2024-04-18 06:27:51 +08:00
Aart Bik	491f4820f5	[torch-mlir][sparse] pre-pend named buffers to parameter list (#3178 ) weights and biases and other model parameters appear as a separate data structure to the traced graph, but are needed when running the MLIR compiled code; this PR implements that extended functionality	2024-04-17 14:44:05 -07:00
Andreas Falkenberg	b66eabd492	[onnx][torch][linalg] Implementing align-corner modes for gridsampler (#3171 ) Align corner modes which select what the corners mean. Either the center of the corner points or the edges of the edge points. --------- Co-authored-by: Rob Suderman <rob.suderman@gmail.com>	2024-04-17 13:38:19 -07:00
penguin_wwy	3aa81f78d8	[FxImporter] Replace local_scalar_dense in fx_importer (#3180 )	2024-04-17 22:45:47 +08:00
Xinyu Yang	d2ba956e69	[Torch] Support Aten_CastLongOp. (#3160 ) By canonicalize Aten_CastLongOp into AtenToDtypeOp	2024-04-17 21:58:32 +08:00
penguin_wwy	e4b11a0ab4	[FxImporter] Fix fx importer test config and clean xfail set (#3176 )	2024-04-16 22:36:07 -07:00
penguin_wwy	398aeeec87	[FxImporter] Fix kwarg operands in fx importer (#3166 ) Remove the `kwarg_only` limitation, for example ``` torch.add(x, 3.0, alpha=2) ``` compiled to ``` %0 = torch.aten.add.Scalar %arg0, %float3.000000e00, %int1 ``` fix to ``` %0 = torch.aten.add.Scalar %arg0, %float3.000000e00, %int2 ```	2024-04-16 13:17:05 -07:00
zjgarvey	7a1ad0d7c0	[TorchToLinalg] Adds Support for Remaining Quantized Matmul Cases (#3167 ) The new cases added for quantized matmuls are: 1. vec-vec 2. vec-mat 3. mat-vec each of which are now lowered to expand(s), quantized_matmul, and collapse.	2024-04-16 09:28:28 -07:00
Vinayak Dev	a0232e9ebd	[MLIR][TORCH] Add OnnxToTorch lowering for ReduceL1 Op (#3146 ) Adds OnnxToTorch Lowering for the ReduceL1 op.	2024-04-16 12:24:46 +05:30
penguin_wwy	af5509c5d9	[FxImporter] Type conversion to resolve the mismatch between Py type and schema type (#3163 )	2024-04-15 23:14:19 -07:00
penguin_wwy	10b6062d41	[CI] Enable the tests for fx_importer in the CI (#3168 ) Replace the torchdynamo e2e with the fx_importer e2e	2024-04-15 21:20:23 -07:00
Xinyu Yang	ae4724763a	[Stablehlo] Enhance broadcast pattern in matmul Ops (#3161 ) To pass test "MatmulStaticBroadcast_basic" in stablehlo: ```python class MatmulStaticBroadcast(torch.nn.Module): def __init__(self): super().__init__() @export @annotate_args([ None, ([4, 1, 6, 7], torch.float32, True), ([8, 1, 5, 7, 6], torch.float32, True), ]) def forward(self, lhs, rhs): return torch.matmul(lhs, rhs) @register_test_case(module_factory=lambda: MatmulStaticBroadcast()) def MatmulStaticBroadcast_basic(module, tu: TestUtils): module.forward(tu.rand(4, 1, 6, 7), tu.rand(8, 1, 5, 7, 6)) ```	2024-04-16 10:10:36 +08:00
zjgarvey	5e564b5864	Adds Some Quantization Support for AtenMatmulOp (#3147 ) 1. onnx.MatMulInteger now converts to aten.matmul instead of aten.mm 2. aten.matmul, for ranks >=2, now allows quantized inputs and will lower to linalg::quantized_matmul or linalg::quantized_batch_matmul. 3. added AtenMatmulOp to the FuseQuantizeOps rewrite patters QuantizeOperands, QuantizeTransposedOperands, and QuantizeAccumulator 4. added several tests, including some to test AtenMmOp with varying quantization signed-ness. 5. a quantized matmul mat-vec test is added to verify the failure to lower to linalg; cleaned of out-of-date code related to common torch-mlir lowering xfails. 6. in debugging a real model with quantized matmuls, I found a bug on the scalarize-shapes pass which resulted from the aten.full op folder returning an incompatible result type. This is fixed by the small change here to [lib/Dialect/Torch/IR/TorchOps.cpp](https://github.com/llvm/torch-mlir/compare/main...zjgarvey:torch-mlir:MatMulIntegerFix?expand=1#diff-dc8ed165c207918e606490eee3984b1ad51d7034e6aac36fc046bf47f6f03f4f).	2024-04-15 16:06:47 -07:00
IanWood1	5708ee7ec9	Added 2 Ops: Floor divide scalar and Floor divide scalar mode (#3156 ) - Added linalg lowering for `AtenFloorDivideScalarOp` - Needed `AtenDivScalarModeOp` for the decomp. - Added linalg lowering for `AtenDivScalarModeOp` - Moved linalg payload logic to `createDivModePayload()` since the logic was nearly identical for both `AtenDivScalarModeOp` and `AtenDivTensorModeOp`. Just a template function - Added `AtenDivScalarModeOp` lowering for stablehlo Pytorch's [`torch.floor_divide()`](https://pytorch.org/docs/stable/generated/torch.floor_divide.html) in a previous version (for a reason unknown to me) preformed a truncation instead of "floor". The already implemented op `AtenFloorDivideTensorOp` was done before this change. However, this wasn't caught because our testcases only tested positive floor division. I changed this to floor as well as adding a few test cases.	2024-04-15 13:45:10 -07:00
jinchen	83cba8c696	[onnx] Support for `onnx.EyeLike` via torch lowering (#2994 )	2024-04-15 09:23:26 -07:00
penguin_wwy	9ac90ec7b2	Refactor the parameters and usage instructions of the setup script. (#3162 ) As this issuecomment(https://github.com/llvm/torch-mlir/pull/3021#issuecomment-2031248199) suggests, `setup.py` should only be used for building Python packages, so: * disabled the develop command * refactor the environment variable parameters * add more doc for the usage and env var of setup.py	2024-04-14 10:40:25 -07:00
penguin_wwy	45eaeaaf36	[FxImporter] Add FxImporter config in e2e-test (#3151 )	2024-04-12 16:07:56 -07:00
jinchen	859f5d280f	Generalize getting index for onnx compress op (#3150 )	2024-04-12 15:18:22 -07:00
Xida Ren (Cedar)	ed163f49e8	Delete .github/workflows/buildAndTest.yml to solve CI error messages … (#3155 ) …on every push fixes #3144	2024-04-12 12:11:48 -07:00

... 6 7 8 9 10 ...

3107 Commits (c0ec22df2ca9bc1e0d645304e8e46cff84728c45) All Branches Search

3107 Commits (c0ec22df2ca9bc1e0d645304e8e46cff84728c45)

All Branches