torch-mlir

Commit Graph

Author	SHA1	Message	Date
Yuanqiang Liu	fab2696489	[Torch] support aten.trunc (#3219 ) decompose `trunc(x)` to `sign(x) * floor(abs(x))`	2024-04-24 14:32:33 +08:00
Yuanqiang Liu	8a1dbbd597	[torchscript] export extra library file name to user (#3203 ) * so that it could be specified by user.	2024-04-24 11:34:02 +08:00
Xinyu Yang	4da3d714cc	[Torch] Support AtenProdOp on linalg and stablehlo (#3215 )	2024-04-24 11:14:04 +08:00
Yuanqiang Liu	db3842f2e8	[Stablehlo] support lowering sinh & cosh to stablehlo (#3213 )	2024-04-23 19:54:58 +08:00
penguin_wwy	e5bdd71baf	[Torch] Emit and decompose prims.iota op (#3132 )	2024-04-21 19:45:01 -07:00
Xinyu Yang	790a697245	[Torch] Add folder for AtenIntOp, AtenFloatOp (#3189 ) See unit test below: ``` // CHECK-LABEL: func.func @torch.aten.tensor.float( // CHECK-NEXT: torch.vtensor.literal(dense<1.000000e+01> : tensor<f32>) : !torch.vtensor<[],f32> func.func @torch.aten.tensor.float() -> !torch.vtensor<[],f32> { %none = torch.constant.none %false = torch.constant.bool false %float1.000000e01 = torch.constant.float 1.000000e+01 %67 = torch.aten.tensor.float %float1.000000e01, %none, %none, %false : !torch.float, !torch.none, !torch.none, !torch.bool -> !torch.vtensor<[],f32> return %67 : !torch.vtensor<[],f32> } // CHECK-LABEL: func.func @torch.aten.tensor.int( // CHECK-NEXT: torch.vtensor.literal(dense<45> : tensor<si32>) : !torch.vtensor<[],si32> func.func @torch.aten.tensor.int() -> !torch.vtensor<[],si32> { %none = torch.constant.none %false = torch.constant.bool false %int45 = torch.constant.int 45 %67 = torch.aten.tensor.int %int45, %none, %none, %false : !torch.int, !torch.none, !torch.none, !torch.bool -> !torch.vtensor<[],si32> return %67 : !torch.vtensor<[],si32> } ```	2024-04-19 22:17:06 +08:00
Xinyu Yang	d4313eed4a	[Torch] Add decomposition of RepeatInterleaveSelfInt Op (#3075 ) Decomposition RepeatInterleaveSelfInt with following ops: ```python def my_repeat_interleave(input, repeats, dim=None): if dim is None: # Flatten the input and then repeat return input.flatten().unsqueeze(-1).tile((1, repeats)).flatten() else: # Calculate the shape after repeat expanded_shape = list(input.shape) expanded_shape[dim] = repeats # Repeat the tensor along the specified dimension repeat_shape = [1] (input.dim() + 1) repeat_shape[dim + 1] = repeats input = input.unsqueeze(-1) # Tile and then reshape tiled = torch.tile(input, repeat_shape) # Rearrange and reshape repeated = tiled.reshape(expanded_shape) return repeated ``` I passed the tests of stablehlo and linalg. When testing onnx, strange things happened. In torch-mlir's CI torch_nightly* and my own environment(torch==2.4.0.dev20240318+cpu), it can pass the pass. In torch-mlir's CI torch_stable, it failed. The test case is `RepeatInterleaveSelfIntNoDimModule_basic`, the result shape should be [120]. ```python class RepeatInterleaveSelfIntNoDimModule(torch.nn.Module): def __init__(self): super().__init__() @export @annotate_args([ None, ([3, 4, 5], torch.float32, True), ]) def forward(self, x): return x.repeat_interleave(2) @register_test_case(module_factory=lambda: RepeatInterleaveSelfIntNoDimModule()) def RepeatInterleaveSelfIntNoDimModule_basic(module, tu: TestUtils): module.forward(tu.rand(3, 4, 5)) ``` The error log is as follows: ``` Unexpected outcome summary: (onnx) ****** Failed tests - 1 tests FAIL - "RepeatInterleaveSelfIntNoDimModule_basic" @ trace item #0 - call to "forward" @ output of call to "forward" ERROR: shape (torch.Size([6, 4, 5])) is not equal to golden shape (torch.Size([120])) ``` @rsuderman Would you please help me check what's wrong with my PR? Thanks a lot.	2024-04-18 06:27:51 +08:00
Xinyu Yang	d2ba956e69	[Torch] Support Aten_CastLongOp. (#3160 ) By canonicalize Aten_CastLongOp into AtenToDtypeOp	2024-04-17 21:58:32 +08:00
IanWood1	5708ee7ec9	Added 2 Ops: Floor divide scalar and Floor divide scalar mode (#3156 ) - Added linalg lowering for `AtenFloorDivideScalarOp` - Needed `AtenDivScalarModeOp` for the decomp. - Added linalg lowering for `AtenDivScalarModeOp` - Moved linalg payload logic to `createDivModePayload()` since the logic was nearly identical for both `AtenDivScalarModeOp` and `AtenDivTensorModeOp`. Just a template function - Added `AtenDivScalarModeOp` lowering for stablehlo Pytorch's [`torch.floor_divide()`](https://pytorch.org/docs/stable/generated/torch.floor_divide.html) in a previous version (for a reason unknown to me) preformed a truncation instead of "floor". The already implemented op `AtenFloorDivideTensorOp` was done before this change. However, this wasn't caught because our testcases only tested positive floor division. I changed this to floor as well as adding a few test cases.	2024-04-15 13:45:10 -07:00
Xinyu Yang	5eb0cf9104	[Torch] Add decompose of AtenToPrimDeviceOp (#3131 ) As device information isn't relevant to torch-mlir	2024-04-10 22:26:48 +08:00
IanWood1	8ff28527cb	Add more descriptive error message to torch_ods_gen.py. (#3108 ) Added error message when adding new torch op to [torch_ods_gen.py](https://github.com/llvm/torch-mlir/compare/main...IanWood1:torch-mlir:ods_gen_error_message?expand=1#diff-889b60b904ed67a5065a14e8de6fc89e00e199577e4d2bfa134ac4d1c89832d2). New message displays which op key is failing and possible matches in the torch `Registry`. ```Op does not match any Torch ops in Registry Given op: "aten::hardtanh_wrong : (Tensor, Scalar) -> (Tensor)" Possible matches: "aten::hardshrink : (Tensor, Scalar) -> (Tensor)" "aten::hardtanh_ : (Tensor, Scalar, Scalar) -> (Tensor)" "aten::hardtanh : (Tensor, Scalar, Scalar) -> (Tensor)" "aten::clamp_min : (Tensor, Scalar) -> (Tensor)" "aten::linalg_cond : (Tensor, Scalar?) -> (Tensor)"``` Also, ran black formatting on file. Based on LLVM style guides this seems to be correct, but I can revert the formatting if needed.	2024-04-09 09:50:34 -07:00
Xinyu Yang	42a16fa912	[Torch] Support Aten_CastFloatOp. (#3115 ) By canonicalize Aten_CastFloatOp into AtenToDtypeOp	2024-04-09 11:06:53 +08:00
Xinyu Yang	84c24e5771	[Torch] Support Aten__And__ScalarOp (#3114 )	2024-04-08 20:24:17 +08:00
Yuanqiang Liu	2c56ef9252	[Torch Dialect] canonicalize aten.sign to aten.sgn (#3112 ) * `aten.sign` is a sub-set of `aten.sgn` (`aten.sgn` support complex type).	2024-04-08 20:05:42 +08:00
Rob Suderman	f97cd4893f	[torch] Improve shape inference for dynamic shapes (#3091 ) Shapes can be processed as tensors to represent the set of dimensions. As reshapes take a list of scalars this can result in a single dynamic dimension blocking the adjacent static dimensions. This pass attempts to de-couple tensor computations related to shapes and propagate values to better support lowering scalar tensor computations.	2024-04-02 16:19:57 -07:00
Stella Laurenzo	6d680ff445	[ods] Allow all tensor returns to be optional. (#3082 ) This was found while tracing backwards graphs: the convolution_backwards op will return None if the first result is not needed. Confirmed by defining a custom op with a `Tensor` return signature and having its meta kernel return None.	2024-03-29 23:09:34 -07:00
Yuanqiang Liu	0a581a97a7	[Torch Dialect] enhance aten.int.tensor's canonicalize (#3058 ) support fold with literal vtensor. change it to canonicalize because this pattern will create new op.	2024-03-27 09:51:58 +08:00
Rob Suderman	14b548f968	[torch] Improve shape inference for `torch-to-linalg` path for reshapes (#3055 ) Reshaping tensors depend on directly matching individual dimensions to their corresponding dim in the `torch.view` reshape dimensions. This involves decoupling dynamic dimensions from their static counterparts and support cleanup / canonicalization.	2024-03-26 12:41:40 -07:00
schnkmwt	1fcbfa87ec	Implement linalg lowering of diag_embed torch op (#2885 ) This PR adds lowering of diag_embed to linalg dilect. Tracked in https://github.com/nod-ai/SHARK-Turbine/issues/288 --------- Co-authored-by: sachink <sachink@xilinx.com>	2024-03-22 16:32:50 -07:00
zjgarvey	99b3a5f117	Converts all Adaptive Pooling Ops to Linalg (#2808 ) The previous conversions for AtenAdaptiveAvgPool1dOp and AtenAdaptiveMaxPool2dOp are refactored into a general templated conversion that works for all of the AtenAdaptive...PoolNdOp's. New support is added for the following ops: 1. AtenAdaptiveMaxPool1d 2. AtenAdaptiveMaxPool3d 3. AtenAdaptiveAvgPool3d Support is also provided for passing inputs without batch dimensions. For example, applying adaptive_avg_pool2d to an input tensor of rank 3. After [pytorch #118162](https://github.com/pytorch/pytorch/pull/118162) gets down to torch-mlir, I'll add a test for AdaptiveMaxPool1d with return_indices (which will pass with that upstream fix). --------- Co-authored-by: James Newling <james.newling@gmail.com>	2024-03-22 11:05:20 -07:00
penguin_wwy	7616d637fd	Add stateless fx graph import (#3036 )	2024-03-21 14:44:54 -07:00
penguin_wwy	f34c187ac4	Normalize type hints to be compatible with multiple Python versions (#3028 ) Although we provide a wheel package for Python 3.8, it may actually throw the following exception: `TypeError: 'type' object is not subscriptable`	2024-03-15 08:29:48 -07:00
Yuanqiang Liu	4282eb9e76	[Torch Dialect] support aten.fake_quantize_per_tensor_affine (#3014 )	2024-03-15 08:53:29 +08:00
Yuanqiang Liu	870e63bc3c	[Torch Dialect] support decomposition of aten.linspace (#3006 )	2024-03-14 08:28:33 +08:00
Yuanqiang Liu	43c6996a31	[Torch Dialect] add folder for aten.ceil and unify patterns of ceil, … (#3010 ) …floor, round	2024-03-14 07:41:58 +08:00
ptrifunovic98	524ff99216	Implement lowering of torch.aten.linalg_cross (#2986 ) Closes [nod-ai/SHARK-Turbine#497](https://github.com/nod-ai/SHARK-Turbine/issues/497)	2024-03-13 12:17:22 -07:00
Yuanqiang Liu	229ca3a9e1	[Torch Dialect] emit aten::mul and add folder (#3007 )	2024-03-11 19:59:34 +08:00
Yuanqiang Liu	a3fe130f73	[Torch Dialect] emit aten::warn (#3003 ) * torch-mlir may not handle `aten.warn`. But it could be handled by custom users' backend which involves torch-mlir.	2024-03-10 08:29:08 +08:00
Rob Suderman	0723584936	[torch] Add folder for torch.aten.*.Scalar comparisons (#3000 ) This folds small version of the tensor-scalar comparison operators as they are commonly used for shape computations. This includes le, lt, ge, gt, eq, and ne.	2024-03-08 13:44:00 -08:00
Ze Zhang	aa7c9a9653	e2e support aten.linalg_norm to aten.linalg_vector_norm (#2953 ) Add e2d support for `aten.linalg_norm` by decompose it to `aten.linalg_vector_norm`. Lowering to `aten.linalg_matrix_norm` is still unsupported. To Test: `python -m e2e_testing.main -v` --------- Co-authored-by: Ze Zhang <ze.zhang@getcruise.com>	2024-03-05 16:31:01 -08:00
Rob Suderman	bc0527676b	[torch] Add support for `torch.split_with_sizes` via decompose (#2979 ) Convert to individiual slices and tuple together as a list. --------- Co-authored-by: Scott Todd <scott.todd0@gmail.com>	2024-03-05 15:01:21 -08:00
Rob Suderman	19d4888278	[torch] Make torch.aten.unflatten lower directly to linalg (#2971 ) Existing lowering via aten.view does not work as well for dynamic shapes as the lowering to tensor.expand must re-infer dynamic shape matching. Better to directly lower.	2024-03-04 10:17:42 -08:00
Rob Suderman	61f0a5facf	[torch] Add an `aten.cat` length-0 canonicalization (#2966 ) If an input is length-0 along the dimension of canonicalization we can remove the tensor from the list	2024-03-01 21:41:12 -08:00
mmakevic	76b81e0ccd	Implement lowering of torch.aten.fmod.Tensor (#2767 ) Closing https://github.com/nod-ai/SHARK-Turbine/issues/351	2024-02-29 11:22:03 +05:30
Rob Suderman	6f3d62ab04	[torch] Fix folders and `cat` and `view` torch lowerings (#2963 ) A bunch of small fixes are interlinked and trigger crashes if not addressed as a group. This includes: - aten view when expand from a rank-0 tensor - slice folder with negative indices - `aten._shape_as_tensor` folder on a rank-0 tensor - `aten.cat` of a tensor with a length-0 tensor	2024-02-28 12:04:52 -08:00
Vivek Khandelwal	d81747eadb	[MLIR][TORCH] Extend support for OnnxToLinalg lowering for Dropout and Div op (#2938 ) Fixes https://github.com/nod-ai/SHARK-Turbine/issues/451, https://github.com/nod-ai/SHARK-Turbine/issues/452	2024-02-27 11:02:05 +05:30
ptrifunovic98	c5a1da1910	Implement lowering of torch.aten.norm.Scalar (#2899 ) Closes [nod-ai/SHARK-Turbine#365](https://github.com/nod-ai/SHARK-Turbine/issues/365)	2024-02-26 08:46:56 -08:00
Andreas Falkenberg	55dc8deb92	[torch] GridSample TorchToLinalg lowering (#2883 ) Lowers `torch.grid_sample` to the equilvalent `linalg` representation.	2024-02-23 09:14:38 -08:00
Aart Bik	534b266f2d	[torch-mlir][NFC] remove trailing whitespace (#2936 )	2024-02-20 11:23:14 -08:00
Rob Suderman	135c81a416	[torch] Add folder for `prim.NumToTensor.Scalar` (#2921 ) Useful for `slice` lowerings that depend on tensors made form scalars.	2024-02-19 11:55:54 -08:00
Rob Suderman	e80054a3cc	[torch] Folders for `torch.aten.*.tensor` operators [add, sub, mul] (#2878 ) Simple folder for limited size aten tensor operations. This is primarily useful for shape computation folding as they unfortunately can use `aten` operators. Add, sub, mul are common examples of these folders.	2024-02-19 10:28:23 -08:00
aldesilv	d29157b33f	OnnxToTorch support for onnx.InstanceNormalization op (#2710 ) https://github.com/nod-ai/SHARK-Turbine/issues/327	2024-02-19 19:53:48 +05:30
Vivek Khandelwal	d6d1a173dc	[MLIR][Torch] Add OnnxToTorch and TorchToLinalg support for trig ops (#2903 ) This commit adds the OnnxToTorch lowering for cosh, acosh, asin, asinh, and atanh op. This commit also adds the TorchToLinalg lowering for acosh, asin, asinh, and atanh op. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-02-14 11:58:09 +05:30
Rob Suderman	c0f139be0f	[torch] Add `torch.aten.eq.Tensor` comparison folder (#2889 ) Added a folded for a equals operator. This allows an equivalent comparison folder, primarily for when shape computations occur small size tensor.	2024-02-09 15:02:20 -08:00
Rob Suderman	7d33ba69ac	[torch] Folder for torch.aten.select.int for splat cases (#2890 ) If the input or result is a splat value we can just constant fold the result. This is common for shape computations and can help with shape inference.	2024-02-09 14:02:54 -08:00
Franz Haniel	4cc62aeb24	Implement trace (#2790 ) The lowering decomposes AtenTraceOp into an AtenDiagonalOp followed by AtenSumOp. The progress is tracked in https://github.com/nod-ai/SHARK-Turbine/issues/333. --------- Co-authored-by: Franz Haniel <franz.haniel@amd.com>	2024-02-09 08:00:24 -08:00
Rob Suderman	a8aad2a5ab	[torch] Add `torch.aten.where.*` folders (#2886 ) Where operation can be statically computed when involving splats of known value. Added handling these cases with multiple tests.	2024-02-07 19:43:31 -05:00
Dave Liddell	23647ab2d1	[torhc] aten.index_select folder (#2871 ) Folds aten::index_select ops under the following conditions: 1. If the input and output are the same shape, the indexing operation is a NOP, so just return the input. 2. If the input has shape <1x1x...xNx...x1> (all 1's except for one dim), and the output shape is <1x1x...x1> (all 1's), then there is a single index, so extract the single element value and return a tensor with that value. --------- Co-authored-by: Dave Liddell <dliddell@xilinx.com>	2024-02-07 16:17:15 -08:00
mmakevic	32dbf99ce2	Implement lowering of torch.aten.all.dim (#2873 ) Lowering of torch.aten.all.dim to linalg. Per PyTorch documentation: > This function matches the behaviour of NumPy in returning output of dtype bool for all supported dtypes except uint8. For uint8 the dtype of output is uint8 itself. Since there is no support for ui8 in torch-mlir currently (https://github.com/llvm/torch-mlir/pull/1384#issuecomment-1260011334) implementation returns failure for that case.	2024-02-07 12:34:52 -08:00
saienduri	bfcf93ea21	Rename torch_mlir.compile APIs and introduce FX based analogs (#2842 ) Link to related RFC: https://discourse.llvm.org/t/rfc-rename-torch-mlir-compile-apis-and-introduce-fx-based-analogs/76646 This commit updates the documentation, tests, CMake files, and API for the proposed changes in the RFC. There is a new torch_mlir/fx.py for user level APIs related to importing modules and a corresponding test for this path can be found at test/python/fx_importer/basic_test.py. --------- Co-authored-by: MaheshRavishankar <mravisha@amd.com>	2024-02-06 19:07:59 -08:00
Xida Ren (Cedar)	cc06391630	AtenSortOp Folder (#2864 ) A chunk off https://github.com/llvm/torch-mlir/pull/2856 https://github.com/llvm/torch-mlir/pull/2860 --------- Co-authored-by: Xida Ren <xida.ren.dev@gmail.com> Co-authored-by: Rob Suderman <rob.suderman@gmail.com>	2024-02-06 21:12:12 +00:00
Dave Liddell	1cb14f6879	Rob's atenTensor folder (#2867 ) If a tensor is initialized by a list with a single constant integer, this folder turns it into a torch.vtensor.literal --------- Co-authored-by: Dave Liddell <dliddell@xilinx.com>	2024-02-05 17:10:42 -08:00
Xida Ren (Cedar)	24b8c8672a	[torch] Add folders for `torch.fill`, `torch.ones`, `torch.zeros` and `aten.getItem` (#2849 ) So that the CumSum Op in OPT can get the constant that it requires to be lowered to TMTensor --------- Co-authored-by: Rob Suderman <rob.suderman@gmail.com> Co-authored-by: Xida Ren <xida.ren.dev@gmail.com>	2024-02-02 10:46:33 -08:00
Ilija Kalinić	54ef18c556	Implement lowering of torch.aten.lerp.Scalar (#2773 ) Closes nod-ai/SHARK-Turbine#356	2024-01-31 09:39:38 -08:00
Yuanqiang Liu	d778950f45	[Torch Dialect] add fold pattern for aten.clone (#2804 )	2024-01-31 09:43:21 +08:00
Quinn Dawkins	494089d53d	Clang format refresh (#2812 ) After noticing a number of commits with unrelated formatting changes, I think something was changed with clang-format at one point and we're seeing a number of unrelated changes. Doing a refresh can help avoid this. The changes made here came from ``` find lib -iname .h -o -iname .cpp \| xargs clang-format -i --style=llvm find include -iname .h -o -iname .cpp \| xargs clang-format -i --style=llvm find projects -iname .h -o -iname .cpp \| xargs clang-format -i --style=llvm ```	2024-01-29 12:59:33 -05:00
Rob Suderman	2ef228328f	[torch] `torch.dequantize` for per channel tensors to` linalg` (#2769 ) Support a lowering for dequantization for per channel tensors from `torch` dialect to a linalg decomposition. Tested via a numerical `torch` test.	2024-01-25 16:40:21 -08:00
zjgarvey	c531f5495b	AtenAdaptiveMaxPool2d Conversion to Linalg (#2779 ) The logic here is very similar to the conversion for AdaptiveAvgPool1d #2661 with a few modifications: 1. buffVal = -inf instead of 0 2. the main linalg generic op accumulates a max, instead of a sum, to the first output tensor 3. avg pooling requires dividing the sum pool by the kernel width, which we stored as an auxilliary tensor (kSizeTensor). Here, the auxiliary tensor will be recording the indices. Strangely enough, the only signature available for this function is to return indices, and it appears that they must be computed whether the user desires them or not. See [pytorch/torch/nn/functional.py](https://github.com/pytorch/pytorch/blob/main/torch/nn/functional.py#L1174). Before writing other adaptive pooling conversions, the logic of this decomposition should be rolled into a helper function that will work for both max and avg pooling ops. Even the auxiliary tensor should likely be automated. This code was written in a slightly more tedious way than strictly necessary (often using loops to fill SmallVectors up to rank-2, which is only two in this case), in order to more easily facilitate the transition to a helper function.	2024-01-24 09:09:56 -08:00
Xida Ren (Cedar)	ccaac85788	implement aten.conv1d, aten.conv3d, and aten.conv_tbc (#2757 ) convolution with [time,batch,channel] ordering, as opposed to the default [batch, channel, time]. Currently implementing by transposing the input and output, but may need to get its own implementation in the future because this is supposed to be an op that gives a speedup. This is used by fairseq (https://github.com/facebookresearch/fairseq/issues/172). (in case you were wondering like me, this is different from transposed convolution. Transposed convolution has fractional strides). --------- Co-authored-by: Xida Ren <xida.ren.dev@gmail.com> Co-authored-by: Frederik Harwath <frederik.harwath@amd.com>	2024-01-23 21:30:03 -08:00
Franz Haniel	b9806cfa38	[TorchToLinalg] Add lowering for torch.aten.diagonal (#2632 )	2024-01-22 12:47:13 -05:00
John Wu	704cfdaf08	Add aten.pool_max3d support to torch-to-linalg (#2735 ) Added verification logic to the abstract_interpreter_lib_gen.py Also made some unit tests Initially, I thought we can use `linalg::pooling_ndhwc_max` to help implement this problem. However, on a 5-dimensional matrix it does the pooling on dimensions (2, 3, 4) which is not what we want. We want pooling on dimensions (3, 4, 5). To achieve this, we would need to lower our code using the `linalg` dialect. Turns out the pooling code in `linalg` looks like this. ``` func @max_pooling_ncdhw(%I: memref<?x?x?x?x?xf32>, %K: memref<3xindex>, %O: memref<?x?x?x?x?xf32>, %strides: memref<3xindex>, %dilations: memref<3xindex>) { %c0 = arith.constant 0 : index %c1 = arith.constant 1 : index %N = memref.dim %I, %c0 : memref<?x?x?x?x?xf32> %C = memref.dim %I, %c1 : memref<?x?x?x?x?xf32> %D = memref.dim %I, 2 : memref<?x?x?x?x?xf32> %H = memref.dim %I, 3 : memref<?x?x?x?x?xf32> %W = memref.dim %I, 4 : memref<?x?x?x?x?xf32> %kernel_d = memref.load %K[%c0] : memref<3xindex> %kernel_h = memref.load %K[%c1] : memref<3xindex> %kernel_w = memref.load %K[2] : memref<3xindex> %stride_d = memref.load %strides[%c0] : memref<3xindex> %stride_h = memref.load %strides[%c1] : memref<3xindex> %stride_w = memref.load %strides[2] : memref<3xindex> %dilation_d = memref.load %dilations[%c0] : memref<3xindex> %dilation_h = memref.load %dilations[%c1] : memref<3xindex> %dilation_w = memref.load %dilations[2] : memref<3xindex> linalg.generic { indexing_maps = [ affine_map<(n, c, d, h, w, kd, kh, kw) -> (n, c, d * %stride_d + kd * %dilation_d, h * %stride_h + kh * %dilation_h, w * %stride_w + kw * %dilation_w)>, // Map for input tensor affine_map<(n, c, d, h, w, kd, kh, kw) -> (kd, kh, kw)>, // Map for kernel tensor affine_map<(n, c, d, h, w, kd, kh, kw) -> (n, c, d, h, w)> // Map for output tensor ], iterator_types = ["parallel", "parallel", "parallel", "parallel", "parallel", "reduction", "reduction", "reduction"], doc = "3D Max Pooling NCDHW with Strides, Dilations, and Kernel Size" } ins(%I, %K : memref<?x?x?x?x?xf32>, memref<3xindex>) outs(%O : memref<?x?x?x?x?xf32>) { ^bb0(%input_elem: f32, %kernel_elem: index, %output_elem: f32): %max_val = arith.maxf %input_elem, %output_elem : f32 linalg.yield %max_val : f32 } return } ``` This was implemented based on it's source code with the adjustments mentioned above: `4ca1b5e094/mlir/include/mlir/Dialect/Linalg/IR/LinalgNamedStructuredOps.yaml (L5647)` Issues related to this can be found here https://github.com/nod-ai/SHARK-Turbine/issues/324	2024-01-19 21:09:46 +05:30
Ilija Kalinić	faa4517e83	Implement lowering of torch.aten.remainder.Tensor (#2763 ) Closes nod-ai/SHARK-Turbine#349	2024-01-19 18:09:08 +05:30
Sungsoon Cho	a8538e1e3f	Decompose AtenNormalFunctionalOp into AtenRandn* and other arithmetic. (#2737 )	2024-01-15 22:49:29 -08:00
lonely eagle	f85e5c932b	[Torch Dialect] support aten.isneginf, aten.isposinf, aten.nan_to_num (#2743 )	2024-01-16 14:29:34 +08:00
James Newling	f78ec78ac8	Adjust bound check to be the same as PyTorch native (i.e. stricter) (#2755 ) prims.expand expects the start and end dimensions to be strictly less than the rank of the tensor.	2024-01-15 11:44:45 -08:00
lisaliu1	09421b1cf3	[TorchToLinalg] Add lowering for aten.replication_pad2d (#2715 ) Co-authored-by: Lisa Liu <lingl@xilinx.com>	2024-01-15 14:02:27 -05:00
Rob Suderman	dc37616d67	[torch][quant] Support quantize and dequantize for torch (#2731 ) Handle both `torch.dequantize` and `torch.quantize_per_tensor` including the op based quantization parameter tracking. This includes adding `qint32` to torch types as it was missing during the initial type inclusion. For testing we only have `torch.int8` and `torch.float` types on function boundaries as the `qint8` types require passing the scale and zero point quantization information which is not supported yet.	2024-01-12 19:11:14 -08:00
Ilija Kalinić	e1a86e480a	Implement lowering of torch.aten.logit (#2697 ) Closes nod-ai/SHARK-Turbine#290	2024-01-11 20:25:42 +05:30
Frederik Harwath	0860c41ee2	Implement aten.reflection_pad2d lowering to linalg	2024-01-10 21:32:22 -10:00
zjgarvey	07d0645f64	[RFC] general support for Adaptive Pooling Ops (#2661 ) Adaptive pooling ops can only be decomposed into their non-adaptive counterparts in trivial cases. For example, the current decomposition for AtenAdaptiveAvgPool1dOp in DecomposeComplexOps.cpp supports outSize = inSize (i.e., do literally nothing), and outSize = 1 (i.e., do a batched average). The reason adaptive pooling ops are difficult to lower to linalg is that they are not constantly strided. They are computed by taking an input tensor of shape (N, C, Hin), and an output size Hout, and computing the output tensor at position (n,c, h) in the following way: 1. compute st(h) = (hHin)//Hout 2. compute en(h) = 1 + ((h+1)Hin -1)//Hout 3. apply a computation (max or avg) to the slice: INPUT[n, c, st(h):en(h)] The provided sample implementation (for ConvertAtenAdaptiveAvgPool1dOp) uses tensor.extract to access the input tensor inside the payload of a linalg generic op. This is likely an unattractive use of linalg generic ops, which is why I am asking for some more targeted feedback on the validity of this approach before attempting to support the many other adaptive pooling ops. Specifically: - Is the performance of this implementation bad enough to warrant targeting different dialects entirely? e.g. TMtensor/linalg ext/ etc. - If the provided implementation is of acceptable performance to the community, then is it permissable to remove the Adaptive pooling decompositions from DecomposeComplexOps.cpp? Based on the current structure of the -torch-decompose-complex-ops pass, it does not seem possible to only decompose the adaptive ops in special cases (it seems to get stuck in an infinite loop on a match failure). I would be happy to instead incorporate the case logic into the conversion directly, and remove the decompositions once they are rendered completely obsolete. As long as this approach is acceptable, I can clean up the implementation with some helper functions, and quickly add support for each of the remaining Adaptive pooling ops.	2024-01-09 11:14:10 -08:00
kumardeepakamd	9adad9bc40	Add support for reflection_pad1d (#2706 ) Adds a lowering to Linalg for reflection_pad1d. Based on ideas/code from draft PR https://github.com/llvm/torch-mlir/pull/2693. --------- Co-authored-by: Kumar Deepak <kumar@xilinx.com>	2024-01-02 14:05:11 -05:00
Sungsoon Cho	8e389ff2ff	Implement lowering of torch.aten.exponential (#2680 ) https://github.com/llvm/torch-mlir/issues/2646 Decompose aten.exponential() into: -exp(1-x)/lambda	2023-12-27 20:33:18 -08:00
Rik Huijzer	8328998172	Allow printing all IR in `torch_mlir.compile` (#2669 ) This PR adds the `enable_ir_printing` option to `torch_mlir.compile`, which can be used to print the IR for all intermediate passes. When running the added test file via: ```shell $ python test/python/compile.py 2> tiny.stderr ``` the file `tiny.stderr` is about 700 KB.	2023-12-20 15:08:21 -06:00
Rob Suderman	11cc92d4ab	[onnx] Lowerings from `onnx.tan` (#2642 ) Started work on the `tan` lowerings for ONNX to Torch. Uses `sin` and `cos` to represent a `tan`.	2023-12-20 10:09:39 -08:00
Rob Suderman	61888690bb	[onnx] Add support for `onnx.sinh` (#2643 ) Adds a lowering from `onnx.sinh` to `aten.sinh`. This includes adding the `aten.sinh` operator.	2023-12-15 21:23:51 -08:00
Sungsoon Cho	55e9401c5c	Implement lowering of aten.cosh op. (#2635 )	2023-12-15 11:19:26 -08:00
JianzheXiao	6ddeb1a6ef	[torch] Add support for aten.selu (#2640 ) Add `aten.selu` operation to `torch` dialect.	2023-12-13 20:28:08 -08:00
JianzheXiao	7cf52ae73f	[Torch Dialect]Add Support for AtenGroupNormOp and AtenNativeGroupNormOp (#2591 ) Co-authored-by: LiuYuanqiang <liuyuanqiang.yqliu@bytedance.com>	2023-12-13 11:05:12 +08:00
Stella Laurenzo	74f7a0c9d6	Upstream the ONNX importer. (#2636 ) This is part 1 of 2, which will also include upstreaming the FX importer. I started with ONNX because it forces some project layout updates and is more self contained/easier as a first step. Deviating somewhat from the RFCs on project layout, I made the following decisions: * Locating the `onnx_importer.py` into `torch_mlir.extras` as Maks already has opened up that namespace and it seemed to fit. Better to have fewer things at that level. * Setup the build so that the root project only contains MLIR Python and pure Python deps (like the importers), but this can be augmented with the `projects/` adding more depending on which features are enabled. * The default build continues to build everything whereas in `TORCH_MLIR_ENABLE_ONLY_MLIR_PYTHON_BINDINGS=1` mode, it builds a `torch-mlir-core` wheel with the pure contents only. `onnx_importer.py` and `importer_smoke_test.py` are almost verbatim copies from SHARK-Turbine. I made some minor local alterations to adapt to paths and generalize the way they interact with the outer project. I expect I can copy these back to Turbine verbatim from here. I also updated the license boilerplate (they have the same license but slightly different project norms for the headers) but retained the correct copyright. Other updates: * Added the ONNX importer unit test (which also can generate test data) in lit, conditioned on the availability of the Python `onnx` package. In a followup once I know everything is stable, I'll add another env var that the CI can set to always enable this so we know conclusively if tests pass. * Moved the ONNX conversion readme to `docs/`. * Renamed CMake option `TORCH_MLIR_ENABLE_ONLY_MLIR_PYTHON_BINDINGS` -> `TORCH_MLIR_ENABLE_PYTORCH_EXTENSIONS` and inverted the sense. Made the JitIR importer and LTC options `cmake_dependent_options` for robustness.	2023-12-12 19:02:51 -08:00
Frederik Harwath	b656c674ee	Implement e2e support for aten.acos op This depends on a change in the LLVM core repository which adds acos support to the MLIR Math dialect.	2023-12-12 10:52:02 +01:00
Sambhav Jain	7acabafd84	Remove folder from `AtenStackOp` for single element list inputs (#2626 ) `AtenStackOp` defines this folder for list operand containing single element: ``` OpFoldResult AtenStackOp::fold(FoldAdaptor adaptor) { auto list = getOperand(0).getDefiningOp<PrimListConstructOp>(); if (!list \|\| !list->hasOneUse() \|\| list.getElements().size() != 1) return nullptr; return list.getElements()[0]; } ``` However, unlike `AtenCatOp`, `AtenStackOp` cannot be folded away for single element list operand because the result from a stack operation contains an additional dimension (of size 1, like expand_shape). This PR removes the `AtenStackOp::fold` method, and adds an e2e test for single element list input case, which fails on current `main` as follows: ``` Unexpected outcome summary: (linalg) ****** Failed tests - 1 tests FAIL - "TensorsStackSingleElementListModule_basic" @ trace item #0 - call to "forward" @ output of call to "forward" ERROR: shape (torch.Size([10, 32])) is not equal to golden shape (torch.Size([10, 1, 32])) ``` Thanks Chris Lalau Keraly for the bug report.	2023-12-11 10:52:50 -08:00
Vivek Khandelwal	0b4422a253	[MLIR][ONNX] Add OnnxToTorch support for bitwise and math ops This commit adds the OnnxToTorch support for BitwiseXor, BitwiseOr, Div, Equal, Cast, Ceil, Floor, Cos, and Clip op. This commit also adds the TorchToLinalg support for aten.clamp.Tensor and aten.clamp_min.Tensor op. Signed-Off By: vivekkhandelwal1424@gmail.com	2023-12-11 19:36:01 +05:30
JianzheXiao	96fcde4d77	[Torch Dialect] Support Einsum Op (#2230 ) As title, support torch.aten.einsum op Right now only support Static Shape, because of the known issue, the fixed solution is here: https://github.com/llvm/torch-mlir/pull/2154 Co-authored-by: Jiawei Wu [wujiawei.aml@bytedance.com](mailto:wujiawei.aml@bytedance.com)	2023-12-10 12:30:37 +08:00
frafranz	c0115706a0	Add a decomposition for torch.aten.argmin (#2613 ) Adds a lowering for the torch.aten.argmin operator to linalg via decomposition into torch.aten.min.dim. --------- Co-authored-by: Franz Haniel <franz.haniel@amd.com>	2023-12-06 09:45:30 -05:00
Frederik Harwath	6248216dca	Add aten.min.dim to linalg lowering (#2600 )	2023-12-05 07:16:35 -08:00
Frederik Harwath	d0b49a912e	Recommend update_torch_ods.sh for re-generating GeneratedTorchOps.td (#2609 ) Fix #2608	2023-12-05 05:26:05 -08:00
Vivek Khandelwal	dc9ea08db5	[MLIR][ONNX] Add OnnxToTorch support for atan and bitwise ops This commit adds the OnnxToTorch support for Atan, Bitshift, BitwiseAnd, and BitwiseNot op. This commit also adds the TorchToLinalg support for AtenBitwiseLeftShiftTensorOp. Signed-Off By: vivekkhandelwal@nod-labs.com	2023-11-28 17:19:07 +05:30
James Newling	03e8f99730	Lowering to linalg of prims split_dim op (#2576 ) Adds support for lowering to prims split_op. Similar design to collapse op lowering in https://github.com/llvm/torch-mlir/pull/2572, with some small differences, because the split_dim op (in pytorch) is view-changing whereas the collapse is not. The difference means that 1) it must be registered in the function Torch::isViewLikeOp 2) it must be be added to the "expected fail" set for the torch dynamo backend.	2023-11-21 07:56:09 -08:00
Zhekun(Josh) Zhang	d67afa9e95	[Torch] Add fold rule for AtenMaskedFillTensorOp to AtenMaskedFillScalarOp (#2543 )	2023-11-21 13:26:17 +08:00
Stella Laurenzo	5eae0adff1	Breakup python pytorch deps (#2582 ) This lifts the core of the jit_ir_importer and ltc out of the pt1 project, making them peers to it. As a side-effect of this layering, now the "MLIR bits" (dialects, etc) are not commingled with the various parts of the pt1 project, allowing pt1 and ltc to overlay cleanly onto a more fundamental "just MLIR" Python core. Prior to this, the Python namespace was polluted to the point that this could not happen. That "just MLIR" Python core will be introduced in a followup, which will create the space to upstream the FX and ONNX pure Python importers. This primary non-NFC change to the API is: * `torch_mlir.dialects.torch.importer.jit_ir` -> `torch_mlir.jit_ir_importer`. The rest is source code layering so that we can make the pt1 project optional without losing the other features. Progress on #2546.	2023-11-19 12:10:19 -08:00
James Newling	dad1f012f6	Add verification for torch permute op (#2551 ) - adds support for an optional verifier to the generated torch op tablegen (GeneratedTorchOps.td) - uses the above to add a verifier for the torch permute op. Motivation: I hit an unclear error from linalg while developing a decomposition pass for pixel_shuffle. The error would have been clearer if the problem had been detected earlier in the invalid aten.permute op. Testing: new tests added. To run added tests, from the base directory run ``` ./build/bin/llvm-lit test/Dialect/Torch/invalid.mlir ```	2023-11-15 11:47:54 -08:00
James Newling	e81282ae8f	Support for prims collapse op (lowering to linalg) (#2572 ) Steps taken: 1) add generator code to torch_ods_gen.py, run update_torch_ods.sh 2) add (custom) shape and type inference generator code to abstract_interp_lib_gen.py, run update_abstract_interp_lib.sh 3) Implement lowering to tensor.collapse_dims. Requires the `start` and `end` values to be constant, else lowering fails 4) Update xfail_sets.py (append to LTC_XFAIL_SET) after running /tools/e2e_test.sh --filter Collapse --verbose -c XX for all support backends (XX). Motivation: - Supporting the collapse operation will be useful for lowering of pixel_shuffle (see Issue #2559)	2023-11-15 08:34:38 -08:00
Shehroze Khan	dde66e66b0	add bool scalar type to int implicit cast (#2571 ) [LTC] Add bool scalar type to int implicit cast	2023-11-14 08:56:12 -05:00
Yuanqiang Liu	3ab790c50a	[Torch Dialect] add canonicalize for aten.numel (#2562 )	2023-11-11 12:16:53 +08:00
Yuanqiang Liu	60effcee89	[Dtype Function] fix aten.div.Tensor_mode's dtype function (#2555 )	2023-11-09 09:46:53 +08:00
James Newling	b6e551c7b8	Decomposition of aten.pixel_shuffle with static input shape (#2550 ) For static tests (that is when the shape is know) for example: ``` @annotate_args([None, ([3, 18, 2, 2], torch.float32, True)]) ``` The e2e passes. But only if the replacement op's return type is set as undefined (optional shape and type must be explicitly made unset), otherwise there's a error about the function return type. For dynamic cases, for example if the above is replaced with ``` @annotate_args([None, ([-1, -1, -1, -1], torch.float32, True)]) ``` There is a failure to lower to linalg from torch ("view op explicitly labelled as illegal"). This seems to be because the support for lowering from torch to linalg with dynamic shapes is limited.	2023-11-08 08:52:44 -05:00
JianzheXiao	a42d4c18ff	[Torch Dialect]Support aten.cosine_similarity (#2364 ) As title, add support for aten.cosine_similarity, support broadcast inputA/inputB to the same shape	2023-11-08 15:28:30 +08:00
Jiawei Wu	d5ee8ee73a	[Torch Dialect] emit aten.reshape_as op and add decomposition pattern. (#2553 )	2023-11-05 11:38:36 +08:00
Yuanqiang Liu	0378da0abd	[Torch Dialect] support aten.isinf (#2544 ) Also fix linalg lowering from `UEQ` to `OEQ`. I will check other comparison's lowering later.	2023-11-04 22:26:01 +08:00
Stella Laurenzo	6961f0a247	Re-organize project structure to separate PyTorch dependencies from core project. (#2542 ) This is a first step towards the structure we discussed here: https://gist.github.com/stellaraccident/931b068aaf7fa56f34069426740ebf20 There are two primary goals: 1. Separate the core project (C++ dialects and conversions) from the hard PyTorch dependencies. We move all such things into projects/pt1 as a starting point since they are presently entangled with PT1-era APIs. Additional work can be done to disentangle components from that (specifically LTC is identified as likely ultimately living in a `projects/ltc`). 2. Create space for native PyTorch2 Dynamo-based infra to be upstreamed without needing to co-exist with the original TorchScript path. Very little changes in this path with respect to build layering or options. These can be updated in a followup without commingling directory structure changes. This also takes steps toward a couple of other layering enhancements: * Removes the llvm-external-projects/torch-mlir-dialects sub-project, collapsing it into the main tree. * Audits and fixes up the core C++ build to account for issues found while moving things. This is just an opportunistic pass through but roughly ~halves the number of build actions for the project from the high 4000's to the low 2000's. It deviates from the discussed plan by having a `projects/` tree instead of `compat/`. As I was thinking about it, this will better accommodate the follow-on code movement. Once things are roughly in place and the CI passing, followups will focus on more in-situ fixes and cleanups.	2023-11-02 19:45:55 -07:00

1 2 3

150 Commits (45c85c3b34629ee75d6b3a6fd9447894ce7a8ce3)