torch-mlir

Commit Graph

Author	SHA1	Message	Date
zjgarvey	c531f5495b	AtenAdaptiveMaxPool2d Conversion to Linalg (#2779 ) The logic here is very similar to the conversion for AdaptiveAvgPool1d #2661 with a few modifications: 1. buffVal = -inf instead of 0 2. the main linalg generic op accumulates a max, instead of a sum, to the first output tensor 3. avg pooling requires dividing the sum pool by the kernel width, which we stored as an auxilliary tensor (kSizeTensor). Here, the auxiliary tensor will be recording the indices. Strangely enough, the only signature available for this function is to return indices, and it appears that they must be computed whether the user desires them or not. See [pytorch/torch/nn/functional.py](https://github.com/pytorch/pytorch/blob/main/torch/nn/functional.py#L1174). Before writing other adaptive pooling conversions, the logic of this decomposition should be rolled into a helper function that will work for both max and avg pooling ops. Even the auxiliary tensor should likely be automated. This code was written in a slightly more tedious way than strictly necessary (often using loops to fill SmallVectors up to rank-2, which is only two in this case), in order to more easily facilitate the transition to a helper function.	2024-01-24 09:09:56 -08:00
Xida Ren (Cedar)	ccaac85788	implement aten.conv1d, aten.conv3d, and aten.conv_tbc (#2757 ) convolution with [time,batch,channel] ordering, as opposed to the default [batch, channel, time]. Currently implementing by transposing the input and output, but may need to get its own implementation in the future because this is supposed to be an op that gives a speedup. This is used by fairseq (https://github.com/facebookresearch/fairseq/issues/172). (in case you were wondering like me, this is different from transposed convolution. Transposed convolution has fractional strides). --------- Co-authored-by: Xida Ren <xida.ren.dev@gmail.com> Co-authored-by: Frederik Harwath <frederik.harwath@amd.com>	2024-01-23 21:30:03 -08:00
Ramiro Leal-Cavazos	5883ef0f21	Fix unused variable warnings (#2775 )	2024-01-22 11:05:55 -08:00
Franz Haniel	b9806cfa38	[TorchToLinalg] Add lowering for torch.aten.diagonal (#2632 )	2024-01-22 12:47:13 -05:00
James Newling	50ac3b1912	g++ build fix (#2778 ) Introduced in `704cfdaf08` of @wu-s-john g++ compiler error: Pooling.cpp:177:13: error: explicit specialization in non-namespace scope ‘class Design looks good, g++ is just freaking out for no good reason. Un-nesting the template classes fixes the error. We don't have g++ CI. This hopefully happens infrequently enough that we can just fix manually. My service to those folks who really like building with g++... :)	2024-01-19 19:12:29 -08:00
John Wu	704cfdaf08	Add aten.pool_max3d support to torch-to-linalg (#2735 ) Added verification logic to the abstract_interpreter_lib_gen.py Also made some unit tests Initially, I thought we can use `linalg::pooling_ndhwc_max` to help implement this problem. However, on a 5-dimensional matrix it does the pooling on dimensions (2, 3, 4) which is not what we want. We want pooling on dimensions (3, 4, 5). To achieve this, we would need to lower our code using the `linalg` dialect. Turns out the pooling code in `linalg` looks like this. ``` func @max_pooling_ncdhw(%I: memref<?x?x?x?x?xf32>, %K: memref<3xindex>, %O: memref<?x?x?x?x?xf32>, %strides: memref<3xindex>, %dilations: memref<3xindex>) { %c0 = arith.constant 0 : index %c1 = arith.constant 1 : index %N = memref.dim %I, %c0 : memref<?x?x?x?x?xf32> %C = memref.dim %I, %c1 : memref<?x?x?x?x?xf32> %D = memref.dim %I, 2 : memref<?x?x?x?x?xf32> %H = memref.dim %I, 3 : memref<?x?x?x?x?xf32> %W = memref.dim %I, 4 : memref<?x?x?x?x?xf32> %kernel_d = memref.load %K[%c0] : memref<3xindex> %kernel_h = memref.load %K[%c1] : memref<3xindex> %kernel_w = memref.load %K[2] : memref<3xindex> %stride_d = memref.load %strides[%c0] : memref<3xindex> %stride_h = memref.load %strides[%c1] : memref<3xindex> %stride_w = memref.load %strides[2] : memref<3xindex> %dilation_d = memref.load %dilations[%c0] : memref<3xindex> %dilation_h = memref.load %dilations[%c1] : memref<3xindex> %dilation_w = memref.load %dilations[2] : memref<3xindex> linalg.generic { indexing_maps = [ affine_map<(n, c, d, h, w, kd, kh, kw) -> (n, c, d * %stride_d + kd * %dilation_d, h * %stride_h + kh * %dilation_h, w * %stride_w + kw * %dilation_w)>, // Map for input tensor affine_map<(n, c, d, h, w, kd, kh, kw) -> (kd, kh, kw)>, // Map for kernel tensor affine_map<(n, c, d, h, w, kd, kh, kw) -> (n, c, d, h, w)> // Map for output tensor ], iterator_types = ["parallel", "parallel", "parallel", "parallel", "parallel", "reduction", "reduction", "reduction"], doc = "3D Max Pooling NCDHW with Strides, Dilations, and Kernel Size" } ins(%I, %K : memref<?x?x?x?x?xf32>, memref<3xindex>) outs(%O : memref<?x?x?x?x?xf32>) { ^bb0(%input_elem: f32, %kernel_elem: index, %output_elem: f32): %max_val = arith.maxf %input_elem, %output_elem : f32 linalg.yield %max_val : f32 } return } ``` This was implemented based on it's source code with the adjustments mentioned above: `4ca1b5e094/mlir/include/mlir/Dialect/Linalg/IR/LinalgNamedStructuredOps.yaml (L5647)` Issues related to this can be found here https://github.com/nod-ai/SHARK-Turbine/issues/324	2024-01-19 21:09:46 +05:30
Ilija Kalinić	faa4517e83	Implement lowering of torch.aten.remainder.Tensor (#2763 ) Closes nod-ai/SHARK-Turbine#349	2024-01-19 18:09:08 +05:30
lisaliu1	09421b1cf3	[TorchToLinalg] Add lowering for aten.replication_pad2d (#2715 ) Co-authored-by: Lisa Liu <lingl@xilinx.com>	2024-01-15 14:02:27 -05:00
Rob Suderman	dc37616d67	[torch][quant] Support quantize and dequantize for torch (#2731 ) Handle both `torch.dequantize` and `torch.quantize_per_tensor` including the op based quantization parameter tracking. This includes adding `qint32` to torch types as it was missing during the initial type inclusion. For testing we only have `torch.int8` and `torch.float` types on function boundaries as the `qint8` types require passing the scale and zero point quantization information which is not supported yet.	2024-01-12 19:11:14 -08:00
Ilija Kalinić	e1a86e480a	Implement lowering of torch.aten.logit (#2697 ) Closes nod-ai/SHARK-Turbine#290	2024-01-11 20:25:42 +05:30
Frederik Harwath	0860c41ee2	Implement aten.reflection_pad2d lowering to linalg	2024-01-10 21:32:22 -10:00
zjgarvey	07d0645f64	[RFC] general support for Adaptive Pooling Ops (#2661 ) Adaptive pooling ops can only be decomposed into their non-adaptive counterparts in trivial cases. For example, the current decomposition for AtenAdaptiveAvgPool1dOp in DecomposeComplexOps.cpp supports outSize = inSize (i.e., do literally nothing), and outSize = 1 (i.e., do a batched average). The reason adaptive pooling ops are difficult to lower to linalg is that they are not constantly strided. They are computed by taking an input tensor of shape (N, C, Hin), and an output size Hout, and computing the output tensor at position (n,c, h) in the following way: 1. compute st(h) = (hHin)//Hout 2. compute en(h) = 1 + ((h+1)Hin -1)//Hout 3. apply a computation (max or avg) to the slice: INPUT[n, c, st(h):en(h)] The provided sample implementation (for ConvertAtenAdaptiveAvgPool1dOp) uses tensor.extract to access the input tensor inside the payload of a linalg generic op. This is likely an unattractive use of linalg generic ops, which is why I am asking for some more targeted feedback on the validity of this approach before attempting to support the many other adaptive pooling ops. Specifically: - Is the performance of this implementation bad enough to warrant targeting different dialects entirely? e.g. TMtensor/linalg ext/ etc. - If the provided implementation is of acceptable performance to the community, then is it permissable to remove the Adaptive pooling decompositions from DecomposeComplexOps.cpp? Based on the current structure of the -torch-decompose-complex-ops pass, it does not seem possible to only decompose the adaptive ops in special cases (it seems to get stuck in an infinite loop on a match failure). I would be happy to instead incorporate the case logic into the conversion directly, and remove the decompositions once they are rendered completely obsolete. As long as this approach is acceptable, I can clean up the implementation with some helper functions, and quickly add support for each of the remaining Adaptive pooling ops.	2024-01-09 11:14:10 -08:00
Rob Suderman	985e7796a4	[linalg] Added `aten.clamp` support with integers to `torch-to-linalg` (#2718 ) The lowering for `aten.clamp` did not support integer types. Added support for integer types including a signed integer test.	2024-01-05 15:16:49 -08:00
kumardeepakamd	9adad9bc40	Add support for reflection_pad1d (#2706 ) Adds a lowering to Linalg for reflection_pad1d. Based on ideas/code from draft PR https://github.com/llvm/torch-mlir/pull/2693. --------- Co-authored-by: Kumar Deepak <kumar@xilinx.com>	2024-01-02 14:05:11 -05:00
Xida Ren (Cedar)	6660a26594	lower torch.aten.isinf to linalg (#2638 ) Co-authored-by: Rob Suderman <rob.suderman@gmail.com>	2023-12-28 17:20:32 -08:00
Rob Suderman	11cc92d4ab	[onnx] Lowerings from `onnx.tan` (#2642 ) Started work on the `tan` lowerings for ONNX to Torch. Uses `sin` and `cos` to represent a `tan`.	2023-12-20 10:09:39 -08:00
Rob Suderman	a24aadbfab	[aten] Make `torch.aten.matmul` to `linalg` work for non-broadcasting case (#2659 ) Broadcasting for `torch.aten.matmul` is optional so a MxN with NxK matmul should be legalized to a `linalg.matmul`.	2023-12-20 10:09:10 -08:00
Rob Suderman	791c666479	[torch] Lower `torch.aten.sinh` to `linalg` (#2662 )	2023-12-18 09:15:12 -08:00
Quinn Dawkins	030b0140d4	[TorchToLinalg] Lower aten.cat to tensor.concat (#2650 ) This replaces the lowering of aten.cat with tensor.concat, allowing more efficient handling of concatenations in downstream flows. The refbackend populates concat decomposition patterns that can be used to recover the previous lowering.	2023-12-15 15:45:32 -05:00
Sungsoon Cho	55e9401c5c	Implement lowering of aten.cosh op. (#2635 )	2023-12-15 11:19:26 -08:00
Frederik Harwath	b656c674ee	Implement e2e support for aten.acos op This depends on a change in the LLVM core repository which adds acos support to the MLIR Math dialect.	2023-12-12 10:52:02 +01:00
Vivek Khandelwal	0b4422a253	[MLIR][ONNX] Add OnnxToTorch support for bitwise and math ops This commit adds the OnnxToTorch support for BitwiseXor, BitwiseOr, Div, Equal, Cast, Ceil, Floor, Cos, and Clip op. This commit also adds the TorchToLinalg support for aten.clamp.Tensor and aten.clamp_min.Tensor op. Signed-Off By: vivekkhandelwal1424@gmail.com	2023-12-11 19:36:01 +05:30
Felix Schneider	fb21a85874	[TorchToLinalg] Lower grouped conv2d to linalg Op with correct dimension ordering (#2623 ) The linalg Op `linalg.conv_2d_ngchw_fgchw` had a bug where 1. Weights were accessed as G,F,C,H,W instead of as F,G,C,H,W 2. Output was accessed as N,F,G,H,W instead of as N,G,F,H,W Now this has been fixed in https://github.com/llvm/llvm-project/pull/73855 which broke the torch-mlir lowering to that Op. This patch switches lowering in torch-mlir to the newly introduced `linalg.conv_2d_ngchw_gfchw` op which accesses weights in an order that is compatible with PyTorch's memory layout. Fix https://github.com/llvm/torch-mlir/issues/2622	2023-12-08 14:18:23 +01:00
Quinn Dawkins	63505ad6b2	[TorchToLinalg] Drop constexpr from ifs in argmin/max.dim (#2617 ) MSVC-19 does not support constexprs of lambda captured constexpr values like this: https://godbolt.org/z/ej65rMzdr Instead, this just drops the constexpr from the if statements. See the discussion in https://discord.com/channels/689900678990135345/1062405112292712499/1182338050664185999	2023-12-07 13:08:17 -05:00
Quinn Dawkins	141202bc01	[TorchToLinalg] Fix integer type handling for aten.mm (#2615 ) Despite aten.mm requiring the input and output types match, we still opt to maintain signedness semantics in case later passes try to do any sort of integer type narrowing.	2023-12-07 00:13:53 -05:00
Frederik Harwath	6248216dca	Add aten.min.dim to linalg lowering (#2600 )	2023-12-05 07:16:35 -08:00
Quinn Dawkins	400752ca8d	[TorchToLinalg] NFC: Move Utils.h to an externally accessible location (#2603 )	2023-12-01 19:38:21 -05:00
Ramiro Leal-Cavazos	e568f7e999	Move handling of integer signedness to the backend conversions (#2597 ) The function `getTypeForScalarType` currently takes an argument to specify the signedness of integer types. This is leakage of backend specific requirements into the torch dialect world. Because `getTypeForScalarType` is a utility function for the torch dialect, it should only produce types that match the sign conventions used by PyTorch (regular integers are signed and unsigned integers are unsigned). This commit removes the signedness argument from `getTypeForScalarType`, and moves the backend specific handling of integer types to the backend code.	2023-11-29 09:43:09 -08:00
Vivek Khandelwal	dc9ea08db5	[MLIR][ONNX] Add OnnxToTorch support for atan and bitwise ops This commit adds the OnnxToTorch support for Atan, Bitshift, BitwiseAnd, and BitwiseNot op. This commit also adds the TorchToLinalg support for AtenBitwiseLeftShiftTensorOp. Signed-Off By: vivekkhandelwal@nod-labs.com	2023-11-28 17:19:07 +05:30
James Newling	03e8f99730	Lowering to linalg of prims split_dim op (#2576 ) Adds support for lowering to prims split_op. Similar design to collapse op lowering in https://github.com/llvm/torch-mlir/pull/2572, with some small differences, because the split_dim op (in pytorch) is view-changing whereas the collapse is not. The difference means that 1) it must be registered in the function Torch::isViewLikeOp 2) it must be be added to the "expected fail" set for the torch dynamo backend.	2023-11-21 07:56:09 -08:00
James Newling	647f2f5076	Additional tests for view lowering (#2584 ) The logic for lowering the aten view op to linalg is fairly complex. In this PR I have tried to follow all non-failing paths through the lowering and add unit tests where they're missing. There is 1 logical change to the lowering: redundant tensor.cast ops (same source and destination type) are folded.	2023-11-20 17:35:25 -08:00
Yuanqiang Liu	7b94189e07	[E2E] add nan case in elementwise comparison e2e tests (#2575 )	2023-11-20 11:27:08 +08:00
James Newling	e81282ae8f	Support for prims collapse op (lowering to linalg) (#2572 ) Steps taken: 1) add generator code to torch_ods_gen.py, run update_torch_ods.sh 2) add (custom) shape and type inference generator code to abstract_interp_lib_gen.py, run update_abstract_interp_lib.sh 3) Implement lowering to tensor.collapse_dims. Requires the `start` and `end` values to be constant, else lowering fails 4) Update xfail_sets.py (append to LTC_XFAIL_SET) after running /tools/e2e_test.sh --filter Collapse --verbose -c XX for all support backends (XX). Motivation: - Supporting the collapse operation will be useful for lowering of pixel_shuffle (see Issue #2559)	2023-11-15 08:34:38 -08:00
Yuanqiang Liu	0378da0abd	[Torch Dialect] support aten.isinf (#2544 ) Also fix linalg lowering from `UEQ` to `OEQ`. I will check other comparison's lowering later.	2023-11-04 22:26:01 +08:00
Stella Laurenzo	6961f0a247	Re-organize project structure to separate PyTorch dependencies from core project. (#2542 ) This is a first step towards the structure we discussed here: https://gist.github.com/stellaraccident/931b068aaf7fa56f34069426740ebf20 There are two primary goals: 1. Separate the core project (C++ dialects and conversions) from the hard PyTorch dependencies. We move all such things into projects/pt1 as a starting point since they are presently entangled with PT1-era APIs. Additional work can be done to disentangle components from that (specifically LTC is identified as likely ultimately living in a `projects/ltc`). 2. Create space for native PyTorch2 Dynamo-based infra to be upstreamed without needing to co-exist with the original TorchScript path. Very little changes in this path with respect to build layering or options. These can be updated in a followup without commingling directory structure changes. This also takes steps toward a couple of other layering enhancements: * Removes the llvm-external-projects/torch-mlir-dialects sub-project, collapsing it into the main tree. * Audits and fixes up the core C++ build to account for issues found while moving things. This is just an opportunistic pass through but roughly ~halves the number of build actions for the project from the high 4000's to the low 2000's. It deviates from the discussed plan by having a `projects/` tree instead of `compat/`. As I was thinking about it, this will better accommodate the follow-on code movement. Once things are roughly in place and the CI passing, followups will focus on more in-situ fixes and cleanups.	2023-11-02 19:45:55 -07:00
Daniel Garvey	4901773f77	add uncovered cases in view lowering (#2524 ) removes unecessary checks from empty strided	2023-11-01 21:56:44 -05:00
Quinn Dawkins	6f81ad7293	[TorchToLinalg] Improve broadcast lowerings in strict symbolic modes (#2505 ) With strict symbolic shapes, we can assume numpy-style dynamic broadcasts never occur. This improves the lowering in the presence of this assumption.	2023-10-05 15:15:26 -04:00
Ramiro Leal-Cavazos	2e5d65064c	[linalg] Add handling for leadin and trailing size-1 dims in ViewOp This commit adds to the lowering of `aten.view` handling for the following cases: - `(..., a.size(i))` -> `(..., a.size(i), 1, ..., 1)` - `(..., a.size(i), 1, ..., 1)` -> `(..., a.size(i))` - `(a.size(i), ...)` -> `(1, ..., 1, a.size(i), ...)` - `(1, ..., 1, a.size(i), ...)` -> `(a.size(i), ...)`	2023-10-03 23:04:52 +00:00
Ramiro Leal-Cavazos	1c508af0ba	Revert "[linalg] Fix handling of trailing size-1 dimensions in aten.view (#2474 )" This reverts commit `7c6b9d2445`.	2023-10-03 23:04:52 +00:00
Vivek Khandelwal	ca6ce8974f	[MLIR][TORCH] Add support for int8 dtype for sub, add, and bitwise_and op Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2023-10-03 22:12:31 +05:30
Vivek Khandelwal	9293326e1e	[MLIR][TORCH] Add support for bitwise_right_shit and bitwise_and.Scalar op Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2023-10-02 13:06:59 +05:30
Vivek Khandelwal	c434736ee9	[MLIR][TORCH] Add support for conversion to int8 dtype Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2023-10-02 09:48:46 +05:30
Stella Laurenzo	860be09a39	Elide dynamic broadcast checks when in strict symbolic shapes mode. (#2496 ) When importing dynamic shaped programs from Dynamo, via torch.compile or torch.export, we can assume that strict symbolic shape checks have been done prior to generating torch IR. Among other shape checking, this eliminates the case where an unknown dimension can be dynamically '1' in a way that signals a broadcast. Adds a `isAssumingStrictSymbolicShapes` utility which consults a `torch.assume_strict_symbolic_shapes` attribute on an enclosing scope and returns true if present. In the linalg pipeline, many runtime checks are elided when this returns true.	2023-09-29 16:45:48 -07:00
saienduri	4e1dd3bf10	add e2e support for torch.log10 (#2479 )	2023-09-28 10:17:03 -07:00
Ramiro Leal-Cavazos	7c6b9d2445	[linalg] Fix handling of trailing size-1 dimensions in aten.view (#2474 ) This commit adds to the lowering of `aten.view` handling for the following cases: - `(..., a.size(i))` -> `(..., a.size(i), 1, ..., 1)` - `(..., a.size(i), 1, ..., 1)` -> `(..., a.size(i))` Fixes: https://github.com/llvm/torch-mlir/issues/2448	2023-09-27 09:09:30 -07:00
Ramiro Leal-Cavazos	c9fd78988e	[NFC] Clean-up `ConvertAtenViewOp` in linalg backend (#2470 ) While trying to fix a bug in the `ConvertAtenViewOp` pattern in the linalg backend, I realized that the pattern had become quite complex and had accumulated some dead code, making it hard to reason about. This commit simplifies the pattern quite a bit. The main changes are: 1. All the static helper functions in the `ConvertAtenViewOp` class have been simplified, both in their signature and their body. Each one now performs simple calculations on arrays, and take the least number of arguments necessary. 2. The body of [the `while` loop](`9fce566b0c/lib/Conversion/TorchToLinalg/DataMovement.cpp (L407)`) inside the main pattern has been changed to work on `MutableArrayRef` slices, to avoid having to keep track of `start` and `end` indices for the input and output shape arrays. 3. All the heuristics used to determine the mapping between the input and output dimensions are now in [this relatively short `if-else` section](`9fce566b0c/lib/Conversion/TorchToLinalg/DataMovement.cpp (L428-L460)`), making it easy to see what is going on. 4. Dead code was eliminated + updates to some of the documentation comments This commit does not add any new functionality to the `ConvertAtenViewOp` pattern.	2023-09-26 09:20:01 -07:00
Stella Laurenzo	278c41e938	Bump llvm-project to f66cd9e9556a53142a26a5c21a72e21f1579217c. (#2466 ) Picks up DenseResourceElementsAttr python support and fixes minf/maxf C++ rename.	2023-09-19 10:50:53 -07:00
Vivek Khandelwal	23b72244b1	[MLIR][TORCH] Add different dtype support for aten.bmm op Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2023-09-12 12:38:46 +05:30
Bruce Kim	27b55b1d5f	implemented complex tensor aten mul (#2444 )	2023-09-07 13:29:15 -07:00
Jerin Philip	9cb5d38cd1	[MLIR][TORCH] Add E2E `torch.aten.prod_dim_int` (#2423 ) Uses the existing reduction codepath, adding modifications or branches required alongside for prod.	2023-09-05 13:38:51 -07:00

1 2 3 4 5 ...

310 Commits (60bf6c25af8a34f8d9356636d90a18c24467c6ec)