torch-mlir

Commit Graph

Author	SHA1	Message	Date
zjgarvey	de28c8540b	[ONNX] add int16 quantization support (#3446 ) There is currently no int16 quantization support in torch. This patch adds a new mlir type to correspond to the missing "torch.qint16" type, and enables lowering of quantization-related onnx ops using int16 types. In follow-up patches, custom quantization logic for ops like aten.matmul/aten.mm/aten.convolution may need to be revisited to allow support for qint16. The passes in FuseQuantizedOps.cpp may also need slight modifications.	2024-06-12 10:37:22 +05:30
Vivek Khandelwal	661be2d5b0	[MLIR][Torch] Add TorchToLinalg lowering for AtenAvgPool3dOp (#3030 ) This commit also fixes the average pool op' test failing for OnnxToLinalg lowering. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-06-04 22:12:34 +05:30
Rob Suderman	afca88a058	[NFC] Change to cast instead of .cast variants (#3405 ) Member casts have been deprecated. Changing over a bunch of the member cast calls to the global templated variants to remove deprecation warnings.	2024-05-30 23:45:13 -07:00
penguin_wwy	1f544c37d0	[NFC] Remove unused header files (#3386 )	2024-05-30 14:30:36 +08:00
penguin_wwy	6679728c56	Fix deprecated uses of cast/dyn_cast/dyn_cast_or_null/isa (#3243 ) Like #3130, gradually replace the deprecated code https://github.com/llvm/mlir-www/blob/main/website/content/deprecation/_index.md#deprecated	2024-04-27 14:00:56 -07:00
penguin_wwy	d4a30b7e67	Fix deprecated uses of cast/dyn_cast/dyn_cast_or_null/isa (#3130 ) We should prefer functional style as the method style is deprecated https://github.com/llvm/mlir-www/blob/main/website/content/deprecation/_index.md#deprecated (https://mlir.llvm.org/deprecation/)	2024-04-11 06:47:35 -07:00
Rob Suderman	d83b576c6e	Bump LLVM to llvm/llvm-project@bb180856ec (#2895 ) Includes some minor first for `AffineMap::inferFromExprList`	2024-02-09 14:07:49 -08:00
Rob Suderman	25a5a22cbd	[torch] Support `torch.convolution` quantized lowering to `linalg` (#2811 ) Linalg has quantized specific operations. We can lower to these operations when there is a known zeropoint and scale operations. This allows the `convolution` to occur with lower bitwidth's, improving the overall performance.	2024-01-30 13:46:47 -08:00
Quinn Dawkins	494089d53d	Clang format refresh (#2812 ) After noticing a number of commits with unrelated formatting changes, I think something was changed with clang-format at one point and we're seeing a number of unrelated changes. Doing a refresh can help avoid this. The changes made here came from ``` find lib -iname .h -o -iname .cpp \| xargs clang-format -i --style=llvm find include -iname .h -o -iname .cpp \| xargs clang-format -i --style=llvm find projects -iname .h -o -iname .cpp \| xargs clang-format -i --style=llvm ```	2024-01-29 12:59:33 -05:00
Aart Bik	46a25d7241	[torch-mlir][sparse] preserve sparsity during lowering torch to linalg (#2809 ) This preserves sparsity at the most obvious places of lowering TORCH tensors to MLIR RankedTensorType tensors. Other places are marked for audit. With some initial lowering tests.	2024-01-26 10:54:59 -08:00
Rob Suderman	dc37616d67	[torch][quant] Support quantize and dequantize for torch (#2731 ) Handle both `torch.dequantize` and `torch.quantize_per_tensor` including the op based quantization parameter tracking. This includes adding `qint32` to torch types as it was missing during the initial type inclusion. For testing we only have `torch.int8` and `torch.float` types on function boundaries as the `qint8` types require passing the scale and zero point quantization information which is not supported yet.	2024-01-12 19:11:14 -08:00
Quinn Dawkins	400752ca8d	[TorchToLinalg] NFC: Move Utils.h to an externally accessible location (#2603 )	2023-12-01 19:38:21 -05:00
Ramiro Leal-Cavazos	e568f7e999	Move handling of integer signedness to the backend conversions (#2597 ) The function `getTypeForScalarType` currently takes an argument to specify the signedness of integer types. This is leakage of backend specific requirements into the torch dialect world. Because `getTypeForScalarType` is a utility function for the torch dialect, it should only produce types that match the sign conventions used by PyTorch (regular integers are signed and unsigned integers are unsigned). This commit removes the signedness argument from `getTypeForScalarType`, and moves the backend specific handling of integer types to the backend code.	2023-11-29 09:43:09 -08:00
Quinn Dawkins	6f81ad7293	[TorchToLinalg] Improve broadcast lowerings in strict symbolic modes (#2505 ) With strict symbolic shapes, we can assume numpy-style dynamic broadcasts never occur. This improves the lowering in the presence of this assumption.	2023-10-05 15:15:26 -04:00
Stella Laurenzo	860be09a39	Elide dynamic broadcast checks when in strict symbolic shapes mode. (#2496 ) When importing dynamic shaped programs from Dynamo, via torch.compile or torch.export, we can assume that strict symbolic shape checks have been done prior to generating torch IR. Among other shape checking, this eliminates the case where an unknown dimension can be dynamically '1' in a way that signals a broadcast. Adds a `isAssumingStrictSymbolicShapes` utility which consults a `torch.assume_strict_symbolic_shapes` attribute on an enclosing scope and returns true if present. In the linalg pipeline, many runtime checks are elided when this returns true.	2023-09-29 16:45:48 -07:00
Vivek Khandelwal	23b72244b1	[MLIR][TORCH] Add different dtype support for aten.bmm op Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2023-09-12 12:38:46 +05:30
Abhishek Varma	6c9ba4ce95	[Torch-to-Linalg] Add dynamic dimension support for BroadcastTo op (#2174 ) -- This commit adds support for dynamic dimension in BroadcastTo op. Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>	2023-07-07 10:01:51 -07:00
Vivek Khandelwal	788efc3180	[MLIR][TORCH] Add support for non-unit stride for conv backward This commit also adds the support for non-unit output padding in the case of transposed convolution. Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2023-04-04 17:53:27 +05:30
Vivek Khandelwal	e7edcc62fd	build: update llvm tag to 147fe9de Summary of changes: - Replace call to `MemoryEffectOpInterface::hasNoEffect` with `isMemoryEffectFree`. - Make fix for the dynamic dims, since `kDynamicSize` value changed to `std::numeric_limits<int64_t>::min()` from `-1` in llvm - `makeShapeLLVMCompatible` and `makeShapeTorchCompatible` utilities convert shapes in order to remain consistent with the Torch and MLIR semantics. - Update tags llvm: 147fe9de29dc13c14835127b35280c4d95c8e8ba mhlo: 1944b5fa6062ec4c065d726c9c5d64f1487ee8c5 Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-12-01 13:36:50 +05:30
Gaurav Shukla	0d209998d1	llvm: update tag to e864ac6945 (#1600 ) Summary of changes: 1. Replace `string` iterator types by `IteratorType` enum. (`e6598b053d`) 2. Update `includes` wrt new directory layout of MLIR HLO codebase. (`9fd8d251a8`) 3. Update tags llvm: e864ac694540342d5e59f59c525c5082f2594fb8 MHLO: eab364ba2a66bd0613efb94f8a738c1c97aaee92 Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com> Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-11-16 14:40:36 -08:00
George Petterson	92f385bd9f	[MLIR][TORCH] Add E2E support aten.convolution_backward op This commit adds the decomposition for the `aten.convolution_backward` and `aten.convolution_backward_overrideable` op.	2022-11-15 07:38:26 +05:30
Tanyo Kwok	17bc7c89cc	build: update llvm tag to 74fb770d (#1539 ) * build: update llvm tag to 74fb770d This commit makes the following changes needed to update bump LLVM: + replace usages of `tensor::createPadScalarOp`, see https://reviews.llvm.org/D136493 + Update file checks	2022-11-01 15:27:09 +08:00
Ramiro Leal-Cavazos	82a3860e25	build: update llvm tag to 4546397e (#1502 ) This commit makes the following changes needed to update bump LLVM: - Replace `linalg.init_tensor` with `tensor.empty` (see: https://reviews.llvm.org/D135129) - Replace `NoSideEffect` with `Pure` (see https://reviews.llvm.org/D135505) - Replace `body` region accessor for `ReduceOp` and `ReduceWindowOp` with `getBody` - Fix incorrect use of `tosa::ReduceSumOp` in `AtenNativeLayerNormOp` conversion pattern. The result type of `tosa::ReduceSumOp` must have the same rank as the input type. (see: https://www.mlplatform.org/tosa/tosa_spec.html#_reduce_sum) Co-authored-by: Ashay Rane <ashay@users.noreply.github.com> Co-authored-by: Ashay Rane <ashay@users.noreply.github.com>	2022-10-18 04:22:53 +00:00
Ashay Rane	faa9a78e38	build: update llvm tag to 6f46ff37 (#1448 ) Summary of changes: - Updated references to the Arith dialect (https://reviews.llvm.org/D134762) - Switched to prefixed accessors for MemRef dialect (https://reviews.llvm.org/D134995) - Fixed warnings about signed/unsigned comparisons, ignored return values, and unused variables	2022-10-05 08:28:06 -05:00
gpetters94	f012279fa2	Add transposed case for at::convolution (#917 ) Also adds a decomposition for aten::conv_transposed2d.input	2022-08-24 12:19:35 -04:00
Gaurav Shukla	1be604bfd3	[LINALG] Lower `aten.Matmul` to `linalg.BatchMatmul` This commit lowers `aten.matmul` to `linalg.BatchMatmul` under the following conditions: 1. The result of matrix multiplication must have batch dimensions, i.e., rank greater than 2. 2. The resultant matrix must have at most 1 dynamic batch dimension. It also handles broadcasting of batch dimensions when batch dimensions of the matrices are broadcastable. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-06-25 10:58:06 +05:30
Ashay Rane	f18b2be911	torch,linalg: add support for translating aten.linalg.vector_norm (#839 ) This patch adds support for the torch.linalg.vector_norm op to the torch dialect, including the necessary shape function. It also extends the conversion of reduction operators to support lowering of AtenLinalgVectorNormOp, in addition to adding a handful of end-to-end tests to validate the lowering. There exist several opportunities to make this lowering optimal and robust. For instance, in its current form, the translation does not support ord = 0, +inf, or -inf. For L1 norms, we don't need to raise each element to the power 1.0. Similarly, L2 norms could benefit from strength reduction. Since the canonicalization pass is not able to apply these optimizations, we should consider applying them during the linalg lowering itself.	2022-05-19 15:48:15 -07:00
Vivek Khandelwal	f15d257aac	[MLIR][TORCH] Add support for ceil_mode = true for pooling ops This commit adds support for aten.max_pool2d, aten.max_pool2d_with_indices, and aten.avg_pool2d op for the cases where ceil_mode = true. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-05-11 12:52:47 +05:30
Prashant Kumar	5cdef0213d	[LINALG] Bug fix i64 vs i32 type comparison. Comparing index type instead of integer types solves the problem.	2022-04-22 08:09:58 +05:30
Vivek Khandelwal	769f3a8870	[MLIR][TORCH] Add E2E support for max_pool2d_with_indices op This commit adds lowering of `max_pool2d_with_indices` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-04-18 21:05:19 +05:30
Sean Silva	5d9222383c	Split up TorchToLinalg.cpp This helps keep things organized and also exposes more parallelism to the build system. It seems though that most of the compile time is actually spent in the headers though, so the wall time doesn't decrease as much as I had hoped (and now that the headers are being included multiple times, the cpu time actually increases a lot, sadly -- will try to dig into this).	2022-03-14 10:19:41 -07:00

31 Commits (c7d52f63b482b2c30f4efb435ce0cc2efeab25d9)