torch-mlir

Commit Graph

Author	SHA1	Message	Date
Rob Suderman	e9cdd6cbc5	[torch] Fix tm_tensor.attention for end-to-end (#2907 ) Some operations include a backend matcher for specialized operations. We map these back to generics so they appropriately match to the high performance versions. This is done for the attention operation.	2024-02-13 21:18:01 -08:00
Scott Todd	d6e1d836ca	Drop torch attributes at the end of backend conversion. (#2876 ) Fixes https://github.com/llvm/torch-mlir/issues/2866 Some backends / downstream projects expect that a "fully converted" program has no remaining ops or attributes from the original dialect(s).	2024-02-13 14:32:02 -08:00
Rob Suderman	c0f139be0f	[torch] Add `torch.aten.eq.Tensor` comparison folder (#2889 ) Added a folded for a equals operator. This allows an equivalent comparison folder, primarily for when shape computations occur small size tensor.	2024-02-09 15:02:20 -08:00
Rob Suderman	7d33ba69ac	[torch] Folder for torch.aten.select.int for splat cases (#2890 ) If the input or result is a splat value we can just constant fold the result. This is common for shape computations and can help with shape inference.	2024-02-09 14:02:54 -08:00
Dave Liddell	23647ab2d1	[torhc] aten.index_select folder (#2871 ) Folds aten::index_select ops under the following conditions: 1. If the input and output are the same shape, the indexing operation is a NOP, so just return the input. 2. If the input has shape <1x1x...xNx...x1> (all 1's except for one dim), and the output shape is <1x1x...x1> (all 1's), then there is a single index, so extract the single element value and return a tensor with that value. --------- Co-authored-by: Dave Liddell <dliddell@xilinx.com>	2024-02-07 16:17:15 -08:00
Xida Ren (Cedar)	fc04bc7ee9	[torch] AtenSliceOp folder that produces splat results (#2869 ) Includes `slice` folder and lit tests --------- Co-authored-by: Xida Ren <xida.ren.dev@gmail.com>	2024-02-07 19:00:46 +00:00
Xida Ren (Cedar)	cc06391630	AtenSortOp Folder (#2864 ) A chunk off https://github.com/llvm/torch-mlir/pull/2856 https://github.com/llvm/torch-mlir/pull/2860 --------- Co-authored-by: Xida Ren <xida.ren.dev@gmail.com> Co-authored-by: Rob Suderman <rob.suderman@gmail.com>	2024-02-06 21:12:12 +00:00
Dave Liddell	1cb14f6879	Rob's atenTensor folder (#2867 ) If a tensor is initialized by a list with a single constant integer, this folder turns it into a torch.vtensor.literal --------- Co-authored-by: Dave Liddell <dliddell@xilinx.com>	2024-02-05 17:10:42 -08:00
Rob Suderman	e3faef5224	[onnx] Convert `onnx.QLinearConv` to `torch` (#2851 ) Leaning on the QDQ functionality in torch we can support the QLinearConv operation by piggybacking through `torch.Convolution`. This includes some changes such as allowing the `onnx` rewriter to run recursively. Doing so allows `QLinearConv` to decopmose to `onnx.Convolution` which is then lowered to `torch`.	2024-02-05 16:09:41 -08:00
Xida Ren (Cedar)	24b8c8672a	[torch] Add folders for `torch.fill`, `torch.ones`, `torch.zeros` and `aten.getItem` (#2849 ) So that the CumSum Op in OPT can get the constant that it requires to be lowered to TMTensor --------- Co-authored-by: Rob Suderman <rob.suderman@gmail.com> Co-authored-by: Xida Ren <xida.ren.dev@gmail.com>	2024-02-02 10:46:33 -08:00
Rob Suderman	25a5a22cbd	[torch] Support `torch.convolution` quantized lowering to `linalg` (#2811 ) Linalg has quantized specific operations. We can lower to these operations when there is a known zeropoint and scale operations. This allows the `convolution` to occur with lower bitwidth's, improving the overall performance.	2024-01-30 13:46:47 -08:00
Aaron St George	4c557847bd	Don't fold `aten.detach` if result isn't same type as input. (#2824 ) We were seeing some assertion failures after some checks around folders were tightened up in LLVM: https://github.com/llvm/llvm-project/pull/75887 . This PR essentially moves the logic that used to be applied at the LLVM level into the folder, which seems to be the suggested fix. I'm not sure if the IR that caused issues for us _should_ be valid? ``` %1 = torch.aten.detach %arg0 : !torch.tensor<[1],f32> -> !torch.tensor ``` A better fix might be to create a verifier ensuring the result of `aten.detach` has the same type as its operand. --------- Co-authored-by: aaron-stgeorge <aaron.stgeorge@getcruise.com>	2024-01-30 09:45:51 -08:00
Aart Bik	fe836ceebf	[torch-mlir][test] cleanup trailing whitespace in mlir files (#2806 )	2024-01-25 14:24:13 -08:00
Aart Bik	e824fbc65c	[torch-mlir][torch] add encoding field to torch type (#2799 ) This adds an encoding field to the torch type, using the interfaces for printing, parsing, and verification. Note that although this change prepares adding sparsity to the torch type (as illustrated by the round trip and invalid tests), nothing in this change depends on the actual contents of the encoding field!	2024-01-25 10:04:04 -08:00
Rob Suderman	f6f890520b	[torch][quant] Quantized `torch.mm` for linalg with end-to-end test (#2750 ) This includes custom op matching for decomposed operations and fusing dequantization into dense operations. As a validation we compare to the dequant+mm torch implementation.	2024-01-24 14:02:50 -08:00
Han-Chung Wang	10acea71be	Bump LLVM to llvm/llvm-project@0cb024b (#2753 ) - Add fixes for `af78e5daf0` - Add fixes for `bb6d5c2200`	2024-01-15 07:12:12 -08:00
Zhekun(Josh) Zhang	d67afa9e95	[Torch] Add fold rule for AtenMaskedFillTensorOp to AtenMaskedFillScalarOp (#2543 )	2023-11-21 13:26:17 +08:00
Stella Laurenzo	5eae0adff1	Breakup python pytorch deps (#2582 ) This lifts the core of the jit_ir_importer and ltc out of the pt1 project, making them peers to it. As a side-effect of this layering, now the "MLIR bits" (dialects, etc) are not commingled with the various parts of the pt1 project, allowing pt1 and ltc to overlay cleanly onto a more fundamental "just MLIR" Python core. Prior to this, the Python namespace was polluted to the point that this could not happen. That "just MLIR" Python core will be introduced in a followup, which will create the space to upstream the FX and ONNX pure Python importers. This primary non-NFC change to the API is: * `torch_mlir.dialects.torch.importer.jit_ir` -> `torch_mlir.jit_ir_importer`. The rest is source code layering so that we can make the pt1 project optional without losing the other features. Progress on #2546.	2023-11-19 12:10:19 -08:00
James Newling	dad1f012f6	Add verification for torch permute op (#2551 ) - adds support for an optional verifier to the generated torch op tablegen (GeneratedTorchOps.td) - uses the above to add a verifier for the torch permute op. Motivation: I hit an unclear error from linalg while developing a decomposition pass for pixel_shuffle. The error would have been clearer if the problem had been detected earlier in the invalid aten.permute op. Testing: new tests added. To run added tests, from the base directory run ``` ./build/bin/llvm-lit test/Dialect/Torch/invalid.mlir ```	2023-11-15 11:47:54 -08:00
Yuanqiang Liu	3ab790c50a	[Torch Dialect] add canonicalize for aten.numel (#2562 )	2023-11-11 12:16:53 +08:00
Stella Laurenzo	6961f0a247	Re-organize project structure to separate PyTorch dependencies from core project. (#2542 ) This is a first step towards the structure we discussed here: https://gist.github.com/stellaraccident/931b068aaf7fa56f34069426740ebf20 There are two primary goals: 1. Separate the core project (C++ dialects and conversions) from the hard PyTorch dependencies. We move all such things into projects/pt1 as a starting point since they are presently entangled with PT1-era APIs. Additional work can be done to disentangle components from that (specifically LTC is identified as likely ultimately living in a `projects/ltc`). 2. Create space for native PyTorch2 Dynamo-based infra to be upstreamed without needing to co-exist with the original TorchScript path. Very little changes in this path with respect to build layering or options. These can be updated in a followup without commingling directory structure changes. This also takes steps toward a couple of other layering enhancements: * Removes the llvm-external-projects/torch-mlir-dialects sub-project, collapsing it into the main tree. * Audits and fixes up the core C++ build to account for issues found while moving things. This is just an opportunistic pass through but roughly ~halves the number of build actions for the project from the high 4000's to the low 2000's. It deviates from the discussed plan by having a `projects/` tree instead of `compat/`. As I was thinking about it, this will better accommodate the follow-on code movement. Once things are roughly in place and the CI passing, followups will focus on more in-situ fixes and cleanups.	2023-11-02 19:45:55 -07:00
Zhekun(Josh) Zhang	88d4c475d3	[Torch] Fix mixP case for non value semantic ops (#2540 ) NonValueSemantic Ops like Add_, div_, etc. expect result DType to be the same as the first input. However, current implementation would result in wrong result type for case like: ```python a = torch.randn(3, 3).half() # float16 b = torch.randn(3, 3) # float32 a += b # i.e. torch.ops.aten.add_(a, b) ``` torch expects `a` to be float16, but dtype refinement would infer float32 type, since it's replaced by `aten.add`.	2023-11-02 12:40:08 +08:00
Quinn Dawkins	ae72eec224	Improve aten.broadcast_to folder when in strict symbol mode (#2504 ) Strict symbolic shapes allow us to assume numpy-style dynamic broadcasts never occur. This allows us to strengthen the folder for broadcasts to cases where the rank is the same and all shapes match (including dynamic sentinel values).	2023-10-05 09:02:10 -04:00
Stella Laurenzo	a00a0d4bfb	Integrate llvm-project and mlir-hlo. (#2454 ) Corresponding commits: * mlir-hlo: 16886a108eff5197f816ca0f1950cc5ff1b078d9 * stablehlo: 77a59815a82b34f7b08ed2d42a711d9920682d0e * llvm-project: 4acc3ffbb0af5631bc7916aeff3570f448899647 * Adapt to ByteCodeOpInterface changes. * Adapt to RegionBranchPoint changes: https://reviews.llvm.org/D159116 * Adapt inferReturnTypes to get the value from properties. * Adapt invalid.mlir to properties syntax * [TOSA] Align with custom assembly format change. * [TOSA] handle change of axis to int32 type * [TOSA] Restore improper convert to i32 Landing with Windows broken (it cannot be fixed because of the way the mlir-hlo dep is inserted). Will followup with an untangling. --------- Co-authored-by: TatWai Chong <tatwai.chong@arm.com> Co-authored-by: Eric Kunze <eric.kunze@arm.com>	2023-09-12 15:09:57 -07:00
Bruce Kim	cd1c7df8be	[MLIR][TORCH] Add E2E support for view_as_real op (#2419 ) * view_as_real test case, allow dtype in testutils.randn * abstract python upstream func implemented * fixed upstream dtype func, implemented view_as_real backend op * formatted AtenViewAsRealOp, removed change in e2etest/framework * removed test suit from reshape_like.py, because it's moved to basic.py * implemented C-API wrapper for mlirComplexF128 type * fixed torch.complex dtype width in MLIR and Torch MLIR, deleted float16 dtype dict * Changed IR input of aten fft_fft unit test * code refactored * code refactored and fixed ci test * refactored: removed white spaces, and rolled back to having both input/output affine expr * refactored: deleted output affine expr to reduce redundancy * xfail ltc backend * removed ComplexImag and ComplexReal from torchdynamo xfail set * copied and pasted from main branch as there's no change to be made in this file * refactored abstract_interp_lib_gen.py * refactored: torchtypes.td, formatted, removed commented out code	2023-09-01 21:12:01 -07:00
Quinn Dawkins	1fc4314b62	Add folder for aten.broadcast_to on unchanged static shapes (#2421 )	2023-09-01 14:50:34 -04:00
JianzheXiao	17d02811d5	[Torch Dialect] add folder for aten.any.bool (#2388 ) * update * update * update * update * update * update * update	2023-08-30 17:29:03 +08:00
jinchen62	1682b540bf	Prototype passes for lowering quantized group matmul (#2402 ) * Support brevitas custom op (#2320) * f16 change for brevitas * Adapt the change of brevitas quant custom op name * Add unit tests * Make brevitas conversions isolated * Address the comments --------- Co-authored-by: dan <danimal197@gmail.com>	2023-08-29 21:25:45 -07:00
Jiawei Wu	4c9d234b01	revert canonicalizer for PrimListConstructOp (#2408 )	2023-08-22 09:18:39 +08:00
Jiawei Wu	4c12aceb81	[Torch-Dialect] add canonicalizer for prim::ListConstruct op (#2306 ) [Torch-Dialect] add canonicalizer for prim::ListConstruct op	2023-08-08 10:28:11 +08:00
Alexandre Rames	1e468e8294	Fix canonicalization of `torch.prim.TupleUnpack`.	2023-07-20 20:08:46 +02:00
Alexandre Rames	a20422ce65	Support `DerefineOp` in `RefinePublicReturn`.	2023-07-20 20:08:46 +02:00
Alexandre Rames	4847563bed	Clean up verification of calling conventions. The implementation at this place was a remnent of the times the pipeline was run only once. Rely instead on the backend verification, after optimizations have had an opportunity to resolve some uncertainties. (e.g. `!torch.optional`).	2023-07-20 20:08:46 +02:00
Matthias Gehre	64d7626a52	Fixes for split tensor and slice (#2314 ) * RecomposeComplexOps: Remove dead slice op * lib/Dialect/Torch/IR/TorchOps.cpp: Fold slice ops even when they are on non-value tensors * lib/Conversion/TorchToTosa/TorchToTosa.cpp: Fix slice start/end out of range/none * lib/Dialect/Torch/IR/TorchOps.cpp: AtenSliceTensorOp::fold: Fold slices that go from 0:int_max * More tests for aten.split.Tensor	2023-07-20 09:53:54 +02:00
Jiawei Wu	3f843c8fd9	[torch-dialect] fix aten.type_as op's folder (#2283 ) [torch-dialect] fix torch.type_as op's folder by decomposing it to prim.dtype + aten.to_dtype	2023-07-20 09:51:58 +08:00
Ramiro Leal-Cavazos	718f53ff8a	Fix handling of `!torch.number` in abstract interpretation library (#2309 ) In PyTorch, the `NumberType` is equal to `Union[int, float, complex]`. However, the abstract interpretation library was treating the `NumberType` as `Union[int, float]`, resulting in type mismatches when reifying certain dtype functions. This commit fixes the type inconsistency by having the abstract interpretation functions take as an input a `Union[int, float, complex]` for the ops that take `!torch.number` inputs.	2023-07-17 09:52:04 -07:00
Jiawei Wu	c7fa42b7d3	[Torch Dialect] Add canonicalizer for aten.to.other op (#2273 ) Canonicalize aten.to.other to prim.device + prim.dtype + aten.to.device Co-authored-by: wujiawei.aml <wujiawei.aml@bytedance.com>	2023-06-30 09:43:08 +08:00
Yuanqiang Liu	449cfb8375	[Torch Dialect] add more scalar op folders (#2265 )	2023-06-29 10:37:13 +08:00
Yuanqiang Liu	1ea2b57ab7	[Torch Dialect] add folder for aten.add (#2264 ) * [Torch Dialect] add folder for aten.add * update * update * update	2023-06-27 10:55:28 +08:00
Yuanqiang Liu	96b14e952e	[Torch Dialect] Support aten.device.with_index (#2254 )	2023-06-23 01:07:14 +08:00
Yuanqiang Liu	7c6961bcbf	[Torch Dialect] Support aten.cuda and add canonicalizer for aten.cuda (#2231 )	2023-06-14 09:56:39 +08:00
Yuanqiang Liu	ddea56a832	[Torch Dialect] fix torch.uint8's dtype infer (#2227 )	2023-06-13 10:38:20 +08:00
Matthias Gehre	27a3d09917	Torch: Fold RuntimeAssertOp when condition is true (#2198 )	2023-06-09 19:06:25 +08:00
Yuanqiang Liu	5a7bf4e4cb	[Torch Dialect] Add canonicalize pattern for aten.is_floating_point (#2194 ) * [Torch Dialect] Add canonicalize pattern for aten.is_floating_point * implement as fold * add lit test	2023-06-07 17:05:31 +08:00
Ramiro Leal-Cavazos	dff3405d5a	Add alias analysis for cast-like ops to maximize-value-semantics (#2160 ) When `use_tracing=True` is used to import a model into Torch-MLIR, several casts get inserted in the IR to bridge the untyped inputs and outputs with the typed body of the computation. These casts create extra aliases of tensors that cause the current analysis in `maximize-value-semantics` to fail. In particular, the `maximize-value-semantics` analysis assumes that the only valid alias right after an overwrite is the overwritten alias. So, if there is a use of a casted version of the overwritten alias after the overwrite, the analysis fails. This commit improves the analysis by identifying all cast-like aliases of the overwritten alias and allowing such aliases to be used after an overwrite. Because this issue only arises when using tracing, it cannot be currently tested e2e, so only lit test is added.	2023-05-25 17:05:41 +00:00
Ramiro Leal-Cavazos	de02b56e17	Replace RefineTypes with dtype functions (#2105 ) This commit adds dtype functions for all the torch ops that did not previously have one and removes the pass `RefineTypes`, since the abstract interpretation library now takes care of all the dtype propagation. All dtype functions added are tested except for - `aten.embedding` - `aten._embedding_bag` - `aten.embedding_bag` These functions need a change to the testing framework to allow specifying the actual data inside the tensor used for testing. I will fix this in a follow up patch. Co-authored-by: Jiahao Li <liplus17@163.com>	2023-05-12 13:40:45 -07:00
Zhekun Zhang	0cf9ee340b	[Torch Dialect] Add to.dtype_layout canonicalize patterns (#2062 ) * add to.dtype_layout canonicalize patterns * update comment --------- Co-authored-by: zhekun.zhang <zhekun.zhang@bytedance.com>	2023-05-02 20:06:02 -07:00
Yuanqiang Liu	3e83a86354	[Torch Dialect] fix isValidSubtype with dynamic dim (#2018 )	2023-04-11 01:02:18 -07:00
Vivek Khandelwal	98747d09a8	[MLIR][TORCH] Add support for prims::view_of op This op does nothing and just returns the input operand as the result of the op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2023-04-11 07:58:10 +05:30
Ramiro Leal-Cavazos	d803ab4eeb	Cast `number` to `float` when shape function takes Scalar arg (#1978 ) To keep things simple in shape functions, `Scalar` inputs are considered `float`s. This means that when inserting the shape functions into the IR, we must cast any `!torch.number`s into `float`s so that the operand type matches the expected type in the shape function. This commit adds the cast from `Scalar` to `float`.	2023-03-28 09:30:31 -07:00

1 2 3 4 5 ...

340 Commits (074f112d6afbfe48441083fa0e9764114d3c72de)