torch-mlir

Commit Graph

Author	SHA1	Message	Date
Xida Ren (Cedar)	24b8c8672a	[torch] Add folders for `torch.fill`, `torch.ones`, `torch.zeros` and `aten.getItem` (#2849 ) So that the CumSum Op in OPT can get the constant that it requires to be lowered to TMTensor --------- Co-authored-by: Rob Suderman <rob.suderman@gmail.com> Co-authored-by: Xida Ren <xida.ren.dev@gmail.com>	2024-02-02 10:46:33 -08:00
Rob Suderman	25a5a22cbd	[torch] Support `torch.convolution` quantized lowering to `linalg` (#2811 ) Linalg has quantized specific operations. We can lower to these operations when there is a known zeropoint and scale operations. This allows the `convolution` to occur with lower bitwidth's, improving the overall performance.	2024-01-30 13:46:47 -08:00
Aaron St George	4c557847bd	Don't fold `aten.detach` if result isn't same type as input. (#2824 ) We were seeing some assertion failures after some checks around folders were tightened up in LLVM: https://github.com/llvm/llvm-project/pull/75887 . This PR essentially moves the logic that used to be applied at the LLVM level into the folder, which seems to be the suggested fix. I'm not sure if the IR that caused issues for us _should_ be valid? ``` %1 = torch.aten.detach %arg0 : !torch.tensor<[1],f32> -> !torch.tensor ``` A better fix might be to create a verifier ensuring the result of `aten.detach` has the same type as its operand. --------- Co-authored-by: aaron-stgeorge <aaron.stgeorge@getcruise.com>	2024-01-30 09:45:51 -08:00
Aart Bik	fe836ceebf	[torch-mlir][test] cleanup trailing whitespace in mlir files (#2806 )	2024-01-25 14:24:13 -08:00
Aart Bik	e824fbc65c	[torch-mlir][torch] add encoding field to torch type (#2799 ) This adds an encoding field to the torch type, using the interfaces for printing, parsing, and verification. Note that although this change prepares adding sparsity to the torch type (as illustrated by the round trip and invalid tests), nothing in this change depends on the actual contents of the encoding field!	2024-01-25 10:04:04 -08:00
Rob Suderman	f6f890520b	[torch][quant] Quantized `torch.mm` for linalg with end-to-end test (#2750 ) This includes custom op matching for decomposed operations and fusing dequantization into dense operations. As a validation we compare to the dequant+mm torch implementation.	2024-01-24 14:02:50 -08:00
Han-Chung Wang	10acea71be	Bump LLVM to llvm/llvm-project@0cb024b (#2753 ) - Add fixes for `af78e5daf0` - Add fixes for `bb6d5c2200`	2024-01-15 07:12:12 -08:00
Zhekun(Josh) Zhang	d67afa9e95	[Torch] Add fold rule for AtenMaskedFillTensorOp to AtenMaskedFillScalarOp (#2543 )	2023-11-21 13:26:17 +08:00
Stella Laurenzo	5eae0adff1	Breakup python pytorch deps (#2582 ) This lifts the core of the jit_ir_importer and ltc out of the pt1 project, making them peers to it. As a side-effect of this layering, now the "MLIR bits" (dialects, etc) are not commingled with the various parts of the pt1 project, allowing pt1 and ltc to overlay cleanly onto a more fundamental "just MLIR" Python core. Prior to this, the Python namespace was polluted to the point that this could not happen. That "just MLIR" Python core will be introduced in a followup, which will create the space to upstream the FX and ONNX pure Python importers. This primary non-NFC change to the API is: * `torch_mlir.dialects.torch.importer.jit_ir` -> `torch_mlir.jit_ir_importer`. The rest is source code layering so that we can make the pt1 project optional without losing the other features. Progress on #2546.	2023-11-19 12:10:19 -08:00
James Newling	dad1f012f6	Add verification for torch permute op (#2551 ) - adds support for an optional verifier to the generated torch op tablegen (GeneratedTorchOps.td) - uses the above to add a verifier for the torch permute op. Motivation: I hit an unclear error from linalg while developing a decomposition pass for pixel_shuffle. The error would have been clearer if the problem had been detected earlier in the invalid aten.permute op. Testing: new tests added. To run added tests, from the base directory run ``` ./build/bin/llvm-lit test/Dialect/Torch/invalid.mlir ```	2023-11-15 11:47:54 -08:00
Yuanqiang Liu	3ab790c50a	[Torch Dialect] add canonicalize for aten.numel (#2562 )	2023-11-11 12:16:53 +08:00
Stella Laurenzo	6961f0a247	Re-organize project structure to separate PyTorch dependencies from core project. (#2542 ) This is a first step towards the structure we discussed here: https://gist.github.com/stellaraccident/931b068aaf7fa56f34069426740ebf20 There are two primary goals: 1. Separate the core project (C++ dialects and conversions) from the hard PyTorch dependencies. We move all such things into projects/pt1 as a starting point since they are presently entangled with PT1-era APIs. Additional work can be done to disentangle components from that (specifically LTC is identified as likely ultimately living in a `projects/ltc`). 2. Create space for native PyTorch2 Dynamo-based infra to be upstreamed without needing to co-exist with the original TorchScript path. Very little changes in this path with respect to build layering or options. These can be updated in a followup without commingling directory structure changes. This also takes steps toward a couple of other layering enhancements: * Removes the llvm-external-projects/torch-mlir-dialects sub-project, collapsing it into the main tree. * Audits and fixes up the core C++ build to account for issues found while moving things. This is just an opportunistic pass through but roughly ~halves the number of build actions for the project from the high 4000's to the low 2000's. It deviates from the discussed plan by having a `projects/` tree instead of `compat/`. As I was thinking about it, this will better accommodate the follow-on code movement. Once things are roughly in place and the CI passing, followups will focus on more in-situ fixes and cleanups.	2023-11-02 19:45:55 -07:00
Zhekun(Josh) Zhang	88d4c475d3	[Torch] Fix mixP case for non value semantic ops (#2540 ) NonValueSemantic Ops like Add_, div_, etc. expect result DType to be the same as the first input. However, current implementation would result in wrong result type for case like: ```python a = torch.randn(3, 3).half() # float16 b = torch.randn(3, 3) # float32 a += b # i.e. torch.ops.aten.add_(a, b) ``` torch expects `a` to be float16, but dtype refinement would infer float32 type, since it's replaced by `aten.add`.	2023-11-02 12:40:08 +08:00
Quinn Dawkins	ae72eec224	Improve aten.broadcast_to folder when in strict symbol mode (#2504 ) Strict symbolic shapes allow us to assume numpy-style dynamic broadcasts never occur. This allows us to strengthen the folder for broadcasts to cases where the rank is the same and all shapes match (including dynamic sentinel values).	2023-10-05 09:02:10 -04:00
Stella Laurenzo	a00a0d4bfb	Integrate llvm-project and mlir-hlo. (#2454 ) Corresponding commits: * mlir-hlo: 16886a108eff5197f816ca0f1950cc5ff1b078d9 * stablehlo: 77a59815a82b34f7b08ed2d42a711d9920682d0e * llvm-project: 4acc3ffbb0af5631bc7916aeff3570f448899647 * Adapt to ByteCodeOpInterface changes. * Adapt to RegionBranchPoint changes: https://reviews.llvm.org/D159116 * Adapt inferReturnTypes to get the value from properties. * Adapt invalid.mlir to properties syntax * [TOSA] Align with custom assembly format change. * [TOSA] handle change of axis to int32 type * [TOSA] Restore improper convert to i32 Landing with Windows broken (it cannot be fixed because of the way the mlir-hlo dep is inserted). Will followup with an untangling. --------- Co-authored-by: TatWai Chong <tatwai.chong@arm.com> Co-authored-by: Eric Kunze <eric.kunze@arm.com>	2023-09-12 15:09:57 -07:00
Bruce Kim	cd1c7df8be	[MLIR][TORCH] Add E2E support for view_as_real op (#2419 ) * view_as_real test case, allow dtype in testutils.randn * abstract python upstream func implemented * fixed upstream dtype func, implemented view_as_real backend op * formatted AtenViewAsRealOp, removed change in e2etest/framework * removed test suit from reshape_like.py, because it's moved to basic.py * implemented C-API wrapper for mlirComplexF128 type * fixed torch.complex dtype width in MLIR and Torch MLIR, deleted float16 dtype dict * Changed IR input of aten fft_fft unit test * code refactored * code refactored and fixed ci test * refactored: removed white spaces, and rolled back to having both input/output affine expr * refactored: deleted output affine expr to reduce redundancy * xfail ltc backend * removed ComplexImag and ComplexReal from torchdynamo xfail set * copied and pasted from main branch as there's no change to be made in this file * refactored abstract_interp_lib_gen.py * refactored: torchtypes.td, formatted, removed commented out code	2023-09-01 21:12:01 -07:00
Quinn Dawkins	1fc4314b62	Add folder for aten.broadcast_to on unchanged static shapes (#2421 )	2023-09-01 14:50:34 -04:00
JianzheXiao	17d02811d5	[Torch Dialect] add folder for aten.any.bool (#2388 ) * update * update * update * update * update * update * update	2023-08-30 17:29:03 +08:00
jinchen62	1682b540bf	Prototype passes for lowering quantized group matmul (#2402 ) * Support brevitas custom op (#2320) * f16 change for brevitas * Adapt the change of brevitas quant custom op name * Add unit tests * Make brevitas conversions isolated * Address the comments --------- Co-authored-by: dan <danimal197@gmail.com>	2023-08-29 21:25:45 -07:00
Jiawei Wu	4c9d234b01	revert canonicalizer for PrimListConstructOp (#2408 )	2023-08-22 09:18:39 +08:00
Jiawei Wu	4c12aceb81	[Torch-Dialect] add canonicalizer for prim::ListConstruct op (#2306 ) [Torch-Dialect] add canonicalizer for prim::ListConstruct op	2023-08-08 10:28:11 +08:00
Alexandre Rames	1e468e8294	Fix canonicalization of `torch.prim.TupleUnpack`.	2023-07-20 20:08:46 +02:00
Alexandre Rames	a20422ce65	Support `DerefineOp` in `RefinePublicReturn`.	2023-07-20 20:08:46 +02:00
Alexandre Rames	4847563bed	Clean up verification of calling conventions. The implementation at this place was a remnent of the times the pipeline was run only once. Rely instead on the backend verification, after optimizations have had an opportunity to resolve some uncertainties. (e.g. `!torch.optional`).	2023-07-20 20:08:46 +02:00
Matthias Gehre	64d7626a52	Fixes for split tensor and slice (#2314 ) * RecomposeComplexOps: Remove dead slice op * lib/Dialect/Torch/IR/TorchOps.cpp: Fold slice ops even when they are on non-value tensors * lib/Conversion/TorchToTosa/TorchToTosa.cpp: Fix slice start/end out of range/none * lib/Dialect/Torch/IR/TorchOps.cpp: AtenSliceTensorOp::fold: Fold slices that go from 0:int_max * More tests for aten.split.Tensor	2023-07-20 09:53:54 +02:00
Jiawei Wu	3f843c8fd9	[torch-dialect] fix aten.type_as op's folder (#2283 ) [torch-dialect] fix torch.type_as op's folder by decomposing it to prim.dtype + aten.to_dtype	2023-07-20 09:51:58 +08:00
Ramiro Leal-Cavazos	718f53ff8a	Fix handling of `!torch.number` in abstract interpretation library (#2309 ) In PyTorch, the `NumberType` is equal to `Union[int, float, complex]`. However, the abstract interpretation library was treating the `NumberType` as `Union[int, float]`, resulting in type mismatches when reifying certain dtype functions. This commit fixes the type inconsistency by having the abstract interpretation functions take as an input a `Union[int, float, complex]` for the ops that take `!torch.number` inputs.	2023-07-17 09:52:04 -07:00
Jiawei Wu	c7fa42b7d3	[Torch Dialect] Add canonicalizer for aten.to.other op (#2273 ) Canonicalize aten.to.other to prim.device + prim.dtype + aten.to.device Co-authored-by: wujiawei.aml <wujiawei.aml@bytedance.com>	2023-06-30 09:43:08 +08:00
Yuanqiang Liu	449cfb8375	[Torch Dialect] add more scalar op folders (#2265 )	2023-06-29 10:37:13 +08:00
Yuanqiang Liu	1ea2b57ab7	[Torch Dialect] add folder for aten.add (#2264 ) * [Torch Dialect] add folder for aten.add * update * update * update	2023-06-27 10:55:28 +08:00
Yuanqiang Liu	96b14e952e	[Torch Dialect] Support aten.device.with_index (#2254 )	2023-06-23 01:07:14 +08:00
Yuanqiang Liu	7c6961bcbf	[Torch Dialect] Support aten.cuda and add canonicalizer for aten.cuda (#2231 )	2023-06-14 09:56:39 +08:00
Yuanqiang Liu	ddea56a832	[Torch Dialect] fix torch.uint8's dtype infer (#2227 )	2023-06-13 10:38:20 +08:00
Matthias Gehre	27a3d09917	Torch: Fold RuntimeAssertOp when condition is true (#2198 )	2023-06-09 19:06:25 +08:00
Yuanqiang Liu	5a7bf4e4cb	[Torch Dialect] Add canonicalize pattern for aten.is_floating_point (#2194 ) * [Torch Dialect] Add canonicalize pattern for aten.is_floating_point * implement as fold * add lit test	2023-06-07 17:05:31 +08:00
Ramiro Leal-Cavazos	dff3405d5a	Add alias analysis for cast-like ops to maximize-value-semantics (#2160 ) When `use_tracing=True` is used to import a model into Torch-MLIR, several casts get inserted in the IR to bridge the untyped inputs and outputs with the typed body of the computation. These casts create extra aliases of tensors that cause the current analysis in `maximize-value-semantics` to fail. In particular, the `maximize-value-semantics` analysis assumes that the only valid alias right after an overwrite is the overwritten alias. So, if there is a use of a casted version of the overwritten alias after the overwrite, the analysis fails. This commit improves the analysis by identifying all cast-like aliases of the overwritten alias and allowing such aliases to be used after an overwrite. Because this issue only arises when using tracing, it cannot be currently tested e2e, so only lit test is added.	2023-05-25 17:05:41 +00:00
Ramiro Leal-Cavazos	de02b56e17	Replace RefineTypes with dtype functions (#2105 ) This commit adds dtype functions for all the torch ops that did not previously have one and removes the pass `RefineTypes`, since the abstract interpretation library now takes care of all the dtype propagation. All dtype functions added are tested except for - `aten.embedding` - `aten._embedding_bag` - `aten.embedding_bag` These functions need a change to the testing framework to allow specifying the actual data inside the tensor used for testing. I will fix this in a follow up patch. Co-authored-by: Jiahao Li <liplus17@163.com>	2023-05-12 13:40:45 -07:00
Zhekun Zhang	0cf9ee340b	[Torch Dialect] Add to.dtype_layout canonicalize patterns (#2062 ) * add to.dtype_layout canonicalize patterns * update comment --------- Co-authored-by: zhekun.zhang <zhekun.zhang@bytedance.com>	2023-05-02 20:06:02 -07:00
Yuanqiang Liu	3e83a86354	[Torch Dialect] fix isValidSubtype with dynamic dim (#2018 )	2023-04-11 01:02:18 -07:00
Vivek Khandelwal	98747d09a8	[MLIR][TORCH] Add support for prims::view_of op This op does nothing and just returns the input operand as the result of the op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2023-04-11 07:58:10 +05:30
Ramiro Leal-Cavazos	d803ab4eeb	Cast `number` to `float` when shape function takes Scalar arg (#1978 ) To keep things simple in shape functions, `Scalar` inputs are considered `float`s. This means that when inserting the shape functions into the IR, we must cast any `!torch.number`s into `float`s so that the operand type matches the expected type in the shape function. This commit adds the cast from `Scalar` to `float`.	2023-03-28 09:30:31 -07:00
Maksim Levental	953ea39cb5	handles 2,3,4 from https://github.com/llvm/torch-mlir/issues/1963 (#1964 )	2023-03-24 21:50:01 -05:00
Ramiro Leal-Cavazos	eae3ff7f1c	Change dtype functions interface to take ints tuple for each tensor (#1965 ) The original design for the dtype functions outlined in https://github.com/llvm/torch-mlir/issues/1462 was unable to properly handle ops that take optional tensors as an input when the optional tensor has a value of None. By the time the op gets imported into torch-mlir, if an optional value is None, all information about the original type is lost from the op type signature, preventing torch-mlir from knowing if a value of None was from an optional tensor or not, which was crucial in the original design since each tensor argument must be turned into two separate arguments for the dtype function. This commit changes the interface to dtype functions such that each tensor turns into a tuple of two ints, the first representing the rank of the tensor and the second the dtype of the tensor. Since now there is a one-to-one correspondence between the operands of an op and the operands of its dtype function, there is no ambiguity about which operand of the op corresponds with which operand of the dtype function. To test the implementation, this commit defines dtype function for convolution op, which takes one optional tensor as an argument.	2023-03-23 11:05:39 -07:00
Matthias Gehre	aa5bcb3cf2	LowerToBackendContract: Explicitly error out on unimplemented operator (#1947 ) * LowerToBackendContract: Explicitly error out on unimplemented operator But only reject torch.operator when results are invalid. Otherwise it might be a custom op that the backend supports.	2023-03-20 16:27:08 +01:00
Ramiro Leal-Cavazos	d310bb12bd	Expand definition of tensor subtype to include shape/dtype info (#1929 ) Currently, the op `torch.tensor_static_info_cast` will not get canonicalized away if the result type has any shape or dtype information. This is because `isValidSubtype` only returns true when the tensor types being compared are exactly the same or the supertype has no shape and dtype information. Being unable to canonicalize away the `torch.tensor_static_info_cast` gets in the way of further optimizations, such as shape propagation. This commit improves `isValidSubtype` by adding logic that compares the shapes and dtypes of the two tensor types to determine of one type is indeed a valid subtype of the other. Fixes https://github.com/llvm/torch-mlir/issues/1926	2023-03-10 16:43:57 -08:00
Ziheng Jiang	dca2b8a40a	[TORCH] Improve type refinement for aten.cat. (#1908 ) * [TORCH] Fix type refinement for aten.cat. * Add test. * Address comments. * Update. * Update. * Update. * Update. * Update. --------- Co-authored-by: Ziheng Jiang <ziheng.jiang@bytedance.com>	2023-03-09 16:17:35 -08:00
Zhekun Zhang	1d3a7419c5	[Torch Dialect] add RSub, ScalarImplicit canonicalize (#1899 ) * add rsub, scalarimplit canonicalizer * reformat * address comments * fix bug * fix test * Update elementwise.py * resolve merge conflict * change to 3 * change to 3 * real fix * fix name * add torchdynamo fail test --------- Co-authored-by: zhekun.zhang <zhekun.zhang@bytedance.com>	2023-03-06 17:38:27 -08:00
Ramiro Leal-Cavazos	d30af8772b	Handle uninitialized lattice elements in RefineTypes (#1911 ) The data-flow analysis does not always propagate information to the entire graph. This results in some lattice elements being uninitialized. Currently the lattice elements are not checked to see if they are uninitialized before rewriting the graph, potentially resulting in invalid IR (see https://github.com/llvm/torch-mlir/issues/1896). This commit adds handling for uninitialized lattice elements.	2023-03-03 08:55:58 -08:00
Yuanqiang Liu	7a8304f935	[Torch Dialect] add folder for aten.sub.float (#1871 )	2023-03-02 09:07:33 -08:00
Yuanqiang Liu	fc1e091d6a	[Torch Dialect] add aten.pow.int_float op and it's folder (#1872 )	2023-02-28 09:36:05 -08:00
Yuanqiang Liu	6ab990e1e8	[Torch Dialect] add folder for aten.Int.float (#1863 )	2023-02-10 13:59:03 -08:00
Yuanqiang Liu	2f6fdb7f0b	[Torch Dialect] add folder for prim.min.int (#1864 )	2023-02-10 13:58:15 -08:00
Gleb Kazantaev	3930588a7e	Enable VerifyBackendContract in LTC backend (#1798 ) * Enable VerifyBackendContract in LTC backend * Update VerifyBackendContract pass * Move convert_scalar_implicit to jit_utils * Rename VerifyBackendContract to VerifyBackendContractNoDecompositions * Update verify-backend-contract-error.mlir test	2023-01-24 22:14:17 -05:00
Ramiro Leal-Cavazos	6c86bec04f	build: update llvm tag to 9acc2f37 (#1828 ) This commit makes the following changes: - Update dialects to use fold API `kEmitFoldAdaptorFolder` and update signature of `fold` methods (see PSA https://discourse.llvm.org/t/psa-new-improved-fold-method-signature-has-landed-please-update-your-downstream-projects/67618) - Replace `makeArrayRef` with `ArrayRef` (see https://reviews.llvm.org/D140896) - Remove `TypeRange{}` arg from `b.create<scf::IfOp>` since builder no longer takes that argument - Make `func`s in `Torch/invalid.mlir` private, since symbol declarations cannot be public. (see https://discourse.llvm.org/t/rfc-symbol-definition-declaration-x-visibility-checks/2140)	2023-01-25 01:29:42 +00:00
Ramiro Leal-Cavazos	273664ded6	[custom op] Replace `tanh` dtype function with `expm1` (#1769 ) This commit replaces the `tanh` dtype function, which was being used to test the implementation of dtype functions in `a710237437`, with a dtype function for `expm1`. The dtype function for `expm1` is identical to the `tanh` one, so the same level of testing is maintained. Currently, there are ops getting dtype information from the `RefineTypes` pass and ops getting dtype information from the `TorchDtypeRefinementPipeline`. Since each pass can only propagete dtype information for the ops it knows how to handle, some models with many ops handled in both passes require the two dtype propagation passes to execute many times, reaching the iteration limit set in the `LowerToBackendContractPass`. To temporarily avoid this issue while the migration to `TorchDtypeRefinementPipeline` is finished, this commit switches `tanh` to `expm1`, since the latter is used a lot less in large models.	2023-01-03 14:18:26 -08:00
ataheridezfouli-groq	17ee643aeb	[TORCH] Add Complex Number support (#1673 ) Add Complex number dtype support to torch tensors. Add aten.fft_fft op to test complex numbers.	2022-12-15 21:40:01 +00:00
Ramiro Leal-Cavazos	60db793feb	Pass op legality info to `verifyBackendContractPass` (#1705 ) In order to verify if a given IR satisfies the backend contract, the verifier needs to know if decompositions took place, and if so, which ops were decomposed and which were not. This commit adds two arguments to `verifyBackendContractPass` to specify if decompositions took place and which ops to consider backend legal, similar to the arguments of `LowerToBackendContractPass`.	2022-12-15 08:32:52 -08:00
Ahmed S. Taei	b1f6832849	Add aten.slice.Tensor & aten.cat folders (#1691 )	2022-12-13 13:02:47 -08:00
Ramiro Leal-Cavazos	a710237437	[custom op] Generalize shape library logic to work with dtypes (#1594 ) * [custom op] Generalize shape library logic to work with dtypes This commit generalizes the shape library logic, so that dtype rules for ops can also be expressed using the same mechanism. In other words, each op can now have a shape function and a dtype function specified in Python that is imported during lowering to calculate the shapes and dtypes throught a program. For more information about how to specify a dtype function, see the updated `docs/adding_a_shape_and_dtype_function.md`. For those not familiar with how the shape library works, the file `docs/calculations_lib.md` provides an overview.	2022-12-13 08:25:41 -08:00
Ramiro Leal-Cavazos	a54b334578	Allow running DecomposeComplexOps more than once (#1671 ) The current implementation of `DecomposeComplexOps` fails if an op expected to be decomposed does not get decomposed in the first iteration of the `createTorchSimplificationPipeline` in `LowerToBackendContractPass`. However, some graphs require multiple iterations of `createTorchSimplificationPipeline` to fully propagate all statically knowable information, such as dtypes and shapes, to the entire graph, sometimes resulting in the need to run `DecomposeComplexOps` more than once. This commit changes `DecomposeComplexOps` to use a greedy algorithm for pattern application and moves the legalization check of ops to the `LowerToBackendContractPass` to allow for the `DecomposeComplexOps` to run more than once.	2022-12-08 09:26:38 -08:00
Ramiro Leal-Cavazos	76190e8a3f	Remove unnecessary decompose-complex-ops tests (#1693 ) This commit removes lit tests from the `decompose-complex-ops` that are essentially testing a macro expansion, in accordance with https://github.com/llvm/torch-mlir/blob/main/docs/architecture.md#dos-and-donts-for-unit-vs-end-to-end-testing .	2022-12-08 08:22:08 -08:00
Ramiro Leal-Cavazos	dd35488da5	build: update llvm tag to 798fa4b4 (#1684 ) - Support for non-prefixed accessors has been removed. See: https://reviews.llvm.org/D136727 - Rename `operands` to `methodOperands` in `prim.CallMethod` since the name `operands` overlaps with a builtin method name. See: https://reviews.llvm.org/D136727 - Add passes in refbackend to lower memref.subview. See: https://reviews.llvm.org/D136377 - Replace `CopyToValueTensorOps` first in `RewriteViewLikeSubgraph` in maximize-value-semantics. The current implementation of the `RewriteViewLikeSubgraph` pass in maximize-value-semantics creates temporarily invalid IR. In particular, given a forward slice starting from a `CopyToNonValueTensorOp` and ending in `CopyToValueTensorOp`s, the pass first replaces all uses of the `CopyToNonValueTensorOp` with its operand, which results in all the `CopyToValueTensorOp` users having their operand have type `!torch.vtensor`, which is invalid. The correct way to do things is to first replace all the `CopyToValueTensorOp`s with their operand, and then replace all uses of the `CopyToNonValueTensorOp` with its operand. This only started failing now because the generated accessor `getOperand` for the `CopyToValueTensorOp` now returns a `TypedValue<NonValueTensorType>`, which has an assert checking that the value returned is of the expected type.	2022-12-07 12:20:41 -08:00
Vivek Khandelwal	e7edcc62fd	build: update llvm tag to 147fe9de Summary of changes: - Replace call to `MemoryEffectOpInterface::hasNoEffect` with `isMemoryEffectFree`. - Make fix for the dynamic dims, since `kDynamicSize` value changed to `std::numeric_limits<int64_t>::min()` from `-1` in llvm - `makeShapeLLVMCompatible` and `makeShapeTorchCompatible` utilities convert shapes in order to remain consistent with the Torch and MLIR semantics. - Update tags llvm: 147fe9de29dc13c14835127b35280c4d95c8e8ba mhlo: 1944b5fa6062ec4c065d726c9c5d64f1487ee8c5 Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-12-01 13:36:50 +05:30
Tanyo Kwok	bbcdb38d99	Revert "Decompose torch.slice_scatter (#1622 )" (#1659 ) This reverts commit `f3f2f10030`.	2022-11-30 12:47:13 +08:00
Vivek Khandelwal	d9cbf01d1e	Revert "build: update llvm tag to 147fe9de" This reverts commit `e45ad313d4`.	2022-11-25 12:41:56 +05:30
Vivek Khandelwal	e45ad313d4	build: update llvm tag to 147fe9de Summary of changes: - Update call to `hasNoEffect` utility - `KDynamicSize` value changed to `std::numeric_limits<int64_t>::min()` from `-1` - Update tags llvm: 147fe9de29dc13c14835127b35280c4d95c8e8ba mhlo: 1944b5fa6062ec4c065d726c9c5d64f1487ee8c5 Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-11-24 12:44:43 +05:30
Tanyo Kwok	f3f2f10030	Decompose torch.slice_scatter (#1622 ) * Decompose torch.slice_scatter * fix compilation error * update file check * fix ci * fix i64 torch.tensor dtype	2022-11-23 18:14:12 +08:00
Vivek Khandelwal	da8fdc9f96	[MLIR][TORCH] Fix refine types crash This commit fixes https://github.com/llvm/torch-mlir/issues/1599. Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-11-23 15:17:37 +05:30
Vivek Khandelwal	55c7e66aa7	[MLIR][TORCH] Fix mean and mean.dim op for large-sized inputs This commit fixes the aten.mean and aten.mean.dim op decomposition for supporting large-sized inputs. This commit also fixes the formatting for the file stats.py Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-11-22 08:38:51 +05:30
Vivek Khandelwal	4cbd3927d7	[MLIR][TORCH] Add aten.sort.int op Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-11-20 19:00:41 +05:30
Daniel Ellis	a7ac0def45	Move single-tensor-tuple-return test to mlir unit test. Also, add multiple return test.	2022-11-10 09:23:53 -05:00
Xiafei Qiu	4f173c6e0f	update llvm tag to a2620e00. (#1567 ) - also update MHLO to 57ba12a2(branch greencommit/2022-11-07-a2620e00) - change -pass-pipeline format to make tests pass.	2022-11-10 18:39:28 +08:00
Ramiro Leal-Cavazos	b723186983	Remove all but one of valsem ops + move fill.Scalar to elementwise (#1531 ) This commit removes almost all of the valsem ops, since the value semantics version of the ops now exist in PyTorch. The only op missing is `aten.bernoulli_.float`. In addition, this commit also simplifies the implementation of `aten.fill.Scalar` by moving it to the pattern that converts elementwise ops.	2022-10-28 15:06:11 +00:00
Vivek Khandelwal	ca87033d2f	[MLIR][TORCH] Add E2E support for aten.mse_loss op This commit adds decomposition for the `aten.mse_loss` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-10-25 21:06:58 +05:30
Ramiro Leal-Cavazos	82a3860e25	build: update llvm tag to 4546397e (#1502 ) This commit makes the following changes needed to update bump LLVM: - Replace `linalg.init_tensor` with `tensor.empty` (see: https://reviews.llvm.org/D135129) - Replace `NoSideEffect` with `Pure` (see https://reviews.llvm.org/D135505) - Replace `body` region accessor for `ReduceOp` and `ReduceWindowOp` with `getBody` - Fix incorrect use of `tosa::ReduceSumOp` in `AtenNativeLayerNormOp` conversion pattern. The result type of `tosa::ReduceSumOp` must have the same rank as the input type. (see: https://www.mlplatform.org/tosa/tosa_spec.html#_reduce_sum) Co-authored-by: Ashay Rane <ashay@users.noreply.github.com> Co-authored-by: Ashay Rane <ashay@users.noreply.github.com>	2022-10-18 04:22:53 +00:00
Gaurav Shukla	da90a25f90	[MLIR][TORCH] Add E2E support for `aten.[div.int\|bitwise_or.Tensor]` ops This commit adds lowering of `aten.div.int` and `aten.bitwise_or.Tensor` ops. Both these ops are required in order to support bloom_560m model. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-10-10 22:28:51 +05:30
武家伟	c03aa63325	[MLIR] Add canonicalizer for aten.slice.t op (#1413 ) * [MLIR] Add canonicalizer for aten.slice.t op * Add mlir tests and strength the canonicalizer * rename variable Co-authored-by: Vremold <xremold@gamil.com>	2022-09-26 14:35:50 -07:00
Tanyo Kwok	061a97c3f2	Replace empty_like && empty_memory_format with full/full_like (#1398 ) * Replace empty_like && empty_memory_format with full/full_like * fix broadcast rank0 tensor	2022-09-23 10:24:36 +08:00
gpetters94	48418b9c22	Fold away type_as (#1358 )	2022-09-12 18:59:12 -04:00
Sean Silva	0e3ddbac91	Remove VerifyInvariantsBeforeBackendLowering LowerToBackendContract now checks all this consistently.	2022-08-26 10:24:43 -07:00
Tanyo Kwok	3d0e18bbe7	Add decomposition for aten.roll (#1170 ) * Add decomposition for aten.roll * add e2e unittest * refine type of torch.roll * fix aten::cat output type	2022-08-24 08:36:05 +08:00
Tanyo Kwok	9176b5ed29	Add decomposition for aten.flatten.using_ints (#1161 )	2022-08-23 11:52:54 +08:00
Sean Silva	01290d134a	Add a way for backends to control which ops are legal for them. We were already hitting many cases where backends different in terms of the legal ops that they wanted. This caused unnecessary coupling between the backends. Examples: - https://github.com/llvm/torch-mlir/pull/1161 - https://github.com/llvm/torch-mlir/pull/862 This PR centralizes all compilation to go through `torch_mlir.compile` so that we can keep the logic centralized there. We should move these lists closer to each backend. Especially cases like https://github.com/llvm/torch-mlir/pull/862 where blocking a decomposition is necessary to avoid a crash emphasize that the set of decompositions is tightly coupled to the backend, and should be "controlled by the backend" and not something arbitrarily tweakable. Also: - Fix a small bug in the way we passed through the backendLegalOps option. - Add better error messages in `torch_mlir.compile` for import errors.	2022-08-22 14:16:13 -07:00
武家伟	99fb4c8637	Add folder for ToF64Op and FromF64Op (#1257 )	2022-08-22 09:49:39 +08:00
Ramiro Leal-Cavazos	9bc606c384	Add support for returning more than one copy of the same tensor (#1228 ) One of the simplifications made by the pass `RefinePublicReturn` currently only happens if the tensor in question only has one user. However, the current method of checking this does not correctly handle the case of a user having multiple uses of the same tensor. This commit makes sure only unique users are considered.	2022-08-18 22:41:45 +00:00
Sean Silva	283e0f141a	Add a concept of "backend legal ops". This is a first step towards formalizing the set of ops in our backend contract. The goal is to eventually formalize `torch` dialect ops into 3 categories: 1. Legal in backend contract 2. Illegal in backend contract 3. Conditionally legal in backend contract The "conditionally legal" set are the ops that we can optionally decompose for backends. This patch adds relevant pass options for this throughout the compiler, in preparation for a new set of traits which will formalize this classification.	2022-08-18 11:46:50 -07:00
Sean Silva	57681f7947	Iteratively run the main simplification pipeline. This introduces a new pass LowerToBackendContract (better name very welcome) which performs the bulk of the simplifications that we do, such as - shape refinement - dtype refinement - maximizing value semantics - inlining global slots - decomposing complex ops The key difference from before is that it iterates the set of transformations, which can help to break a number of "catch-22" issues where one simplification depends on another, the latest example being here: https://github.com/llvm/torch-mlir/issues/1131 This also exposed that RefineTypes was sometimes crashing/asserting for certain inputs. This commit hardens it a bit.	2022-08-17 14:54:33 -07:00
武家伟	3b3cb99ef8	Generalize canonicalization pattern for more aten.sub/div/mul/add op (#1209 ) Generalize canonicalization pattern for more sub/div/mul/add op, but for AtenDivTensorModeOp in 'trunc' rounding mode, we try to fold it.	2022-08-16 13:24:08 +08:00
Sean Silva	504de5e701	Rework how global slot initializers work. Rather than a per-global-slot initializer region, we now have one for the whole module. For example, it might look like this: ``` torch.global_slot "private" @tensor : !torch.tensor torch.global_slot "private" @list : !torch.list<tensor> torch.global_slot.module_initializer { %0 = torch.tensor.literal(dense<0.0> : tensor<f32>) : !torch.tensor %1 = torch.prim.ListConstruct %0 : (!torch.tensor) -> !torch.list<tensor> torch.initialize.global_slots [ @tensor(%0 : !torch.tensor) @list(%1 : !torch.list<tensor>) ] } ``` This new structure allows GlobalizeObjectGraph to create the initializer in a much simpler way, avoiding the need to reason about whether different slots alias each other. Reasoning about whether slots alias each other now is the responsibility of InlineGlobalSlots, which has to do a much more complicated analysis, implemented using MLIR's dataflow analysis framework. Recommended review order: - Check out the new IR constructs in the .mlir files of various passes - Op definitions (*.td) - Changes to GlobalizeObjectGraph pass. - InlineGlobalSlots pass (~total rewrite) - Misc changes: - Moving torchMlirAdjustStaticInformation for sharing with C++ code. - EraseModuleInitializer pass To make this a bit nicer, it would be good to have a `torch.module` op with an initializer region attached. That would be more invasive though. This change has highlighted certain aspects of our project layering which are worth calling out. None of our backends can handle global slots, so we enforce that there are no global slots before backend lowering. At an earlier stage in the project, we had aspirations of transparently handling mutable global state and such, but for reasons described below, that is no longer a goal. So really global slots should be seen as a progressive lowering step as part of inlining all the IValue's in the original program (GlobalizeObjectGraph is also one such step). Over time, with insights from work like IREE-JAX, it has become clear that there isn't a reliable programming model we can compile for users where we just transparently handle mutable global state (and some other things, like lists and dictionaries). There is a need for an "outer program" that orchestrates more restricted subroutines of the kind we can handle in our compile flow here. The benefit of that is that it decouples considerations like shapes, dtypes, etc. from the program constructs used in the outer program. As long as the outer program can efficiently invoke (pipelining/async/etc.) high-performance data-parallel numerical subroutines of the kind we compile in our flow here, then there is a complete programming model. This is also consistent with the direction of upstream PyTorch which is becoming more tracing-based (which inherently loses a lot of program structure, which then has to be applied back with an "outer program" orchestrating the traced subroutines).	2022-08-08 18:12:06 -07:00
Tanyo Kwok	1ee865983b	[MHLO] fix tensor mode aten.div op pattern (#1160 ) * [MHLO] fix tensor mode aten.div op pattern See RFC #999 Co-authored-by: Bairen Yi <yibairen.byron@bytedance.com> Co-authored-by: Jiawei Wu <xremold@gmail.com> Co-authored-by: Tianyou Guo <tianyou.gty@alibaba-inc.com> Co-authored-by: Xu Yan <yancey.yx@alibaba-inc.com> Co-authored-by: Ziheng Jiang <ziheng.jiang@bytedance.com>	2022-08-06 23:38:06 +08:00
PhaneeshB	8b5631d4c5	[MLIR][TORCH] Add decomposition for aten.std.dim Op Signed-Off By: Phaneesh Barwaria <phaneesh@nod-labs.com>	2022-07-29 23:52:54 +05:30
Vivek Khandelwal	d386b8f9e5	[MLIR][TORCH] Add decomposition for aten.var.correction op This commit adds the decomposition for `aten.var.correction` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com	2022-07-29 11:08:57 +05:30
Quinn Dawkins	11a8901078	[MLIR][TORCH] Add support for multiple indexing tensors for aten.index.Tensor (#1097 ) - Includes a canonicalizer for `aten.add.t`needed for successfully lowering the shape function - Only offers support for statically sized index tensors when there is more than one - Dynamic shape support remains for single indexing tensors	2022-07-28 19:00:02 -04:00
Kevin Kiningham	e8f327cc00	Add lowering to linalg for softplus and log1p Follows existing conventions for unary operators.	2022-07-25 21:25:57 +05:30
Ramiro Leal-Cavazos	f271e6a88c	Add verifiers for ToBuiltinTensorOp and FromBuiltinTensorOp (#1089 ) This commit adds verifiers to the ops `ToBuiltinTensorOp` and `FromBuiltinTensorOp` that make sure that the input and output have the same shape and data type.	2022-07-21 21:41:45 +00:00
Vivek Khandelwal	4c25878e64	[MLIR][TORCH] Add canonicalization pattern for prim.ListUnpack op This commit adds the canonicalization pattern for the `prim.ListUnpack` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-07-18 13:51:25 +05:30
Vivek Khandelwal	3589134d31	[MLIR][TORCH] Add decomposition for aten.var.dim op This commit adds the decomposition for `aten.var.dim` op. This commit also make changes in the decomposition for `aten.var` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-07-15 09:53:42 +05:30
Ashay Rane	29bc48aedb	torch: add pass to catch non-value tensors (#1052 ) This patch adds a new pass `torch-verify-conversion-to-value-semantics`, which looks for non-value semantics tensors to catch such tensors early during compilation. This pass requires `torch-refine-public-return` pass to ensure that return operations are updated to use value tensors, followed by the canonicalize pass to remove any dead ops that may use or produce non-value tensors.	2022-07-13 17:11:15 -07:00
Ashay Rane	64c04bd5f6	canonicalizer: [nfc] update LIT variable names for consistency (#1051 ) A previous patch used lowercase names for LIT variables. This patch replaces them with uppercase names to maintain consistency with other variables.	2022-07-13 12:28:25 -07:00
Ashay Rane	ac4d7d10e0	canonicalizer: propagate type information across copy and cast ops (#1030 ) Prior to this patch, the canonicalizers for `AtenSizeOp` and `AtenSizeIntOp` succeeded only if the tensor operand's type information included the size of the requested dimension(s). We can extend the set of optimizable cases by propagating types across operations whose result type matches the input tensor type. Specifically, this patch enables the canonicalizers for `AtenSizeOp` and `AtenSizeIntOp` to see past `tensor_static_info_cast`, `copy.to_vtensor`, and `copy.to_tensor` ops until it reaches the first op whose result type contains size information for the requested dimensions, with a maximum bound of 6 parent lookups to avoid indefinite compilation times. All other encountered ops cause the canonicalizer to give up.	2022-07-12 12:38:37 -07:00

1 2 3 4 5 ...

381 Commits (44266ab0c439bcc30b70c92a3ee762618ccfc940)