torch-mlir

Commit Graph

Author	SHA1	Message	Date
MaheshRavishankar	28c7051ceb	Bump LLVM to llvm/llvm-project@5fcf907b34 (#2810 )	2024-01-26 18:38:44 -08:00
Ramiro Leal-Cavazos	dff3405d5a	Add alias analysis for cast-like ops to maximize-value-semantics (#2160 ) When `use_tracing=True` is used to import a model into Torch-MLIR, several casts get inserted in the IR to bridge the untyped inputs and outputs with the typed body of the computation. These casts create extra aliases of tensors that cause the current analysis in `maximize-value-semantics` to fail. In particular, the `maximize-value-semantics` analysis assumes that the only valid alias right after an overwrite is the overwritten alias. So, if there is a use of a casted version of the overwritten alias after the overwrite, the analysis fails. This commit improves the analysis by identifying all cast-like aliases of the overwritten alias and allowing such aliases to be used after an overwrite. Because this issue only arises when using tracing, it cannot be currently tested e2e, so only lit test is added.	2023-05-25 17:05:41 +00:00
Tanyo Kwok	577e38da58	build: update llvm tag to 7ccbb4df (#1736 ) Summary of changes: - LLVM now includes <optional> instead of "llvm/ADT/Optional.h" in most (although not all) places (https://reviews.llvm.org/rG541ef3d61e9341cd38420c0dbca9250c4d0ea04c). This patch replaces the affected instances of `llvm::Optional` with `std::optional`. - In the usages of llvm::Optional that remain, llvm::Optional::value() is deprecated, so this patch replaces them with a dereference.	2022-12-20 18:17:27 +08:00
Ramiro Leal-Cavazos	dd35488da5	build: update llvm tag to 798fa4b4 (#1684 ) - Support for non-prefixed accessors has been removed. See: https://reviews.llvm.org/D136727 - Rename `operands` to `methodOperands` in `prim.CallMethod` since the name `operands` overlaps with a builtin method name. See: https://reviews.llvm.org/D136727 - Add passes in refbackend to lower memref.subview. See: https://reviews.llvm.org/D136377 - Replace `CopyToValueTensorOps` first in `RewriteViewLikeSubgraph` in maximize-value-semantics. The current implementation of the `RewriteViewLikeSubgraph` pass in maximize-value-semantics creates temporarily invalid IR. In particular, given a forward slice starting from a `CopyToNonValueTensorOp` and ending in `CopyToValueTensorOp`s, the pass first replaces all uses of the `CopyToNonValueTensorOp` with its operand, which results in all the `CopyToValueTensorOp` users having their operand have type `!torch.vtensor`, which is invalid. The correct way to do things is to first replace all the `CopyToValueTensorOp`s with their operand, and then replace all uses of the `CopyToNonValueTensorOp` with its operand. This only started failing now because the generated accessor `getOperand` for the `CopyToValueTensorOp` now returns a `TypedValue<NonValueTensorType>`, which has an assert checking that the value returned is of the expected type.	2022-12-07 12:20:41 -08:00
Vivek Khandelwal	6db513c51d	[tosa] Add support for some cases of aten.broadcast_to op (#1429 ) This commit adds support for TorchToTosa lowering of `aten.broadcast_to` op for cases: 1.) When the rank of input and output tensor is equal. 2.) When the rank of input tensor is zero. Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-09-29 09:40:56 -07:00
gpetters94	79b9cf9468	Add lowering for aten.to.device (#1107 )	2022-08-10 19:24:02 -04:00
Ashay Rane	bb47c166a0	llvm: update tag to 061e0189 (#1180 ) Summary of changes: - Switch to C++17 (similar to https://reviews.llvm.org/D131348) - Update MHLO to build with LLVM commit hash 061e0189 - Replace deprecated `hasValue()` and `getValue()` with `has_value()` and `value()` respectively (https://reviews.llvm.org/D131349) - Use `TypedAttr` (https://reviews.llvm.org/D130092) - Use updated assembly format of `mhlo.compare` op (commit d03ef01e70fbf9afd0fa1976fbb7ed31838929b3 in MHLO repo)	2022-08-08 20:17:35 -07:00
Alec	554570f3ab	Implemented a decomposition of aten::narrow	2022-08-01 18:32:14 +05:30
Vivek Khandelwal	77ab31641f	[MLIR][TORCH] Add decomposition of aten.numpy_T op This commit adds the decomposition of `aten.numpy_T` op into `aten.t` or `aten.permute` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-06-16 00:01:22 +05:30
Prashant Kumar	12b3af70d3	[TORCH] Add folding of aten.detach op. `aten.detach` op is folded and returns the first operand since it's an identity function(kind of identity just remove the has_grad attribute).	2022-05-10 21:54:45 +05:30
Yi Zhang	28be6511d2	Fix type promotion code for scalar only operations Fix the type promotion code for scalar only operation to return TorchType which is the type tracked in ValueKnowledge.scalarType. - Fix `getPromotedResultScalarType` to return Torch type. - Add `getBuiltInTypeForTorchScalar` helper to convert scalar type to builtin type before passing to the next level type promotion helper `updateResultTypeState`. - Add `setScalarType` helper to make setting ValueKnowledge.scalarType easier.	2022-05-07 10:37:21 -04:00
Vivek Khandelwal	c0634bc996	[MLIR][TORCH] Add E2E support for aten.to.dtype_layout op This commit decomposes `aten.to.dtype_layout` op into `aten.to.dtype` op. This commit also fixes the formatting for the file type_conversion.py. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-05-03 12:48:58 +05:30
Ashay Rane	9208bf0eb6	llvm: bump tag to e1318078 (#781 ) The updated LLVM code includes a patch to create bfloat16 array attributes, thus enabling a different patch to torch-mlir to flesh out support for the bfloat16 type.	2022-04-26 12:27:51 -07:00
Maksim Levental	25ba51b2af	This commit decomposes aten._reshape_alias op into aten.view op. (#690 )	2022-03-28 23:54:28 -05:00
Vigilans	63fb1e5aad	Bump LLVM at 8361c5da30588d3d4a48eae648f53be1feb5cfad	2022-03-18 13:16:14 -04:00
Ramiro Leal-Cavazos	0bcc6d1075	Add maximize-value-semantics support for multiple non-value tensor inputs (#659 ) This commit adds value semantics support for ops such as `aten.view_as` and `aten.expand_as` that take two non-value tensors as input.	2022-03-15 18:13:45 -07:00
Sean Silva	a5fe0cf063	Introduce new shape library design. See the documentation in `docs/shape_lib.md` and `docs/adding_a_shape_function.md` for an overview of the system. This completely overhauls how we represent shape functions. In particular, RefineTypes does not infer shapes anymore (only dtypes). Shape functions are now written in (TorchScript'able) Python. Recommended review order: 1. Read `docs/shape_lib.md` and `docs/adding_a_shape_function.md`. 1. Code and tests for ReifyShapeCalculations, DropShapeCalculations. 1. Code and tests for SimplifyShapeCalculations. 1. shape_lib_gen.py 1. Code and tests for new RefineTypes pass. 1. Random folders/canonicalizers in TorchOps.cpp and associated test in `canonicalize.mlir`. 1. New ReadOnly trait inferred from the registry. 1. Any miscellaneous remaining stuff. Example `-print-ir-after-all` for ElementwiseUnaryModule: [IR lowering dump](https://gist.github.com/silvasean/e4dc8cbc8d00aac7819602e3cbd8e212). Example `-print-ir-after-all` for ElementwiseBinaryModule: [IR lowering dump](https://gist.github.com/silvasean/daf6860ecced732af3568af6b1899113).	2022-03-15 12:41:58 -07:00
Ramiro Leal-Cavazos	51e267aa37	Combine maximize-value-semantics rewrite patterns into one pattern (#642 ) This commit replaces the two rewrite patterns of maximize-value-semantics with a single pattern that captures the behavior of both as well as other edge cases previously not supported. The new pattern works by first performing alias analysis on a subgraph to see if pattern is applicable, then rewriting all non-value tensors to value tensors in a single go.	2022-03-10 09:36:52 -08:00
Ramiro Leal-Cavazos	ba29d4f250	Add operand type invariant to `torch.overwrite.tensor.contents` (#606 ) This commit adds the invariant to the op `torch.overwrite.tensor.contents` that both of its operands have the same shape and size. In order to maintain the invariant, special handling of this op is added to the `RefineTypes` pass.	2022-02-22 11:41:46 -08:00
Ramiro Leal-Cavazos	ea371a9bf2	Fix handling of view-like ops in `maximize-value-semantics` (#611 ) This commit adds handling to the `maximize-value-semantics` pass for the case where a view-like op depends on a tensor that has been overwritten by a value tensor. The approach for removing the dependency is to change the input to the view-like op to be a copy of the value tensor that is being used to overwrite. This commit also removes `AtenFill_ScalarOp` and `AtenBernoulli_FloatOp` from the list of view-like ops, since these ops now have a corresponding op with value semantics into which they get converted in the `reduce-op-variants` pass.	2022-02-18 10:19:07 -08:00
Prashant Kumar	258660deb6	Add aten.bernoulli decomposition. aten.bernoulli is decomposed to aten.gtTensor(aten.uniform(x), x).	2022-02-11 00:35:33 +05:30
Gaurav Shukla	0079901039	[TORCH][MLIR] Add E2E support for `aten.reshape` op This commit decomposes `aten.reshape` into `aten.view` op in the case of value tensor type operand. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-02-02 20:41:47 +05:30
Nirvedh	3cb46cecef	Added aten::t() Op	2021-12-22 10:57:10 -05:00
Gaurav Shukla	5a47f92390	[TORCH][MLIR] Add E2E support for `aten.squeeze.dim` op This commit adds lowering of `aten.squeeze.dim` op into `linalg.TensorCollapseShape` op. Here, the dim(th) dimension of the input tensor is not supposed to be dynamic. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2021-12-10 17:01:20 +05:30
Daniel Garvey	a52aded0b9	Add lowering for slice and selectInt (#398 )	2021-12-02 22:09:21 -06:00
Gaurav Shukla	73b27b32dc	[MLIR][TORCH] Add E2E support for `aten.squeeze` op This commit adds lowering of `aten.Squeeze` op into `linalg.TensorCollapseShape` op. The size 1 dynamic dimensions are not handled as a part of this commit. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2021-11-30 23:00:28 +05:30
Prashant Kumar	36afa4a4d3	Add aten.fill.Scalar op lowering The lowering of aten.fill.Scalar has been added. The changes have been made as a part of -torch-convert-to-linalg pass. Signed-off-by: Prashant Kumar <prashant@nod-labs.com>	2021-11-30 21:12:15 +05:30
Prateek Gupta	18e8806b14	[TORCH][MLIR] Add E2E support for aten::to.dtype. This commit adds end to end support for AtenToDtypeOp from aten to linalg. Signed-Off-By: Prateek Gupta <prateek@nod-labs.com>	2021-11-08 12:56:03 -05:00
Gaurav Shukla	2ce47dc8e4	[TORCH][MLIR] Add E2E support for aten.expand This commit adds decomposition of `aten.Expand` to `aten.BroadcastTo` op. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2021-11-03 23:58:59 +05:30
Gaurav Shukla	69eaf9a154	[MLIR][TORCH] Add E2E support for `torch.aten.view` - This commit adds lowering of `aten.View` to `linalg.TensorExpandShape`. - This lowering will be successful only when one or more static dimensions are expanded. - It also fixes a typo in `ConvertAtenFlattenUsingIntsOp` conversion pattern. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2021-10-29 22:33:10 +05:30
George Petterson	2ea2ab518b	Add contiguous	2021-10-29 11:11:50 -04:00
Prateek Gupta	c33a2ca952	[TORCH][MLIR] Add E2E support for aten.permute. This commit adds lowering of aten.permute to linalg.generic operation. Signed-Off-By: Prateek Gupta <prateek@nod-labs.com>	2021-10-28 10:25:26 -04:00
George Petterson	7c47b9a0c8	Formatting fix	2021-10-19 13:33:31 -04:00
George Petterson	8853dfbc74	Add broadcast	2021-10-19 13:33:31 -04:00
Yi Zhang	a459e09ab7	E2e support for aten.softmax.int and aten.embedding - Added a DecomposeComplexOps pass to decompose complex torchOps. - Refactored `visitAtenArgmaxOp` and `visitAtenAnyDimOp` to `visitReductionAlongDimIntOp`. - Moved some helper functions into torch-mlir/Dialect/Torch/Utils/Utils.h to be shared by multiple files. - Added support for f64 tensor as argument and return types.	2021-10-18 17:57:45 -04:00
Sean Silva	5b6902e31c	Dual license the torch-mlir project. This commit (with approval from all contributors) dual licenses the torch-mlir project under both the standard LLVM license and the standard PyTorch license. This will facilitate moving code between torch-mlir and the two upstream projects. The standard file comment is now: ``` // This file is licensed under the Apache License v2.0 with LLVM Exceptions. // See https://llvm.org/LICENSE.txt for license information. // SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception // Also available under a BSD-style license. See LICENSE. ``` See `LICENSE` in the project root for the terms of both licenses.	2021-10-01 10:46:08 -07:00
Sean Silva	4fad753073	Move external/torch-mlir to the root of the repo.	2021-09-27 17:11:08 -07:00
Sean Silva	28a7738189	[torch-mlir earthmoving (1/N)] C/C++ code movement. This creates the `external/torch-mlir` directory as an LLVM_EXTERNAL_PROJECTS-compatible project (analogous to `iree-dialects`) and completes movement/rename of all pure MLIR C/C++ compiler code into there. The next step will be to move all the Python code / code that links/includes PyTorch C++ code (which currently lives in `frontends/pytorch`) into a subdirectory here. I call this "earthmoving" because it is mostly mechanical changes and renames. As a quick summary (we can change this down the road easily) - C++ `mlir::NPCOMP::Torch -> mlir::torch::Torch` - CAPI `npcompTorchListTypeGet -> torchMlirTorchListTypeGet` - preprocessor `#ifndef NPCOMP_ -> #ifndef TORCHMLIR_` - CMake `NPCOMPFoo -> TorchMLIRFoo` The goal of this is to create a standalone project creating a center of mass for entry into the MLIR ecosystem from PyTorch, suitable in scope for eventual inclusion/ownership in PyTorch. The idea is that `external/torch-mlir` will some day be pulled out into its own repository, and then npcomp will simply pull it in as a submodule. Layering-wise, what lives in `torch-mlir` lowers code from PyTorch (currently TorchScript, but TorchFX or pytorch/xla-style tracing are possible extensions) down to what we have been calling the "Torch backend contract" which is cleaned up IR (inlining, simplifcation, conversion to value tensors, ...) entirely in the `torch` dialect. This is the branching off point for further lowering, of which npcomp takes one opinion (outside `torch-mlir` of course!), namely the `TorchConversion` dialect/transforms which lower to IR suitable for IREE and other linalg-on-tensors based lower-level compilers. Summary of changes: - move `{include,lib,test}/Dialect/Torch` into `torch-mlir` - move relevant parts of CAPI into `torch-mlir`. - leave a few things related to the `torch-mlir` Python build commented out, which should be resolved in a subsequent change.	2021-09-10 21:44:37 -07:00
Sean Silva	79928cd2dd	Generalize support for elementwise ops. We plumb through e2e a fair number of interesting cases: - unary, binary, ternary elementwise ops - ops like `torch.aten.add.Tensor` that also take a scalar parameter - static size-1 broadcasting We allow the static size-1 broadcasting case, but emit a runtime error in the case of dynamic size-1 broadcasting. This seems like a sweet spot subset of things that can be lowered directly to linalg, while not being overly constraining to users. This is consistent with what IREE is doing for CHLO->Linalg lowering as well ([code](`50bf7a87e4/iree/compiler/InputConversion/MHLO/BroadcastingToLinalgPatterns.cpp (L1)`)). To test the static size-1 case, we added support for the `torch.aten.unsqueeze` op and lowering for it through `linalg.tensor_expand_shape`. This involved a generalization of `MaximizeValueSemantics` able to handle it (the solution there also works for `torch.aten.flatten.using_ints` which we need for ResNet anyway) Also, a few minor additional changes: - Add `VerifyInvariantsBeforeBackendLowering` pass, which catches a large class of errors before we get to backend lowering (now that we are doing dialect conversion, the errors are way nicer if we just emit them up front rather than in the guts of a random pattern). - Minor change to RefBackend to allow `linalg.tensor_expand_shape`. Recommended review order: - e2e tests in elementwise.py - `ConvertElementwiseOp` in TorchToLinalg.cpp + elementwise.mlir test - `ConvertAtenUnsqueezeOp` in TorchToLinalg.cpp + unsqueeze.mlir test - RefineTypes.cpp + tests - MaximizeValueSemantics changes + test - VerifyInvariantsBeforeBackendLowering pass + test	2021-06-28 13:28:38 -07:00
Sean Silva	79aade33da	Make MaximizeValueSemantics a bit smarter. This adds a pattern to MaximizeValueSemantics which does a simple abstract interpretation within a block, which handles simple cases of `torch.overwrite_tensor`, enough to remove all the unnecessary uses of non-value tensors in ResNet right now. Before/after IR: [gist](https://gist.github.com/silvasean/a3e1ef625b19dfc63579f73cd3b543b6) Also, - Split `torch.copy.tensor` into `torch.copy.to_tensor` and `torch.copy.to_vtensor` which convert between value and non-value semantic tensors. This is a much cleaner factorization as they have very separate use cases and properties (e.g. different side effects) - Remove the various canonicalization patterns they had, which were confusing because they resulted in limited forms of maximizing value semantics throughout the pipeline. We should structure our compilation pipeline such that only MaximizeValueSemantics should be maximizing value semantics. - Adjust pass pipeline to only run MaximizeValueSemantics once. - Make OverwriteTensorOp `$value` always be a value tensor and `$overwritten` be a non-value tensor.	2021-06-22 16:48:57 -07:00
Sean Silva	784156a998	Add `!torch.bool` type. This finishes removing the dependence on the basicpy dialect! Changes: - Add `!torch.bool` type and replace use of `!basicpy.BoolType` in Torch-related code. - Rename BuiltinTensorize to BackendTypeConversion since now it handles bool conversions (and, when we add !torch.int and !torch.float, it will handle those as well), and generalize the related utilities (I also moved them to Torch/Transforms since they aren't really part of Torch/IR). - Add `torch.to_i1` and `torch.from_i1` ops for materializations - [cleanup] Reorganize `torch.constant.*` ops in TorchOps.td - Remove dependency of `torch` dialect on `basicpy` dialect and also `std` dialect. For `std`, we use some call related ops, but the `torch` dialect itself never produces them (we have passes that do though). This is fairly mechanical. Recommended review order: - New stuff in Torch/IR - New BuiltinTypeConversion files. - Mechnical fixups elsewhere.	2021-06-16 13:22:00 -07:00
Sean Silva	370e3270ab	Introduce `!torch.tensor` / `!torch.vtensor` types. This removes our reliance on the numpy dialect and avoids our off-label use of the builtin tnesor type for modeling unknown dtypes. The `!torch.vtensor` (`ValueTensorType`) type is a value-semantic tensor. The `!torch.tensor` (`NonValueTensorType`) type is a non-value-semantic tensor. The new types look as follows syntactically: ``` // Least-static-information, non-value-semantic tensor. !torch.tensor // Explicit form of least-static-information variant. !torch.tensor<,unk> // Least-static-information, value-semantic tensor. !torch.vtensor // Explicit form of least-static-information variant. !torch.vtensor<,unk> // Fixed-set of allowable element types, with first-class support for // Torch's frontend signedness semantics. !torch.tensor<*,si32> // First-class support for unknown dtypes. !torch.tensor<[?,?,?],unk> // Standard MLIR representation of `?` for unknown dimensions. !torch.tensor<[?,2,?,4],unk> // Statically shaped / dtyped example. !torch.vtensor<[1,2,3,4],f32> ``` This required fairly significant changes throughout the compiler, but overall it is a big cleanup. We now have a much clearer layering of "the Torch frontend lowering" vs "lowering to std + linalg + etc.". At the C++ level, there is `ValueTensorType`, `NonValueTensorType`. We also have a helper `BaseTensorType` (kind of like ShapedType) which interoperates with those two. Included changes: - New `torch.tensor(dense<0.0> : tensor<5xf32>) : !torch.tensor` op for creating torch tensor literals in the frontend. - Consistently use signedness for the types (except i1 which I didn't touch -- we need to sort out the situation with !basicpy.BoolType there anyway so will be attending to that soon) - Frontend can annotate whether an argument to the function has value semantics. We currently require this, as our backend contract does not currently allow us to even model the non-value-semantic case. Before, the value-semantic assumption was randomly injected in the middle of the pass pipeline. - Move ArrayToTensor (now called MaximizeValueSemantics) and RefinePublicReturn passes to torch dialect. - The TorchToStd and TorchToLinalg passes are now type conversions from `!torch.vtensor` to `tensor` and use the dialect conversion infra. The overall conversion pipeline is set up following the best practices of the "Type Conversions the Not-So-Hard Way" talk. This required introducing `torch-func-builtin-tensorize` and `torch-finalizing-builtin-tensorize` passes analogous to the upstream bufferization passes with the corresponding names (mostly just copypasta from there). - Misc Torch-level canonicalizations -- we now cleanly layer the lowering to std later in the pipeline, so we are gradually lessening our reliance on random std constant folding before we get to that point. Recommended review order: - New types in TorchTypes.td/TorchTypes.h/TorchDialect.cpp - New ops in TorchOps.td / TorchOps.cpp - Less important / more mechanical stuff - Frontend changes. - Pass changes/additions in `Torch/Transforms` and `Conversion/`	2021-06-10 10:56:48 -07:00

42 Commits (401869e31dc49692e21edd3069072016d46469e2)