torch-mlir

Commit Graph

Author	SHA1	Message	Date
Tanyo Kwok	9176b5ed29	Add decomposition for aten.flatten.using_ints (#1161 )	2022-08-23 11:52:54 +08:00
Sean Silva	01290d134a	Add a way for backends to control which ops are legal for them. We were already hitting many cases where backends different in terms of the legal ops that they wanted. This caused unnecessary coupling between the backends. Examples: - https://github.com/llvm/torch-mlir/pull/1161 - https://github.com/llvm/torch-mlir/pull/862 This PR centralizes all compilation to go through `torch_mlir.compile` so that we can keep the logic centralized there. We should move these lists closer to each backend. Especially cases like https://github.com/llvm/torch-mlir/pull/862 where blocking a decomposition is necessary to avoid a crash emphasize that the set of decompositions is tightly coupled to the backend, and should be "controlled by the backend" and not something arbitrarily tweakable. Also: - Fix a small bug in the way we passed through the backendLegalOps option. - Add better error messages in `torch_mlir.compile` for import errors.	2022-08-22 14:16:13 -07:00
武家伟	99fb4c8637	Add folder for ToF64Op and FromF64Op (#1257 )	2022-08-22 09:49:39 +08:00
Ramiro Leal-Cavazos	9bc606c384	Add support for returning more than one copy of the same tensor (#1228 ) One of the simplifications made by the pass `RefinePublicReturn` currently only happens if the tensor in question only has one user. However, the current method of checking this does not correctly handle the case of a user having multiple uses of the same tensor. This commit makes sure only unique users are considered.	2022-08-18 22:41:45 +00:00
Sean Silva	283e0f141a	Add a concept of "backend legal ops". This is a first step towards formalizing the set of ops in our backend contract. The goal is to eventually formalize `torch` dialect ops into 3 categories: 1. Legal in backend contract 2. Illegal in backend contract 3. Conditionally legal in backend contract The "conditionally legal" set are the ops that we can optionally decompose for backends. This patch adds relevant pass options for this throughout the compiler, in preparation for a new set of traits which will formalize this classification.	2022-08-18 11:46:50 -07:00
Sean Silva	57681f7947	Iteratively run the main simplification pipeline. This introduces a new pass LowerToBackendContract (better name very welcome) which performs the bulk of the simplifications that we do, such as - shape refinement - dtype refinement - maximizing value semantics - inlining global slots - decomposing complex ops The key difference from before is that it iterates the set of transformations, which can help to break a number of "catch-22" issues where one simplification depends on another, the latest example being here: https://github.com/llvm/torch-mlir/issues/1131 This also exposed that RefineTypes was sometimes crashing/asserting for certain inputs. This commit hardens it a bit.	2022-08-17 14:54:33 -07:00
武家伟	3b3cb99ef8	Generalize canonicalization pattern for more aten.sub/div/mul/add op (#1209 ) Generalize canonicalization pattern for more sub/div/mul/add op, but for AtenDivTensorModeOp in 'trunc' rounding mode, we try to fold it.	2022-08-16 13:24:08 +08:00
Sean Silva	504de5e701	Rework how global slot initializers work. Rather than a per-global-slot initializer region, we now have one for the whole module. For example, it might look like this: ``` torch.global_slot "private" @tensor : !torch.tensor torch.global_slot "private" @list : !torch.list<tensor> torch.global_slot.module_initializer { %0 = torch.tensor.literal(dense<0.0> : tensor<f32>) : !torch.tensor %1 = torch.prim.ListConstruct %0 : (!torch.tensor) -> !torch.list<tensor> torch.initialize.global_slots [ @tensor(%0 : !torch.tensor) @list(%1 : !torch.list<tensor>) ] } ``` This new structure allows GlobalizeObjectGraph to create the initializer in a much simpler way, avoiding the need to reason about whether different slots alias each other. Reasoning about whether slots alias each other now is the responsibility of InlineGlobalSlots, which has to do a much more complicated analysis, implemented using MLIR's dataflow analysis framework. Recommended review order: - Check out the new IR constructs in the .mlir files of various passes - Op definitions (*.td) - Changes to GlobalizeObjectGraph pass. - InlineGlobalSlots pass (~total rewrite) - Misc changes: - Moving torchMlirAdjustStaticInformation for sharing with C++ code. - EraseModuleInitializer pass To make this a bit nicer, it would be good to have a `torch.module` op with an initializer region attached. That would be more invasive though. This change has highlighted certain aspects of our project layering which are worth calling out. None of our backends can handle global slots, so we enforce that there are no global slots before backend lowering. At an earlier stage in the project, we had aspirations of transparently handling mutable global state and such, but for reasons described below, that is no longer a goal. So really global slots should be seen as a progressive lowering step as part of inlining all the IValue's in the original program (GlobalizeObjectGraph is also one such step). Over time, with insights from work like IREE-JAX, it has become clear that there isn't a reliable programming model we can compile for users where we just transparently handle mutable global state (and some other things, like lists and dictionaries). There is a need for an "outer program" that orchestrates more restricted subroutines of the kind we can handle in our compile flow here. The benefit of that is that it decouples considerations like shapes, dtypes, etc. from the program constructs used in the outer program. As long as the outer program can efficiently invoke (pipelining/async/etc.) high-performance data-parallel numerical subroutines of the kind we compile in our flow here, then there is a complete programming model. This is also consistent with the direction of upstream PyTorch which is becoming more tracing-based (which inherently loses a lot of program structure, which then has to be applied back with an "outer program" orchestrating the traced subroutines).	2022-08-08 18:12:06 -07:00
Tanyo Kwok	1ee865983b	[MHLO] fix tensor mode aten.div op pattern (#1160 ) * [MHLO] fix tensor mode aten.div op pattern See RFC #999 Co-authored-by: Bairen Yi <yibairen.byron@bytedance.com> Co-authored-by: Jiawei Wu <xremold@gmail.com> Co-authored-by: Tianyou Guo <tianyou.gty@alibaba-inc.com> Co-authored-by: Xu Yan <yancey.yx@alibaba-inc.com> Co-authored-by: Ziheng Jiang <ziheng.jiang@bytedance.com>	2022-08-06 23:38:06 +08:00
PhaneeshB	8b5631d4c5	[MLIR][TORCH] Add decomposition for aten.std.dim Op Signed-Off By: Phaneesh Barwaria <phaneesh@nod-labs.com>	2022-07-29 23:52:54 +05:30
Vivek Khandelwal	d386b8f9e5	[MLIR][TORCH] Add decomposition for aten.var.correction op This commit adds the decomposition for `aten.var.correction` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com	2022-07-29 11:08:57 +05:30
Quinn Dawkins	11a8901078	[MLIR][TORCH] Add support for multiple indexing tensors for aten.index.Tensor (#1097 ) - Includes a canonicalizer for `aten.add.t`needed for successfully lowering the shape function - Only offers support for statically sized index tensors when there is more than one - Dynamic shape support remains for single indexing tensors	2022-07-28 19:00:02 -04:00
Kevin Kiningham	e8f327cc00	Add lowering to linalg for softplus and log1p Follows existing conventions for unary operators.	2022-07-25 21:25:57 +05:30
Ramiro Leal-Cavazos	f271e6a88c	Add verifiers for ToBuiltinTensorOp and FromBuiltinTensorOp (#1089 ) This commit adds verifiers to the ops `ToBuiltinTensorOp` and `FromBuiltinTensorOp` that make sure that the input and output have the same shape and data type.	2022-07-21 21:41:45 +00:00
Vivek Khandelwal	4c25878e64	[MLIR][TORCH] Add canonicalization pattern for prim.ListUnpack op This commit adds the canonicalization pattern for the `prim.ListUnpack` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-07-18 13:51:25 +05:30
Vivek Khandelwal	3589134d31	[MLIR][TORCH] Add decomposition for aten.var.dim op This commit adds the decomposition for `aten.var.dim` op. This commit also make changes in the decomposition for `aten.var` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-07-15 09:53:42 +05:30
Ashay Rane	29bc48aedb	torch: add pass to catch non-value tensors (#1052 ) This patch adds a new pass `torch-verify-conversion-to-value-semantics`, which looks for non-value semantics tensors to catch such tensors early during compilation. This pass requires `torch-refine-public-return` pass to ensure that return operations are updated to use value tensors, followed by the canonicalize pass to remove any dead ops that may use or produce non-value tensors.	2022-07-13 17:11:15 -07:00
Ashay Rane	64c04bd5f6	canonicalizer: [nfc] update LIT variable names for consistency (#1051 ) A previous patch used lowercase names for LIT variables. This patch replaces them with uppercase names to maintain consistency with other variables.	2022-07-13 12:28:25 -07:00
Ashay Rane	ac4d7d10e0	canonicalizer: propagate type information across copy and cast ops (#1030 ) Prior to this patch, the canonicalizers for `AtenSizeOp` and `AtenSizeIntOp` succeeded only if the tensor operand's type information included the size of the requested dimension(s). We can extend the set of optimizable cases by propagating types across operations whose result type matches the input tensor type. Specifically, this patch enables the canonicalizers for `AtenSizeOp` and `AtenSizeIntOp` to see past `tensor_static_info_cast`, `copy.to_vtensor`, and `copy.to_tensor` ops until it reaches the first op whose result type contains size information for the requested dimensions, with a maximum bound of 6 parent lookups to avoid indefinite compilation times. All other encountered ops cause the canonicalizer to give up.	2022-07-12 12:38:37 -07:00
Sean Silva	e5e11e214b	GlobalizeObjectGraph: Clean up handling of unused slots The way we did it previously still created the slot and copied the initializer even if unused.	2022-07-12 10:47:28 -07:00
Ashay Rane	9017be9e9e	torch: copy uses to prevent iterator invalidation (#1033 ) Prior to this patch, the code in the `torch-simplify-shape-calculations` pass iterated on the uses of an op's result while also modifying the value. This caused the iterator to get invalidated, thus terminating the loop early and producing incorrect IR. This patch makes use of `llvm::make_early_inc_range()` to ensure that the iterator is not invalidated while executing the loop body.	2022-07-11 18:47:04 -07:00
Ramiro Leal-Cavazos	11148e60d6	Undo shape lib changes + update function signature of sum + zero (#1035 ) This commit does three things: 1. Reverts some of the shape lib changes merged in https://github.com/llvm/torch-mlir/pull/844 2. Updates the signature of `aten.sum_dim_IntList` that was recently updated in `23bdb570cf` 3. Replaces `aten.zero.functional` with `aten.zero`, updated in `960758b0b7`	2022-07-11 10:56:12 -07:00
Prateek Gupta	2d75654b2c	[TORCH][MLIR] Add lowering of `aten.slice_scatter` and `aten.select_scatter` op. This commit adds: 1. Lowering of `aten.slice_scatter` op into `tensor.insert_slice` op. 2. Decomposes the `aten.select_scatter` op into `aten.slice_scater` op. Signed-Off-By: Prateek Gupta <gprateek93@gmail.com>	2022-07-11 14:07:21 +05:30
Ashay Rane	340d8af28a	torch: handle `torch.prim.dtype` ops during type refinement (#1013 ) The canonicalizer converts `torch.prim.dtype` ops into integer constants for valid types, but the type may not be known until type refinement is complete. However, type refinement cannot make progress until `torch.prim.dtype` ops have been resolved to their corresponding integer constants, thus creating a circular dependency. This patch creates a tight coupling between type refinement and the lowering of `torch.prim.dtype` ops by handling such ops as they are encountered during type refinement. The unit test in this patch aims to check whether the type refinement pass can now handle chains of operations that alternate between type construction and type refinement.	2022-07-08 16:38:51 -07:00
Ramiro Leal-Cavazos	6a72ab4502	Add basic support for list of optional tensors in reduce-op-variants (#971 ) This commit adds support for lists of type `list<optional<tensor>>` where each element in the list is either a `!torch.tensor` or a `!torch.none`.	2022-07-08 11:12:15 -07:00
Quinn Dawkins	f0c3b5a7ed	Add E2E support for aten.len.str (#969 )	2022-07-07 10:41:55 -07:00
Ashay Rane	88316b3b4e	torch: fold prim.dtype(bf16) to integer constant 15 (#1012 ) A prior patch (`63538de2`) that added support for bfloat16 type did not add the canonicalization pattern to fold `torch.prim.dtype` operations on bfloat16 tensors into the integer constant 15. This patch fixes the problem.	2022-07-06 18:21:43 -07:00
Tanyo Kwok	d4f1f41435	[MLIR][TORCH] Add decomposition of aten.repeat (#932 ) * [MLIR][TORCH] Add decomposition of aten.repeat * refine & rebase * refine static shapes * add e2e test * Rebase and Refine naming style	2022-07-01 13:02:31 +08:00
Sean Silva	227dea7b2e	Add support for ScalarType::QUInt8 I ran into this while poking around at https://github.com/llvm/torch-mlir/issues/959	2022-06-29 15:33:28 -07:00
Ashay Rane	163fa57cde	torch: allow torch dialect ops after running drop-shape pass (#979 ) In the `pyhpc_turbulent_kinetic_energy` TorchBench benchmark, the shape calculation occurs inside loops, but because `DropShapeCalculationsPass` does not explicitly mark the Torch dialect as legal, the pass execution fails. This patch adds Torch to the list of legal dialects, and adds a test to validate the translation.	2022-06-25 07:27:47 -07:00
Tanyo Kwok	143a7bcb76	[MLIR][TORCH] Add folder for torch_c.from_i64 & torch_c.to_i64 (#933 ) * [MLIR][TORCH] Add folder for torch_c.from_i64 & torch_c.to_i64 * add unit tests for each individual fold * fix failure of NumelZeroRankModule & TestMultipleTensorAndPrimitiveTypesReturn	2022-06-24 09:34:39 +08:00
erman-gurses	5cff40c88a	Add canonicalization for aten.add.tensor op	2022-06-23 17:24:59 -04:00
Maksim Levental	829717c96e	Bump LLVM (#958 )	2022-06-22 22:23:46 -05:00
Vivek Khandelwal	77ab31641f	[MLIR][TORCH] Add decomposition of aten.numpy_T op This commit adds the decomposition of `aten.numpy_T` op into `aten.t` or `aten.permute` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-06-16 00:01:22 +05:30
Vivek Khandelwal	33fa8e7761	[MLIR][TORCH] Add decomposition of aten.floor_divide op This commit adds the decomposition of `aten.floor_divide` op into `aten.div.Tensor_mode` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-06-14 08:56:25 +05:30
Vivek Khandelwal	a11ef674a7	[MLIR][TORCH] Add E2E support for aten.baddbmm op This commit decomposes `aten.baddbmm` op into `aten.bmm`, `aten.mul.Scalar`, and `aten.add.Tensor` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-06-07 22:26:28 +05:30
Vivek Khandelwal	2718b4d838	[MLIR][TORCH] Add E2E support for aten.clamp_[min\|max] op This commit decomposes `aten.clamp_min` and `aten.clamp_max` op into `aten.clamp` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-06-06 11:52:29 +05:30
Henry Tu	abf5c94a1b	Replace valsem.aten.zero with aten.zero.functional (#893 )	2022-06-03 16:27:31 -04:00
Vivek Khandelwal	6f548fc3ad	[MLIR][TORCH] Add decomposition of aten.adaptive_avg_pool2d op This commit adds the decomposition of `aten.adaptive_avg_pool2d` op into `aten.avg_pool2d` op. The current decomposition only supports cases where input size is equal to the output size. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-05-27 07:56:37 +05:30
Vivek Khandelwal	56e77d4213	[MLIR][TORCH] Add E2E support for aten.Bool.[float\|int] op This commit adds lowering of `aten.Bool.float` and `aten.Bool.int` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-05-24 21:18:34 +05:30
Vivek Khandelwal	bc9b2156e3	[MLIR][TORCH] Add E2E support for aten.sqrt.int op This commit adds lowering of `aten.sqrt.int` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-05-24 16:50:39 +05:30
Sean Silva	3fb54cba4c	torch.prim.TupleIndex: Adjust tensor types when folding. In cases where a refinement/derefinement was needed, we didn't fold. Fixes https://github.com/llvm/torch-mlir/issues/863	2022-05-19 09:36:27 -07:00
Ashay Rane	bb52a460cb	mlir: bump llvm tag to 5380e3 (#856 ) In addition to updating the llvm-project submodule, this patch also: 1. updates shape functions and tests so that `func` and `call` operations refer to the `func` dialect 2. avoid duplicate registration of dialects	2022-05-16 12:54:35 -07:00
Ramiro Leal-Cavazos	96f90efd16	Add shape info to `rand_like` + support for `dtype` flag (#851 ) The op `aten.rand_like` was missing a shape function, unit tests, and the `dtype` argument was being ignored in its decomposition. This commit fixes all three things.	2022-05-12 16:00:59 -07:00
Yi Zhang	28be6511d2	Fix type promotion code for scalar only operations Fix the type promotion code for scalar only operation to return TorchType which is the type tracked in ValueKnowledge.scalarType. - Fix `getPromotedResultScalarType` to return Torch type. - Add `getBuiltInTypeForTorchScalar` helper to convert scalar type to builtin type before passing to the next level type promotion helper `updateResultTypeState`. - Add `setScalarType` helper to make setting ValueKnowledge.scalarType easier.	2022-05-07 10:37:21 -04:00
Vivek Khandelwal	96fabc0036	[MLIR][TORCH] E2E support for [ge\|ceil].float, [ge\|ne\|gt].float_int op This commit adds lowering of `aten.ge.float`, `aten.ge.float_int`, `aten.ne.float_int`, `aten.gt.float_int` and `aten.ceil.float` op. This commit also fixes formatting for the file scalar.py and scalar_comparison.py. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-05-05 21:48:35 +05:30
Yi Zhang	9f7264a7a4	Add support for scalar type propagation The main changes are: - Added `ValueKnowledge.scalarType` to track scalar type information. - Added `ValueKnowledge.kind` to indicate the value kind. - Modified the meet and join helper functions. The ValueKnowledge has slightly more complicated state now so the meet and join function need to look at the `kind` field in addition to just the type field.	2022-05-04 16:57:56 -04:00
Sean Silva	32159c4e54	Fix TupleIndex canonicalizer. It would change the result type.	2022-05-03 09:08:49 -07:00
Vivek Khandelwal	c0634bc996	[MLIR][TORCH] Add E2E support for aten.to.dtype_layout op This commit decomposes `aten.to.dtype_layout` op into `aten.to.dtype` op. This commit also fixes the formatting for the file type_conversion.py. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-05-03 12:48:58 +05:30
Prateek Gupta	81ee5bb58c	[TORCH][MLIR] Fix ConstantPad2dStaticModule test. This commit fixes the `ConstantPad2dStaticModule` test case by adding the lowering of `aten.pad` operation. Previously the test case mapped to `aten.constant_pad_nd` operation. The `aten.pad` now decomposes into `aten.constant_pad_nd` operation. Signed-Off-By: Prateek Gupta <prateek@nod-labs.com>	2022-04-29 21:57:01 +05:30

1 2 3 4 5

250 Commits (1106b9aeae867f1ed44fd8f90abf140fc8f9534c)