torch-mlir

Commit Graph

Author	SHA1	Message	Date
Ramiro Leal-Cavazos	51d4d55f8a	Add support for multi-dim input to `index_put_impl` (#722 ) This commit adds support for multi-dimensional tensors as input to the `_index_put_impl_` op. The support was to some degree already there, since `ScatterOp` already supports multi-dimensional tensors. This commit also adds a bit more error checking to `index_put` and refactors the code for creating `ScatterOp`s to mimic the way one would make a `Linalg::GenericOp`.	2022-03-31 09:27:21 -07:00
Anup Gangwar	ccf924d3df	tosa] Support for Aten[Gelu\|GeluBackward] ops (#720 ) Signed-off-by: Anup Gangwar <anup.gangwar@arm.com> Co-authored-by: Anup Gangwar <anup.gangwar@arm.com>	2022-03-30 17:00:55 -07:00
Sean Silva	c17c0a6ba2	Fix for 0-size dim inferred incorrectly. The issue was in the canonicalizer for torch.aten.ge.int -- in cases where the operands were swapped, it would miscompile. This issue is fixed and folding support generalized to `torch.aten.size.int < 0` as well. Fixes #716	2022-03-30 16:36:15 -07:00
Gaurav Shukla	969785d1b6	[LINALG] Add E2E support for `aten.where.[Scalar\|ScalarSelf\|ScalarOther]` ops This commit decomposes different variants of `aten.where.*` op into `aten.where.Self` op. It covers `aten.where.Scalar`, `aten.where.ScalarSelf` and `aten.where.ScalarOther` ops. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-03-30 20:36:48 +05:30
Vivek Khandelwal	2597c481f6	[MLIR][TORCH] Add E2E support for aten.new_empty op This commit decomposes `aten.new_empty` op into `aten.empty.memory_format` op. This commit also made a dtype fix to the constant tensor allocation like ops. Earlier the dtype for the result was inferred from the result type; now, it's being evaluated as per the original definition of the op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-30 13:21:01 +05:30
Sean Silva	140babd952	Add minimal support for Union types. A recent PyTorch commit made ConstantPad2d call a helper function with a `Union[int, float]` type annotated. This commit adds minimal support for representing and dealing with that. https://github.com/pytorch/pytorch/pull/73287 Changes: - Adding support for `!torch.union<T1, T2, T3>`/`Torch::UnionType`, along with the importer and CAPI code. - Add support in isValidSubtype for union types. - Adding a canonicalizer for `torch.derefine` to help simplify some code that derefines to a UnionType (this also fixes #664). There is still more work to do for really supporting UnionType well, such as canonicalizing UnionType's so that they can be compared with pointer equality.	2022-03-29 17:45:48 -07:00
Liam Fitzpatrick	f2269ced80	Improve list index normalization SimplifyShapeCalculations. (#710 ) The reified code to compute the shape of torch.aten.constant_pad_nd uses negative indices when setting list elements. This was not converted to a positive offset in one place in SimplifyShapeCalculations which prevented computation of the static shape.	2022-03-29 22:21:47 +02:00
Maksim Levental	25ba51b2af	This commit decomposes aten._reshape_alias op into aten.view op. (#690 )	2022-03-28 23:54:28 -05:00
Sean Silva	520725cdc5	Fix bad rename from "pseudo" to "valsem".	2022-03-28 20:40:42 +00:00
Sean Silva	776426ea4e	[SimplifyShapeCalculations] Fix AbstractlyInterpretListOpsWithinABlock The logic in the rewriting phase had a bug in case of a read-only op coming before mutation ops. The logic would use the op itself as the "latest literal", but that is not correct, because later on we replace the op itself with the final "latest literal", assuming that all uses of the op have been rewritten -- that was working in general, except for any read-only ops at the beginning. Big thanks to @ljfitz for the tiny reproducer! Fixes #704	2022-03-28 13:18:35 -07:00
Anup Gangwar	5d7a6c2976	[tosa] Support for Aten[Unsqueeze\|Contiguous\|Dropout\|Reshape\|View] ops (#700 )	2022-03-25 14:15:07 -07:00
Vivek Khandelwal	88c216da13	[MLIR][TORCH] Add support for same input and output shapes for view op This commit adds support for the cases of view op where the rank and the shapes of the input and result are equal. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-25 22:26:10 +05:30
Gaurav Shukla	02b6d04eb4	[LINALG] Add E2E support for `aten.zero_` op This commit adds decomposition of `aten.zero_` op. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-03-25 12:46:50 +05:30
Ramiro Leal-Cavazos	e966112c8d	Add final cast to TorchToLinalg conversions missing it (#692 ) In order to make sure that the TorchToLinalg conversions leave the graph in a valid state, the final result of the conversion has to be casted to the result type of the op. This commit adds this cast to ops that did not have it.	2022-03-23 13:52:32 -07:00
Qiang Fu	f7c7bb800c	Add non-default dtype support for a few elementwise math ops. (#687 ) * fix type inference * fix Torch2Linalg conversion * add test cases	2022-03-23 13:35:43 -07:00
Ahmed Taei	f9d34596e8	[NFC] Split BackendTypeConversion -> (BackendTypeConversion, BackendTypeConversionPasses)	2022-03-22 13:56:18 -07:00
Gaurav Shukla	7c3ba25238	[LINALG] Add decomposition of `aten.dropout` op - This commit adds decomposition of `aten.dropout` op. It also covers the training mode of the same op. - It also adds lowering of `aten.sub.float` op. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-03-22 13:14:49 +05:30
Sean Silva	729402c3f4	Reduce compilation time for TorchOps.cpp.inc The `assemblyFormat` stuff (which generates unrolled, per-op C++ code) was taking up a lot of compile time, and all the ops are essentially printed with the same logic. So this PR makes them all call the same helper function. This is done by using `let hasCustomAssemblyFormat = 1` and then implementing `FooOp::parse` and `FooOp::print`. Additionally, the `Generated*Ops.td` files are all collapsed into just `GeneratedTorchOps.td` (there is no reason to have the files separate, since the files are very large anyway so one is always having to search within them -- editors don't care that the file to search is now a bit bigger :) ). This reduces TorchOpsODSGenerated.cpp compile time (which is now GeneratedTorchOps.cpp) from 39 to 31 seconds on my machine. This is actually less than I expected, but this PR is an overall cleanup to the code anyway. The next step will be to introduce (better) functionality upstream for sharding the TorchOps.cpp.inc file, so that we can truly parallelize the O(#ops) costs. This is also necessary, because after this PR, TorchDialect.cpp is now the slowest file to compile, due to the `addOperations<... all the ops ...>` call, which needs to be shareded too.	2022-03-21 14:42:26 -07:00
Vivek Khandelwal	5b9bdfaf3f	[MLIR][TORCH] Add E2E support for aten._to_copy op This commit decomposes `aten._to_copy` op into `valsem.aten.copy` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-21 19:12:37 +05:30
Vivek Khandelwal	13383b03b8	[MLIR][TORCH] Add value tensor variant to aten::copy_ op This commit adds the op `ValsemVariantAtenCopyOp` that represents `AtenCopy_Op` without the underscore. This is needed to make sure that the `ReduceOpVariants` pass turns the in-place op into an op that takes value tensors as inputs, otherwise the `MaximizeValueSemantics` pass will not be able to add value semantics correctly. This commit also adds the lowering of `ValsemVariantAtenCopyOp`. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-21 19:12:37 +05:30
Vivek Khandelwal	4c0cd5c23d	[MLIR][TORCH] Add E2E support for aten.expand_as op This commit decomposes `aten.expand_as` op into `aten.broadcast_to` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-21 12:47:39 +05:30
Vigilans	63fb1e5aad	Bump LLVM at 8361c5da30588d3d4a48eae648f53be1feb5cfad	2022-03-18 13:16:14 -04:00
Ramiro Leal-Cavazos	218b4875d5	Make conditions for type refinement of static cast less strict (#680 ) This commit adds support for type refinement when `torch.tensor_static_info_cast`s are involved, even when there are users of the casted tensor that don't allow type refinements. Originally the canonicalization pattern for `torch.tensor_static_info_cast` would check if all the users of the casted tensor allowed type refinements before making any changes. This means that if at least one of the users did not allow type refinements, the pattern would fail. This becomes an issue when doing shape calculations because the calculations need the shape information of each input tensor to be available before the calculation can be simplified.	2022-03-18 09:10:12 -07:00
Prateek Gupta	7256c9e395	[TORCH][MLIR] Fix the return types of `aten.native_layer_norm`. This commit fixes the 2nd and 3rd return types of the `aten.native_layer_norm`. Previously the mean and rSTD were returned with reduction dims removed. This commit fixes this and keeps the reduction dims of the results. Signed-Off-By: Prateek Gupta <prateek@nord-labs.com>	2022-03-17 12:08:32 +05:30
Sean Silva	3b66b4925a	Make TorchOps.cpp faster to iterate on. The ODS-generated code included via the `TorchOps.cpp.inc` file takes a very long time to compile. This PR isolates it into its own file so that the build system can cache it. This PR creates a new file `TorchOpsODSGenerated.cpp` just to include the `TorchOps.cpp.inc` file. Doing so required moving to the "new" way to define verifiers, since the static `verify` free functions in TorchOps.cpp weren't accessible from the .inc file after it was moved to `TorchOpsODSGenerated.cpp`. On my machine, this drops the build time of TorchOps.cpp (such as when iterating on a canonicalizer) from >40 seconds to <10 seconds. 10 seconds still isn't great though, but at least it isn't "go get a coffee" type of waiting.	2022-03-16 09:33:12 -07:00
Vivek Khandelwal	8da7d90611	[MLIR][TORCH] Add E2E support for aten.index_put op This commit decomposes `aten.index_put` op into `valsem.aten.index_put_impl` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-16 22:02:02 +05:30
Vivek Khandelwal	3d95c3d6c9	[MLIR][TORCH] Add value tensor variant to aten::_index_put_impl_ This commit adds the op `ValsemVariantAtenIndexPutImplOp` that represents `Aten_IndexPutImpl_Op` without the underscore. This is needed to make sure that the `ReduceOpVariants` pass turns the in-place op into an op that takes value tensors as inputs, otherwise the `MaximizeValueSemantics` pass will not be able to add value semantics correctly. This commit also adds the lowering of `ValsemVariantAtenIndexPutImplOp` op. This commit also updates the `torch.bincount` op test cases.	2022-03-16 22:02:02 +05:30
Ramiro Leal-Cavazos	0bcc6d1075	Add maximize-value-semantics support for multiple non-value tensor inputs (#659 ) This commit adds value semantics support for ops such as `aten.view_as` and `aten.expand_as` that take two non-value tensors as input.	2022-03-15 18:13:45 -07:00
Sean Silva	92da4988f0	Improve "pseudo" op terminology. The term "pseudo" is very vague and was getting confusing (I felt I had to explain it in every comment referencing it). Instead, rework the "pseudo" ops to instead be named: - MLIR Syntax: `torch.valsem.` - C++ / ODS: `ValsemVariantOp` This makes it clear what the concept is, and avoids confusion with other things that might be called "pseudo", since these are very specific and should be 100% consistently named w.r.t. the non-valsem-variant ops that they correspond to.	2022-03-15 17:57:52 -07:00
Sean Silva	7ea50a537a	Avoid `using` the `torch_upstream` namespace. This is code that we always want to treat as "foreign" and not get too comfortable using in many functions. One way to accomplish that is to make it a bit clunkier to use. Also, fix Utils.cpp to match the LLVM/MLIR coding conventions (don't define functions inside namespaces -- prefer `using` and explicit qualification).	2022-03-15 17:24:17 -07:00
Sean Silva	84a9693006	Elide `!torch.` prefix in nested dialect types. This leads to much more succinct types in many cases: ``` !torch.list<!torch.int> !torch.list<int> !torch.tuple<!torch.list<!torch.int>, !torch.list<!torch.int>> !torch.tuple<list<int>, list<int>> !torch.optional<!torch.list<!torch.int>> !torch.optional<list<int>> !torch.list<list<list<tensor>>> !torch.list<!torch.list<!torch.list<!torch.tensor>>> ``` I would like to take this further and allow omitting the `!torch.` prefix in all cases, but that's harder -- for example, we currently use `FuncOp` for functions, and so I don't think we can customize the printing there. It seems like it will be a longer road to getting that level of customization.	2022-03-15 17:24:08 -07:00
Sean Silva	a5fe0cf063	Introduce new shape library design. See the documentation in `docs/shape_lib.md` and `docs/adding_a_shape_function.md` for an overview of the system. This completely overhauls how we represent shape functions. In particular, RefineTypes does not infer shapes anymore (only dtypes). Shape functions are now written in (TorchScript'able) Python. Recommended review order: 1. Read `docs/shape_lib.md` and `docs/adding_a_shape_function.md`. 1. Code and tests for ReifyShapeCalculations, DropShapeCalculations. 1. Code and tests for SimplifyShapeCalculations. 1. shape_lib_gen.py 1. Code and tests for new RefineTypes pass. 1. Random folders/canonicalizers in TorchOps.cpp and associated test in `canonicalize.mlir`. 1. New ReadOnly trait inferred from the registry. 1. Any miscellaneous remaining stuff. Example `-print-ir-after-all` for ElementwiseUnaryModule: [IR lowering dump](https://gist.github.com/silvasean/e4dc8cbc8d00aac7819602e3cbd8e212). Example `-print-ir-after-all` for ElementwiseBinaryModule: [IR lowering dump](https://gist.github.com/silvasean/daf6860ecced732af3568af6b1899113).	2022-03-15 12:41:58 -07:00
Sean Silva	5d9222383c	Split up TorchToLinalg.cpp This helps keep things organized and also exposes more parallelism to the build system. It seems though that most of the compile time is actually spent in the headers though, so the wall time doesn't decrease as much as I had hoped (and now that the headers are being included multiple times, the cpu time actually increases a lot, sadly -- will try to dig into this).	2022-03-14 10:19:41 -07:00
Ramiro Leal-Cavazos	51e267aa37	Combine maximize-value-semantics rewrite patterns into one pattern (#642 ) This commit replaces the two rewrite patterns of maximize-value-semantics with a single pattern that captures the behavior of both as well as other edge cases previously not supported. The new pattern works by first performing alias analysis on a subgraph to see if pattern is applicable, then rewriting all non-value tensors to value tensors in a single go.	2022-03-10 09:36:52 -08:00
Prateek Gupta	3d9ba5e525	[MLIR][TORCH] Add E2E support for aten.erf op. Signed-Off-By: Prateek Gupta <prateek@nod-labs.com>	2022-03-09 22:22:03 +05:30
Vivek Khandelwal	1a2a9e066f	[MLIR][TORCH] Add TorchToTMTensor pass This pass is added to lower ops, which can not be lowered via the TorchToLinalg pass, such as `torch.bincount` op. This pass also uses torch-mlir's TMTensor Dialect to lower the complex ops. Also add torch.bincount op lowering with the help of TMTensor dialect Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-08 22:52:34 +05:30
Vivek Khandelwal	b2952b12dd	[MLIR][TORCH] Move common helper functions to Utils.cpp This commit moves the helper function which are common across different torch-mlir conversion passes into a common directory Utils. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-08 22:52:34 +05:30
Vivek Khandelwal	bf463d1f36	[MLIR][TORCH]Add support for integer-type inputs for sum and max op This commit adds support for integer type inputs for `AtenMaxOp`, `AtenSumOp`, `AtenSumDimIntListOp`. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-08 22:52:34 +05:30
Gaurav Shukla	e57d3f9774	[LINALG] Fix `aten.bernoulli` op lowering - This commit adds E2E support for `aten.rand_like` and `aten.bernoulli_.Tensor` ops. - The `aten.bernoulli(x)` was implemented as: `aten.bernoulli(x) = rand_like(x) < 0.5`, assuming 0.5 as default probability, whereas according to the pytorch documentation: https://pytorch.org/docs/stable/generated/torch.bernoulli.html#torch.bernoulli the input x in `aten.bernoulli(x)` is itself a tensor containing probabilities to be used for drawing the binary random number. - So this commit fixes the `aten.bernoulli(x)` implementation as: `aten.bernoulli(x) = rand_like(x) < x`. - It also fixes the case where the input to `aten.bernoulli_.float` is an integer tensor. In this case the input must be casted to float type before passing it as operand to `aten.rand_like` op. `aten.bernoulli_.float(x, p) = rand_like(float(x)) < p`. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-03-05 09:38:22 +05:30
Vivek Khandelwal	af551bd9cd	[MLIR][TORCH] Add E2E support for aten.full_like op This commit decomposes `aten.full_like` op into `aten.empty_like` and `aten.fill` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-04 21:58:23 +05:30
Vivek Khandelwal	d61ae92eee	[MLIR][TORCH] Add E2E support for aten.full op This commit decomposes `aten.full` op into `aten.empty` and `aten.fill` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-04 21:58:23 +05:30
Ramiro Leal-Cavazos	9ce62473f9	Add static type information support to `aten.bmm` (#636 ) This commit adds static type information support to `aten.bmm`. This is needed for the forward pass of Bert training.	2022-03-03 13:01:17 -08:00
Ramiro Leal-Cavazos	5ec70c175d	[LINALG] Add torch-to-linalg lowering for `TensorStaticInfoCastOp` (#634 ) This commit adds a lowering for `TensorStaicInfoCastOp` that simply replaces the op with the `tensor::CastOp`.	2022-03-02 13:35:26 -08:00
Ramiro Leal-Cavazos	298eeb79ca	[LINALG] Add handling of unknown dimension in size list of `view` op (#633 ) The view op allows for the new shape argument to have a -1 value for one of the dimensions, and the op is expected to deduce the size of that dimension by looking at the sizes of the other dimensions and comparing it to the total number of elements in the original tensor. This commit adds this functionality.	2022-03-02 13:35:01 -08:00
Yi Zhang	1d285f0153	Add aten.hardtanh e2e support.	2022-03-02 12:28:06 -05:00
Prashant Kumar	819f29316f	Decompose aten.silu op Decomposition of aten.silu.op is added as silu(x) = x * sigmoid(x).	2022-03-01 23:24:19 +05:30
Vivek Khandelwal	ddd45d6068	[MLIR][TORCH] Add E2E support for aten.new_zeros, aten.new_ones op This commit adds lowering of `aten.new_zeros` and `aten.new_ones` op Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-01 22:09:47 +05:30
Ramiro Leal-Cavazos	1dba4fcbd7	[LINALG] Support for contiguous memory format in `clone` and `empty` (#628 ) This commit adds support for the contiguous memory format for the ops `AtenCloneOp` and `AtenEmptyMemoryFormatOp`.	2022-02-28 13:58:04 -08:00
Ramiro Leal-Cavazos	58abec5c0a	Add `reduction` support to `torch.nll_loss_forward` (#624 ) This commit does a couple of things. First, it fixes a bug in the `linalg.generic` body of the `nll_loss_forward` lowering where the `ignoreIndex` was being compared with the loop index rather than the current element of the `target` tensor. This was not being caught by the tests because they were not testing the case where `ingnoreIndex` actually corresponds to a value in `target`. This has been fixed. Second, this commit adds support for the `reduction` argument in `torch.nll_loss_forward` as well as support for 1-D inputs. In order to simplify the lowering code, I've refactored the code that creates the `linalg.generic` ops for elementwise and reduction ops into static functions, to avoid having boilerplate code for indexing maps, etc that can be very error prone. Note: The function `convertScalarToDtype` was moved to before all the conversion patterns, but nothing in it was modified.	2022-02-28 11:01:23 -08:00
Prashant Kumar	7c637eebc3	[LINALG] Decompose aten_hardswish op. `aten.hardswish` op is decomposed into (x/6) * Relu6(x+3).	2022-02-25 21:59:27 +05:30

1 2 3 4 5 ...

518 Commits (e1c7c1f9c5f626a63ddf7bb07c50f6997fbdc16c)