torch-mlir

Commit Graph

Author	SHA1	Message	Date
Vivek Khandelwal	88c216da13	[MLIR][TORCH] Add support for same input and output shapes for view op This commit adds support for the cases of view op where the rank and the shapes of the input and result are equal. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-25 22:26:10 +05:30
Gaurav Shukla	02b6d04eb4	[LINALG] Add E2E support for `aten.zero_` op This commit adds decomposition of `aten.zero_` op. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-03-25 12:46:50 +05:30
Sean Silva	94df096c11	Add note to not edit upstream_shape_helpers.py	2022-03-24 09:32:19 -07:00
Prashant Kumar	730cdcd071	Add hugging face `albert-base-v2` in torchscript_e2e_heavydep_tests `albert-base-v2` for sequence classification is added in e2e_heavy_test.	2022-03-24 17:43:24 +05:30
Ramiro Leal-Cavazos	e966112c8d	Add final cast to TorchToLinalg conversions missing it (#692 ) In order to make sure that the TorchToLinalg conversions leave the graph in a valid state, the final result of the conversion has to be casted to the result type of the op. This commit adds this cast to ops that did not have it.	2022-03-23 13:52:32 -07:00
Qiang Fu	f7c7bb800c	Add non-default dtype support for a few elementwise math ops. (#687 ) * fix type inference * fix Torch2Linalg conversion * add test cases	2022-03-23 13:35:43 -07:00
max	fe8ac57e6d	This PR implements an eager mode backend for PyTorch through the torch-mlir framework. This is accomplished by overriding the `__torch_dispatch__` class method on wrapper subclass `TorchMLIRTensor(torch.Tensor)`. Effectively, this mode works by compiling op by op as the NN is eagerly executed by PyTorch. Entailed in that compilation is building a representation of the op that can be `torch.jit.script`ed, importing using `ModuleBuilder`, and then executing (e.g., with `RefBackendLinalgOnTensorsBackend`). This mode includes a fallback to conventional PyTorch if anything in the torch-mlir compilation process fails (e.g., unsupported op). Currently, all e2e tests pass execpt for two that involve an upstream PyTorch bug (https://github.com/pytorch/pytorch/issues/74400). High priority next steps: 1. A compile cache in order to speed up reruns of the same NN. 2. Integration with IREE (though not in this repo). 3. Integration with `torch.distributed`.	2022-03-22 14:42:57 -07:00
Ahmed Taei	f9d34596e8	[NFC] Split BackendTypeConversion -> (BackendTypeConversion, BackendTypeConversionPasses)	2022-03-22 13:56:18 -07:00
Sean Silva	6a7cf0c304	Update Torch-MLIR architecture diagram Torch FX was never really a different path, since all FX modules are actually valid TorchScript modules. Instead, replace it with the new torch.dispatch work that we are building.	2022-03-22 11:51:52 -07:00
Gaurav Shukla	7c3ba25238	[LINALG] Add decomposition of `aten.dropout` op - This commit adds decomposition of `aten.dropout` op. It also covers the training mode of the same op. - It also adds lowering of `aten.sub.float` op. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-03-22 13:14:49 +05:30
Sean Silva	729402c3f4	Reduce compilation time for TorchOps.cpp.inc The `assemblyFormat` stuff (which generates unrolled, per-op C++ code) was taking up a lot of compile time, and all the ops are essentially printed with the same logic. So this PR makes them all call the same helper function. This is done by using `let hasCustomAssemblyFormat = 1` and then implementing `FooOp::parse` and `FooOp::print`. Additionally, the `Generated*Ops.td` files are all collapsed into just `GeneratedTorchOps.td` (there is no reason to have the files separate, since the files are very large anyway so one is always having to search within them -- editors don't care that the file to search is now a bit bigger :) ). This reduces TorchOpsODSGenerated.cpp compile time (which is now GeneratedTorchOps.cpp) from 39 to 31 seconds on my machine. This is actually less than I expected, but this PR is an overall cleanup to the code anyway. The next step will be to introduce (better) functionality upstream for sharding the TorchOps.cpp.inc file, so that we can truly parallelize the O(#ops) costs. This is also necessary, because after this PR, TorchDialect.cpp is now the slowest file to compile, due to the `addOperations<... all the ops ...>` call, which needs to be shareded too.	2022-03-21 14:42:26 -07:00
Vivek Khandelwal	5b9bdfaf3f	[MLIR][TORCH] Add E2E support for aten._to_copy op This commit decomposes `aten._to_copy` op into `valsem.aten.copy` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-21 19:12:37 +05:30
Vivek Khandelwal	13383b03b8	[MLIR][TORCH] Add value tensor variant to aten::copy_ op This commit adds the op `ValsemVariantAtenCopyOp` that represents `AtenCopy_Op` without the underscore. This is needed to make sure that the `ReduceOpVariants` pass turns the in-place op into an op that takes value tensors as inputs, otherwise the `MaximizeValueSemantics` pass will not be able to add value semantics correctly. This commit also adds the lowering of `ValsemVariantAtenCopyOp`. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-21 19:12:37 +05:30
Vivek Khandelwal	4c0cd5c23d	[MLIR][TORCH] Add E2E support for aten.expand_as op This commit decomposes `aten.expand_as` op into `aten.broadcast_to` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-21 12:47:39 +05:30
Vigilans	63fb1e5aad	Bump LLVM at 8361c5da30588d3d4a48eae648f53be1feb5cfad	2022-03-18 13:16:14 -04:00
Ramiro Leal-Cavazos	218b4875d5	Make conditions for type refinement of static cast less strict (#680 ) This commit adds support for type refinement when `torch.tensor_static_info_cast`s are involved, even when there are users of the casted tensor that don't allow type refinements. Originally the canonicalization pattern for `torch.tensor_static_info_cast` would check if all the users of the casted tensor allowed type refinements before making any changes. This means that if at least one of the users did not allow type refinements, the pattern would fail. This becomes an issue when doing shape calculations because the calculations need the shape information of each input tensor to be available before the calculation can be simplified.	2022-03-18 09:10:12 -07:00
Prateek Gupta	7256c9e395	[TORCH][MLIR] Fix the return types of `aten.native_layer_norm`. This commit fixes the 2nd and 3rd return types of the `aten.native_layer_norm`. Previously the mean and rSTD were returned with reduction dims removed. This commit fixes this and keeps the reduction dims of the results. Signed-Off-By: Prateek Gupta <prateek@nord-labs.com>	2022-03-17 12:08:32 +05:30
Sean Silva	3b66b4925a	Make TorchOps.cpp faster to iterate on. The ODS-generated code included via the `TorchOps.cpp.inc` file takes a very long time to compile. This PR isolates it into its own file so that the build system can cache it. This PR creates a new file `TorchOpsODSGenerated.cpp` just to include the `TorchOps.cpp.inc` file. Doing so required moving to the "new" way to define verifiers, since the static `verify` free functions in TorchOps.cpp weren't accessible from the .inc file after it was moved to `TorchOpsODSGenerated.cpp`. On my machine, this drops the build time of TorchOps.cpp (such as when iterating on a canonicalizer) from >40 seconds to <10 seconds. 10 seconds still isn't great though, but at least it isn't "go get a coffee" type of waiting.	2022-03-16 09:33:12 -07:00
Vivek Khandelwal	8da7d90611	[MLIR][TORCH] Add E2E support for aten.index_put op This commit decomposes `aten.index_put` op into `valsem.aten.index_put_impl` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-16 22:02:02 +05:30
Vivek Khandelwal	3d95c3d6c9	[MLIR][TORCH] Add value tensor variant to aten::_index_put_impl_ This commit adds the op `ValsemVariantAtenIndexPutImplOp` that represents `Aten_IndexPutImpl_Op` without the underscore. This is needed to make sure that the `ReduceOpVariants` pass turns the in-place op into an op that takes value tensors as inputs, otherwise the `MaximizeValueSemantics` pass will not be able to add value semantics correctly. This commit also adds the lowering of `ValsemVariantAtenIndexPutImplOp` op. This commit also updates the `torch.bincount` op test cases.	2022-03-16 22:02:02 +05:30
Yi Zhang	8a4388ea7b	Fix convert_to_loops.mlir format	2022-03-16 11:42:37 -04:00
Ramiro Leal-Cavazos	0bcc6d1075	Add maximize-value-semantics support for multiple non-value tensor inputs (#659 ) This commit adds value semantics support for ops such as `aten.view_as` and `aten.expand_as` that take two non-value tensors as input.	2022-03-15 18:13:45 -07:00
Sean Silva	92da4988f0	Improve "pseudo" op terminology. The term "pseudo" is very vague and was getting confusing (I felt I had to explain it in every comment referencing it). Instead, rework the "pseudo" ops to instead be named: - MLIR Syntax: `torch.valsem.` - C++ / ODS: `ValsemVariantOp` This makes it clear what the concept is, and avoids confusion with other things that might be called "pseudo", since these are very specific and should be 100% consistently named w.r.t. the non-valsem-variant ops that they correspond to.	2022-03-15 17:57:52 -07:00
Sean Silva	7ea50a537a	Avoid `using` the `torch_upstream` namespace. This is code that we always want to treat as "foreign" and not get too comfortable using in many functions. One way to accomplish that is to make it a bit clunkier to use. Also, fix Utils.cpp to match the LLVM/MLIR coding conventions (don't define functions inside namespaces -- prefer `using` and explicit qualification).	2022-03-15 17:24:17 -07:00
Sean Silva	84a9693006	Elide `!torch.` prefix in nested dialect types. This leads to much more succinct types in many cases: ``` !torch.list<!torch.int> !torch.list<int> !torch.tuple<!torch.list<!torch.int>, !torch.list<!torch.int>> !torch.tuple<list<int>, list<int>> !torch.optional<!torch.list<!torch.int>> !torch.optional<list<int>> !torch.list<list<list<tensor>>> !torch.list<!torch.list<!torch.list<!torch.tensor>>> ``` I would like to take this further and allow omitting the `!torch.` prefix in all cases, but that's harder -- for example, we currently use `FuncOp` for functions, and so I don't think we can customize the printing there. It seems like it will be a longer road to getting that level of customization.	2022-03-15 17:24:08 -07:00
Sean Silva	3734f69119	Remove basic_mt from the heavydep tests This was an aspirational goal at an earlier stage in the project where the focus was heavily on programs with state, control flow, and lists/dicts. We will circle back to such programs likely 2022H2 at some point, but for now, having this test doesn't add much, since basically nothing works or is being worked on.	2022-03-15 15:25:53 -07:00
Sean Silva	a5fe0cf063	Introduce new shape library design. See the documentation in `docs/shape_lib.md` and `docs/adding_a_shape_function.md` for an overview of the system. This completely overhauls how we represent shape functions. In particular, RefineTypes does not infer shapes anymore (only dtypes). Shape functions are now written in (TorchScript'able) Python. Recommended review order: 1. Read `docs/shape_lib.md` and `docs/adding_a_shape_function.md`. 1. Code and tests for ReifyShapeCalculations, DropShapeCalculations. 1. Code and tests for SimplifyShapeCalculations. 1. shape_lib_gen.py 1. Code and tests for new RefineTypes pass. 1. Random folders/canonicalizers in TorchOps.cpp and associated test in `canonicalize.mlir`. 1. New ReadOnly trait inferred from the registry. 1. Any miscellaneous remaining stuff. Example `-print-ir-after-all` for ElementwiseUnaryModule: [IR lowering dump](https://gist.github.com/silvasean/e4dc8cbc8d00aac7819602e3cbd8e212). Example `-print-ir-after-all` for ElementwiseBinaryModule: [IR lowering dump](https://gist.github.com/silvasean/daf6860ecced732af3568af6b1899113).	2022-03-15 12:41:58 -07:00
Sean Silva	5d9222383c	Split up TorchToLinalg.cpp This helps keep things organized and also exposes more parallelism to the build system. It seems though that most of the compile time is actually spent in the headers though, so the wall time doesn't decrease as much as I had hoped (and now that the headers are being included multiple times, the cpu time actually increases a lot, sadly -- will try to dig into this).	2022-03-14 10:19:41 -07:00
Prashant Kumar	b6d13301fc	[TORCH] Fix the location of packed_params. The location of packed_params.h is changed in aten src.	2022-03-14 17:52:19 +05:30
Ramiro Leal-Cavazos	51e267aa37	Combine maximize-value-semantics rewrite patterns into one pattern (#642 ) This commit replaces the two rewrite patterns of maximize-value-semantics with a single pattern that captures the behavior of both as well as other edge cases previously not supported. The new pattern works by first performing alias analysis on a subgraph to see if pattern is applicable, then rewriting all non-value tensors to value tensors in a single go.	2022-03-10 09:36:52 -08:00
Yi Zhang	3510b2ba9d	Fix scatter op bufferization to alway copy original tensor	2022-03-09 18:19:44 -05:00
Prateek Gupta	3d9ba5e525	[MLIR][TORCH] Add E2E support for aten.erf op. Signed-Off-By: Prateek Gupta <prateek@nod-labs.com>	2022-03-09 22:22:03 +05:30
Vivek Khandelwal	1a2a9e066f	[MLIR][TORCH] Add TorchToTMTensor pass This pass is added to lower ops, which can not be lowered via the TorchToLinalg pass, such as `torch.bincount` op. This pass also uses torch-mlir's TMTensor Dialect to lower the complex ops. Also add torch.bincount op lowering with the help of TMTensor dialect Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-08 22:52:34 +05:30
Vivek Khandelwal	b2952b12dd	[MLIR][TORCH] Move common helper functions to Utils.cpp This commit moves the helper function which are common across different torch-mlir conversion passes into a common directory Utils. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-08 22:52:34 +05:30
Vivek Khandelwal	bf463d1f36	[MLIR][TORCH]Add support for integer-type inputs for sum and max op This commit adds support for integer type inputs for `AtenMaxOp`, `AtenSumOp`, `AtenSumDimIntListOp`. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-08 22:52:34 +05:30
Yi Zhang	af7f42fd93	Add a README.md to torch-mlir-dialects	2022-03-07 16:08:30 -05:00
Gaurav Shukla	e57d3f9774	[LINALG] Fix `aten.bernoulli` op lowering - This commit adds E2E support for `aten.rand_like` and `aten.bernoulli_.Tensor` ops. - The `aten.bernoulli(x)` was implemented as: `aten.bernoulli(x) = rand_like(x) < 0.5`, assuming 0.5 as default probability, whereas according to the pytorch documentation: https://pytorch.org/docs/stable/generated/torch.bernoulli.html#torch.bernoulli the input x in `aten.bernoulli(x)` is itself a tensor containing probabilities to be used for drawing the binary random number. - So this commit fixes the `aten.bernoulli(x)` implementation as: `aten.bernoulli(x) = rand_like(x) < x`. - It also fixes the case where the input to `aten.bernoulli_.float` is an integer tensor. In this case the input must be casted to float type before passing it as operand to `aten.rand_like` op. `aten.bernoulli_.float(x, p) = rand_like(float(x)) < p`. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-03-05 09:38:22 +05:30
Vivek Khandelwal	af551bd9cd	[MLIR][TORCH] Add E2E support for aten.full_like op This commit decomposes `aten.full_like` op into `aten.empty_like` and `aten.fill` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-04 21:58:23 +05:30
Vivek Khandelwal	d61ae92eee	[MLIR][TORCH] Add E2E support for aten.full op This commit decomposes `aten.full` op into `aten.empty` and `aten.fill` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-04 21:58:23 +05:30
Ramiro Leal-Cavazos	9ce62473f9	Add static type information support to `aten.bmm` (#636 ) This commit adds static type information support to `aten.bmm`. This is needed for the forward pass of Bert training.	2022-03-03 13:01:17 -08:00
Yi Zhang	486f95e84f	Add bufferization pass for TMTensor ops The pass is mostly borrowed from the BufferizeAnyLinalgOp pass in mlir upstream with some minor changes. At a high level, it's a naive partial bufferization pass which allocate new buffers for all the output tensors. The initial value of an output buffer is copied from the original buffer if there are uses of the original value. One difference from linalg bufferization pass is the way to tell if the loop body uses the init value of output operand. For TMTensor ops, it differs from op to op because the payload region doesn't represent the entire loop body.	2022-03-03 11:39:14 -05:00
Ramiro Leal-Cavazos	5ec70c175d	[LINALG] Add torch-to-linalg lowering for `TensorStaticInfoCastOp` (#634 ) This commit adds a lowering for `TensorStaicInfoCastOp` that simply replaces the op with the `tensor::CastOp`.	2022-03-02 13:35:26 -08:00
Ramiro Leal-Cavazos	298eeb79ca	[LINALG] Add handling of unknown dimension in size list of `view` op (#633 ) The view op allows for the new shape argument to have a -1 value for one of the dimensions, and the op is expected to deduce the size of that dimension by looking at the sizes of the other dimensions and comparing it to the total number of elements in the original tensor. This commit adds this functionality.	2022-03-02 13:35:01 -08:00
Yi Zhang	1d285f0153	Add aten.hardtanh e2e support.	2022-03-02 12:28:06 -05:00
Prashant Kumar	819f29316f	Decompose aten.silu op Decomposition of aten.silu.op is added as silu(x) = x * sigmoid(x).	2022-03-01 23:24:19 +05:30
Vivek Khandelwal	ddd45d6068	[MLIR][TORCH] Add E2E support for aten.new_zeros, aten.new_ones op This commit adds lowering of `aten.new_zeros` and `aten.new_ones` op Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-01 22:09:47 +05:30
Ramiro Leal-Cavazos	1dba4fcbd7	[LINALG] Support for contiguous memory format in `clone` and `empty` (#628 ) This commit adds support for the contiguous memory format for the ops `AtenCloneOp` and `AtenEmptyMemoryFormatOp`.	2022-02-28 13:58:04 -08:00
Ramiro Leal-Cavazos	58abec5c0a	Add `reduction` support to `torch.nll_loss_forward` (#624 ) This commit does a couple of things. First, it fixes a bug in the `linalg.generic` body of the `nll_loss_forward` lowering where the `ignoreIndex` was being compared with the loop index rather than the current element of the `target` tensor. This was not being caught by the tests because they were not testing the case where `ingnoreIndex` actually corresponds to a value in `target`. This has been fixed. Second, this commit adds support for the `reduction` argument in `torch.nll_loss_forward` as well as support for 1-D inputs. In order to simplify the lowering code, I've refactored the code that creates the `linalg.generic` ops for elementwise and reduction ops into static functions, to avoid having boilerplate code for indexing maps, etc that can be very error prone. Note: The function `convertScalarToDtype` was moved to before all the conversion patterns, but nothing in it was modified.	2022-02-28 11:01:23 -08:00
Stephen Neuendorffer	9b2613533b	Ensure that torch-mlir-dialects is built when we're out of tree Its unclear to me what the right layering is here: Are you expecting torch-mlir-dialects to always get built with LLVM? This is pretty breaking for us, if so.	2022-02-28 11:41:31 -05:00
Stephen Neuendorffer	330042aa4c	Don't override MLIR_TABLEGEN_EXE (#622 ) This should be set elsewhere depending on the build configuration. In particular, we need to be careful when cross-compiling to pick up the host mlir-tblgen.	2022-02-25 14:41:09 -08:00

... 4 5 6 7 8 ...

1162 Commits (795479a88d8b5d5e244f59f94b4585b08c677dc0) All Branches Search

1162 Commits (795479a88d8b5d5e244f59f94b4585b08c677dc0)

All Branches