torch-mlir

Commit Graph

Author	SHA1	Message	Date
Gleb Kazantaev	804f9f1f8f	Extended TorchMLIRLoweringContext with virtual CreateComputation method (#1699 ) * Extended TorchMLIRLoweringContext with virtual CreateComputation method * Fix device_data_cast return value	2022-12-08 15:57:07 -05:00
Sean Silva	e8511840c3	[cleanup] Use a single function pipeline for TOSA->Linalg This should run faster and is overall clearer.	2022-12-08 09:02:38 -08:00
Sean Silva	69171c246a	[RefBackend] Add elementwise fusion and buffer deallocation This gives some decent improvements to memory consumption and latency of testing. I would have expected buffer-deallocation to actually make a big difference to the final process RSS but it doesn't appear to. Also running buffer-deallocation later in the pipeline results in miscompiles. I didn't have the time or interest to dig in deeper, but something is off. (numbers below are taken from a single run, but I did do a few runs to make sure that the variance wasn't that great) - Linalg-on-Tensors shows memory consumption improvements and some slight speedups. ``` ./tools/e2e_test.sh -s -v -c refbackend fuse=0 dealloc=0 RSS: 3071.33 MB real 3m58.204s user 6m22.299s sys 0m51.235s fuse=1 dealloc=0 RSS: 2515.89 MB real 3m34.797s user 5m56.902s sys 0m44.933s fuse=1 dealloc=post-bufferize: RSS: 2290.25 MB real 3m42.242s user 6m0.560s sys 0m46.335s ``` - TOSA ResNet18 gets significantly faster and uses significantly less memory. ``` time ./tools/e2e_test.sh -s -v -c tosa -f ResNet18 fuse=0 dealloc=0 rss 1328.56 MB real 0m50.303s user 0m55.355s sys 0m12.260s fuse=1 dealloc=0 rss 859MB real 0m30.454s user 0m35.551s sys 0m11.879s fuse=1 dealloc=post-bufferize: rss 851MB real 0m30.313s user 0m39.889s sys 0m11.941s ``` Big thanks to Ramiro for the methodology here for measuring the RSS with `psutil`: https://gist.github.com/ramiro050/5b5c2501f7389c008d9029210772c3a8	2022-12-08 03:14:42 -08:00
Ramiro Leal-Cavazos	dd35488da5	build: update llvm tag to 798fa4b4 (#1684 ) - Support for non-prefixed accessors has been removed. See: https://reviews.llvm.org/D136727 - Rename `operands` to `methodOperands` in `prim.CallMethod` since the name `operands` overlaps with a builtin method name. See: https://reviews.llvm.org/D136727 - Add passes in refbackend to lower memref.subview. See: https://reviews.llvm.org/D136377 - Replace `CopyToValueTensorOps` first in `RewriteViewLikeSubgraph` in maximize-value-semantics. The current implementation of the `RewriteViewLikeSubgraph` pass in maximize-value-semantics creates temporarily invalid IR. In particular, given a forward slice starting from a `CopyToNonValueTensorOp` and ending in `CopyToValueTensorOp`s, the pass first replaces all uses of the `CopyToNonValueTensorOp` with its operand, which results in all the `CopyToValueTensorOp` users having their operand have type `!torch.vtensor`, which is invalid. The correct way to do things is to first replace all the `CopyToValueTensorOp`s with their operand, and then replace all uses of the `CopyToNonValueTensorOp` with its operand. This only started failing now because the generated accessor `getOperand` for the `CopyToValueTensorOp` now returns a `TypedValue<NonValueTensorType>`, which has an assert checking that the value returned is of the expected type.	2022-12-07 12:20:41 -08:00
Sean Silva	b1f9e09f85	[torchdynamo] Add ResNet18 example with TorchDynamo This is a minor variation on our other resnet18 examples swapping in TorchDynamo. We replicate the refbackend_torchdynamo_backend out of the e2e test config to avoid making that appear like a public API. Also, some minor cleanups to TorchDynamoTestConfig.	2022-12-07 09:25:27 -08:00
Sean Silva	c956c39c86	[cleanup] Remove disabled e2e test This test has been disabled a long time, and since RefBackend is so slow we don't want to add this unnecessarily. I believe it is covered by downstream testing such as the Shark Tank.	2022-12-07 06:36:48 -08:00
Vivek Khandelwal	3e4bb2bd8e	[MLIR][TORCH] Add E2E support for randn and randn.generator op Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-12-06 22:41:24 +05:30
Sean Silva	485c18bb2f	[torchdynamo] Add "lockstep" numerical accuracy debugger. Thanks to TorchDynamo's great layering and design, this is only about 100 lines of code for a basic lockstep debugger. This should allow us to deprecate eager_mode, since AFAIK the only interesting use case that it was really supporting is for downstream users to write lockstep debuggers. NOTE: The exact reporting and interface here is subject to change. Please try it out and provide feedback (or patches :) ). - make_fx should not drop source locations: https://github.com/pytorch/pytorch/issues/90276 - Report tensors better (huge tensors should be summarized) - Maybe don't abort, but just warn? - Allow customizing atol/rtol. - How best to print the failing node? And include surrounding graph context?	2022-12-06 07:57:45 -08:00
Vivek Khandelwal	ef39b9ebb4	build: manually update PyTorch version Set PyTorch and TorchVision version to nightly release 2022-12-05. Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-12-05 22:44:32 +05:30
Vivek Khandelwal	f416953600	[MLIR][TORCH] Add TorchConversionToMLProgram and MLProgramBufferize pass This commit changes the `InsertRngGlobalsPass` to `TorchConversionToMLProgram` pass. This commit also adds the `MLProgramBufferize` pass for the bufferization of ml_program dialect ops to run on refbackend. Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-12-02 13:20:46 +05:30
Sean Silva	88db99946b	[torchdynamo] Use decompositions to support a few ops	2022-12-01 11:25:20 -08:00
Ramiro Leal-Cavazos	b4b92c990e	Replace LCG algorithm with squares64 algorithm in AtenUniformOp (#1633 ) This commit replaces the LCG algorithm that was being used by the `TorchToLinalg` lowering of `AtenUniformOp` to generate random numbers with the `squares64` algorithm, for the LCG algorithm was producing tensors that were highly correlated with one another. Squares64 algorithm: https://arxiv.org/abs/2004.06278 Closes https://github.com/llvm/torch-mlir/issues/1608	2022-12-01 08:30:10 -08:00
Ramiro Leal-Cavazos	0983a7f93a	Fix modulus calculation in LCG algorithm of refbackend (#1658 ) The current implementation sets the `nextSeed` value to `temp & 127`, which is wrong. The last step of the LCG algorithm for the multiplier and increment chosen should be `temp % 2^{64} = temp & (1 << 63)`. However, because we are dealing with i64 values, the modulus operation happens automatically, so it is not needed. See Donald Knuth's values for LCG here: https://en.wikipedia.org/wiki/Linear_congruential_generator	2022-11-30 08:46:52 -08:00
Abhishek Varma	c27c1791f1	[MLIR][TORCH] Add e2e support for `aten.amax` op -- This commit adds e2e support for `atend.amax` op. Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>	2022-11-30 17:54:37 +05:30
Tanyo Kwok	bbcdb38d99	Revert "Decompose torch.slice_scatter (#1622 )" (#1659 ) This reverts commit `f3f2f10030`.	2022-11-30 12:47:13 +08:00
Daniel Ellis	e2de20575f	Automatically strip overloads for FX-based models.	2022-11-29 22:19:09 -05:00
Ramiro Leal-Cavazos	a8cbfff95b	Reduce memory usage of e2e tests by reducing input sizes (#1653 ) There are a few e2e tests that take several very large tensors as input, which leads to the e2e test suite leaking too much memory. Running things locally resulted in a total memory usage of 12.5 GB when running the suite sequentially on the refbackend. Many of the tests that take large tensors don't actually need such large tensors to pass, and some that take several large tensors as input are just doing the same thing multiple times. This commit reduces the size of some of the tensors and removes repetitive parts of tests to reduce the memory usage to a total of 3 GB.	2022-11-29 10:03:36 -08:00
Sean Silva	5a488ff085	Remove deprecated np.bool `np.bool is bool` and will never be returned as a dtype of an `np.ndarray`, so we don't need to handle it here. ``` >>> a = np.ndarray([1], dtype=bool) >>> a.dtype.type is np.bool_ True ``` More info here: https://numpy.org/devdocs/release/1.20.0-notes.html#deprecations	2022-11-29 01:46:21 -08:00
Sean Silva	5a27f826b8	Fix multiprocessing for `--config=torchdynamo` For reasons that I haven't yet fully tracked down, the TorchDynamo TestConfig seems to result in tensors that cannot be pickled. They seem to be holding some sort of weak handles to a `torch.fx.graph.Graph`. Here is the object structure that leads to the unpickleable object: ``` (<function _rebuild_tensor_v2 at 0x7f56346d56c0>, <class 'torch.Tensor'>, ( 1.0... {<object object at 0x7f557529e6b0>: <WeakKeyDictionary at 0x7f556a3efbb0>} {'data': {<weakref at 0x7f5615372ed0; to 'PythonKeyTracer' at 0x7f556a3ee5c0>: _... <class 'torch.fx.graph.Graph'> <class 'torch._ops.OpOverloadPacket'> TypeError("cannot pickle 'torch._C.FunctionSchema' object") ``` Upstream bug filed: https://github.com/pytorch/pytorch/issues/89626	2022-11-28 04:03:11 -08:00
Shivam Gupta	853fd5c965	Fix RuntimeError while running examples/eager_mode.py (#1647 )	2022-11-25 10:21:56 -06:00
Vivek Khandelwal	d9cbf01d1e	Revert "build: update llvm tag to 147fe9de" This reverts commit `e45ad313d4`.	2022-11-25 12:41:56 +05:30
Vivek Khandelwal	9cac480a18	Revert "[MLIR][TORCH] Fix indentation and spacing for E2E tests" This reverts commit `3790a4270e`.	2022-11-25 12:41:56 +05:30
Sean Silva	28957adaac	[torchdynamo] Initial TorchDynamo support This adds a basic e2e Config for TorchDynamo using Linalg-on-Tensors/RefBackend. But TorchDynamo is pretty orthogonal to various other pieces, so it should compose nicely with variations like: - Switching out all the backends (Linalg-on-Tensors, TOSA, MHLO) - PyTorch functionalization and decompositions - Taking the example inputs and compiling with all dynamic or all static shapes without duplicating tests. This adds it to the CI, but there are still a lot of XFAIL's. This also adds a helper `from torch_mlir.dynamo import make_simple_dynamo_backend` which simplifies some of the steps for making a Torch-MLIR-based TorchDynamo backend. We include "simple" in the name because we are going to be exploring various things next from the long-term roadmap. The next steps are: - Burn down all the XFAIL's. - Start working on the pieces from the [long-term roadmap](https://github.com/llvm/torch-mlir/blob/main/docs/long_term_roadmap.md). - Add functionalization/decompositions into the TorchDynamo flow and remove reliance on the current Torch-MLIR "frontend". - Write a pure-Python direct FX->MLIR importer. - Hook up the new PyTorch symbolic shape stuff. - Explore PrimTorch decompositions for simplifying backends.	2022-11-24 04:10:25 -08:00
Vivek Khandelwal	3790a4270e	[MLIR][TORCH] Fix indentation and spacing for E2E tests Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-11-24 12:44:43 +05:30
Vivek Khandelwal	e45ad313d4	build: update llvm tag to 147fe9de Summary of changes: - Update call to `hasNoEffect` utility - `KDynamicSize` value changed to `std::numeric_limits<int64_t>::min()` from `-1` - Update tags llvm: 147fe9de29dc13c14835127b35280c4d95c8e8ba mhlo: 1944b5fa6062ec4c065d726c9c5d64f1487ee8c5 Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-11-24 12:44:43 +05:30
Tanyo Kwok	14f1260ac4	Add more mhlo basic converters (#1628 ) * Add more mhlo basic converters * remove unused pinnedMemory constraints * refine naming	2022-11-24 14:28:34 +08:00
Maksim Levental	bfcfd60d55	[MLIR][TORCH] Refix differentiable view (#1639 ) * `BatchMlpLayerModule_basic` passes * Fix https://github.com/llvm/torch-mlir/issues/1618 by stripping `requires_grad` from results of view ops.	2022-11-23 15:35:39 -06:00
Tanyo Kwok	f3f2f10030	Decompose torch.slice_scatter (#1622 ) * Decompose torch.slice_scatter * fix compilation error * update file check * fix ci * fix i64 torch.tensor dtype	2022-11-23 18:14:12 +08:00
Vivek Khandelwal	68f568b704	[MLIR][TORCH] Add E2E support for prims.convert_element_type op Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-11-22 09:36:36 +05:30
Vivek Khandelwal	55c7e66aa7	[MLIR][TORCH] Fix mean and mean.dim op for large-sized inputs This commit fixes the aten.mean and aten.mean.dim op decomposition for supporting large-sized inputs. This commit also fixes the formatting for the file stats.py Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-11-22 08:38:51 +05:30
Maksim Levental	ed901094c1	Fix https://github.com/llvm/torch-mlir/issues/1618 by stripping `requires_grad` from results of view ops. (#1624 )	2022-11-21 19:15:53 -06:00
Sean Silva	22307a1427	Clean up some parts of the test suite The purpose of the test suite is to accelerate the development of the compiler. However, we had various tests there that were not expected to work, had no in-progress work being tested by the test, and nobody was actively working on them. Having such tests in our test suite just adds clutter and slows down development on the compiler.	2022-11-21 06:14:31 -08:00
Vivek Khandelwal	25ab8fcc1f	[MLIR][TORCH] Fix numel tests for Roll PyTorch action Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-11-20 19:19:42 +05:30
Vivek Khandelwal	4cbd3927d7	[MLIR][TORCH] Add aten.sort.int op Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-11-20 19:00:41 +05:30
Abhishek Varma	1d949f3ac2	[MLIR][TORCH] Fix aten.upsample_nearest2d op -- aten.upsample_nearest2d.vec op is not present owing to https://github.com/pytorch/pytorch/pull/85638 -- So this commit adds a lowering on aten.upsample_nearest2d. Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>	2022-11-18 13:41:47 +05:30
Sean Silva	39de4d6265	[cleanup] Make diagnostics better Also remove some unused imports.	2022-11-17 02:09:54 -08:00
Vivek Khandelwal	5f7177da35	[MLIR][TORCH] Add decomposition for aten.var_mean.correction op Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-11-17 13:00:09 +05:30
Sean Silva	3695ca83e6	[torch_mlir.compile] Handle the case of already-scripted models better Closes #1582	2022-11-16 10:47:13 -08:00
Vivek Khandelwal	a1d3afdba9	[MLIR][TORCH] Add E2E support for aten.randint.low op Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-11-16 09:54:18 +05:30
George Petterson	92f385bd9f	[MLIR][TORCH] Add E2E support aten.convolution_backward op This commit adds the decomposition for the `aten.convolution_backward` and `aten.convolution_backward_overrideable` op.	2022-11-15 07:38:26 +05:30
Gleb Kazantaev	6909eaf7fc	Update TorchMlirBackendImpl Methods (#1580 ) * Fix LTC build * Remove passing test from xfail set	2022-11-14 00:37:49 -05:00
Vivek Khandelwal	a558034c1a	[MLIR][TORCH] Fix aten.upsample_nearest2d_backward op Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-11-12 00:05:36 +05:30
Vivek Khandelwal	d571d050fd	[torch_mlir.compile] Fixes issue with the https://github.com/llvm/torch-mlir/issues/1557 Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-11-11 18:05:15 +05:30
Sean Silva	cc468d2d16	[cleanup] Be consistent about apostrophe	2022-11-10 07:42:15 -08:00
Daniel Ellis	a7ac0def45	Move single-tensor-tuple-return test to mlir unit test. Also, add multiple return test.	2022-11-10 09:23:53 -05:00
Xiafei Qiu	4f173c6e0f	update llvm tag to a2620e00. (#1567 ) - also update MHLO to 57ba12a2(branch greencommit/2022-11-07-a2620e00) - change -pass-pipeline format to make tests pass.	2022-11-10 18:39:28 +08:00
Sean Silva	64914603fa	[torch_mlir.compile] Add support for multiple exported methods For AoT deployments models often have multiple exported methods. This patch enables something like this: ``` class TwoMethodsModule(torch.nn.Module): def sin(self, x): return torch.ops.aten.sin(x) def cos(self, x): return torch.ops.aten.cos(x) example_args = torch_mlir.ExampleArgs() example_args.add_method("sin", torch.ones(2, 3)) example_args.add_method("cos", torch.ones(2, 4)) print(torch_mlir.compile(TwoMethodsModule(), example_args)) ``` In the [long-term](https://github.com/llvm/torch-mlir/blob/main/docs/long_term_roadmap.md#tools-for-advanced-aot-deployments) we will need to reconcile this with our story for stateful models and the backend contract being purely functional. For now, this provides some basic infra that seems harmless. Arguably, we could tighten up the backend contract even more to only allow a single compiled function which would prohibit this or require building out a layer above. Fixes #1557	2022-11-10 02:10:22 -08:00
Jae Hoon (Antonio) Kim	2ec4b06bbb	Remove MakeView from IR Builder (#1552 ) * Remove MakeView from IR Builder * Update PyTorch requirements	2022-11-09 13:46:34 -05:00
Ashay Rane	d99b2ddb1b	importer: fix usage after PyTorch update (#1555 ) Unless requested otherwise, PyTorch no longer installs most of the header files under the caffe2 directory (see https://github.com/pytorch/pytorch/pull/87986). This breaks our importer code since we need to use the `MakeGuard()` function to execute statements in the event of exceptions. To fix this issue, this patch implements a rudimentary version of PyTorch's ScopeGuard, where once the class variable goes out of scope, it executes a predefined method.	2022-11-04 15:02:23 -05:00
Vivek Khandelwal	fedf8c0640	[MLIR][TORCH] Add E2E support for aten.upsample_nearest2d_backward.vec op Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-11-04 22:10:07 +05:30
Jae Hoon (Antonio) Kim	0701464c47	Remove view ops from IR builder (#1534 ) * Remove view ops from IR builder * Update PyTorch requirements	2022-10-30 21:42:44 -04:00
Vivek Khandelwal	c86177730d	[MLIR][TORCH] Add E2E support for aten.fill.Tensor op This commit adds the decomposition for `aten.fill.Tensor` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-10-30 18:40:47 +05:30
Ramiro Leal-Cavazos	b723186983	Remove all but one of valsem ops + move fill.Scalar to elementwise (#1531 ) This commit removes almost all of the valsem ops, since the value semantics version of the ops now exist in PyTorch. The only op missing is `aten.bernoulli_.float`. In addition, this commit also simplifies the implementation of `aten.fill.Scalar` by moving it to the pattern that converts elementwise ops.	2022-10-28 15:06:11 +00:00
Vivek Khandelwal	ea602127b6	[MLIR][TORCH] Add E2E support for aten.addcmul_ and aten.addcdiv_ op Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-10-28 16:07:50 +05:30
Daniel Ellis	3e199aaf11	Add better error message for single-tensor tuple returns.	2022-10-25 12:48:55 -04:00
Vivek Khandelwal	ca87033d2f	[MLIR][TORCH] Add E2E support for aten.mse_loss op This commit adds decomposition for the `aten.mse_loss` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-10-25 21:06:58 +05:30
Jae Hoon (Antonio) Kim	2f300935bf	Reference lazy graph executor (#1507 ) * Add LazyGraphExecutor registration * Update PyTorch version to 1.14.0.dev20221024 Co-authored-by: Roll PyTorch Action <torch-mlir@users.noreply.github.com>	2022-10-24 17:15:11 -04:00
Chi_Liu	ad6f5848cb	[MLIR][TORCH] Add TorchToTosa lowering for aten.where.self op (#1454 )	2022-10-18 09:39:39 -07:00
Ashay Rane	a9942f343a	Cache PyTorch source builds to reduce CI time (#1500 ) * ci: cache PyTorch source builds This patch reduces the time spent in regular CI builds by caching PyTorch source builds. Specifically, this patch: 1. Makes CI lookup the cache entry for the PyTorch commit hash in pytorch-version.txt 2. If lookup was successful, CI fetches the previously-generated WHL file into the build_tools/python/wheelhouse directory 3. CI sets the `TM_PYTORCH_INSTALL_WITHOUT_REBUILD` variable to `true` 4. The build_libtorch.sh script then uses the downloaded WHL file instead of rebuilding PyTorch * ci: warm up PyTorch source cache during daily RollPyTorch action This patch makes the RollPyTorch action write the updated WHL file to the cache, so that it can be later retrieved by CI that runs for each PR. We deliberately add the caching step to the end of the action since the RollPyTorch action never needs to read from the cache, although executing this step earlier in the process should not cause problems either.	2022-10-18 00:42:42 -05:00
Ramiro Leal-Cavazos	82a3860e25	build: update llvm tag to 4546397e (#1502 ) This commit makes the following changes needed to update bump LLVM: - Replace `linalg.init_tensor` with `tensor.empty` (see: https://reviews.llvm.org/D135129) - Replace `NoSideEffect` with `Pure` (see https://reviews.llvm.org/D135505) - Replace `body` region accessor for `ReduceOp` and `ReduceWindowOp` with `getBody` - Fix incorrect use of `tosa::ReduceSumOp` in `AtenNativeLayerNormOp` conversion pattern. The result type of `tosa::ReduceSumOp` must have the same rank as the input type. (see: https://www.mlplatform.org/tosa/tosa_spec.html#_reduce_sum) Co-authored-by: Ashay Rane <ashay@users.noreply.github.com> Co-authored-by: Ashay Rane <ashay@users.noreply.github.com>	2022-10-18 04:22:53 +00:00
Ramiro Leal-Cavazos	86095dd432	Replace linear transformation with `low` and `high` in test inputs (#1485 ) This commit replaces test inputs that were being linearly transformed by multiplying and adding/subtracting to the input tensor with inputs that use the `low` and `high` keyword arguments instead.	2022-10-14 18:52:07 +00:00
Gleb Kazantaev	bdb5083d33	New ops support & enhancements (#1494 ) * New ops support & enhancements * Enabled xfail ltc tests	2022-10-14 10:28:21 -04:00
Prashant Kumar	3a2cd23380	[LINALG] Add lowering for aten::round op. -- Added the lowering for aten::round op. -- Added the folding for integer cases.	2022-10-13 02:41:26 +05:30
Sean Silva	c8280d67bd	Remove the heavydep tests We originally added these to help bring up more complex models with heavier dependencies. However, over time it has become clear that these models usually require more than just heavier dependencies -- they often require a nontrivial amount of "one-off" code to extract the relevant parts of the model and compile them. This is not a good fit for a component in the core Torch-MLIR repo. However, in the community, nod.ai has developed the ["Shark Tank"](https://github.com/nod-ai/SHARK/tree/main/tank) which has all the appropriate code to wrangle these models and organize them. We intend to more heaviliy lean on that as a community and improve the symbiosis there to serve the role that these heavydep tests were meant to play.	2022-10-12 05:19:36 -07:00
Sean Silva	6403c0e56f	torch_mlir.compile: allow custom backend_legal_ops set Allow customizing `backend_legal_ops` for "torch" output type, since we don't know which backend will be used (it might be a custom backend). We don't allow customizing the `backend_legal_ops` for the other output types (Linalg, TOSA, MHLO) since those backends control their set of legal ops directly. Fixes #1418	2022-10-12 04:21:22 -07:00
Abhishek Varma	61db1b5c4d	[MLIR][TORCH] Add e2e support for `aten.Mish` op (#1470 ) -- This commit adds e2e support for `aten.Mish` op. -- `aten.Mish` op is decomposed as following :- Mish(x) = x * Tanh(Softplus(x)) Signed-off-by: Abhishek Varma <avarma094@gmail.com> Signed-off-by: Abhishek Varma <avarma094@gmail.com>	2022-10-11 14:03:10 -07:00
Jae Hoon (Antonio) Kim	3e08f5a779	Fix `fromIntArrayRef` call (#1479 ) * Fix fromSymint call * Update PyTorch requirement * Re-enable LTC	2022-10-11 13:29:07 -04:00
Ashay Rane	aefbf65e27	Disable LTC and update PyTorch (#1472 ) * build: disable LTC again so that we can bump PyTorch version When built using PyTorch's master branch, the LTC code has been failing to build for a few days. As a result, the PyTorch version referenced by Torch-MLIR is stalled to the one from October 4th. In an effort to advance to PyTorch version, this patch disables LTC, and a subsequent patch will advance the PyTorch version. * update PyTorch version to 1.14.0.dev20221010 Also disables the `UpSampleNearest2dDynamicFactor_basic` e2e test, since the (PyTorch) oracle differs from the computed value for both the refbackend and the eager_mode backends.	2022-10-10 23:05:40 -05:00
Gaurav Shukla	da90a25f90	[MLIR][TORCH] Add E2E support for `aten.[div.int\|bitwise_or.Tensor]` ops This commit adds lowering of `aten.div.int` and `aten.bitwise_or.Tensor` ops. Both these ops are required in order to support bloom_560m model. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-10-10 22:28:51 +05:30
Vivek Khandelwal	d3cc3f1aff	[tosa] Add lowering for aten.to.dtype and aten._to_copy op This commit adds the TorchToTosa lowering for `aten.to.dtype` and `aten._to_copy` op. Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-10-06 12:00:25 +05:30
Daniel Ellis	e7b2b84a66	Update torch-mlir-opt error message.	2022-10-05 15:02:10 -04:00
Jae Hoon (Antonio) Kim	c57d801260	Fix functionalize_aten_op calls for symint ops (#1459 ) * Fix functionalize_aten_op calls for symint ops * Update PyTorch version	2022-10-05 10:23:48 -04:00
Gleb Kazantaev	708fa346a6	Fix Base Lazy Backend Type Conversion (#1412 ) * Fix c10::prim::Constant conversion; Added CAPI for passes; Added passes to base lazy backend * Update ivalue_importer to use ImportOptions; Added tests for non-value/value tensor types * Added tests for scalar Constant import; Updated MB::importFunction to use ImportOptions * Test updates * Move back module variable name * Remove RefineTypes from TorchMlirLoweringContext::Build() * Rename pass; Remove passes from base lazy backend * Rename pass to VerifyBackendContractPass * Aligned cmd pass name; Fixed TorchConversion passes registration	2022-10-04 15:53:28 -07:00
Daniel Ellis	2ba71af651	Add support for mv decomposition.	2022-10-04 11:34:45 -04:00
Prashant Kumar	6777a9484d	[LINALG] Add lowering for the aten.upsample_nearest2d op.	2022-10-04 17:20:29 +05:30
Daniel Ellis	4d47f1671a	Reject dictionary inputs when tracing. The underlying error message was misleading. See https://github.com/llvm/torch-mlir/issues/1425	2022-09-30 16:02:35 -04:00
AmosLewis	940959589b	[MLIR][TORCH] Add Byte and Char Dtype support	2022-09-30 13:19:31 +05:30
Ashay Rane	0b46462528	Miscellaneous fixes for Windows builds (#1376 ) * test: allow spaces in path to Python executable On Windows, the path to the Python binary may contain spaces, so this patch adds quotes around the path to the python executable. Thanks to @sstamenova for suggesting the fix! * python: remove header file that causes Windows build failures Similar to https://reviews.llvm.org/D125284, we can safely remove this header file without affecting the build on either Linux. It is necessary to remove this header file on Windows builds since otherwise it causes build errors. * python: drop `TORCH_API` from function defined in Torch-MLIR `TORCH_API` should apply to functions that are either exported by libtorch.so or ones that are imported from libtorch.so by its downstream consumers (like Torch-MLIR). Neither case applies to the `importJitFunctionAsFuncOp()` function, since it is defined in Torch-MLIR (and thus outside libtorch.so). This patch fixes the problem by dropping `TORCH_API` from that function's declaration. * python: make output of class anotations deterministic The `class-annotator-repr.py` test checks for class annotations in a specific order, but prior to this patch, the order was non-deterministic, since the code iterated on an _unordered_ map. This patch makes the iteration order deterministic through two changes: 1. using a sorted map 2. using the class qualified name instead of the address of the class in memory * test: use Python3_EXECUTABLE as interpreter path for consistency This ensures that tests use the Python3 version that was detected using CMake, instead of whichever python version that happens to be in the PATH variable when invoking the test. * test: fix RUN string The parenthesis syntax does not run on Windows (the shell interprets the `(` character as part of the path). Moreover, the ODR violation in the comment no longer seems to apply. * python: port parallel test framework to Windows Since Windows does not support `fork` natively, Python's `multiprocessing` module needs to use `spawn` on Windows. However, to use `spawn`, the multiprocessing module serializes (or pickles) the worker function and its arguments. Sadly, the multiprocessing module (both the default one in Python and the one that is extended in PyTorch) is unable to serialize lambda functions (see https://stackoverflow.com/a/19985580) for detals. Unfortunately, given how our tests are structured, we require that the function under test is passed as an argument to another function, so we cannot sidestep our use of lambda functions. To resolve this problem, this patch makes use of the `multiprocess` and `dill` Python modules, which together offers a multiprocessing mechanism that can serialize lambda functions. The multiprocess module also offers a process pool, which simplifies the code for our parallel testing framework.	2022-09-29 12:07:43 -05:00
Vivek Khandelwal	6db513c51d	[tosa] Add support for some cases of aten.broadcast_to op (#1429 ) This commit adds support for TorchToTosa lowering of `aten.broadcast_to` op for cases: 1.) When the rank of input and output tensor is equal. 2.) When the rank of input tensor is zero. Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-09-29 09:40:56 -07:00
Jae Hoon (Antonio) Kim	fa5a8e21a3	Propagate parameter names to TorchMlirComputation (#1420 ) * Propagate parameter name to MLIR * Add TorchMlirNode Constructor Hook * Make func_op mutable - Purpose of this is to allow modification of func_op by subclass backend * Clean up unnecessary changes * Remove unnecessary attribute case * Address PR comments	2022-09-29 11:43:39 -04:00
JakopinA	8ef0c874c2	Implement Expand/Collapse Functionality for Aten.View (#1353 )	2022-09-27 11:08:14 -07:00
武家伟	c03aa63325	[MLIR] Add canonicalizer for aten.slice.t op (#1413 ) * [MLIR] Add canonicalizer for aten.slice.t op * Add mlir tests and strength the canonicalizer * rename variable Co-authored-by: Vremold <xremold@gamil.com>	2022-09-26 14:35:50 -07:00
Jae Hoon (Antonio) Kim	3e27aa2be3	Fix as_strided/slice symint (#1401 ) * Fix as_strided symint * Re-enable LTC tests * Re-enable LTC * Add hardtanh shape inference function * Fix slice symint	2022-09-26 12:16:49 -04:00
武家伟	ab7aa01b1e	[MHLO] Add torch-to-mhlo e2e support for aten.gather op (#1410 ) * Add torch-to-mhlo e2e support for aten.gather op * Add more e2e tests for torch.aten.gather op	2022-09-25 22:07:46 +08:00
Vivek Khandelwal	bc11e1aba6	[tosa] Add "-tosa-to-tensor" pass in the lowering pipeline Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-09-24 10:03:07 +05:30
Tanyo Kwok	72e422b589	Add relu6 and binary broadcasts (#1408 ) * Add relu6 and binary broadcasts	2022-09-23 20:39:15 +08:00
Sean Silva	7a77f9fe3d	Add a way to turn off crashing tests This adds a very long and obnoxious option to disable crashing tests. The right fix here is to use the right multiprocessing techniques to ensure that segfaulting tests can be XFAILed like normal tests, but we currently don't know how to implement "catch a segfault" in Python (patches or even just ideas welcome). Motivated by #1361, where we ended up removing two tests from all backends due to a failure in one backend, which is undesirable.	2022-09-23 05:01:39 -07:00
Vivek Khandelwal	5090ac9359	[MLIR][TORCH] Add a test for sum.dim_IntList op working for tosa (#1387 ) Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com> Co-authored-by: Suraj Sudhir <16977902+sjarus@users.noreply.github.com>	2022-09-20 11:38:09 -07:00
Vivek Khandelwal	1ffd42bbde	[MLIR][TORCH] Add TorchToTosa lowering for aten.broadcast_to op (#1386 ) Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-09-20 10:04:51 -07:00
武家伟	0e2e94d542	Add torch-to-mhlo e2e support for AtenArangeStartStepOp (#1385 ) Co-authored-by: Vremold <xremold@gamil.com>	2022-09-20 22:31:24 +08:00
Jae Hoon (Antonio) Kim	8967463980	Fix symint ops and blacklist `lift_fresh_copy` (#1373 ) * Add symint to native functions yaml * Re-enable LTC * Fix new_empty_strided and narrow_copy	2022-09-20 10:16:04 -04:00
武家伟	4f3cd236dd	Strength the shape inference for aten.arange-like op (#1367 ) Strength the shape inference for aten.arange-like op by 1. registering aten.sub and aten.ceil.Scalar op and design folders for them. 2. register a new constant-like op: Torch::ConstantNumberOp and design canonicalizer for it.	2022-09-20 12:40:19 +08:00
Vivek Khandelwal	04f3a4ffce	[MLIR][TORCH] Add support for bool element type for aten.sum[.dim_IntList] op This commit adds bool element type support for `aten.sum` and `aten.sum.dim_IntList` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-09-17 09:18:34 +05:30
Ashay Rane	1895b581c4	shape-lib: generate string as multiple lines to work with MSVC (#1370 ) As @oroppas identified, literal strings that are over 16,380 characters cause the MSVC compiler to throw an error (C2026), eventually causing the Windows build of Torch-MLIR to fail because the length of the generated MLIR for the shape library crosses the allowed threshold. This patch fixes the problem by making the Python script generate one literal string per line to satisfy the MSVC compiler. Thanks to @oroppas for the bulk of the effort required to resolve this!	2022-09-16 15:16:01 -05:00
Ashay Rane	2bb5f4d8fe	build: update llvm tag to 4d4ca6c9 (#1359 ) Summary of changes: - Updated emitAccessorPrefix since the default value has changed (https://reviews.llvm.org/D133179) - Updated RefineTypes pass since Lattice::isUninitialized() is removed (https://reviews.llvm.org/D132800) - Updated MHLO tag so that it builds with the updated LLVM tag - Disabled two tests that cause segfaults in the TOSA backend (see Issue #1361)	2022-09-13 21:24:43 -05:00
gpetters94	48418b9c22	Fold away type_as (#1358 )	2022-09-12 18:59:12 -04:00
Vivek Khandelwal	71b1f0dd7a	[MLIR][TORCH] Add E2E support for aten.index.Tensor_hacked_twin op This commit adds lowering of `index.Tensor_hacked_twin` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-09-12 21:47:18 +05:30
George Petterson	a12b9c4492	Add lowering for aten::cumsum	2022-09-12 09:28:07 +05:30
Vivek Khandelwal	326f21229e	[MLIR][TORCH] Fix shape calculation for aten::pow.Tensor_Tensor op Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-09-08 21:14:12 +05:30
Vivek Khandelwal	e35741fb1d	[MLIR][TORCH] Add E2E support for aten.bitwise_not op This commit adds lowering of `aten.bitwise_not` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-09-08 17:52:12 +05:30

1 2 3 4 5 ...

661 Commits (67ab708b636d300db003fbec902411315e222bfb)