torch-mlir

Commit Graph

Author	SHA1	Message	Date
Sambhav Jain	3e836d8dad	[fx_importer] Convert non-persistent buffers lifted as tensor constants (#2902 ) The investigation is largely recorded in https://github.com/llvm/torch-mlir/pull/2881, but this change allows us to capture non-persistent buffers that were lifted as tensor constants (after https://github.com/pytorch/pytorch/pull/118969 landed in upstream PyTorch), and propagate them to `Torch` dialect as "frozen" `torch.vtensor.literal`. I believe this patch should work with both nightly and stable PyTorch, but will let CI confirm the same. Thanks @stellaraccident for the valuable pointers and guidance. --------- Co-authored-by: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-02-13 12:38:32 -08:00
Aart Bik	b6f4ca512e	[torch-mlir][sparse] sparsity metadata refinement (#2901 ) Various improvements on sparsity metadata: (1) define single data structure for all sparsity related metadata (2) handle batched dense dimensions, as well as dense subtensor dimensions (3) refine sparsity propagation for deeper networks	2024-02-12 16:10:57 -08:00
Aart Bik	be8375d350	[torch-mlir][sparse] implement first sparse_jit end-to-end path (#2894 ) This PR introduces a sparse_jit wrapper that can run simple models with sparse tensor inputs end-to-end. The implementation shows all required components on modifying sparse tensor types with a 1:N relation on the call sites. Two tests shows that the JIT runs end-to-end while computing the correct results. More details to follow (generalizing to COO and different ranks, as well as support for output sparse tensors), but the general concepts are all here now. _Update: Thanks to Rob, bump to proper LLVM/MLIR hash is done!_ _NOTE that all parameter passing changes are nicely done "downstream" in MLIR, so very little changes are required in torch-mlir code proper_ --------- Co-authored-by: Franz Haniel <77495327+frafranz@users.noreply.github.com> Co-authored-by: Franz Haniel <franz.haniel@amd.com>	2024-02-12 10:04:54 -08:00
saienduri	bfcf93ea21	Rename torch_mlir.compile APIs and introduce FX based analogs (#2842 ) Link to related RFC: https://discourse.llvm.org/t/rfc-rename-torch-mlir-compile-apis-and-introduce-fx-based-analogs/76646 This commit updates the documentation, tests, CMake files, and API for the proposed changes in the RFC. There is a new torch_mlir/fx.py for user level APIs related to importing modules and a corresponding test for this path can be found at test/python/fx_importer/basic_test.py. --------- Co-authored-by: MaheshRavishankar <mravisha@amd.com>	2024-02-06 19:07:59 -08:00
Daniel Garvey	faf7d4aaa5	[fx_importer] Add support for 0D tensors (#2870 ) Adds an escape hatch from creating a DenseResourceElementsAttr for single value tensors into DenseElementsAttr. For 0d or 1element, splats are better as DenseElementsAttr. Don't use DenseResourceElementsAttr for it	2024-02-06 00:19:31 -06:00
Dave Liddell	04be6ba773	Make the onnx importer more robust for internal/external and large models (#2794 ) Fix for https://github.com/llvm/torch-mlir/issues/2765 The onnx docs say that you can't do shape inference using the in-memory API for models > 2 GB. This fix replaces that API with the file-based API. Since the new API generates an intermediate file, also added a --keep switch to keep that file, which I delete by default. --------- Co-authored-by: Dave Liddell <dliddell@xilinx.com>	2024-01-31 21:58:43 -08:00
Rob Suderman	54e258792c	[onnx] Import `onnx` constants as `onnx.Constant` instead of literals (#2831 ) To handle the conversion from raw bytes to `DenseElementsAttr` we need to handle the endianness conversion during `torch-onnx-to-torch`. Therefore when importing `onnx.Constant` it is better to represent using the `onnx` constant operation so that only one location requires the endianness correction.	2024-01-31 11:41:06 -08:00
Aart Bik	105aad6f57	[torch-mlir] provide FX traced graph importer for sparse tensors (#2817 ) Note that we are waiting for actual FX traced graph support for sparse tensors. For details see https://github.com/pytorch/pytorch/issues/117188 Until then, however, we provide this clever importer that builds the FX traced graph for for the dense case and then puts a sparse annotation back on the parameters. With import test.	2024-01-30 21:22:12 -08:00
Stella Laurenzo	77c14ab22b	[ci] Upgrade to new runners and disable unsupported jobs. (#2818 ) Per the RFC and numerous conversations on Discord, this rebuilds the torch-mlir CI and discontinues the infra and coupling to the binary releases (https://discourse.llvm.org/t/rfc-discontinuing-pytorch-1-binary-releases/76371). I iterated on this to get latency back to about what it was with the old (much larger and non-ephemeral) runners: About 4m - 4.5m for an incremental change. Behind the scenes changes: * Uses a new runner pool operated by AMD. It is currently set to manual scaling and has two runners (32-core, 64GiB RAM) while we get some traction. We can either fiddle with some auto-scaling or use a schedule to give it an increase during certain high traffic hours. * Builds are now completely isolated and cannot have run-to-run interference like we were getting before (i.e. lock file/permissions stuff). * The GHA runner is installed directly into a manylinux 2.28 container with upgraded dev tools. This eliminates the need to do sub-invocations of docker on Linux in order to run on the same OS that is used to build wheels. * While not using it now, this setup was cloned from another project that posts the built artifacts to the job and fans out testing. Might be useful here later. * Uses a special git cache that lets us have ephemeral runners and still check out the repo and deps (incl. llvm) in ~13s. * Running in an Azure VM Scale Set. In-repo changes: * Disables (but does not yet delete): * Old buildAndTest.yml jobs * releaseSnapshotPackage.yml * Adds a new `ci.yml` pipeline and scripts the steps in `build_tools/ci` (by decomposing the existing `build_linux_packages.sh` for in-tree builds and modularizing it a bit better). * Test framework changes: * Adds a `TORCH_MLIR_TEST_CONCURRENCY` env var that can be used to bound the multiprocess concurrency. Ended up not using this in the final version but is useful to have as a knob. * Changes the default concurrency to `nproc * 0.8 + 1` vs `nproc * 1.1`. We're running on systems with significantly less virtual memory and I did a bit of fiddling to find a good tradeoff. * Changed multiprocess mode to spawn instead of fork. Otherwise, I was getting instability (as discussed on discord). * Added MLIR configuration to disable multithreaded contexts globally for the project. Constantly spawning `nproc * nproc` threads (more than that actually) was OOM'ing. * Added a test timeout of 5 minutes. If a multiprocess worker crashes, the framework can get wedged indefinitely (and then will just be reaped after multiple hours). We should fix this, but this at least keeps the CI pool from wedging with stuck jobs. Functional changes needing followup: * No matter what I did, I couldn't get the LTC tests to work, and I'm not 100% sure they were being run in the old setup as the scripts were a bit twisty. I disabled them and left a comment. * Dropped out-of-tree build variants. These were not providing much signal and increase CI needs by 50%. * Dropped MacOS and Windows builds. Now that we are "just a library" and not building releases, there is less pressure to test these commit by commit. Further, since we bump torch-mlir to known good commits on these platforms, it has been a long time since either of these jobs have provided much signal (and they take ~an hour+ to run). We can add them back later post-submit if ever needed.	2024-01-27 18:35:45 -08:00
Yuanqiang Liu	e73c5368fb	[FxImporter] make FxImporter to fit python<=3.9 (#2802 ) As that torch with py3.9 is also used widely.	2024-01-26 09:01:47 +08:00
Dave Liddell	d452c4f4c0	Fix onnx importer to treat Constant values as static (#2780 ) Fixes https://github.com/llvm/torch-mlir/issues/2764 In the case of OPT, there are ConstantOfShape ops whose input shape is not static (that is, an initializer), but rather comes from a Constant op. The importer can't handle such non-static input shapes. The fix here is to create initializers for a subset of Constant ops (ones with "value" attributes), so that their outputs can be used statically. Additionally, there was no case for creating a splat of int64, so I added that as well. --------- Co-authored-by: Dave Liddell <dliddell@xilinx.com>	2024-01-22 13:00:05 -08:00
Rob Suderman	85b86b36a2	[onnx] Fix importer variable names to make `mlir` legal (#2690 ) Some names for `onnx` identifiers are not legal in `mlir-ir`. Sanitize so that the generated `ir` is legal.	2023-12-21 17:05:18 -08:00
Stella Laurenzo	ccd469ca0d	[fx] Upstream the turbine FxImporter to torch-mlir. (#2681 ) Changes made during upstreaming: * Removed comments attributing some copied code back to torch-mlir (since it is now repatriated). * Re-organized imports. * Inlined RefMapping/RefTracker and TypeSubclassMap from an external utility module. * Added FxImporter class comments. * Updated stack trace extraction to be fail safe. * Added an entry-point for `import_frozen_exported_program` which uses the shiny new upstream `torch.export.export()` API (versus the lower-level/older API that Turbine is presently using). This necessitated a small FX rewrite to line external state management up with current conventions. * Adapted one of Turbine's importer tests to go with this initial submission. Turbine unfortunately has a lot of more-integration-ey tests, and I would like to extract those as more of unit tests of the importer features and upstream them that way vs trying to copy directly. For now, one overall test with the initial submission gets us moving. I acknowledge that there are some code quality things that could be improved in this submission: this was authored over the course of many months (and often via some trial and error). I would like to keep it relatively converged with the downstream for the next few steps while getting the test suite upstreamed. And then it will be easier to take a hygienic pass through the code. Including co-authors for contributors in the git log of the original repository. Co-authored-by: Ean Garvey <87458719+monorimet@users.noreply.github.com> Co-authored-by: Avinash Sharma <aviator1994@gmail.com> Co-authored-by: Arham Khan <arhammkhan@gmail.com> Co-authored-by: brucekimrokcmu <kwangkyk@alumni.cmu.edu> Co-authored-by: saienduri <77521230+saienduri@users.noreply.github.com>	2023-12-21 08:40:10 -08:00
Stella Laurenzo	ed4df38e8d	[onnx] Add torch-mlir-import-onnx tool. (#2637 ) Simple Python console script to import an ONNX protobuf to the torch dialect for additional processing. For installed wheels, this can be used with something like: ``` torch-mlir-import-onnx test/python/onnx_importer/LeakyReLU.onnx ``` Or from a dev setup: ``` python -m torch_mlir.tools.import_onnx ... ```	2023-12-12 22:01:30 -08:00
Stella Laurenzo	74f7a0c9d6	Upstream the ONNX importer. (#2636 ) This is part 1 of 2, which will also include upstreaming the FX importer. I started with ONNX because it forces some project layout updates and is more self contained/easier as a first step. Deviating somewhat from the RFCs on project layout, I made the following decisions: * Locating the `onnx_importer.py` into `torch_mlir.extras` as Maks already has opened up that namespace and it seemed to fit. Better to have fewer things at that level. * Setup the build so that the root project only contains MLIR Python and pure Python deps (like the importers), but this can be augmented with the `projects/` adding more depending on which features are enabled. * The default build continues to build everything whereas in `TORCH_MLIR_ENABLE_ONLY_MLIR_PYTHON_BINDINGS=1` mode, it builds a `torch-mlir-core` wheel with the pure contents only. `onnx_importer.py` and `importer_smoke_test.py` are almost verbatim copies from SHARK-Turbine. I made some minor local alterations to adapt to paths and generalize the way they interact with the outer project. I expect I can copy these back to Turbine verbatim from here. I also updated the license boilerplate (they have the same license but slightly different project norms for the headers) but retained the correct copyright. Other updates: * Added the ONNX importer unit test (which also can generate test data) in lit, conditioned on the availability of the Python `onnx` package. In a followup once I know everything is stable, I'll add another env var that the CI can set to always enable this so we know conclusively if tests pass. * Moved the ONNX conversion readme to `docs/`. * Renamed CMake option `TORCH_MLIR_ENABLE_ONLY_MLIR_PYTHON_BINDINGS` -> `TORCH_MLIR_ENABLE_PYTORCH_EXTENSIONS` and inverted the sense. Made the JitIR importer and LTC options `cmake_dependent_options` for robustness.	2023-12-12 19:02:51 -08:00
Stella Laurenzo	6961f0a247	Re-organize project structure to separate PyTorch dependencies from core project. (#2542 ) This is a first step towards the structure we discussed here: https://gist.github.com/stellaraccident/931b068aaf7fa56f34069426740ebf20 There are two primary goals: 1. Separate the core project (C++ dialects and conversions) from the hard PyTorch dependencies. We move all such things into projects/pt1 as a starting point since they are presently entangled with PT1-era APIs. Additional work can be done to disentangle components from that (specifically LTC is identified as likely ultimately living in a `projects/ltc`). 2. Create space for native PyTorch2 Dynamo-based infra to be upstreamed without needing to co-exist with the original TorchScript path. Very little changes in this path with respect to build layering or options. These can be updated in a followup without commingling directory structure changes. This also takes steps toward a couple of other layering enhancements: * Removes the llvm-external-projects/torch-mlir-dialects sub-project, collapsing it into the main tree. * Audits and fixes up the core C++ build to account for issues found while moving things. This is just an opportunistic pass through but roughly ~halves the number of build actions for the project from the high 4000's to the low 2000's. It deviates from the discussed plan by having a `projects/` tree instead of `compat/`. As I was thinking about it, this will better accommodate the follow-on code movement. Once things are roughly in place and the CI passing, followups will focus on more in-situ fixes and cleanups.	2023-11-02 19:45:55 -07:00
Zhekun(Josh) Zhang	88d4c475d3	[Torch] Fix mixP case for non value semantic ops (#2540 ) NonValueSemantic Ops like Add_, div_, etc. expect result DType to be the same as the first input. However, current implementation would result in wrong result type for case like: ```python a = torch.randn(3, 3).half() # float16 b = torch.randn(3, 3) # float32 a += b # i.e. torch.ops.aten.add_(a, b) ``` torch expects `a` to be float16, but dtype refinement would infer float32 type, since it's replaced by `aten.add`.	2023-11-02 12:40:08 +08:00
Daniel Garvey	4901773f77	add uncovered cases in view lowering (#2524 ) removes unecessary checks from empty strided	2023-11-01 21:56:44 -05:00
Yuanqiang Liu	365655ca29	[Torch Dialect] add canonicalize pattern for aten.floor with integer … (#2534 ) …type	2023-11-02 09:51:31 +08:00
saienduri	a2e694df40	add e2e support for torch.eye operations (aten.eye, aten.eye.m) (#2478 )	2023-11-01 11:23:28 -07:00
Daniel Garvey	1d41f7b6fe	Rework AtenEmptyStridedOp checks (#2537 ) Now using Value instead of Ints. Trades compile failure for a runtime assert	2023-10-31 22:56:54 -05:00
xiaolou86	4199feffed	Fix typos in comments (#2539 ) Fix typos in comments	2023-10-31 20:10:47 -07:00
JianzheXiao	e8706957c0	[Torch Dialect] Add Support for aten.unflatten.int (#2475 ) As title, Add support for aten.unflatten.int, support dim to be negative and one of the sizes' elements to be -1	2023-10-31 15:36:16 +08:00
Yuanqiang Liu	e7282487ea	[Torch Dialect] support aten.glu (#2531 )	2023-10-26 10:36:18 +08:00
Sarthak Gupta	7633619ed2	[torch] Implement stronger verifiers for non-value semantic ops (#2519 ) Attempt to solve https://github.com/llvm/torch-mlir/issues/2490 Changes for Non Value Semantic Ops having the `IsTrailingUnderscoreInplaceVariant` trait : - AnyTorchTensorType -> Torch_NonValueTensorType - AnyTorchOptionalTensorType -> AnyTorchOptionalNonValueTensorType - AnyTorchListOfOptionalTensorType -> AnyTorchListOfOptionalNonValueTensorType - AnyTorchListOfTensorType -> AnyTorchListOfNonValueTensorType Created three new tensor types for optional and list non value tensors.	2023-10-21 09:09:55 -07:00
Ze Zhang	f2c53b8ca5	Add aten.isclose support and its torch-to-tosa lowering (#2512 ) Add aten.isclose op Add its torch-to-tosa lowering Update the TorchToTosa/basic.mlir tests To test e2e tosa lowering: `python -m e2e_testing.main -v -c=tosa` --------- Co-authored-by: Ze Zhang <ze.zhang@getcruise.com>	2023-10-16 09:44:53 -07:00
Ze Zhang	e649e06b7b	Add aten.unflatten.int support and its torch-to-tosa lowering (#2509 ) Add aten.unflatten.int op Add its torch-to-tosa lowering Update the TorchToTosa/basic.mlir tests To test e2e tosa lowering: `python -m e2e_testing.main -v -c=tosa` --------- Co-authored-by: Ze Zhang <ze.zhang@getcruise.com>	2023-10-13 18:39:41 -07:00
Ramiro Leal-Cavazos	2e5d65064c	[linalg] Add handling for leadin and trailing size-1 dims in ViewOp This commit adds to the lowering of `aten.view` handling for the following cases: - `(..., a.size(i))` -> `(..., a.size(i), 1, ..., 1)` - `(..., a.size(i), 1, ..., 1)` -> `(..., a.size(i))` - `(a.size(i), ...)` -> `(1, ..., 1, a.size(i), ...)` - `(1, ..., 1, a.size(i), ...)` -> `(a.size(i), ...)`	2023-10-03 23:04:52 +00:00
Ramiro Leal-Cavazos	1c508af0ba	Revert "[linalg] Fix handling of trailing size-1 dimensions in aten.view (#2474 )" This reverts commit `7c6b9d2445`.	2023-10-03 23:04:52 +00:00
Vivek Khandelwal	ca6ce8974f	[MLIR][TORCH] Add support for int8 dtype for sub, add, and bitwise_and op Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2023-10-03 22:12:31 +05:30
Jae Hoon (Antonio) Kim	32d9b20bde	Add linspace/cumprod/roll ops (#2498 ) Add linspace/cumprod/roll ops to ODS and add shape inference functions to make it work with LTC. Also, add some tensor utils to LTC library for searching for non-detach copy nodes.	2023-10-03 11:01:07 -04:00
Vivek Khandelwal	9293326e1e	[MLIR][TORCH] Add support for bitwise_right_shit and bitwise_and.Scalar op Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2023-10-02 13:06:59 +05:30
Vivek Khandelwal	c434736ee9	[MLIR][TORCH] Add support for conversion to int8 dtype Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2023-10-02 09:48:46 +05:30
Vivek Khandelwal	71ac62f3a8	build: manually update PyTorch version Set PyTorch and TorchVision version to nightly release 2023-09-28. aten.baddbmm changes done because upstream PyTorch has now added support for fp16 gemm on CPU. Refer: `9399e0b1ff`	2023-10-02 09:48:32 +05:30
saienduri	4e1dd3bf10	add e2e support for torch.log10 (#2479 )	2023-09-28 10:17:03 -07:00
Ramiro Leal-Cavazos	7c6b9d2445	[linalg] Fix handling of trailing size-1 dimensions in aten.view (#2474 ) This commit adds to the lowering of `aten.view` handling for the following cases: - `(..., a.size(i))` -> `(..., a.size(i), 1, ..., 1)` - `(..., a.size(i), 1, ..., 1)` -> `(..., a.size(i))` Fixes: https://github.com/llvm/torch-mlir/issues/2448	2023-09-27 09:09:30 -07:00
Vivek Khandelwal	7760bda8ee	build: manually update PyTorch version Set PyTorch and TorchVision version to nightly release 2023-09-26. aten._convolution.deprecated changes done because upstream PyTorch has now added support for fp16 native convolution on CPU. Refer: `7c9052165a` Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2023-09-27 16:24:58 +05:30
Bruce Kim	a520d39f84	[MLIR][TORCH] Add device "cpu" support for aten.to.dtype_layout op (#2481 ) This PR adds device="cpu" support for `aten.to_dtypeLayout` op and corresponding e2e test suit. (refer: PR https://github.com/llvm/torch-mlir/pull/812/)	2023-09-25 10:00:19 -04:00
Gleb Kazantaev	059041e0fe	[LTC] Support torch.ones/zeros/arange ops (#2440 )	2023-09-21 13:25:14 -04:00
David Gens	023fc90072	[Torch Dialect] add avg_pool 2d and 3d op variants (#2473 ) Adds ODS for `avg_pool2d` and `avg_pool3d`, including their backward and `adaptive_` variants.	2023-09-20 13:47:08 -04:00
Bruce Kim	40913a36c2	[MLIR][TORCH] Add E2E support for aten.empty_strided decomposition op (redo PR) (#2459 ) Making the same PR with #2457, as I accidentally thought the review was already made and merged it (reverted). Add decompose empty_strided op. Referring to #1776, this decomposition op only supports default stride values, because accessing the tensor or indexing over that, the indices are determined by the strides. In MLIR, this is not implicitly supported but assumes that the strides are default while iterating over the tensor.	2023-09-13 10:04:31 -07:00
Vivek Khandelwal	4b4c38da46	build: manually update PyTorch version Set PyTorch and TorchVision version to nightly release 2023-09-13. Ref: `464f9c3725` Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2023-09-13 21:25:21 +05:30
Stella Laurenzo	078d1e1a1d	Remove mlir-hlo (replace with stablehlo). (#2460 ) We just have to do this: I ran into an issue today where I needed to make a one line patch to stablehlo to work around a compiler issue, and it is completely unapparent how to do so given that the mlir-hlo repo is a read-only export and is at the tail end of a multi-week integration chain from the open-source stablehlo repo. We've discussed this often enough and gotten +1 from everyone that they are ok with taking the e2e testing hit if it becomes necessary: It is necessary as the current situation is unmanageable. Looking at it, I expect it wouldn't actually be very difficult to build a little runner binary out of the stablehlo interpreter and subprocess call that in order to get the testing coverage back. I leave that as an exercise to the users of this part of the stack and recommend following the breadcrumbs from the deleted python/torch_mlir_e2e_test/stablehlo_backends/linalg_on_tensors.py file and the main.py changes. Note that I am pointing us at a stablehlo fork for the moment until it is apparent that we don't need to carry any local patches to it. We can update this in a few days if everything is clear.	2023-09-12 19:10:02 -07:00
Ramiro Leal-Cavazos	106b58597a	Revert "[MLIR][TORCH] Add E2E support for aten.empty_strided decomposition op (#2457 )" (#2458 ) This reverts commit `97bec86a8b`.	2023-09-12 13:57:47 -07:00
Bruce Kim	97bec86a8b	[MLIR][TORCH] Add E2E support for aten.empty_strided decomposition op (#2457 ) * implemented e2e test case, shape, dtype func * AtenEmptyStrided decompose op implemented * xfailed test module in ltc	2023-09-12 13:37:02 -07:00
Arham Khan	82456eefed	[MLIR][TORCH] add E2E support for aten.new_full (#2425 ) * implement aten.new_full * remove extraneous tests	2023-09-12 09:29:08 -05:00
Vivek Khandelwal	23b72244b1	[MLIR][TORCH] Add different dtype support for aten.bmm op Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2023-09-12 12:38:46 +05:30
Yuanqiang Liu	1f20b7275d	[Torch Dialect] add canonicalize for aten.min.other (#2452 )	2023-09-11 17:28:22 +08:00
Bruce Kim	27b55b1d5f	implemented complex tensor aten mul (#2444 )	2023-09-07 13:29:15 -07:00
Jiawei Wu	b411a40b3d	[Torch Dialect] emit aten.__or__Tensor Op (#2437 ) * emit aten.__or__TensorOp * bug fix * remove convert to stablehlo * code style refinement	2023-09-06 14:21:51 +08:00

1 2 3 4 5 ...

865 Commits (3e836d8dad551b6e5302de1b84840b90ee039c83)