torch-mlir

Commit Graph

Author	SHA1	Message	Date
Sean Silva	0e3ddbac91	Remove VerifyInvariantsBeforeBackendLowering LowerToBackendContract now checks all this consistently.	2022-08-26 10:24:43 -07:00
Sean Silva	b1fa7a2b9d	Fix a few build warnings	2022-08-26 10:24:22 -07:00
Ramiro Leal-Cavazos	e153694c94	Add TestUtils.randint + replace torch.randint with tu.randint (#1276 ) This commit adds a method to `TestUtils` that generates random integer tensors with a similar interface to the `TestUtils.rand`. This commit also replaces with `tu.randint` all test inputs generated with `torch.randint`.	2022-08-26 08:50:16 -07:00
Henry Tu	e869e68559	Fix LTC lib_torch_mlir_ltc.so import error (#1283 ) * Build LTC to _mlir_libs directory * Update CMakeLists.txt	2022-08-25 18:25:01 -04:00
Henry Tu	a1ace0657d	Revert updating mlir_native_functions.cpp signature (#1281 ) * Revert updating mlir_native_functions.cpp signature, due to `a7edf71360` * Restored NewZeros to LTC XFAIL set	2022-08-25 13:00:33 -04:00
Ashay Rane	233fd1246b	doc: fix instructions for LLVM and MHLO updates (#1273 ) Now that we depend on MLIR-HLO, we have a slightly more involved process for updating the LLVM submodule. This patch updates the instructions in `development.md` with both the rationale for the process and the individual steps of the process.	2022-08-24 17:33:46 -05:00
Ashay Rane	1d9d925f6e	mlir: fix replacement of `OpaqueElementsAttr` (#1274 ) An earlier patch (`bb47c166`) incorrectly replaced the now-dropped `OpaqueElementsAttr` with `SparseElementsAttr` in one place and with `DenseElementsAttr` in another. This patch fixes the problem by making both replacements use the dense-equivalent type.	2022-08-24 17:10:40 -05:00
Henry Tu	e2f862cb85	Fix LTC build warnings (#1272 ) * Resolved Wunused-variable * Fix Wunneeded-internal-declaration * Address review comment * Update autogen_ltc_backend.py * Update mlir_native_functions.cpp to work with updated PyTorch * Remove NewZeros from LTC XFAIL set	2022-08-24 15:04:28 -04:00
gpetters94	f012279fa2	Add transposed case for at::convolution (#917 ) Also adds a decomposition for aten::conv_transposed2d.input	2022-08-24 12:19:35 -04:00
Sean Silva	d7d67979b2	[cleanup] Change OutputType enum values to strings The use of numbers was arbitrary and was preventing the enum values from being put in the natural order.	2022-08-23 17:59:39 -07:00
Tanyo Kwok	3d0e18bbe7	Add decomposition for aten.roll (#1170 ) * Add decomposition for aten.roll * add e2e unittest * refine type of torch.roll * fix aten::cat output type	2022-08-24 08:36:05 +08:00
武家伟	1106b9aeae	[MHLO] bert-tiny and resnet18 example from torchscript to mhlo (#1266 ) Co-authored-by: Bairen Yi <yibairen.byron@bytedance.com> Co-authored-by: Jiawei Wu <xremold@gmail.com> Co-authored-by: Tianyou Guo <tianyou.gty@alibaba-inc.com> Co-authored-by: Xu Yan <yancey.yx@alibaba-inc.com> Co-authored-by: Ziheng Jiang <ziheng.jiang@bytedance.com> Co-authored-by: Vremold <xremold@gamil.com>	2022-08-23 16:44:36 -07:00
Tanyo Kwok	2374098d71	[MHLO] Init end to end unit tests (#1223 )	2022-08-23 16:47:21 +08:00
Vivek Khandelwal	8cad02f87e	[MLIR][TORCH] Add torch.Device type to backend contract scalar types Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-08-23 10:50:09 +05:30
Tanyo Kwok	9176b5ed29	Add decomposition for aten.flatten.using_ints (#1161 )	2022-08-23 11:52:54 +08:00
Sean Silva	01290d134a	Add a way for backends to control which ops are legal for them. We were already hitting many cases where backends different in terms of the legal ops that they wanted. This caused unnecessary coupling between the backends. Examples: - https://github.com/llvm/torch-mlir/pull/1161 - https://github.com/llvm/torch-mlir/pull/862 This PR centralizes all compilation to go through `torch_mlir.compile` so that we can keep the logic centralized there. We should move these lists closer to each backend. Especially cases like https://github.com/llvm/torch-mlir/pull/862 where blocking a decomposition is necessary to avoid a crash emphasize that the set of decompositions is tightly coupled to the backend, and should be "controlled by the backend" and not something arbitrarily tweakable. Also: - Fix a small bug in the way we passed through the backendLegalOps option. - Add better error messages in `torch_mlir.compile` for import errors.	2022-08-22 14:16:13 -07:00
powderluv	ef89dadf52	Update Torch-MLIR Architecture Diagram (#1254 ) Add MHLO path Add custom accelarator dialects Rename Torch Dialect back to original Torch-MLIR Dialect (Surrounding text still refers to Torch-MLIR dialect) Check in source for Excalidraw(https://excalidraw.com/) so anyone can use / update it using the open source version	2022-08-22 13:09:32 -07:00
Marius Brehler	1f1abda179	Don't explicitly set MLIR_PDLL_TABLEGEN_EXE (#1262 ) With llvm/llvm-project@91b6f76, the variable `MLIR_PDLL_TABLEGEN_EXE` is set as a cache variable in MLIR upstream. Likely requires an update of externals/mlir-hlo to tensorflow/mlir-hlo@4f2a00b or later.	2022-08-22 16:45:56 +02:00
Vivek Khandelwal	3815cfa7a5	[MLIR][TORCH] Fix CI failure due to failing tests Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-08-22 17:51:01 +05:30
Alex Tsao	c38308f3ef	Add lowering for _convolution.deprecated (#1259 ) * Add lowering for _convolution.deprecated	2022-08-22 11:17:36 +08:00
武家伟	99fb4c8637	Add folder for ToF64Op and FromF64Op (#1257 )	2022-08-22 09:49:39 +08:00
Henry Tu	ba17a4d6c0	Reenable LTC in out-of-tree build (for real this time) (#1205 ) * Fix OOT LTC CI build failure * Disable LTC during macOS package gen * Add more details about static TorchMLIRJITIRImporter library	2022-08-19 15:25:00 -04:00
Vivek Khandelwal	65d811e267	[MLIR][TORCH] Fix dynamic cases for aten.index.Tensor	2022-08-19 12:13:20 +05:30
Sambhav Jain	1e1759c2eb	[Bazel] Run buildifier (#1250 ) Lint all Bazel files (BUILD, WORKSPACE, .bzl, or .sky). Steps followed: ```shell ./utils/bazel/run_docker.sh # run within container git clone https://github.com/bazelbuild/buildtools.git cd buildtools bazel build //buildifier # buildifier formatter ./bazel-bin/buildifier/buildifier_/buildifier -r ../utils/bazel/ # buildifier linter ./bazel-bin/buildifier/buildifier_/buildifier --lint=fix -r ../utils/bazel/ ```	2022-08-18 22:42:58 -07:00
武家伟	7bd173a1c4	[MHLO] Eliminate explicit dynamic output shape generating in converting AtenSliceTensorOp (#1245 ) [MHLO] Eliminate explicit dynamic output shape generating in converting AtenSliceTensorOp	2022-08-19 10:14:57 +08:00
powderluv	0d1aa43764	Drop Python 3.7x from the nightly binary builds (#1246 )	2022-08-18 16:34:12 -07:00
Sean Silva	f601435fdf	Add white background to diagram. This makes it easier to read in dark mode browsers.	2022-08-18 15:53:43 -07:00
Ramiro Leal-Cavazos	9bc606c384	Add support for returning more than one copy of the same tensor (#1228 ) One of the simplifications made by the pass `RefinePublicReturn` currently only happens if the tensor in question only has one user. However, the current method of checking this does not correctly handle the case of a user having multiple uses of the same tensor. This commit makes sure only unique users are considered.	2022-08-18 22:41:45 +00:00
Sean Silva	1a7fc3915c	[docs] Add architecture doc. This attempts to get out of my head most of the critical layering and project structure decisions for Torch-MLIR.	2022-08-18 13:29:49 -07:00
Sean Silva	283e0f141a	Add a concept of "backend legal ops". This is a first step towards formalizing the set of ops in our backend contract. The goal is to eventually formalize `torch` dialect ops into 3 categories: 1. Legal in backend contract 2. Illegal in backend contract 3. Conditionally legal in backend contract The "conditionally legal" set are the ops that we can optionally decompose for backends. This patch adds relevant pass options for this throughout the compiler, in preparation for a new set of traits which will formalize this classification.	2022-08-18 11:46:50 -07:00
Ramiro Leal-Cavazos	f07f7d20f9	Clean up shape functions that use `sum_mean_dim` (#1217 ) I recently fixed the handling of the `dim` argument in `sum_mean_dim` (`59fccab857`). Therefore, the checks that the `dim` input is `None` or `[]` are no longer needed.	2022-08-18 08:23:43 -07:00
Sambhav Jain	7d4a0d0e2b	[Bazel] Add LowerToBackendContract.cpp to TorchMLIRTorchPasses bazel target (#1243 ) Pass is introduced in [this commit](`57681f7947`). Including it to the bazel targets to get a green build.	2022-08-17 18:15:23 -07:00
Sambhav Jain	114f48e96c	[Bazel] Check cache directory exists before changing owners (#1241 ) This fixes a seeding issue with the [previous PR](https://github.com/llvm/torch-mlir/pull/1240) where bazel build's GHA cache is not present to begin with and one of the commands (chown) fails on it. Should get the Bazel build back to green.	2022-08-17 17:04:50 -07:00
Sean Silva	57681f7947	Iteratively run the main simplification pipeline. This introduces a new pass LowerToBackendContract (better name very welcome) which performs the bulk of the simplifications that we do, such as - shape refinement - dtype refinement - maximizing value semantics - inlining global slots - decomposing complex ops The key difference from before is that it iterates the set of transformations, which can help to break a number of "catch-22" issues where one simplification depends on another, the latest example being here: https://github.com/llvm/torch-mlir/issues/1131 This also exposed that RefineTypes was sometimes crashing/asserting for certain inputs. This commit hardens it a bit.	2022-08-17 14:54:33 -07:00
Sambhav Jain	9c8b962720	Dockerize and Cache Bazel {Local, CI} Builds (#1240 ) This PR adds: - A minimal docker wrapper to the bazel GHA workflow to make it reproducible locally - Bazel cache to speed up GHA workflows (down to ~5 minutes from ~40+minutes) This is a no-op for non-bazel workflows and an incremental improvement.	2022-08-17 12:46:17 -07:00
Yan Xu	9be8997536	Revert "add native_dropout and related ops pattern (#1211 )" (#1230 ) This reverts commit `c935795086`.	2022-08-17 13:48:10 +08:00
Quinn Dawkins	85f383ce0b	Bump the shape lib to match the upstream functions currently in PyTorch (#1236 ) Bumps the shape library: - Updates the function signature for aten.arange.start_step - upstream_shape_functions.mean_dim -> upstream_shape_functions.sum_mean_dim	2022-08-17 00:11:04 -04:00
武家伟	11a5b5ac52	[MHLO] Add AtenRSubScalarOp conversion pattern to MHLO (#1233 ) * [MHLO] Add AtenRSubScalarOp conversion pattern Co-authored-by: Bairen Yi <yibairen.byron@bytedance.com> Co-authored-by: Jiawei Wu <xremold@gmail.com> Co-authored-by: Tianyou Guo <tianyou.gty@alibaba-inc.com> Co-authored-by: Xu Yan <yancey.yx@alibaba-inc.com> Co-authored-by: Ziheng Jiang <ziheng.jiang@bytedance.com>	2022-08-17 09:07:36 +08:00
nithinsubbiah	fde390c766	Re-enable custom op support	2022-08-16 22:49:08 +05:30
Jae Hoon (Antonio) Kim	0af55781ae	Propagate device data names (#1157 ) * Propagate device data names * Address PR comment * Add example usage * Add test for device data names * Make TorchMlirComputation fields protected * Add lazy backend device data name unit tests * Disable lazy backend tests if LTC is disabled * Add comments	2022-08-16 09:30:22 -04:00
Ashay Rane	84d345c650	build: update llvm tag to 2dde4ba6 (#1229 ) Summary of changes: - Tensor dialect now sets `emitAccessorPrefix` to prefixed, thus requring updates to methods that retrieve arguments [https://reviews.llvm.org/D131361] - Update MHLO to build with LLVM commit hash 2dde4ba6 - Replace `AbsOp` with `AbsFOp` [https://reviews.llvm.org/D131325] - Replace deprecated `getValue()` with `value()` [https://reviews.llvm.org/D131349] - Remove `AnalysisState::defaultInitialize()` [https://reviews.llvm.org/D131746] - Update MHLO MLIR tests to use the updated assembly format - Disabled two failing TOSA tests (Github Issue link: https://github.com/llvm/torch-mlir/issues/1231)	2022-08-15 23:54:45 -07:00
武家伟	3b3cb99ef8	Generalize canonicalization pattern for more aten.sub/div/mul/add op (#1209 ) Generalize canonicalization pattern for more sub/div/mul/add op, but for AtenDivTensorModeOp in 'trunc' rounding mode, we try to fold it.	2022-08-16 13:24:08 +08:00
Ramiro Leal-Cavazos	9d6ee48661	Fix unused-variables warnings about EmbeddingBag ops (#1220 ) According to the documentation for `torch.embedding_bag` (https://pytorch.org/docs/stable/generated/torch.nn.functional.embedding_bag.html), the default value for `scale_grad_by_freq` is False.	2022-08-15 09:43:55 -07:00
Yan Xu	c935795086	add native_dropout and related ops pattern (#1211 )	2022-08-15 09:28:47 +08:00
Sambhav Jain	41aa562fb4	s/external/externals/g (#1222 ) Fix remaining instances of `external/llvm-project`.	2022-08-13 07:13:56 -07:00
Ashay Rane	606f4d2c0e	build: streamline options for enabling LTC and MHLO (#1221 )	2022-08-12 23:49:28 -07:00
Sambhav Jain	34478ab1c7	[Build] Add concurrency groups to address long queue times (#1219 ) We're seeing large CI queue times ([example](https://discord.com/channels/636084430946959380/742573221882364009/1007631811184164944)) especially with MacOS VMs on GHA. Part of the problem is follow-on commits to the same branch which trigger new runs while the previous runs are still in-progress, hogging on the scarce VMs. This PR adds concurrency groups to the GHA workflow which ensures that only a single job or workflow using the same concurrency group will run at a time. This would cancel any in-progress jobs in the same github workflow and github ref (e.g. `refs/heads/main` or `refs/pull/<pr_number>/merge`). As discussed on discord [thread](https://discord.com/channels/636084430946959380/1007787336848912386/1007787338895740928), once this lands we may have to closely monitor the workflows to see this didn't introduce unintended consequences. If so, we could either revert, or decide to selectively cancel particular runs (e.g. macos only which is the main bottleneck right now) instead of entire workflow. This will also require some expectation management. As in, if you see an ❌ on the main branch, it may not necessarily mean things broke, it could mean the run was killed by a more recent run. Making it a bit harder to traceback a failure to a commit in a sequence of commits (requiring to run those builds again). Thanks @powderluv for the proposal and pointer to this! It should help with the scarce VMs on GHA and save on queue time. References: * https://docs.github.com/en/actions/using-jobs/using-concurrency#example-only-cancel-in-progress-jobs-or-runs-for-the-current-workflow * https://docs.github.com/en/actions/using-workflows/workflow-syntax-for-github-actions#example-only-cancel-in-progress-jobs-or-runs-for-the-current-workflow	2022-08-12 17:38:48 -07:00
Ashay Rane	1581d6a84c	build: fix typo in path (#1218 ) When we renamed the directory containing submodules from `external` to `externals`, we accidentally left the original name in the Github workflow. This patch fixes the problem.	2022-08-12 15:38:25 -07:00
Sambhav Jain	aed0ec3a2c	Merge matrix runs to fail fast globally (#1216 ) My earlier[ PR](https://github.com/llvm/torch-mlir/pull/1213) had (among other things) decoupled ubuntu and macos builds into separate matrix runs. This is not working well due to limited number of MacOS GHA VMs causing long queue times and backlog. There are two reasons causing this backlog: 1. macos arm64 builds with pytorch source are getting erratically cancelled due to resource / network constraints. This is addressed with this: https://github.com/llvm/torch-mlir/pull/1215 > "macos-arm64 (in-tree, OFF) The hosted runner: GitHub Actions 3 lost communication with the server. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error." 2. macos runs don't fail-fast when ubuntu runs fail due to being in separate matrix setups. This PR couples them again.	2022-08-12 11:30:09 -07:00
Sambhav Jain	b8bd0a46cc	use pytorch binary for macos-arm64 builds (#1215 )	2022-08-12 06:33:57 -07:00

1 2 3 4 5 ...

1434 Commits (aa31be720a3f7ebdddf5534a1689720be4a8a4a8) All Branches Search

1434 Commits (aa31be720a3f7ebdddf5534a1689720be4a8a4a8)

All Branches