torch-mlir

Commit Graph

Author	SHA1	Message	Date
Stella Laurenzo	45c6ee1090	In-tree build only builds torch-mlir.	2023-11-02 13:29:38 -07:00
Stella Laurenzo	66bba85d38	Remove CI references to torch-mlir-dialects.	2023-11-02 13:29:38 -07:00
xiaolou86	4199feffed	Fix typos in comments (#2539 ) Fix typos in comments	2023-10-31 20:10:47 -07:00
Ashay Rane	5f772e8cb4	CI: reconcile differences between RollPyTorch and pre-merge checks (#2482 )	2023-09-23 07:00:16 -07:00
Gleb Kazantaev	059041e0fe	[LTC] Support torch.ones/zeros/arange ops (#2440 )	2023-09-21 13:25:14 -04:00
Stella Laurenzo	078d1e1a1d	Remove mlir-hlo (replace with stablehlo). (#2460 ) We just have to do this: I ran into an issue today where I needed to make a one line patch to stablehlo to work around a compiler issue, and it is completely unapparent how to do so given that the mlir-hlo repo is a read-only export and is at the tail end of a multi-week integration chain from the open-source stablehlo repo. We've discussed this often enough and gotten +1 from everyone that they are ok with taking the e2e testing hit if it becomes necessary: It is necessary as the current situation is unmanageable. Looking at it, I expect it wouldn't actually be very difficult to build a little runner binary out of the stablehlo interpreter and subprocess call that in order to get the testing coverage back. I leave that as an exercise to the users of this part of the stack and recommend following the breadcrumbs from the deleted python/torch_mlir_e2e_test/stablehlo_backends/linalg_on_tensors.py file and the main.py changes. Note that I am pointing us at a stablehlo fork for the moment until it is apparent that we don't need to carry any local patches to it. We can update this in a few days if everything is clear.	2023-09-12 19:10:02 -07:00
Matthias Gehre	a3ac4513e4	build_tools/python_deploy/build_linux_packages.sh: Disable dynamo testing for stable pytorch (#2426 )	2023-09-04 10:02:07 +02:00
Yuanqiang Liu	e9ab8ceb1c	[Torch Dialect] support aten.split_with_sizes (#2431 ) * [Torch Dialect] support aten.split_with_sizes * update	2023-09-04 09:59:26 +08:00
Gleb Kazantaev	6b02e9a926	[LTC] Tensor[]? support operands type support using partial codegen (#2410 ) * Tensor[]? support operands type support using partial codegen * aten.index.Tensor support via partial codegen * Add torch.index_put tracing support * Added optional tensor list type support for LTC/TorchMLIR lowering * Added comments Co-authored-by: Gleb Kazantaev <gleb.kazantaev@cerebras.net>	2023-08-30 06:29:39 -04:00
Ashay Rane	8f28d933e1	CI: disable LTC e2e tests in stable PyTorch builds (#2414 ) This way, we can keep CI green without being forced to ignore _all_ errors that arise in stable PyTorch builds	2023-08-23 11:11:17 -05:00
Gleb Kazantaev	3dd29f9d5d	Update Torch ODS list with new ops (#2361 ) * [LTC] Add shape_inference_(add\|uniform) * Add torch.multinomial op. * Update ods gen; add normal_functional and erfinv ops support * New TorchMLIR ops: clamp_min.Tensor, clamp_max.Tensor, xlogy, binary_cross_entropy, log_sigmoid_forward, sigmoid_backward, cosine_embedding_loss, scatter.reduce * Improve the shape inference logic of whereOp - Infer the result tensor according to the broadcasting semantics Signed-off-by: rahul shrivastava <rahul.shrivastava@cerebras.net> * Added aten::sgn * Add shape inference logic for hardtanh_backward op * Added new Torch-MLIR ops Co-authored-by: GlebKazantaev <gleb.nnstu@gmail.com> * Add support for elu lowering * Add support for elu_backward lowering * Support fmod, remainder, and floor_divide Emit generated op defs for the remainder.Tensor and fmod.Tensor Add shape inference impelementations for remainder.Scalar, fmod.Scalar and floor_divide.Tensor * Add shape inference logic for im2col - pytorch.nn.unfold gets decomposed into im2col Signed-off-by: rahul shrivastava <rahul.shrivastava@cerebras.net> * Add aten::eye and aten::eye.m support * Add tracing for linalg_qr * Update GeneratedTorchOps.td * Update xfails * Fix unbound variable issue in torch_ods_gen --------- Signed-off-by: rahul shrivastava <rahul.shrivastava@cerebras.net> Co-authored-by: Mark Browning <mark@cerebras.net> Co-authored-by: zihaoc-cerebras <zihao.chen@cerebras.net> Co-authored-by: rahul shrivastava <rahul.shrivastava@cerebras.net> Co-authored-by: Gokul Ramakrishnan <gokul.ramakrishnan@cerebras.net> Co-authored-by: glebk-cerebras <111300564+glebk-cerebras@users.noreply.github.com> Co-authored-by: Behzad Abghari <behzad.abghari@gmail.com> Co-authored-by: Ahmed Elkoushy <ahmed.elkoushy@cerebras.net>	2023-08-21 06:36:39 -04:00
Gleb Kazantaev	5743b6d4ac	LTC multi-output operations support (#2362 ) * LTC/TorchMLIR multi-output operations support * Update torch-mlir jit lowering to support ops with dynamic number of outputs * Added support for aten::split_copy, aten::split_with_sizes_copy * Fix native function for aten::split; cleanup code * Fix TorchMlirTensorList lowering * Remove xfails	2023-08-20 16:32:11 -04:00
Matthias Gehre	f8e75f659d	Add make_fx_tosa variant to end2end tests (#2240 ) * Add make_fx_tosa variant to end2end tests * e2e_testing/xfail_sets.py: Add make_fx_tosa xfail for stable	2023-07-13 15:07:54 +02:00
Zhekun Zhang	8af3e50662	[Torch Dialect] Add support for AtenScalarTensorOp (#2085 ) * add scalar_tensor op * add dynamo pass test; needs PR2062 * try to fix * Empty commit, trigger test * Empty commit, trigger test * address comments * use dtype function * fix decompose rule * remove unused include * Empty commit, trigger test * fix test * disable ltc * fix dtype --------- Co-authored-by: zhekun.zhang <zhekun.zhang@bytedance.com>	2023-06-01 11:38:50 +08:00
maxbartel	db3f2e3fde	Add Stable PyTorch CI Pipeline (#2038 ) * feat: split pytorch requirements into stable and nightly * fix: add true to tests to see full output * refactor: add comments to explain true statement * feat: move some tests to experimental mode * refactor: refactor pipeline into more fine grained difference * feat: add version differentiation for some tests * feat: activate more configs * refactor: change implementation to use less requirement files * refactor: remove contraints used for testing * fix: revert some requirement file names * refactor: remove unnecessary ninja install * fix: fix version parsing * refactor: remove dependency on torchvision in main requirements file * refactor: remove index url * style: remove unnecesary line switch * fix: readd index url	2023-05-30 12:16:24 -07:00
powderluv	9b7909b599	Add ARM64 release builds (#2159 ) Creates a build_linux_arm64 job that builds the release on an arm64 self-hosted runner. Drop Python 3.10 support Pass TM_TORCH_VERSION to choose the Stable PyTorch version (since arm64 doesn't have nightly builds) Borrows nightly / stable Pytorch switch from the WIP https://github.com/llvm/torch-mlir/pull/2038	2023-05-25 20:39:19 -07:00
Zhekun Zhang	69e993b03f	[Torch Op] Add AtenChunkOp support (#2152 ) * add chunkOp support * update LTC xfail list * address comments * address comments --------- Co-authored-by: zhekun.zhang <zhekun.zhang@bytedance.com>	2023-05-26 10:05:19 +08:00
powderluv	b9b3af8003	[arm64] Fix release builds for ARM64 (#2157 ) Tested on Ubuntu 23.04 on Ampere Altra instance.	2023-05-24 13:52:13 -07:00
Zhekun Zhang	a426363b7d	[Torch Dialect] Add split.tensor support + recompose rules (#2102 ) * add split.tensor support + recompose rules * add e2e test * address comments * address comments * erase op in recomposeOp --------- Co-authored-by: zhekun.zhang <zhekun.zhang@bytedance.com>	2023-05-23 12:43:33 -07:00
Zhekun Zhang	aa97c8383e	[Torch Op] Add unbind.int support with ListUnpack (#2058 ) * add unbind int * reformat * use unpack canonicalize * address comments * Empty commit, trigger test * add ltc blacklist * clean up * address comments * check permute list * erase in recompose --------- Co-authored-by: zhekun.zhang <zhekun.zhang@bytedance.com>	2023-05-18 19:07:58 -07:00
Ashay Rane	28bb866260	CI: prepare CI for ccache updates for MSVC/Windows (#2120 ) This patch, by itself, doesn't fix caching on Windows, but once a new release of ccache is available, caching for Windows builds should start working again (validated by building ccache from source and using it with LLVM builds). Ccache rejects caching when either the `/Zi` or `/ZI` flags are used during compilation on Windows, since these flags tell the compiler to embed debug information in a PDB file (separate from the object file produced by the compiler). In particular, our CI builds add the `/Zi` flag, making ccache mark these compiler invocations as uncacheable. But what caused our CI to add debug flags, especially when we specified `-DCMAKE_BUILD_TYPE=Release`? On Windows, unless we specify the `--config Release` flag during the CMake build step, CMake assumes a debug build. So all this while, we had been producing debug builds of torch-mlir for every PR! No doubt it took so long to build the Windows binaries. The reason for having to specify the configuration during the _build_ step (as opposed to the _configure_ step) of CMake on Windows is that CMake's Visual Studio generators will produce _both_ Release and Debug profiles during the CMake configure step (thus requiring a build-time value that tells CMake whether to build in Release or Debug mode). Luckily, on Linux and macOS, the `--config` flag seems to be simply ignored, instead of causing build errors. Strangely, based on cursory tests, it seems like on Windows we need to specify the Relase configuration as both `-DCMAKE_BUILD_TYPE=Release` as well as `--config Release`. Dropping either made my build switch to a Debug configuration. Additionally, there is a bug in ccache v4.8 (although this is addressed in trunk) that causes ccache to reject caching if the compiler invocation includes any flag that starts with `/Z`, including /`Zc`, which is added by LLVM's HandleLLVMOptions.cmake and which isn't related to debug info or PDB files. The next release of ccache should include the fix, which is to reject caching only for `/Zi` and `/ZI` flags and not all flags that start with `/Z`. As a side note, debugging this problem was possible because of ccache's log file, which is enabled by: `ccache --set-config="log_file=log.txt"`.	2023-05-12 12:45:01 -05:00
Maksim Levental	c718f87c5d	- rename no-jit -> core (#1920 ) - add windows release	2023-03-07 00:20:06 -06:00
Maksim Levental	ac1f03e6f7	add jit,no-jit release matrix (#1916 )	2023-03-05 22:13:33 -08:00
Maksim Levental	415265a64c	Add `torch-mlir-no-jit-importer` build case for mac os wheels (#1902 ) * add flags to setup.py for out-of-tree build * - fix build_ext bug - add wheels script cases for mac wheels	2023-03-05 12:23:43 -06:00
Ashay Rane	67ab708b63	python: separate build- and test-related pip dependencies (#1874 ) We want to ensure that pip packages required for building torch-mlir should be included in the dependencies of torch-mlir, but we don't want the pip packages required for _testing_ of torch-mlir to be included among the dependencies. To be able to specify and install one set of dependencies and not the other, this patch separates the pip packages into two files: build-requirements.txt and test-requirements.txt. This patch also updates references to the requirements.txt file so that CI builds that run end-to-end tests install test-related pip dependencies while everything else (including WHL builds) sticks to just the build-related pip dependencies. Despite this change, this patch should not affect a torch-mlir developer's workflow. More precisely, since this patch makes the top-level requirements.txt file refer to both build-requirements.txt and test-requirements.txt files, a torch-mlir developer should be able to continue referring to the requirements.txt file without any impact.	2023-02-13 21:22:09 -06:00
powderluv	320e67ff34	Python 3.11 support (#1848 ) * Python 3.11 support * test without torchvision * Update pytorch-requirements.txt * Update buildRelease.yml * Update action.yml * Update install_macos_deps.sh * Update build_macos_packages.sh	2023-02-10 07:16:37 -08:00
Ashay Rane	711646d095	mhlo: migrate conversion to stablehlo (#1840 ) This patch replaces all MHLO operations with their StableHLO counterparts and adds a validation pass to ensure that no MHLO operations remain before translating all Stablehlo operations to the MHLO dialect for further lowering to the Linalg dialect. This patch also updates all lit tests so that they refer to the `convert-torch-to-stablehlo` pass and so that they check for StableHLO operations.	2023-02-02 07:29:47 -06:00
Ashay Rane	aefa7be6fa	CI: build torch-mlir python package for Python v3.8 (#1827 ) Previously, torchvision had not released WHL files for Python v3.8, causing failures in torch-mlir python package builds, so we had disabled building for Python v3.8. Now that the WHL files are back, this patch re-enables v3.8 builds.	2023-01-23 17:58:01 -06:00
powderluv	556620e349	Use delvewheel to package windows DLLs (#1820 )	2023-01-22 02:47:26 -08:00
Ashay Rane	a897c49803	CI: miscellaneous fixes for Release builds (#1781 ) - Use v3 of actions/checkout, since the version we use (v2) uses Node.js 12, which is deprecated by GitHub. - Source the PowerShell venv sctipt (instead of the bash sript) since the calling script is a PowerShell script. Without this, the build doesn't use venv at all. - Make the build dependencies in whl-requirements.txt (used by setup.py) match those in requirements.txt. To that end, this patch creates a build-requirements.txt that is referenced by requirements.txt and whl-requirements.txt.	2023-01-06 20:41:43 -06:00
Gleb Kazantaev	8f01072099	Fix OptionalCType class name (#1779 ) * Fix OptionalCType class name * Rmove LTC xfail tests	2023-01-06 17:03:24 -05:00
powderluv	4dbdc179b7	Bump to Python 3.8 (#1756 )	2022-12-29 19:04:54 -08:00
Daniel Ellis	07a65961dd	Disable pypi publishing. See https://github.com/llvm/torch-mlir/issues/1709	2022-12-13 11:45:41 -05:00
Ramiro Leal-Cavazos	a710237437	[custom op] Generalize shape library logic to work with dtypes (#1594 ) * [custom op] Generalize shape library logic to work with dtypes This commit generalizes the shape library logic, so that dtype rules for ops can also be expressed using the same mechanism. In other words, each op can now have a shape function and a dtype function specified in Python that is imported during lowering to calculate the shapes and dtypes throught a program. For more information about how to specify a dtype function, see the updated `docs/adding_a_shape_and_dtype_function.md`. For those not familiar with how the shape library works, the file `docs/calculations_lib.md` provides an overview.	2022-12-13 08:25:41 -08:00
Sean Silva	7731211d02	Remove eager_mode This was an experimental attempt at rolling out own op-by-op executor with `__torch_dispatch__`, but it proved difficult to make it robust. Op-by-op execution is very easy to implement robustly now with the PyTorch 2.0 stack, so we don't need eager_mode. Downstream users were using eager_mode to implement lockstep numerical accuracy debuggers. We implemented the same functionality with TorchDynamo in https://github.com/llvm/torch-mlir/pull/1681 so now there is not much reason to continue maintaining it.	2022-12-09 03:50:00 -08:00
Sean Silva	29c8823464	[e2e tests] Rename default config from "refbackend" to "linalg" This more accurately reflects what it is. The previous name was conflating the use of RefBackend (which `linalg`, `tosa`, and `mhlo` configs all use) with the use of the linalg backend (e.g. TorchToLinalg). This conflation was artifically giving the linalg backend a "privileged" position, which we want to avoid. We still keep it as the default backend, and it remains the most complete, but at least there's not artificial boosting.	2022-12-08 01:34:46 -08:00
Daniel Ellis	98d80a642a	Publish releases to PyPI after build	2022-12-07 10:01:55 -05:00
Sean Silva	28957adaac	[torchdynamo] Initial TorchDynamo support This adds a basic e2e Config for TorchDynamo using Linalg-on-Tensors/RefBackend. But TorchDynamo is pretty orthogonal to various other pieces, so it should compose nicely with variations like: - Switching out all the backends (Linalg-on-Tensors, TOSA, MHLO) - PyTorch functionalization and decompositions - Taking the example inputs and compiling with all dynamic or all static shapes without duplicating tests. This adds it to the CI, but there are still a lot of XFAIL's. This also adds a helper `from torch_mlir.dynamo import make_simple_dynamo_backend` which simplifies some of the steps for making a Torch-MLIR-based TorchDynamo backend. We include "simple" in the name because we are going to be exploring various things next from the long-term roadmap. The next steps are: - Burn down all the XFAIL's. - Start working on the pieces from the [long-term roadmap](https://github.com/llvm/torch-mlir/blob/main/docs/long_term_roadmap.md). - Add functionalization/decompositions into the TorchDynamo flow and remove reliance on the current Torch-MLIR "frontend". - Write a pure-Python direct FX->MLIR importer. - Hook up the new PyTorch symbolic shape stuff. - Explore PrimTorch decompositions for simplifying backends.	2022-11-24 04:10:25 -08:00
Tanyo Kwok	a9fb0c5459	fix mhlo e2e ci crashes (#1620 ) * fix mhlo e2e ci crashes * add passed tests * calc dynamic positive dim	2022-11-21 21:50:35 +08:00
Ashay Rane	eec9a7e022	ci: make pip skip cached packages while installing dependencies (#1570 ) We want each build to be reproducible regardless of prior builds and prior package installations, but pip, by default, uses cached packages from previous invocations of `pip install`. As a result, the incorrect dependencies downloaded in the RollPyTorch workflow in the main repository cannot be reproduced in private forks of the repository. To resolve this problem, this patch adds a `--no-cache-dir` flag to pip, so that it fetches and inspects each requested package independent or prior installations.	2022-11-11 20:31:38 -06:00
Ashay Rane	e16ccce373	ci: re-add powershell script for windows release builds (#1561 ) This file was removed as part of the PR that added build caching for Windows.	2022-11-06 12:48:38 -06:00
Ashay Rane	db5a496eb4	build: enable update scripts to work with out-of-tree builds (#1553 ) Before this patch, the update_shape_lib.sh and update_torch_ods.sh scripts only worked on in-tree builds, which implied that the RollPyTorch action was forced to run the longer-running in-tree build. As a result of this patch, we should be able to run through the basic checks in the RollPyTorch action faster, while running the full suite of tests off the critical path. The key change in this patch is that the update scripts now look for the directory that is most recently modified between in-tree or out-of-tree build directories. The change also correctly handles the case when only one of the two directories exists.	2022-11-04 08:13:02 -05:00
Ashay Rane	2846776897	ci: enable ccache on Windows (#1548 ) This patch makes a few small, but key, changes to enable ccache on Windows. First, it replaces the hendrikmuhs/ccache-action action with command line invocations to the ccache binary, since the action has two bugs, one of which causes CI to refer to different ccache artifacts before versus after the build on Windows whereas the other bug can sometimes cause the action to incorrectly infer that the cache is empty. Second, this patch slightly alters the cache key, so that our old cache artifacts, which have grown too big, are eventually discarded in favor of the new, smaller cache artifacts. Along the way, this patch also keeps the RollPyTorch's cache artifact separate from the regular build's cache artifact so as to keep these artifacts small, and also because the RollPyTorch action is off the critical path for most contributors. Finally, this patch makes small changes to the CMake file so that on Windows, the ccache binary is added as a prefix, as recommended on the [ccache Wiki](https://github.com/ccache/ccache/wiki/MS-Visual-Studio).	2022-11-03 12:17:22 -05:00
Ashay Rane	79871040c9	Revert "ci: build PyTorch before building Torch-MLIR (#1542 )" (#1545 ) This reverts commit `805d728194`.	2022-11-01 20:40:09 -05:00
Ashay Rane	805d728194	ci: build PyTorch before building Torch-MLIR (#1542 ) This patch updates the build_linux_packages.sh script so that when PyTorch needs to be built from source, it is built _before_ building LLVM and before building Torch-MLIR. The rationale behind this change is that previously, when the PyTorch build was triggered through the Torch-MLIR build, the PyTorch compilation added more entries to the ccache artifacts. However, since we cache the PyTorch _binary_ (i.e. the WHL file), there is no need to add the PyTorch compilation to the ccache artifacts. By removing the PyTorch compilation files, we keep the ccache artifact size small, thus reducing the number of evictions when we exceed GitHub's allowed limit.	2022-11-01 17:03:58 -05:00
Ramiro Leal-Cavazos	b723186983	Remove all but one of valsem ops + move fill.Scalar to elementwise (#1531 ) This commit removes almost all of the valsem ops, since the value semantics version of the ops now exist in PyTorch. The only op missing is `aten.bernoulli_.float`. In addition, this commit also simplifies the implementation of `aten.fill.Scalar` by moving it to the pattern that converts elementwise ops.	2022-10-28 15:06:11 +00:00
powderluv	1c579c8c39	Drop 3.9 binaries to keep under 6hrs build (#1533 )	2022-10-28 06:14:08 -07:00
powderluv	bbde4e163f	Add Windows Builder (#1521 ) Add a powershell script to build windows .whl packages Disable LTC as it doesn't build on Windows. Add GHA hooks Use Python 3.10.8	2022-10-25 16:13:31 -07:00
Ashay Rane	4a776be156	build: make PyTorch caching more robust (#1510 ) Whether or not the PyTorch build is cached should not affect the success of the Torch-MLIR build, but based on the existing code, a build may fail if the `TM_PYTORCH_INSTALL_WITHOUT_REBUILD` variable was set but the build cache doesn't exist. Although that variable is set by CI upon a cache hit, nuances of Github's caching behavior can create situations where the coupling between `TM_PYTORCH_INSTALL_WITHOUT_REBUILD` and the cache lookup fails. Specifically, a branch other than our default branch (`main`) may create the cache entry, but because Github doesn't share this cache entry with builds running on the `main` branch, the `main` branch build tries to create it's own cache entry. However, since cache identifiers are unique and because caches are immutable, the caching step running in the `main` branch appears to create an invalid cache entry (of 233 bytes, instead of the expected ~60 MB). Consequently, subsequent builds observe a cache "hit", since caches created by the `main` branch are shared with all other branches, but because this cache entry is invalid (since it doesn't actually contain the ~60 MB PyTorch WHL file), the builds fail. One workaround would be to let only the `main` branch create caches, but in doing so, we would also prevent other branches from _reading_ the cache, making the builds in those branches terribly slow. So this patch uses a different workaround, which is to check whether the PyTorch WHL file exists, even if the build observed a cache hit. If the file doesn't exist, even if it was a purported cache hit, the code builds PyTorch from source, which is probably intuitive. A longer term fix will follow, after a discussion with the wider team.	2022-10-20 08:50:18 -05:00
Ashay Rane	b86ec38541	ci: use the LLVM linker instead of GNU ld (#1501 ) Without this patch, CI logs contained the line: -- Linker detection: GNU ld GNU ld is notoriously slow at linking large binaries, so this patch swaps GNU ld with the LLVM linker. Since the linker invocation is driven through the compiler, perhaps the best way to use the LLVM linker is to tell the compiler which linker binary to use. This patch adds the `-fuse-ld=lld` flag to all Linux builds of Torch-MLIR in CI to make it use lld.	2022-10-18 00:43:04 -05:00

1 2 3 4

196 Commits (45c6ee10905df26baa58edbd3301f39eed223c74)