torch-mlir

Commit Graph

Author	SHA1	Message	Date
Vivek Khandelwal	22c9008bb9	build: Update Roll PyTorch version (#3548 ) This commit also updates the PyTorch and Torchvision nightly links since they are now moved to a different location. PyTorch Nightly: https://download.pytorch.org/whl/nightly/cpu/torch/ Torchvision Nightly: https://download.pytorch.org/whl/nightly/cpu/torchvision/ Disables dtype checks for some ops, tracked by https://github.com/llvm/torch-mlir/issues/3552 Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-07-19 21:38:57 +05:30
Vivek Khandelwal	e18fcebd3a	[CI] Change Roll PyTorch runner (#2828 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-01-30 16:42:18 +05:30
Stella Laurenzo	4a4d80a6ad	[ci] Add lint job and enable yaml linting of GH files. (#2819 )	2024-01-27 15:48:06 -08:00
Vivek Khandelwal	311b6b0286	CI: Fix Roll PyTorch CI failure at determining commit hash (#2796 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-01-24 15:55:12 +05:30
Ashay Rane	5f772e8cb4	CI: reconcile differences between RollPyTorch and pre-merge checks (#2482 )	2023-09-23 07:00:16 -07:00
Ashay Rane	2480cb7a51	CI: Update script to (mock) merge of RollPyTorch PRs (#2213 ) Before enabling the actual merge, this patch dumps to the console the bash commands that it plans to execute.	2023-06-06 12:38:16 -05:00
Ashay Rane	c804dac925	CI: Introduce workflow to auto-merge RollPyTorch updates (#2196 ) This patch adds a new workflow that runs when an update to the rollpytorch branch by silvasean (in whose name the RollPyTorch action runs) causes the regular CI build to complete without errors. Upon execution, this workflow currently just prints the PR number(s) of the PR created by the RollPyTorch action, but once this is working as expected, we will add the step to merge the PR changes.	2023-06-05 08:48:20 -05:00
Ashay Rane	558f12f05c	CI: Use GitHub app token for creating PRs (#2137 ) Since PRs created by the GitHub action bot cannot trigger workflows (and thus build tests), this patch uses the token for a GitHub app that was specifically created for the RollPyTorch action.	2023-05-19 23:18:03 -05:00
Ashay Rane	19a08d51f3	CI: [nfc] Use actions/cache instead of modified fork (#2124 ) We previously used a fork of the action/cache repository for the PyTorch cache since the actions/cache repo did not support read-only caches. Now that actions/cache supports separate read and write steps, this patch switches back to the actions/cache repo.	2023-05-12 23:25:17 -05:00
Ashay Rane	e161f2511a	CI: let GitHub action create commit (#2114 ) The GitHub action for creating the PR expects that either the changes are not committed (in which case it commits them with the specified commit message) or that the commit exists but that it is also pushed to remote. Prior to this patch, we created the commit but did not push it to remote, causing failures. This patch leaves the changes uncommitted so that they're committed and pushed to remote as part of the PR creation.	2023-05-11 19:19:32 -05:00
Ashay Rane	377720af87	CI: create PR for RollPyTorch updates (#2106 ) Currently, we run just the Linux in-tree tests when the RollPyTorch workflow runs, but this is insufficient since WHL files for macOS or Windows are sometimes not uploaded by PyTorch, causing the RollPyTorch action to pass but all subsequent torch-mlir CI tests to fail because of the broken build. The easiest way to validate the RollPyTorch action on all platforms is to run the standard set of tests that we run for each submitted PR, so this patch makes the RollPyTorch action submit a PR instead of committing the changes to the main branch directly. The PR is assigned to a handful of folks for review, although this can be changed in the future.	2023-05-10 09:25:59 -05:00
powderluv	0497f0b08d	Revert "CI: drop deletion of workspace and limit submodule fetch concurrency (#1921 )" (#2007 ) This reverts commit `07f5f042c7`.	2023-04-06 10:36:30 -07:00
Ashay Rane	07f5f042c7	CI: drop deletion of workspace and limit submodule fetch concurrency (#1921 ) Despite using sudo to delete the workspace directory, we still occasionally run into checkout errors. This patch thus drops the deletion of the workspace prior to checkout. It also restricts the number of parallel jobs in the submodule fetch step to just one, to try and resolve the checkout issue ("index.lock: File exists.").	2023-04-04 12:58:52 -05:00
powderluv	f83c516b15	Update RollPyTorch.yml to use Pytorch 3.11 (#1999 )	2023-04-03 23:53:26 -07:00
Ashay Rane	987d5ab335	CI: use `sudo` to remove Docker-created files (#1905 )	2023-02-27 17:44:50 -06:00
Ashay Rane	ea00371d85	CI: clear workspace directory before checkout (#1900 ) We have recently started seeing errors like: ``` Synchronizing submodule url for 'externals/llvm-project' Synchronizing submodule url for 'externals/mlir-hlo' /usr/bin/git -c protocol.version=2 submodule update --init --force --depth=1 Error: fatal: Unable to create '/home/anush/actions-runner/_work/torch-mlir/torch-mlir/.git/modules/externals/llvm-project/index.lock': File exists. ``` As a workaround, this patch removes the workspace directory before the checkout step.	2023-02-24 14:44:35 -06:00
Ashay Rane	268364e061	CI: install `unzip` before using it (#1893 ) The RollPyTorch action needs the `unzip` command to peek into WHL files for fetching metadata. This patch makes sure that the command is installed before referencing it.	2023-02-19 17:49:08 -06:00
powderluv	5710871f4f	Update buildAndTest.yml (#1881 ) * Update buildAndTest.yml * Update oneshotSnapshotPackage.yml * Update buildRelease.yml * Update RollPyTorch.yml * Update oneshotSnapshotPackage.yml * Update buildAndTest.yml	2023-02-15 09:17:12 -08:00
Ashay Rane	67ab708b63	python: separate build- and test-related pip dependencies (#1874 ) We want to ensure that pip packages required for building torch-mlir should be included in the dependencies of torch-mlir, but we don't want the pip packages required for _testing_ of torch-mlir to be included among the dependencies. To be able to specify and install one set of dependencies and not the other, this patch separates the pip packages into two files: build-requirements.txt and test-requirements.txt. This patch also updates references to the requirements.txt file so that CI builds that run end-to-end tests install test-related pip dependencies while everything else (including WHL builds) sticks to just the build-related pip dependencies. Despite this change, this patch should not affect a torch-mlir developer's workflow. More precisely, since this patch makes the top-level requirements.txt file refer to both build-requirements.txt and test-requirements.txt files, a torch-mlir developer should be able to continue referring to the requirements.txt file without any impact.	2023-02-13 21:22:09 -06:00
Ashay Rane	f6b6069a34	ci: post comment on RollPyTorch tracker issue upon build failure (#1730 ) Now that the RollPyTorch tracker issue exists, we can automate the job of notifying folks of failures instead of having to do it manually. This patch adds a step to the workflow to post such a message.	2022-12-18 13:45:30 -06:00
Ashay Rane	731c313231	ci: run `git pull` before committing pytorch version updates (#1716 ) The RollPyTorch action often takes more than 1.5 hours to finish. During this time, if another PR is merged, then the RollPyTorch action needs to first pull the merged changes before committing the updates to the PyTorch commit hash and version files. This patch adds the required `git pull` statement, without which, the subsequent `git push` statement fails, causing the RollPyTorch action to fail as well.	2022-12-13 13:41:41 -06:00
Ramiro Leal-Cavazos	a710237437	[custom op] Generalize shape library logic to work with dtypes (#1594 ) * [custom op] Generalize shape library logic to work with dtypes This commit generalizes the shape library logic, so that dtype rules for ops can also be expressed using the same mechanism. In other words, each op can now have a shape function and a dtype function specified in Python that is imported during lowering to calculate the shapes and dtypes throught a program. For more information about how to specify a dtype function, see the updated `docs/adding_a_shape_and_dtype_function.md`. For those not familiar with how the shape library works, the file `docs/calculations_lib.md` provides an overview.	2022-12-13 08:25:41 -08:00
Ashay Rane	b43965d8d3	build: fetch PyTorch version using downloaded WHL file (#1632 ) Until recently, the metadata file in the torchvision package included the nightly version of the torch package, but since that is no longer the case, our RollPyTorch workflow is broken. As a workaround, this patch uses the `pip download` command's ability to fetch the dependent torch package for the specified version of torchvision, before peeking into the WHL file for the torch package to determine the release version and the commit hash.	2022-11-23 13:54:54 -06:00
Ashay Rane	4eead74232	ci: delay RollPyTorch action by 1 hour to use latest torchvision package (#1603 ) The upload timestamp of the nightly torchvision package has drifted beyond the scheduled time of the RollPyTorch action because of the time change due to daylight saving. As a result, the RollPyTorch action now picks the torchvision package from a day earlier instead of the most recent package. This patch schedules the RollPyTorch action to start one hour later than before so that it continues to pick the most recent nightly package.	2022-11-23 11:31:02 -06:00
Ashay Rane	f1ef5681cc	build: pin torchvision to latest nightly (#1584 ) We currently pin the `torch` package to the latest nightly version, but since `torchvision` depends on the `torch` package, the pip resolver then has to run through an extensive list of `torchvision` packages that can be installed with the pinned `torch` package. This search fails in the RollPyTorch action, causing pip to settle on an old version of `torchvision` that does not work with our tests. In reality, we are only interested in a specific version of the `torchvision` package. To make the dependency explicit and to prevent test failures because of incorrect package installations, this patch makes two key changes: 1. `torchvision` is now pinned to the latest nightly release in pytorch-requirements.txt along with the version of `torch` that is necessary to install the requested `torchvision` package 2. The RollPyTorch action now looks for the latest `torchvision` package instead of the latest `torch` package before writing the version numbers for pinning in pytorch-requirements.txt	2022-11-14 15:56:02 -06:00
Ashay Rane	2846776897	ci: enable ccache on Windows (#1548 ) This patch makes a few small, but key, changes to enable ccache on Windows. First, it replaces the hendrikmuhs/ccache-action action with command line invocations to the ccache binary, since the action has two bugs, one of which causes CI to refer to different ccache artifacts before versus after the build on Windows whereas the other bug can sometimes cause the action to incorrectly infer that the cache is empty. Second, this patch slightly alters the cache key, so that our old cache artifacts, which have grown too big, are eventually discarded in favor of the new, smaller cache artifacts. Along the way, this patch also keeps the RollPyTorch's cache artifact separate from the regular build's cache artifact so as to keep these artifacts small, and also because the RollPyTorch action is off the critical path for most contributors. Finally, this patch makes small changes to the CMake file so that on Windows, the ccache binary is added as a prefix, as recommended on the [ccache Wiki](https://github.com/ccache/ccache/wiki/MS-Visual-Studio).	2022-11-03 12:17:22 -05:00
Ashay Rane	031d127940	ci: introduce read-only and read-write PyTorch build caches (#1546 ) Until recently, we had to either risk feature branches creating PyTorch build caches (which were unusable by the main branch or other parallel feature branches because of GitHub's rules around sharing caches among branches) or we had to limit the PyTorch build caches to only the main branch, causing CI runs on feature branches to be terribly slow because they had to rebuild PyTorch each time. This patch enables the best of both worlds, by using a fork (github.com/ashay/cache) of the GitHub's cache action, where the fork adds an option (called `save`) which, when set, uploads a new cache entry. We thus set this `save` flag only when we're building PyTorch from source in Torch-MLIR's main branch, whereas all other builds set this `save` flag to `false`. The ability to conditionally update the cache has been an oft-requested feature on the original (github.com/actions/cache) repository and multiple unmerged PRs exist to allow conditional cache updates, so it is likely that using the fork is only a temporary solution.	2022-11-01 23:26:17 -07:00
Ashay Rane	a8970101dc	pytorch: rename pytorch-version.txt to pytorch-hash.txt (#1541 ) This patch is part of a larger set of improvements to the CI/build system. In the code, we refer to the version as the string that contains the release identifier such as 1.14.0.dev20221028, so calling the file that contains the commit hash as pytorch-version.txt creates confusion. For the sake of simplicity, this patch renames that file to be pytorch-hash.txt.	2022-10-31 22:03:05 -05:00
Ashay Rane	2cf1092d4d	ci: restrict PyTorch cache to just the main branch (#1540 ) If PyTorch build caches are created on a branch other than the main branch, then GitHub does not share those caches with the main branch, making every CI run that runs for each PR slow. This patch resolves the problem by letting only the main branch create and use PyTorch build caches.	2022-10-31 15:14:53 -05:00
Ashay Rane	801452b2f4	ci: make RollPyTorch run only on the Torch-MLIR repo (#1516 )	2022-10-25 17:56:59 -05:00
Ashay Rane	a9942f343a	Cache PyTorch source builds to reduce CI time (#1500 ) * ci: cache PyTorch source builds This patch reduces the time spent in regular CI builds by caching PyTorch source builds. Specifically, this patch: 1. Makes CI lookup the cache entry for the PyTorch commit hash in pytorch-version.txt 2. If lookup was successful, CI fetches the previously-generated WHL file into the build_tools/python/wheelhouse directory 3. CI sets the `TM_PYTORCH_INSTALL_WITHOUT_REBUILD` variable to `true` 4. The build_libtorch.sh script then uses the downloaded WHL file instead of rebuilding PyTorch * ci: warm up PyTorch source cache during daily RollPyTorch action This patch makes the RollPyTorch action write the updated WHL file to the cache, so that it can be later retrieved by CI that runs for each PR. We deliberately add the caching step to the end of the action since the RollPyTorch action never needs to read from the cache, although executing this step earlier in the process should not cause problems either.	2022-10-18 00:42:42 -05:00
Ashay Rane	0374a6da4e	ci: re-enable auto execution of the RollPyTorch action (#1488 ) Now that the RollPyTorch action seems to have become stable, this patch enables that action to be run at around 4am Pacific Time every day.	2022-10-12 19:18:54 -05:00
Ashay Rane	8a8e779529	Disable auto-update of PyTorch version until CI script stabilizes (#1456 ) Instead of letting the auto-update script either fail because of script errors or letting it commit bad versions, this patch makes the update process manual, for now. Once the script stabilizes, I will its re-enable periodic execution.	2022-10-04 03:02:44 -05:00
Ashay Rane	da02390188	build: update ODS and shape library when updating PyTorch (#1450 ) Updating the PyTorch version may break the Torch-MLIR build, as it did recently, since the PyTorch update caused the shape library to change, but the shape library was not updated in the commit for updating PyTorch. This patch introduces a new default-off environment variable to the build_linux_packages.sh script called `TM_UPDATE_ODS_AND_SHAPE_LIB` which instructs the script to run the update_torch_ods.sh and update_shape_lib.sh scripts. However, running these scripts requires an in-tree build and the tests that run for an in-tree build of Torch-MLIR are more comprehensive than those that run for an out-of-tree build, so this patch also swaps out the out-of-tree build for an in-tree build.	2022-10-02 18:02:34 -05:00
Ashay Rane	005d40f4d7	build: check exit code without causing script to fail (#1447 ) A bug in the CI script caused the entire script to fail if the exit code of the command for comparing with the existing hash returned a non-zero exit status. The non-zero exit status for this comparison does not imply failed execution, since it only indicates that the hash has changed.	2022-10-02 16:04:26 -05:00
Ashay Rane	b3345e69e2	build: miscellaneous performance improvements (#1443 )	2022-09-30 12:47:43 -05:00
Ashay Rane	cf41a2582e	Final changes necessary to auto-update PyTorch version (#1438 ) * build: push directly from CI to main branch This avoids the need to create, approve, and merge a separate PR, in addition to avoiding unnecessary CI runs for the PyTorch version update. * build: schedule cronjob to run RollPyTorch action This patch schedules the RollPyTorch action to be run at noon UTC, which roughly corresponds to 4am Pacific Time. We pick this time since the commit for PyTorch nightly releases are picked just after midnight Pacific Time and the nightly release artifacts are produced in about 2 to 3 hours after the commit is picked.	2022-09-29 17:15:32 -05:00
Ashay Rane	8f608c048d	build: use Github Actions for creating PR (#1433 )	2022-09-29 07:09:16 -05:00
Ashay Rane	53e76b8ab6	build: create RollPyTorch to update PyTorch version in Torch-MLIR (#1419 ) This patch fetches the most recent nightly (binary) build of PyTorch, before pinning it in pytorch-requirements.txt, which is referenced in the top-level requirements.txt file. This way, end users will continue to be able to run `pip -r requirements.txt` without worrying whether doing so will break their Torch-MLIR build. This patch also fetches the git commit hash that corresponds to the nightly release, and this hash is passed to the out-of-tree build so that it can build PyTorch from source. If we were to sort the torch versions as numbers (in the usual descending order), then 1.9 appears before 1.13. To fix this problem, we use the `--version-sort` flag (along with `--reverse` for specifying a descending order). We also filter out lines that don't contain version numbers by only considering lines that start with a digit. As a matter of slight clarity, this patch renames the variable `torch_from_src` to `torch_from_bin`, since that variable is initialized to `TM_USE_PYTORCH_BINARY`. Co-authored-by: powderluv <powderluv@users.noreply.github.com>	2022-09-28 15:38:30 -05:00

39 Commits (1c4b9d6a0e9fb0e9a611281fd35283eb3e0c67b4)