torch-mlir

Commit Graph

Author	SHA1	Message	Date
Ashay Rane	c202cb5263	CI: Checkout repo so that gh knows where to look for the PR (#2223 ) Without this patch, the gh command (for merging the PR) doesn't know which repo we're referring to.	2023-06-09 21:50:19 -05:00
Ashay Rane	33ac7c3ad1	CI: Use GitHub token when calling gh for merging RollPyTorch PR (#2220 )	2023-06-08 15:07:43 -05:00
Ashay Rane	3c1a796f7e	CI: Merge RollPyTorch PR upon successful completion (#2218 ) This patch removes the mock commands, so that once the Build And Test workflow has successfully completed on the RollPyTorch action, the PR is merged and the branch is deleted.	2023-06-07 14:06:50 -05:00
Ashay Rane	2480cb7a51	CI: Update script to (mock) merge of RollPyTorch PRs (#2213 ) Before enabling the actual merge, this patch dumps to the console the bash commands that it plans to execute.	2023-06-06 12:38:16 -05:00
Ashay Rane	173050ec8a	CI: Fix yaml syntax in merge-rollpytorch.yml (#2201 ) This patch fixes the indentation in the yaml file.	2023-06-05 09:43:00 -05:00
Ashay Rane	c804dac925	CI: Introduce workflow to auto-merge RollPyTorch updates (#2196 ) This patch adds a new workflow that runs when an update to the rollpytorch branch by silvasean (in whose name the RollPyTorch action runs) causes the regular CI build to complete without errors. Upon execution, this workflow currently just prints the PR number(s) of the PR created by the RollPyTorch action, but once this is working as expected, we will add the step to merge the PR changes.	2023-06-05 08:48:20 -05:00
Ashay Rane	755d0c46da	CI: Spot fixes related to nightly and stable PyTorch builds (#2190 ) * CI: Skip (redundant) libtorch build when using stable PyTorch version When we use PyTorch stable builds, there is no need to build libtorch from source, making the stable-pytorch-with-torch-binary-OFF configuration redundant with stable-pytorch-with-torch-binary-ON. This patch drops the redundant configuration from CI. * CI: Simplify guard conditions for creating and using libtorch cache Whether libtorch is enabled or not is predicated on a host of conditions such as the platform, in-tree versus out-of-tree build, and stable versus nightly PyTorch builds. Instead of repeating these conditions to guard whether to create or use the libtorch cache artifacts (and getting them almost incorrect), this patch predicates the relevant pipeline steps to whether libtorch is enabled, thus making the conditions far simpler.	2023-06-01 22:58:25 -07:00
maxbartel	db3f2e3fde	Add Stable PyTorch CI Pipeline (#2038 ) * feat: split pytorch requirements into stable and nightly * fix: add true to tests to see full output * refactor: add comments to explain true statement * feat: move some tests to experimental mode * refactor: refactor pipeline into more fine grained difference * feat: add version differentiation for some tests * feat: activate more configs * refactor: change implementation to use less requirement files * refactor: remove contraints used for testing * fix: revert some requirement file names * refactor: remove unnecessary ninja install * fix: fix version parsing * refactor: remove dependency on torchvision in main requirements file * refactor: remove index url * style: remove unnecesary line switch * fix: readd index url	2023-05-30 12:16:24 -07:00
powderluv	2f02ae1ebe	Delete another spurious pip (#2173 )	2023-05-26 00:02:21 -07:00
powderluv	9b7909b599	Add ARM64 release builds (#2159 ) Creates a build_linux_arm64 job that builds the release on an arm64 self-hosted runner. Drop Python 3.10 support Pass TM_TORCH_VERSION to choose the Stable PyTorch version (since arm64 doesn't have nightly builds) Borrows nightly / stable Pytorch switch from the WIP https://github.com/llvm/torch-mlir/pull/2038	2023-05-25 20:39:19 -07:00
powderluv	f5e0287aaa	Remove spurious pip in Release builds (#2172 ) (left over from a previous commit that was approved and landed in a branch on accident)	2023-05-25 18:59:21 -07:00
Ashay Rane	9f65a8a961	CI: disable caching for release builds (#2168 ) This patch adds a (default-true) input called `cache-enabled` to the setup-build action, so that when the input is false, ccache is not setup on the host machine. This patch also sets the input to be false for the release builds.	2023-05-25 11:01:46 -05:00
Ashay Rane	558f12f05c	CI: Use GitHub app token for creating PRs (#2137 ) Since PRs created by the GitHub action bot cannot trigger workflows (and thus build tests), this patch uses the token for a GitHub app that was specifically created for the RollPyTorch action.	2023-05-19 23:18:03 -05:00
Ashay Rane	19a08d51f3	CI: [nfc] Use actions/cache instead of modified fork (#2124 ) We previously used a fork of the action/cache repository for the PyTorch cache since the actions/cache repo did not support read-only caches. Now that actions/cache supports separate read and write steps, this patch switches back to the actions/cache repo.	2023-05-12 23:25:17 -05:00
Ashay Rane	28bb866260	CI: prepare CI for ccache updates for MSVC/Windows (#2120 ) This patch, by itself, doesn't fix caching on Windows, but once a new release of ccache is available, caching for Windows builds should start working again (validated by building ccache from source and using it with LLVM builds). Ccache rejects caching when either the `/Zi` or `/ZI` flags are used during compilation on Windows, since these flags tell the compiler to embed debug information in a PDB file (separate from the object file produced by the compiler). In particular, our CI builds add the `/Zi` flag, making ccache mark these compiler invocations as uncacheable. But what caused our CI to add debug flags, especially when we specified `-DCMAKE_BUILD_TYPE=Release`? On Windows, unless we specify the `--config Release` flag during the CMake build step, CMake assumes a debug build. So all this while, we had been producing debug builds of torch-mlir for every PR! No doubt it took so long to build the Windows binaries. The reason for having to specify the configuration during the _build_ step (as opposed to the _configure_ step) of CMake on Windows is that CMake's Visual Studio generators will produce _both_ Release and Debug profiles during the CMake configure step (thus requiring a build-time value that tells CMake whether to build in Release or Debug mode). Luckily, on Linux and macOS, the `--config` flag seems to be simply ignored, instead of causing build errors. Strangely, based on cursory tests, it seems like on Windows we need to specify the Relase configuration as both `-DCMAKE_BUILD_TYPE=Release` as well as `--config Release`. Dropping either made my build switch to a Debug configuration. Additionally, there is a bug in ccache v4.8 (although this is addressed in trunk) that causes ccache to reject caching if the compiler invocation includes any flag that starts with `/Z`, including /`Zc`, which is added by LLVM's HandleLLVMOptions.cmake and which isn't related to debug info or PDB files. The next release of ccache should include the fix, which is to reject caching only for `/Zi` and `/ZI` flags and not all flags that start with `/Z`. As a side note, debugging this problem was possible because of ccache's log file, which is enabled by: `ccache --set-config="log_file=log.txt"`.	2023-05-12 12:45:01 -05:00
Ashay Rane	e161f2511a	CI: let GitHub action create commit (#2114 ) The GitHub action for creating the PR expects that either the changes are not committed (in which case it commits them with the specified commit message) or that the commit exists but that it is also pushed to remote. Prior to this patch, we created the commit but did not push it to remote, causing failures. This patch leaves the changes uncommitted so that they're committed and pushed to remote as part of the PR creation.	2023-05-11 19:19:32 -05:00
Ashay Rane	377720af87	CI: create PR for RollPyTorch updates (#2106 ) Currently, we run just the Linux in-tree tests when the RollPyTorch workflow runs, but this is insufficient since WHL files for macOS or Windows are sometimes not uploaded by PyTorch, causing the RollPyTorch action to pass but all subsequent torch-mlir CI tests to fail because of the broken build. The easiest way to validate the RollPyTorch action on all platforms is to run the standard set of tests that we run for each submitted PR, so this patch makes the RollPyTorch action submit a PR instead of committing the changes to the main branch directly. The PR is assigned to a handful of folks for review, although this can be changed in the future.	2023-05-10 09:25:59 -05:00
powderluv	0a3ab07c8f	Set fetch-depth 0 for CI builds too (#2034 )	2023-04-14 11:36:41 -07:00
powderluv	6cab740603	Set fetch-depth 0 (#2009 ) This is to potentially workaround the index.lock issue in git when we checkout new depth 1 submodules of recently updated mhlo.	2023-04-06 14:29:59 -07:00
powderluv	0497f0b08d	Revert "CI: drop deletion of workspace and limit submodule fetch concurrency (#1921 )" (#2007 ) This reverts commit `07f5f042c7`.	2023-04-06 10:36:30 -07:00
Ashay Rane	07f5f042c7	CI: drop deletion of workspace and limit submodule fetch concurrency (#1921 ) Despite using sudo to delete the workspace directory, we still occasionally run into checkout errors. This patch thus drops the deletion of the workspace prior to checkout. It also restricts the number of parallel jobs in the submodule fetch step to just one, to try and resolve the checkout issue ("index.lock: File exists.").	2023-04-04 12:58:52 -05:00
powderluv	f83c516b15	Update RollPyTorch.yml to use Pytorch 3.11 (#1999 )	2023-04-03 23:53:26 -07:00
Maksim Levental	c718f87c5d	- rename no-jit -> core (#1920 ) - add windows release	2023-03-07 00:20:06 -06:00
Maksim Levental	ac1f03e6f7	add jit,no-jit release matrix (#1916 )	2023-03-05 22:13:33 -08:00
Maksim Levental	415265a64c	Add `torch-mlir-no-jit-importer` build case for mac os wheels (#1902 ) * add flags to setup.py for out-of-tree build * - fix build_ext bug - add wheels script cases for mac wheels	2023-03-05 12:23:43 -06:00
Ashay Rane	987d5ab335	CI: use `sudo` to remove Docker-created files (#1905 )	2023-02-27 17:44:50 -06:00
Ashay Rane	ea00371d85	CI: clear workspace directory before checkout (#1900 ) We have recently started seeing errors like: ``` Synchronizing submodule url for 'externals/llvm-project' Synchronizing submodule url for 'externals/mlir-hlo' /usr/bin/git -c protocol.version=2 submodule update --init --force --depth=1 Error: fatal: Unable to create '/home/anush/actions-runner/_work/torch-mlir/torch-mlir/.git/modules/externals/llvm-project/index.lock': File exists. ``` As a workaround, this patch removes the workspace directory before the checkout step.	2023-02-24 14:44:35 -06:00
Ashay Rane	268364e061	CI: install `unzip` before using it (#1893 ) The RollPyTorch action needs the `unzip` command to peek into WHL files for fetching metadata. This patch makes sure that the command is installed before referencing it.	2023-02-19 17:49:08 -06:00
powderluv	5710871f4f	Update buildAndTest.yml (#1881 ) * Update buildAndTest.yml * Update oneshotSnapshotPackage.yml * Update buildRelease.yml * Update RollPyTorch.yml * Update oneshotSnapshotPackage.yml * Update buildAndTest.yml	2023-02-15 09:17:12 -08:00
Ashay Rane	67ab708b63	python: separate build- and test-related pip dependencies (#1874 ) We want to ensure that pip packages required for building torch-mlir should be included in the dependencies of torch-mlir, but we don't want the pip packages required for _testing_ of torch-mlir to be included among the dependencies. To be able to specify and install one set of dependencies and not the other, this patch separates the pip packages into two files: build-requirements.txt and test-requirements.txt. This patch also updates references to the requirements.txt file so that CI builds that run end-to-end tests install test-related pip dependencies while everything else (including WHL builds) sticks to just the build-related pip dependencies. Despite this change, this patch should not affect a torch-mlir developer's workflow. More precisely, since this patch makes the top-level requirements.txt file refer to both build-requirements.txt and test-requirements.txt files, a torch-mlir developer should be able to continue referring to the requirements.txt file without any impact.	2023-02-13 21:22:09 -06:00
powderluv	320e67ff34	Python 3.11 support (#1848 ) * Python 3.11 support * test without torchvision * Update pytorch-requirements.txt * Update buildRelease.yml * Update action.yml * Update install_macos_deps.sh * Update build_macos_packages.sh	2023-02-10 07:16:37 -08:00
Ashay Rane	711646d095	mhlo: migrate conversion to stablehlo (#1840 ) This patch replaces all MHLO operations with their StableHLO counterparts and adds a validation pass to ensure that no MHLO operations remain before translating all Stablehlo operations to the MHLO dialect for further lowering to the Linalg dialect. This patch also updates all lit tests so that they refer to the `convert-torch-to-stablehlo` pass and so that they check for StableHLO operations.	2023-02-02 07:29:47 -06:00
Ashay Rane	a897c49803	CI: miscellaneous fixes for Release builds (#1781 ) - Use v3 of actions/checkout, since the version we use (v2) uses Node.js 12, which is deprecated by GitHub. - Source the PowerShell venv sctipt (instead of the bash sript) since the calling script is a PowerShell script. Without this, the build doesn't use venv at all. - Make the build dependencies in whl-requirements.txt (used by setup.py) match those in requirements.txt. To that end, this patch creates a build-requirements.txt that is referenced by requirements.txt and whl-requirements.txt.	2023-01-06 20:41:43 -06:00
Ashay Rane	f6b6069a34	ci: post comment on RollPyTorch tracker issue upon build failure (#1730 ) Now that the RollPyTorch tracker issue exists, we can automate the job of notifying folks of failures instead of having to do it manually. This patch adds a step to the workflow to post such a message.	2022-12-18 13:45:30 -06:00
powderluv	cd90c0aaf5	Update buildAndTest.yml (#1723 )	2022-12-15 05:42:01 -08:00
Ashay Rane	64f9a0e978	ci: print ccache statistics and configuration at end of CI run (#1719 ) There appear to be two problems with the caching layer in our CI runs: (a) the sizes of some of the caches have grown to multiples of the 300 MB limit and (b) caching on Windows seems to be provide little to no benefit. To help understand the reasons for these problems, this patch adds a line item to the list of steps run in CI to dump the ccache configuration and statistics just prior to uploading the cache artifact.	2022-12-14 09:50:43 -06:00
Ashay Rane	731c313231	ci: run `git pull` before committing pytorch version updates (#1716 ) The RollPyTorch action often takes more than 1.5 hours to finish. During this time, if another PR is merged, then the RollPyTorch action needs to first pull the merged changes before committing the updates to the PyTorch commit hash and version files. This patch adds the required `git pull` statement, without which, the subsequent `git push` statement fails, causing the RollPyTorch action to fail as well.	2022-12-13 13:41:41 -06:00
Daniel Ellis	07a65961dd	Disable pypi publishing. See https://github.com/llvm/torch-mlir/issues/1709	2022-12-13 11:45:41 -05:00
Ramiro Leal-Cavazos	a710237437	[custom op] Generalize shape library logic to work with dtypes (#1594 ) * [custom op] Generalize shape library logic to work with dtypes This commit generalizes the shape library logic, so that dtype rules for ops can also be expressed using the same mechanism. In other words, each op can now have a shape function and a dtype function specified in Python that is imported during lowering to calculate the shapes and dtypes throught a program. For more information about how to specify a dtype function, see the updated `docs/adding_a_shape_and_dtype_function.md`. For those not familiar with how the shape library works, the file `docs/calculations_lib.md` provides an overview.	2022-12-13 08:25:41 -08:00
Sambhav Jain	109c91ae9b	[CI] Verify bazel buildifier is run and changes committed (#1700 ) Ensures the buildifier (linter for bazel build files) is run and changes are pushed.	2022-12-08 15:56:57 -08:00
Daniel Ellis	98d80a642a	Publish releases to PyPI after build	2022-12-07 10:01:55 -05:00
Ashay Rane	b43965d8d3	build: fetch PyTorch version using downloaded WHL file (#1632 ) Until recently, the metadata file in the torchvision package included the nightly version of the torch package, but since that is no longer the case, our RollPyTorch workflow is broken. As a workaround, this patch uses the `pip download` command's ability to fetch the dependent torch package for the specified version of torchvision, before peeking into the WHL file for the torch package to determine the release version and the commit hash.	2022-11-23 13:54:54 -06:00
Ashay Rane	4eead74232	ci: delay RollPyTorch action by 1 hour to use latest torchvision package (#1603 ) The upload timestamp of the nightly torchvision package has drifted beyond the scheduled time of the RollPyTorch action because of the time change due to daylight saving. As a result, the RollPyTorch action now picks the torchvision package from a day earlier instead of the most recent package. This patch schedules the RollPyTorch action to start one hour later than before so that it continues to pick the most recent nightly package.	2022-11-23 11:31:02 -06:00
Sambhav Jain	ba5b90ee27	Enable bazel LIT tests in CI (#1596 ) Bazel LIT test support was added in https://github.com/llvm/torch-mlir/pull/1585. This PR enables the tests in CI. ``` INFO: Build completed successfully, 254 total actions @torch-mlir//test/Conversion:TorchToArith/basic.mlir.test PASSED in 0.3s @torch-mlir//test/Conversion:TorchToLinalg/basic.mlir.test PASSED in 0.5s @torch-mlir//test/Conversion:TorchToLinalg/elementwise.mlir.test PASSED in 0.3s @torch-mlir//test/Conversion:TorchToLinalg/flatten.mlir.test PASSED in 0.3s @torch-mlir//test/Conversion:TorchToLinalg/pooling.mlir.test PASSED in 0.3s @torch-mlir//test/Conversion:TorchToLinalg/unsqueeze.mlir.test PASSED in 0.2s @torch-mlir//test/Conversion:TorchToLinalg/view.mlir.test PASSED in 0.3s @torch-mlir//test/Conversion:TorchToMhlo/basic.mlir.test PASSED in 0.5s @torch-mlir//test/Conversion:TorchToMhlo/elementwise.mlir.test PASSED in 0.9s @torch-mlir//test/Conversion:TorchToMhlo/gather.mlir.test PASSED in 0.3s @torch-mlir//test/Conversion:TorchToMhlo/linear.mlir.test PASSED in 0.6s @torch-mlir//test/Conversion:TorchToMhlo/pooling.mlir.test PASSED in 0.3s @torch-mlir//test/Conversion:TorchToMhlo/reduction.mlir.test PASSED in 0.4s @torch-mlir//test/Conversion:TorchToMhlo/view_like.mlir.test PASSED in 0.6s @torch-mlir//test/Conversion:TorchToSCF/basic.mlir.test PASSED in 0.2s @torch-mlir//test/Conversion:TorchToTosa/basic.mlir.test PASSED in 1.1s @torch-mlir//test/Dialect:Torch/GlobalizeObjectGraph/basic.mlir.test PASSED in 0.2s @torch-mlir//test/Dialect:Torch/GlobalizeObjectGraph/error.mlir.test PASSED in 0.2s @torch-mlir//test/Dialect:Torch/GlobalizeObjectGraph/free-functions.mlir.test PASSED in 0.2s @torch-mlir//test/Dialect:Torch/GlobalizeObjectGraph/initializers.mlir.test PASSED in 0.2s @torch-mlir//test/Dialect:Torch/GlobalizeObjectGraph/methods.mlir.test PASSED in 0.2s @torch-mlir//test/Dialect:Torch/GlobalizeObjectGraph/module-uses-error.mlir.test PASSED in 0.2s @torch-mlir//test/Dialect:Torch/GlobalizeObjectGraph/module-uses.mlir.test PASSED in 0.2s @torch-mlir//test/Dialect:Torch/GlobalizeObjectGraph/multiple-instances-error.mlir.test PASSED in 0.2s @torch-mlir//test/Dialect:Torch/GlobalizeObjectGraph/multiple-instances-multiple-module-args.mlir.test PASSED in 0.2s @torch-mlir//test/Dialect:Torch/GlobalizeObjectGraph/multiple-instances.mlir.test PASSED in 0.2s @torch-mlir//test/Dialect:Torch/GlobalizeObjectGraph/submodules.mlir.test PASSED in 0.3s @torch-mlir//test/Dialect:Torch/GlobalizeObjectGraph/visibility.mlir.test PASSED in 0.2s @torch-mlir//test/Dialect:Torch/adjust-calling-conventions.mlir.test PASSED in 0.2s @torch-mlir//test/Dialect:Torch/canonicalize.mlir.test PASSED in 0.4s @torch-mlir//test/Dialect:Torch/decompose-complex-ops-legal.mlir.test PASSED in 0.2s @torch-mlir//test/Dialect:Torch/decompose-complex-ops.mlir.test PASSED in 0.9s @torch-mlir//test/Dialect:Torch/drop-shape-calculations.mlir.test PASSED in 0.4s @torch-mlir//test/Dialect:Torch/erase-module-initializer.mlir.test PASSED in 0.2s @torch-mlir//test/Dialect:Torch/inline-global-slots-analysis.mlir.test PASSED in 0.3s @torch-mlir//test/Dialect:Torch/inline-global-slots-transform.mlir.test PASSED in 0.2s @torch-mlir//test/Dialect:Torch/invalid.mlir.test PASSED in 0.4s @torch-mlir//test/Dialect:Torch/lower-to-backend-contract-error.mlir.test PASSED in 17.3s @torch-mlir//test/Dialect:Torch/maximize-value-semantics.mlir.test PASSED in 0.3s @torch-mlir//test/Dialect:Torch/ops.mlir.test PASSED in 0.3s @torch-mlir//test/Dialect:Torch/prepare-for-globalize-object-graph.mlir.test PASSED in 0.2s @torch-mlir//test/Dialect:Torch/promote-types.mlir.test PASSED in 0.3s @torch-mlir//test/Dialect:Torch/reduce-op-variants-error.mlir.test PASSED in 0.2s @torch-mlir//test/Dialect:Torch/reduce-op-variants.mlir.test PASSED in 0.2s @torch-mlir//test/Dialect:Torch/refine-public-return.mlir.test PASSED in 0.2s @torch-mlir//test/Dialect:Torch/refine-types-branch.mlir.test PASSED in 0.3s @torch-mlir//test/Dialect:Torch/refine-types-ops.mlir.test PASSED in 0.6s @torch-mlir//test/Dialect:Torch/refine-types.mlir.test PASSED in 0.4s @torch-mlir//test/Dialect:Torch/reify-shape-calculations.mlir.test PASSED in 2.9s @torch-mlir//test/Dialect:Torch/simplify-shape-calculations.mlir.test PASSED in 0.3s @torch-mlir//test/Dialect:Torch/torch-function-to-torch-backend-pipeline.mlir.test PASSED in 0.6s @torch-mlir//test/Dialect:TorchConversion/canonicalize.mlir.test PASSED in 0.2s @torch-mlir//test/Dialect:TorchConversion/finalizing-backend-type-conversion.mlir.test PASSED in 0.3s @torch-mlir//test/Dialect:TorchConversion/func-backend-type-conversion.mlir.test PASSED in 0.2s @torch-mlir//test/Dialect:TorchConversion/ops.mlir.test PASSED in 0.3s @torch-mlir//test/Dialect:TorchConversion/verify-linalg-on-tensors-backend-contract.mlir.test PASSED in 0.3s @torch-mlir//test/Dialect:TorchConversion/verify-tosa-backend-contract.mlir.test PASSED in 0.2s @torch-mlir//test/RefBackend:insert-rng-globals.mlir.test PASSED in 0.2s INFO: Build completed successfully, 2[54](https://github.com/sjain-stanford/torch-mlir/actions/runs/3476816449/jobs/5812368489#step:7:55) total actions @torch-mlir//test/RefBackend:munge-calling-conventions.mlir.test PASSED in 0.2s Executed [59](https://github.com/sjain-stanford/torch-mlir/actions/runs/3476816449/jobs/5812368489#step:7:60) out of 59 tests: 59 tests pass. ``` GHA workflow: https://github.com/sjain-stanford/torch-mlir/actions/runs/3476816449/jobs/5812368489	2022-11-16 11:59:33 -08:00
Sambhav Jain	4aa1e90b34	Fix cache bug with Bazel builds in CI (#1593 ) Some time ago, bazel builds in CI were being sped up fine with caching. However, over time the cache got stale because `actions/cache@v3` apparently doesn't update caches when it "hits" unless it is configured to do so specifically. This requires using a uniqued per-commit `key` (to force it to update cache after each successful run) and a relaxed `restore-keys` which is not unique per-commit so newer commits can restore from the nearest hit. Test GHA run 1 (no cache hit): [1h 1m 52s](https://github.com/sjain-stanford/torch-mlir/actions/runs/3474770334/usage) Test GHA run 2 (cache hit, same commit): [5m 14s](https://github.com/sjain-stanford/torch-mlir/actions/runs/3475132135/usage) Test GHA run 3 (cache hit, different commit): [6m 6s](https://github.com/sjain-stanford/torch-mlir/actions/runs/3475161009/usage)	2022-11-15 18:48:31 -08:00
Sambhav Jain	99ec6039f6	Fix bazel CI (#1591 ) I accidentally broke bazel CI by forgetting to update the GHA workflow in my [previous PR](https://github.com/llvm/torch-mlir/pull/1587). This should get it back to green, my apologies. Qualifying CI run: https://github.com/sjain-stanford/torch-mlir/actions/runs/3472523982	2022-11-15 09:51:52 -08:00
Ashay Rane	f1ef5681cc	build: pin torchvision to latest nightly (#1584 ) We currently pin the `torch` package to the latest nightly version, but since `torchvision` depends on the `torch` package, the pip resolver then has to run through an extensive list of `torchvision` packages that can be installed with the pinned `torch` package. This search fails in the RollPyTorch action, causing pip to settle on an old version of `torchvision` that does not work with our tests. In reality, we are only interested in a specific version of the `torchvision` package. To make the dependency explicit and to prevent test failures because of incorrect package installations, this patch makes two key changes: 1. `torchvision` is now pinned to the latest nightly release in pytorch-requirements.txt along with the version of `torch` that is necessary to install the requested `torchvision` package 2. The RollPyTorch action now looks for the latest `torchvision` package instead of the latest `torch` package before writing the version numbers for pinning in pytorch-requirements.txt	2022-11-14 15:56:02 -06:00
Ashay Rane	2846776897	ci: enable ccache on Windows (#1548 ) This patch makes a few small, but key, changes to enable ccache on Windows. First, it replaces the hendrikmuhs/ccache-action action with command line invocations to the ccache binary, since the action has two bugs, one of which causes CI to refer to different ccache artifacts before versus after the build on Windows whereas the other bug can sometimes cause the action to incorrectly infer that the cache is empty. Second, this patch slightly alters the cache key, so that our old cache artifacts, which have grown too big, are eventually discarded in favor of the new, smaller cache artifacts. Along the way, this patch also keeps the RollPyTorch's cache artifact separate from the regular build's cache artifact so as to keep these artifacts small, and also because the RollPyTorch action is off the critical path for most contributors. Finally, this patch makes small changes to the CMake file so that on Windows, the ccache binary is added as a prefix, as recommended on the [ccache Wiki](https://github.com/ccache/ccache/wiki/MS-Visual-Studio).	2022-11-03 12:17:22 -05:00
Ashay Rane	f847642495	CI script improvements (#1547 ) * ci: update versions of external actions Node.js 12 actions are deprecated and will eventually go away, so this patch bumps the old actions to their latest versions that use Node.js 16. * ci: replace deprecated action with bash commands The llvm/actions/install-ninja action uses Node.js 12, which is deprecated. Since that action is not updated to work with Node.js 16, this patch replaces that action with equivalent bash commands to install Ninja. * ci: use smaller ccache artifacts to reduce evictions Over time, our ccache sizes have grown quite large (some as large as 1.3 GB), which results in us routinely exceeding GitHub's limits, thus triggering frequent cache evictions. As a result, cache downloads and uploads take unnecessary long, in addition to fewer cache entries being available. Based on experiments on a clean cache state, it appears that we need less than 300 MB of (compressed) ccache artifacts for each build type. Anything larger than that will accrue changes from the past that aren't needed. To alleviate the cache burden, this patch sets the maximum ccache size to be 300 MB. This change should not affect the success or failure of our builds. I will monitor the build times to check whether this change causes any performance degradation. * ci: use consistent platform identifiers Prior to this patch, some of our builds ran on `ubuntu-latest`, while some others ran on `ubuntu-20.04` and others ran on `ubuntu-22.04`, with similar situations for macOS and windows. This patch instead sets all Linux builds to run on `ubuntu-latest`, all macOS builds to run on `macos-latest`, and all Windows builds to run on `windows-latest`, to make debugging future CI failures a little easier.	2022-11-02 21:37:01 -05:00
Ashay Rane	031d127940	ci: introduce read-only and read-write PyTorch build caches (#1546 ) Until recently, we had to either risk feature branches creating PyTorch build caches (which were unusable by the main branch or other parallel feature branches because of GitHub's rules around sharing caches among branches) or we had to limit the PyTorch build caches to only the main branch, causing CI runs on feature branches to be terribly slow because they had to rebuild PyTorch each time. This patch enables the best of both worlds, by using a fork (github.com/ashay/cache) of the GitHub's cache action, where the fork adds an option (called `save`) which, when set, uploads a new cache entry. We thus set this `save` flag only when we're building PyTorch from source in Torch-MLIR's main branch, whereas all other builds set this `save` flag to `false`. The ability to conditionally update the cache has been an oft-requested feature on the original (github.com/actions/cache) repository and multiple unmerged PRs exist to allow conditional cache updates, so it is likely that using the fork is only a temporary solution.	2022-11-01 23:26:17 -07:00

1 2 3 4

170 Commits (45e2188615711a0db70cb7ad0ca92b95a46687e2)