torch-mlir

Commit Graph

Author	SHA1	Message	Date
Ashay Rane	95ffa27733	release: pin PyTorch version in release requirements (#1435 ) Prior to this patch, the release process (`pip wheel`) retrieved dependencies from the pyproject.toml file, which specified a version of PyTorch that defaulted to the most recent nightly release. Instead, we want the release process to use the same pinned PyTorch version as the development build of PyTorch. Since TOML files can't reference the pytorch-requirements.txt file, this patch puts the dependencies from pyproject.toml into whl-requirements.txt, which references pytorch-requirements.txt.	2022-09-29 14:09:31 -05:00
Ramiro Leal-Cavazos	2509641cab	Add `--no-index` to CI's git-diff check on generated files (#1428 ) `git diff` does not work by default on untracked files. Since the function `_check_file_not_changed_by` stores the new generated file in an untracked file, `git diff` was not catching any modifications in the new generated file. This commit adds the flag `--no-index` to make `git diff` work with untracked files.	2022-09-29 10:31:40 -07:00
Ashay Rane	53e76b8ab6	build: create RollPyTorch to update PyTorch version in Torch-MLIR (#1419 ) This patch fetches the most recent nightly (binary) build of PyTorch, before pinning it in pytorch-requirements.txt, which is referenced in the top-level requirements.txt file. This way, end users will continue to be able to run `pip -r requirements.txt` without worrying whether doing so will break their Torch-MLIR build. This patch also fetches the git commit hash that corresponds to the nightly release, and this hash is passed to the out-of-tree build so that it can build PyTorch from source. If we were to sort the torch versions as numbers (in the usual descending order), then 1.9 appears before 1.13. To fix this problem, we use the `--version-sort` flag (along with `--reverse` for specifying a descending order). We also filter out lines that don't contain version numbers by only considering lines that start with a digit. As a matter of slight clarity, this patch renames the variable `torch_from_src` to `torch_from_bin`, since that variable is initialized to `TM_USE_PYTORCH_BINARY`. Co-authored-by: powderluv <powderluv@users.noreply.github.com>	2022-09-28 15:38:30 -05:00
Ashay Rane	78bfbf2474	build: re-enable TOSA tests after upstream LLVM rollback (#1417 )	2022-09-27 07:35:33 -05:00
Jae Hoon (Antonio) Kim	3e27aa2be3	Fix as_strided/slice symint (#1401 ) * Fix as_strided symint * Re-enable LTC tests * Re-enable LTC * Add hardtanh shape inference function * Fix slice symint	2022-09-26 12:16:49 -04:00
Daniel Ellis	1dfe5efe9e	Create github action for creating pip-compatible releases index	2022-09-23 15:26:19 -04:00
Sean Silva	7a77f9fe3d	Add a way to turn off crashing tests This adds a very long and obnoxious option to disable crashing tests. The right fix here is to use the right multiprocessing techniques to ensure that segfaulting tests can be XFAILed like normal tests, but we currently don't know how to implement "catch a segfault" in Python (patches or even just ideas welcome). Motivated by #1361, where we ended up removing two tests from all backends due to a failure in one backend, which is undesirable.	2022-09-23 05:01:39 -07:00
Sean Silva	566234f97a	Disable LTC again (#1400 ) https://github.com/llvm/torch-mlir/issues/1396	2022-09-22 17:49:13 -05:00
Jae Hoon (Antonio) Kim	8967463980	Fix symint ops and blacklist `lift_fresh_copy` (#1373 ) * Add symint to native functions yaml * Re-enable LTC * Fix new_empty_strided and narrow_copy	2022-09-20 10:16:04 -04:00
Sean Silva	7fa31817c5	Fix generated file checks (#1338 ) No idea how this slipped by. Sorry about that. Fixes #1334	2022-09-02 12:12:42 -07:00
powderluv	234b2f2bd4	Fix release builds to only build release (#1333 ) We were defaulting to building Release and running tests. Tests are spawned separately.	2022-09-02 03:37:57 -07:00
powderluv	729609831c	Remove setting ulimit for docker runs (#1325 ) We added both ipc=host and explicit ulimits. This _may_ be causing slow downs on GHA. Remove the ulimit setting still passes all the CI tests locally. `--ipc=host` is still required.	2022-08-31 20:37:53 -07:00
powderluv	9dbe41a85c	Drop Python3.8 binary releases. Still builds from source. (#1329 ) Shows low download count and we can add it back if people ask for it. Should save release artifacts space and Release build time.	2022-08-31 20:30:01 -07:00
Sean Silva	a924de3e1a	Slightly tweak generated file checks The new logic has the following benefits: 1. It does not clobber the working tree state. We expect testing to not change the work tree. 2. It correctly handles the case where a user has changes to the generated files, but hasn't checked them in yet (this happens frequently when adding new ops).	2022-08-31 20:03:25 -07:00
powderluv	3704363892	Use pre-compiled headers for PyTorch Source builds (#1327 ) This should speed up source builds and ccache. May cause issues on macOS (https://github.com/pytorch/pytorch/issues/80018)	2022-08-31 16:09:16 -07:00
powderluv	928c815ce2	Add shapelib and Torch ODS gen tests (#1318 )	2022-08-31 15:01:59 -07:00
powderluv	9f061ea97d	Dockerize CI + Release builds (#1234 ) Gets both CI and Release builds integrated in one workflow. Mount ccache and pip cache as required for fast iterative builds Current Release docker builds still run with root perms, fix it in the future to run as the same user. There may be some corner cases left especially when switching build types etc. Docker build TEST plan: tl;dr: Build everythin: Releases (Python 3.8, 3.9, 3.10) and CIs. TM_PACKAGES="torch-mlir out-of-tree in-tree" 2.57s user 2.49s system 0% cpu 30:33.11 total Out of Tree + PyTorch binaries: Fresh build (purged cache): TM_PACKAGES="out-of-tree" 0.47s user 0.51s system 0% cpu 5:24.99 total Incremental with ccache: TM_PACKAGES="out-of-tree" 0.09s user 0.08s system 0% cpu 34.817 total Out of Tree + PyTorch from source Incremental TM_PACKAGES="out-of-tree" TM_USE_PYTORCH_BINARY=OFF 1.58s user 1.81s system 2% cpu 1:59.61 total In-Tree + PyTorch binaries: Fresh build and tests: (purge ccache) TM_PACKAGES="in-tree" 0.53s user 0.49s system 0% cpu 6:23.35 total Fresh build/ but with prior ccache TM_PACKAGES="in-tree" 0.45s user 0.66s system 0% cpu 3:57.47 total Incremental in-tree with all tests and regression tests TM_PACKAGES="in-tree" 0.16s user 0.09s system 0% cpu 2:18.52 total In-Tree + PyTorch from source Fresh build and tests: (purge ccache) TM_PACKAGES="in-tree" TM_USE_PYTORCH_BINARY=OFF 2.03s user 2.28s system 0% cpu 11:11.86 total Fresh build/ but with prior ccache TM_PACKAGES="in-tree" TM_USE_PYTORCH_BINARY=OFF 1.58s user 1.88s system 1% cpu 4:53.15 total Incremental in-tree with all tests and regression tests TM_PACKAGES="in-tree" TM_USE_PYTORCH_BINARY=OFF 1.09s user 1.10s system 1% cpu 3:29.84 total Incremental without tests TM_PACKAGES="in-tree" TM_USE_PYTORCH_BINARY=OFF TM_SKIP_TESTS=ON 1.52s user 1.42s system 3% cpu 1:15.82 total In-tree+out-of-tree + Pytorch Binaries TM_PACKAGES="out-of-tree in-tree" 0.25s user 0.18s system 0% cpu 3:01.91 total To clear all artifacts: rm -rf build build_oot llvm-build libtorch docker_venv externals/pytorch/build	2022-08-30 11:07:25 -07:00
Sean Silva	e16b43e20b	Remove "torchscript" association from the e2e framework. We use it for more than TorchScript testing now. This is a purely mechanical change to adjust some file paths to remove "torchscript". The most perceptible change here is that now e2e tests are run with ``` ./tools/e2e_test.sh instead of: ./tools/torchscript_e2e_test.sh ```	2022-08-29 14:10:03 -07:00
Jae Hoon (Antonio) Kim	8e880a2d00	Fix symint related functionalization ops (#1289 ) * Fix symint related functionalization ops * Remove zeros xfail from LTC tests	2022-08-26 16:13:28 -04:00
Henry Tu	e2f862cb85	Fix LTC build warnings (#1272 ) * Resolved Wunused-variable * Fix Wunneeded-internal-declaration * Address review comment * Update autogen_ltc_backend.py * Update mlir_native_functions.cpp to work with updated PyTorch * Remove NewZeros from LTC XFAIL set	2022-08-24 15:04:28 -04:00
Henry Tu	ba17a4d6c0	Reenable LTC in out-of-tree build (for real this time) (#1205 ) * Fix OOT LTC CI build failure * Disable LTC during macOS package gen * Add more details about static TorchMLIRJITIRImporter library	2022-08-19 15:25:00 -04:00
powderluv	0d1aa43764	Drop Python 3.7x from the nightly binary builds (#1246 )	2022-08-18 16:34:12 -07:00
Quinn Dawkins	85f383ce0b	Bump the shape lib to match the upstream functions currently in PyTorch (#1236 ) Bumps the shape library: - Updates the function signature for aten.arange.start_step - upstream_shape_functions.mean_dim -> upstream_shape_functions.sum_mean_dim	2022-08-17 00:11:04 -04:00
nithinsubbiah	fde390c766	Re-enable custom op support	2022-08-16 22:49:08 +05:30
Sambhav Jain	f00ca91db0	Simplify matrix configuration for CI workflows (#1213 ) Addresses https://github.com/llvm/torch-mlir/issues/1207. #### Provisioned jobs: ``` # ubuntu - x86_64 - llvm in-tree - pytorch binary - build+test # most used dev flow and fastest signal # ubuntu - x86_64 - llvm out-of-tree - pytorch source - build+test # most elaborate build # macos - arm64 - llvm in-tree - pytorch source - build only # cross compile, can't test arm64 ``` #### Main changes - [x] Spawn macos builds from a separate matrix (in the same workflow). It made sense to do this as they are fairly different from ubuntu (cross compile, use a different cmake configuration). This simplifies the matrix configuration and exclusions quite a bit, and makes the workflow a bit more tractable and maintenance friendly. - [x] Remove the submodule md5sum step for ccache config. This was [broken](https://github.com/llvm/torch-mlir/runs/7779288734?check_suite_focus=true#step:3:145) for a while now. - [x] Removes unused matrix options - `os`, `targetarch`, `python-version`, `llvmtype`. - [x] Address ZSTD [comment](https://github.com/llvm/torch-mlir/pull/1204#discussion_r942349282) on @powderluv's cross compile [PR](https://github.com/llvm/torch-mlir/pull/1204). #### Further improvements (to be addressed in follow-on): * ubuntu-x86_64 out-of-tree integration tests fail ([error](https://github.com/sjain-stanford/torch-mlir/runs/7781264029?check_suite_focus=true)); only run unit tests for now (tests are excluded in current CI too) #### Passing workflow: https://github.com/sjain-stanford/torch-mlir/actions/runs/2840676309 ![image](https://user-images.githubusercontent.com/19234106/184194535-f3807991-401a-4cb9-b030-0ee8c334eba3.png)	2022-08-11 16:35:15 -07:00
Sean Silva	5618890ca0	development.md: Avoid name collisions with PYTORCH_ variables	2022-08-05 19:41:08 -07:00
Henry Tu	2c3b3606d0	Resolve remaining LTC CI failures (#1110 ) * Replace CHECK_EQ with TORCH_CHECK_EQ * Check value of TORCH_MLIR_USE_INSTALLED_PYTORCH during LTC build * Update LTC XFAIL with NewZerosModule ops * Explicitly blacklist _like ops * Automatically blacklist new_/_like ops * Prune away unused Python dependencies from LTC * Add flag to disable LTC * Autogen dummy _REFERENCE_LAZY_BACKEND library when LTC is disabled * Implement compute_shape_var * Removed Var tests from XFAIL Set * XFAIL tests using _local_scalar_dense or index.Tensor * Add StdDim tests to XFAIL set * Autogen aten::cat	2022-07-30 09:40:02 -04:00
Jae Hoon (Antonio) Kim	425362263b	Clean up Autogen (#1112 ) * Remove unnecessary sed in autogen * Remove .pyc files frrom VCS	2022-07-30 09:40:02 -04:00
Jae Hoon (Antonio) Kim	0d16a91656	Add support for lift_fresh op (#1101 )	2022-07-30 09:40:02 -04:00
Antonio Kim	de6c135dc3	Fix LTC autogen for CI with nightly PyTorch - Update llvm-project pin to match main	2022-07-30 09:40:02 -04:00
Henry Tu	cec74b8d37	Blacklist _convolution op (#1048 ) * Blacklist _convolution op in LTC * Removed duplicate Torch_AtenSelectScatterOp instance from autogen .td * Removed duplicate Torch_AtenSliceScatterOp instance from autogen .td	2022-07-30 09:40:02 -04:00
Henry Tu	47bb38d180	Reference Lazy Backend (#1045 ) * Changed Example MLIR backend to Reference MLIR backend * Moved reference_ltc_backend into csrc * Merged sys_utils.h * Renamed reference_ltc_backend to reference_lazy_backend * Addressed review comments * Update docs with new library name * Removed _REFERENCE_LAZY_BACKEND from .gitignore * Added reference_lazy_backend to the TorchMLIRPythonModules dependency list Fixed typo in `ltc_examples.md` Missed instance where `ltc_backend` was used instead of `lazy_backend`.	2022-07-30 09:40:02 -04:00
Jae Hoon (Antonio) Kim	fb21c9e6cb	Integrate Functionalization Pass (#998 ) * Fix autogen build dir issue * Got functionalization pass to compile * Add slice/diagonal backwards functionalization * Fix codegen invocation in CMakeLists.txt * Add functionalization view ops * Fix logsumexp out functionalization * Fix ComputationPtr * Blacklist new_empty op * Add op comparison * Remove unnecessary ops Co-authored-by: Henry Tu <henry.tu@cerebras.net>	2022-07-30 09:40:02 -04:00
Jae Hoon (Antonio) Kim	a62d60829c	Refactor autogen (#925 )	2022-07-30 09:40:02 -04:00
Jae Hoon (Antonio) Kim	d9aee0d7a7	E2E HuggingFace Bert using LTC Backend (#912 ) * Update native function definitions * Add ops to support bert lowering - Add empty_strided and as_strided - Restore zeros_like to op blacklist (Without this, tensors will be unintentionally created with a CPU device rather than lazy) - Check for composite implicit ops and add device data IR - Also fix codegen for functionalization * Add autogen to CMakeList * Remove PyTorch submodule * Reduced BERT model size * Print Mark Step status in Torch MLIR LTC debug string * Apply fixes to work with latest upstream/main - Pass importOptions into getMlirTypeFromTorchType during NodeImporter::importNode Without this, the tensor type created may have a mismatched type as ImportOptions may cause vtensor to be used instead of tensor * Update shape inference functions - Fixed compute_shape_native_batch_norm when mean and var are uninitialized Previously, the number of shapes returned would be <3 if either mean or val was didn't exist. Instead, we now initialize them with a vector matching the number of channels. - Implemented compute_shape_mul - Fixed bug in reshape shape inference error message * Get MLIR backend more consistent with TS backend - Remove LazyNativeFunctions::_unsafe_view from autogen - Blacklist ops to make JIT graph more like output of TS backend - Print graph when SSA value has mismatch of types and results - Remove normalize_index from LazyShapeInference - Fix seeds for LTC example models * Update and clean up shape inference functions - Prune shape inference functions - Add shape inference function for GenerateSlice - Add shape inference function for GenerateCopy Co-authored-by: Henry Tu <henry.tu@cerebras.net>	2022-07-30 09:40:02 -04:00
Jae Hoon (Antonio) Kim	1bde00c73d	Fix LTC Decoupling (#815 ) * Initial changes * Fix up native functions * Further fix decoupling * Remove unnecessary ops * Formatting and copyright banners: * Add pytorch submodule	2022-07-30 09:40:02 -04:00
Henry Tu	cca9fe126e	Enable support for LTC Input/Output Mapping (#764 ) * Save InputOutputAliases to TorchMlirComputation * Implement GetResultShape for TorchMlirLoweringContext * Use optional return type for GetResultShape * Remove support for aten::detach With this op enabled, tensors were being copied, which resulted in incorrect aliasing. * Add newline before printing I/O alias mapping * Changed printout to use "Input param" as label instead of "Input" * Remote shape inference function for aten::detach * Moved implementation of SetUpAlias to MlirLoweringContext As part of this change, TorchMlirComputation has been moved to the end of mlir_lowering_context.h so that it can access some new structs in TorchMlirLoweringContext * Use updated PyTorch API * Remove GetResultShape Complements this upstream PyTorch PR: pytorch/pytorch#75828 This PR adds support for mapping input and output tensors which alias each other. (e.g. maps input weight tensor in parameter to the same tensor in output after a training iteration) MLIR: func @graph(%arg0: !torch.vtensor<[1,5],f32>, %arg1: !torch.vtensor<[1],si64>, ..., %arg6: !torch.vtensor<[10,5],f32>, %arg7: !torch.vtensor<[10],f32>, ...) { ... return %arg0, %arg1, %17, %23, ... : !torch.vtensor<[1,5],f32>, !torch.vtensor<[1],si64>, !torch.vtensor<[10,5],f32>, !torch.vtensor<[10],f32>, ... } Input/Output Alias Mapping: Output: 0 -> Input: 0 Output: 1 -> Input: 1 Output: 2 -> Input: 6 Output: 3 -> Input: 7 The aten::detach op has also been disabled in this PR to fix the issue of tensors not aliasing properly due to copying.	2022-07-30 09:40:02 -04:00
Antonio Kim	615ff1d31c	Generate MLIR with shape information via LTC frontend (#742 )	2022-07-30 09:40:02 -04:00
Henry Tu	3e9b1cbd36	Added JIT to MLIR lowering (#724 ) * Added JIT to MLIR lowering Lowering to JIT is performed in a way similar to how it's done in the TS LTC backend. After a jit::Graph is constructed, it gets converted to a jit::Function, which is fed into the existing utility to generate an MlirModule in torch-mlir. * Renamed `csrc/backend` to `csrc/base_lazy_backend`	2022-07-30 09:40:02 -04:00
Jae Hoon (Antonio) Kim	65cf1465ef	Fix Torch-MLIR LTC Backend based off latest PyTorch master (#723 ) * Changes as a result of the LTC TS backend decoupling * Fix bugs in BackendImpl and codegen * Fix based on latest PyTorch master	2022-07-30 09:40:02 -04:00
Jae Hoon (Antonio) Kim	c3b20e444c	Got LTC working until compile (#689 )	2022-07-30 09:40:02 -04:00
powderluv	31fd812acf	Add linux and macOS source builds in CI (#1070 ) This enables building Pytorch from source in the CI. The build should mostly hit the ccache. Release builds will follow once we have some runtime on the CI.	2022-07-21 14:16:03 -07:00
powderluv	baa4383c44	Revert to using Pytorch paths for delocate (#1065 ) Remove the linking of libtorch/ paths in delocate for CI builds	2022-07-15 19:51:59 -07:00
powderluv	479a8a8963	Remove libtorch downloads (#1058 ) Remove all the libtorch downloads. If the user sets -DTORCH_MLIR_USE_INSTALLED_PYTORCH=OFF then just build from src. Doesn't change developer workflow since we still default to local PyTorch versions. TEST: Build and verify all tests (except one xfail quant) pass on linux	2022-07-14 17:16:51 -07:00
Ramana Radhakrishnan	6e68f27399	Fail to install x86_64 linux libtorch.so on other architectures. (#1053 ) Found while trying to build torch-mlir on an AArch64 Linux VM, worth a belts and braces to prevent such cases. Change-Id: I89c6fccb62e666dbda0d9acac2d0ee43c2899e9b	2022-07-14 10:01:21 -07:00
Maksim Levental	1bb990afc7	Speed up libtorch build. (#1031 )	2022-07-11 20:46:49 -05:00
powderluv	ea2afce29a	Fix OSX nightly builds (#1032 ) Set default OSX arch to x86_64. Release builds will override it. Also update to the latest point release on Python 3.9x and 3.10x	2022-07-10 22:17:01 -07:00
Ashay Rane	874fdb7e42	build: improve robustness of cmake and shell scripts (#1018 ) On my local machine, `unzip` didn't exist (producing a "command not found" error), but CMake ignored the error. Although the build did succeed (because it found a previously-built version of libtorch), it seems better to abort builds on such failures, so this patch checks the return code of all external process invocations. Along similar lines, this patch also updates the shell scripts in `build_tools` to extensively use double-quoting to prevent unintentional word splitting or globbing. Since some of the scripts execute `rm` while using shell variables, this patch also adds the preamble `set -u` to abort execution if an undefined variable is referenced, so that we reduce the chances of executing `rm -rf /` if the path expression happens to refer to an undefined variable.	2022-07-06 14:39:30 -07:00
powderluv	33bfeda4c5	Enable libtorch caching and source builds (#1004 ) Add an option to cache libtorch/ releases if you don't want to download the latest. Add an option to enable source builds. TESTS: macOS: verify with / without cache downloads verify source builds -- shared and static Linux: Build Tests and Release builds	2022-07-05 10:25:43 -07:00
powderluv	2b52da951b	Link against libtorch (#955 ) This moves torch-mlir to link against libtorch on macOS and linux TESTS: Tests pass. Tested release builds on linux and macOS	2022-06-30 12:40:17 -07:00

1 2 3

136 Commits (796c317aa9f16b79c4833c518df27e7cb28efd71)