torch-mlir

Commit Graph

Author	SHA1	Message	Date
Sambhav Jain	f00ca91db0	Simplify matrix configuration for CI workflows (#1213 ) Addresses https://github.com/llvm/torch-mlir/issues/1207. #### Provisioned jobs: ``` # ubuntu - x86_64 - llvm in-tree - pytorch binary - build+test # most used dev flow and fastest signal # ubuntu - x86_64 - llvm out-of-tree - pytorch source - build+test # most elaborate build # macos - arm64 - llvm in-tree - pytorch source - build only # cross compile, can't test arm64 ``` #### Main changes - [x] Spawn macos builds from a separate matrix (in the same workflow). It made sense to do this as they are fairly different from ubuntu (cross compile, use a different cmake configuration). This simplifies the matrix configuration and exclusions quite a bit, and makes the workflow a bit more tractable and maintenance friendly. - [x] Remove the submodule md5sum step for ccache config. This was [broken](https://github.com/llvm/torch-mlir/runs/7779288734?check_suite_focus=true#step:3:145) for a while now. - [x] Removes unused matrix options - `os`, `targetarch`, `python-version`, `llvmtype`. - [x] Address ZSTD [comment](https://github.com/llvm/torch-mlir/pull/1204#discussion_r942349282) on @powderluv's cross compile [PR](https://github.com/llvm/torch-mlir/pull/1204). #### Further improvements (to be addressed in follow-on): * ubuntu-x86_64 out-of-tree integration tests fail ([error](https://github.com/sjain-stanford/torch-mlir/runs/7781264029?check_suite_focus=true)); only run unit tests for now (tests are excluded in current CI too) #### Passing workflow: https://github.com/sjain-stanford/torch-mlir/actions/runs/2840676309 ![image](https://user-images.githubusercontent.com/19234106/184194535-f3807991-401a-4cb9-b030-0ee8c334eba3.png)	2022-08-11 16:35:15 -07:00
Sean Silva	5618890ca0	development.md: Avoid name collisions with PYTORCH_ variables	2022-08-05 19:41:08 -07:00
Henry Tu	2c3b3606d0	Resolve remaining LTC CI failures (#1110 ) * Replace CHECK_EQ with TORCH_CHECK_EQ * Check value of TORCH_MLIR_USE_INSTALLED_PYTORCH during LTC build * Update LTC XFAIL with NewZerosModule ops * Explicitly blacklist _like ops * Automatically blacklist new_/_like ops * Prune away unused Python dependencies from LTC * Add flag to disable LTC * Autogen dummy _REFERENCE_LAZY_BACKEND library when LTC is disabled * Implement compute_shape_var * Removed Var tests from XFAIL Set * XFAIL tests using _local_scalar_dense or index.Tensor * Add StdDim tests to XFAIL set * Autogen aten::cat	2022-07-30 09:40:02 -04:00
Jae Hoon (Antonio) Kim	425362263b	Clean up Autogen (#1112 ) * Remove unnecessary sed in autogen * Remove .pyc files frrom VCS	2022-07-30 09:40:02 -04:00
Jae Hoon (Antonio) Kim	0d16a91656	Add support for lift_fresh op (#1101 )	2022-07-30 09:40:02 -04:00
Antonio Kim	de6c135dc3	Fix LTC autogen for CI with nightly PyTorch - Update llvm-project pin to match main	2022-07-30 09:40:02 -04:00
Henry Tu	cec74b8d37	Blacklist _convolution op (#1048 ) * Blacklist _convolution op in LTC * Removed duplicate Torch_AtenSelectScatterOp instance from autogen .td * Removed duplicate Torch_AtenSliceScatterOp instance from autogen .td	2022-07-30 09:40:02 -04:00
Henry Tu	47bb38d180	Reference Lazy Backend (#1045 ) * Changed Example MLIR backend to Reference MLIR backend * Moved reference_ltc_backend into csrc * Merged sys_utils.h * Renamed reference_ltc_backend to reference_lazy_backend * Addressed review comments * Update docs with new library name * Removed _REFERENCE_LAZY_BACKEND from .gitignore * Added reference_lazy_backend to the TorchMLIRPythonModules dependency list Fixed typo in `ltc_examples.md` Missed instance where `ltc_backend` was used instead of `lazy_backend`.	2022-07-30 09:40:02 -04:00
Jae Hoon (Antonio) Kim	fb21c9e6cb	Integrate Functionalization Pass (#998 ) * Fix autogen build dir issue * Got functionalization pass to compile * Add slice/diagonal backwards functionalization * Fix codegen invocation in CMakeLists.txt * Add functionalization view ops * Fix logsumexp out functionalization * Fix ComputationPtr * Blacklist new_empty op * Add op comparison * Remove unnecessary ops Co-authored-by: Henry Tu <henry.tu@cerebras.net>	2022-07-30 09:40:02 -04:00
Jae Hoon (Antonio) Kim	a62d60829c	Refactor autogen (#925 )	2022-07-30 09:40:02 -04:00
Jae Hoon (Antonio) Kim	d9aee0d7a7	E2E HuggingFace Bert using LTC Backend (#912 ) * Update native function definitions * Add ops to support bert lowering - Add empty_strided and as_strided - Restore zeros_like to op blacklist (Without this, tensors will be unintentionally created with a CPU device rather than lazy) - Check for composite implicit ops and add device data IR - Also fix codegen for functionalization * Add autogen to CMakeList * Remove PyTorch submodule * Reduced BERT model size * Print Mark Step status in Torch MLIR LTC debug string * Apply fixes to work with latest upstream/main - Pass importOptions into getMlirTypeFromTorchType during NodeImporter::importNode Without this, the tensor type created may have a mismatched type as ImportOptions may cause vtensor to be used instead of tensor * Update shape inference functions - Fixed compute_shape_native_batch_norm when mean and var are uninitialized Previously, the number of shapes returned would be <3 if either mean or val was didn't exist. Instead, we now initialize them with a vector matching the number of channels. - Implemented compute_shape_mul - Fixed bug in reshape shape inference error message * Get MLIR backend more consistent with TS backend - Remove LazyNativeFunctions::_unsafe_view from autogen - Blacklist ops to make JIT graph more like output of TS backend - Print graph when SSA value has mismatch of types and results - Remove normalize_index from LazyShapeInference - Fix seeds for LTC example models * Update and clean up shape inference functions - Prune shape inference functions - Add shape inference function for GenerateSlice - Add shape inference function for GenerateCopy Co-authored-by: Henry Tu <henry.tu@cerebras.net>	2022-07-30 09:40:02 -04:00
Jae Hoon (Antonio) Kim	1bde00c73d	Fix LTC Decoupling (#815 ) * Initial changes * Fix up native functions * Further fix decoupling * Remove unnecessary ops * Formatting and copyright banners: * Add pytorch submodule	2022-07-30 09:40:02 -04:00
Henry Tu	cca9fe126e	Enable support for LTC Input/Output Mapping (#764 ) * Save InputOutputAliases to TorchMlirComputation * Implement GetResultShape for TorchMlirLoweringContext * Use optional return type for GetResultShape * Remove support for aten::detach With this op enabled, tensors were being copied, which resulted in incorrect aliasing. * Add newline before printing I/O alias mapping * Changed printout to use "Input param" as label instead of "Input" * Remote shape inference function for aten::detach * Moved implementation of SetUpAlias to MlirLoweringContext As part of this change, TorchMlirComputation has been moved to the end of mlir_lowering_context.h so that it can access some new structs in TorchMlirLoweringContext * Use updated PyTorch API * Remove GetResultShape Complements this upstream PyTorch PR: pytorch/pytorch#75828 This PR adds support for mapping input and output tensors which alias each other. (e.g. maps input weight tensor in parameter to the same tensor in output after a training iteration) MLIR: func @graph(%arg0: !torch.vtensor<[1,5],f32>, %arg1: !torch.vtensor<[1],si64>, ..., %arg6: !torch.vtensor<[10,5],f32>, %arg7: !torch.vtensor<[10],f32>, ...) { ... return %arg0, %arg1, %17, %23, ... : !torch.vtensor<[1,5],f32>, !torch.vtensor<[1],si64>, !torch.vtensor<[10,5],f32>, !torch.vtensor<[10],f32>, ... } Input/Output Alias Mapping: Output: 0 -> Input: 0 Output: 1 -> Input: 1 Output: 2 -> Input: 6 Output: 3 -> Input: 7 The aten::detach op has also been disabled in this PR to fix the issue of tensors not aliasing properly due to copying.	2022-07-30 09:40:02 -04:00
Antonio Kim	615ff1d31c	Generate MLIR with shape information via LTC frontend (#742 )	2022-07-30 09:40:02 -04:00
Henry Tu	3e9b1cbd36	Added JIT to MLIR lowering (#724 ) * Added JIT to MLIR lowering Lowering to JIT is performed in a way similar to how it's done in the TS LTC backend. After a jit::Graph is constructed, it gets converted to a jit::Function, which is fed into the existing utility to generate an MlirModule in torch-mlir. * Renamed `csrc/backend` to `csrc/base_lazy_backend`	2022-07-30 09:40:02 -04:00
Jae Hoon (Antonio) Kim	65cf1465ef	Fix Torch-MLIR LTC Backend based off latest PyTorch master (#723 ) * Changes as a result of the LTC TS backend decoupling * Fix bugs in BackendImpl and codegen * Fix based on latest PyTorch master	2022-07-30 09:40:02 -04:00
Jae Hoon (Antonio) Kim	c3b20e444c	Got LTC working until compile (#689 )	2022-07-30 09:40:02 -04:00
powderluv	31fd812acf	Add linux and macOS source builds in CI (#1070 ) This enables building Pytorch from source in the CI. The build should mostly hit the ccache. Release builds will follow once we have some runtime on the CI.	2022-07-21 14:16:03 -07:00
powderluv	baa4383c44	Revert to using Pytorch paths for delocate (#1065 ) Remove the linking of libtorch/ paths in delocate for CI builds	2022-07-15 19:51:59 -07:00
powderluv	479a8a8963	Remove libtorch downloads (#1058 ) Remove all the libtorch downloads. If the user sets -DTORCH_MLIR_USE_INSTALLED_PYTORCH=OFF then just build from src. Doesn't change developer workflow since we still default to local PyTorch versions. TEST: Build and verify all tests (except one xfail quant) pass on linux	2022-07-14 17:16:51 -07:00
Ramana Radhakrishnan	6e68f27399	Fail to install x86_64 linux libtorch.so on other architectures. (#1053 ) Found while trying to build torch-mlir on an AArch64 Linux VM, worth a belts and braces to prevent such cases. Change-Id: I89c6fccb62e666dbda0d9acac2d0ee43c2899e9b	2022-07-14 10:01:21 -07:00
Maksim Levental	1bb990afc7	Speed up libtorch build. (#1031 )	2022-07-11 20:46:49 -05:00
powderluv	ea2afce29a	Fix OSX nightly builds (#1032 ) Set default OSX arch to x86_64. Release builds will override it. Also update to the latest point release on Python 3.9x and 3.10x	2022-07-10 22:17:01 -07:00
Ashay Rane	874fdb7e42	build: improve robustness of cmake and shell scripts (#1018 ) On my local machine, `unzip` didn't exist (producing a "command not found" error), but CMake ignored the error. Although the build did succeed (because it found a previously-built version of libtorch), it seems better to abort builds on such failures, so this patch checks the return code of all external process invocations. Along similar lines, this patch also updates the shell scripts in `build_tools` to extensively use double-quoting to prevent unintentional word splitting or globbing. Since some of the scripts execute `rm` while using shell variables, this patch also adds the preamble `set -u` to abort execution if an undefined variable is referenced, so that we reduce the chances of executing `rm -rf /` if the path expression happens to refer to an undefined variable.	2022-07-06 14:39:30 -07:00
powderluv	33bfeda4c5	Enable libtorch caching and source builds (#1004 ) Add an option to cache libtorch/ releases if you don't want to download the latest. Add an option to enable source builds. TESTS: macOS: verify with / without cache downloads verify source builds -- shared and static Linux: Build Tests and Release builds	2022-07-05 10:25:43 -07:00
powderluv	2b52da951b	Link against libtorch (#955 ) This moves torch-mlir to link against libtorch on macOS and linux TESTS: Tests pass. Tested release builds on linux and macOS	2022-06-30 12:40:17 -07:00
Bob Adolf	b90837ee24	Temporarily revert support for custom op extensions. (#944 ) The MacOS builders are having linking trouble with the extension library. Until it's fixed, all support for op extensions is disabled. It should be easy to restore once the issue is resolved.	2022-06-14 18:24:40 -07:00
Bob Adolf	0a7ba62438	Allow torch-mlir to support PyTorch extensions. (#895 ) PyTorch allows new operators to be registered dynamically in modules. Torch-mlir already makes it fairly straightforward to add support for new operators, and this commit just extends that support to allow new PyTorch ops to come from a external module. This does not allow ops to be dynamically loaded into torch-mlir. Torch-mlir must still be compiled with support built-in. Add a `_torch_mlir_custom_op_example` subpackage to `torch_mlir` which registers an demonstration op. It will not be imported by default when importing torch_mlir. It's strictly for testing and documentation. Adds an end-to-end test for the `torch_mlir_custom_op_example::identity` op. With all these changes, we should now be actively testing PyTorch extension support with all future patches.	2022-06-13 14:51:30 -07:00
Prashant Kumar	10c8e3c593	Add simple neural_net and bert_training scripts. 1. With the help of `make_fx` we are able to get the full training graph with weight updates. 2. NeuralNet_training passes. Bert_training passes after cherry-picking https://github.com/llvm/torch-mlir/pull/844. 3. TODO: Remove the functorch's dependency after make_fx moves to pytorch core.	2022-05-19 06:18:42 +05:30
powderluv	d872f3e2ca	Build each OSX python version in an venv (#852 ) Previously only system default versions were built. Now we build binaries for both 3.9 and 3.10	2022-05-12 16:39:35 -07:00
powderluv	e7f306ec2f	Use delocate to make portable wheels on OSX (#850 ) Fix up wheels per python version on OSX	2022-05-12 14:16:32 -07:00
powderluv	0fb7a03ac9	Update build_macos_packages.sh Set default OSX SDK to 11.0 not 11:0	2022-05-04 08:44:43 -07:00
powderluv	fe1237b2a4	Provide a way to override MacOS target and arch (#818 ) Useful when we are only building for one architecture.	2022-05-02 09:04:12 -07:00
Prashant Kumar	5192a4e9f3	Remove heavy_deps models that don't get serialized. BART, BigBird and GPT2 are not being serialized and hence removed. Also, changed the script to obtain the resnest model.	2022-04-29 17:21:25 +05:30
powderluv	ef546e1137	Add a script to build and upload M1 snapshots (#801 ) Uses the latest snapshot tags and adds the releases to same asset directories so it can be run on a cronjob without a GH runner.	2022-04-28 14:50:58 -07:00
Vivek Khandelwal	4635d36efb	[MLIR][TORCH] Add heavydep tests for torch benchmarks This commit adds e2e heavydep tests for the torch benchmarks. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-04-26 13:22:08 +05:30
powderluv	6d09c98b2f	Fix version information in Release builds (#788 ) env vars seems to be lost in manylinux docker. Use a version file like IREE does.	2022-04-25 14:13:17 -07:00
powderluv	7d9138f497	Update build_macos_packages.sh (#787 ) Set the environment variable and export it since it doesn't seem to get passed down.	2022-04-22 15:48:03 -07:00
Prashant Kumar	e9c785b04b	Generate backward graph via functorch-aot module Example to demonstrate the extraction of forward as well as backward graph via Functorch's AOT module is added.	2022-04-22 20:58:35 +05:30
powderluv	4ef61aa27f	Minor buildsystem fixes (#778 ) Sets up auto-pinning of latest torch-nightly	2022-04-21 15:53:00 -07:00
powderluv	b03eac4224	Enable OSX (Intel, Apple Silicon Builds) (#776 ) Update pinned pytorch version. Will submit a follow on PR to bump. Also update artifacts directory	2022-04-21 10:47:28 -07:00
powderluv	cc3a4a58ef	Add oneshot release snapshot for test/ondemand (#768 ) * Add oneshot release snapshot for test/ondemand Add some build scripts to test new release flow based on IREE. Wont affect current builds, once this works well we can plumb it in. Build with manylinux docker * Fixes a few issues found when debugging powderluv's setup. * It is optional to link against Python3_LIBRARIES. Check that and don't do it if they don't exist for this config. * Clean and auditwheel need to operate on sanitized package names. So "torch_mlir" vs "torch-mlir". * Adds a pyproject.toml file that pins the build dependencies needed to detect both Torch and Python (the MLIR Python build was failing to detect because Numpy wasn't in the pip venv). * Commented out auditwheel: These wheels are not PyPi compliant since they weak link to libtorch at runtime. However, they should be fine to deploy to users. * Adds the --extra-index-url to the pip wheel command, allowing PyTorch to be found. * Hack setup.py to remove the _mlir_libs dir before building. This keeps back-to-back versions from accumulating in the wheels for subsequent versions. IREE has a more principled way of doing this, but what I have here should work. Co-authored-by: Stella Laurenzo <stellaraccident@gmail.com>	2022-04-21 02:19:12 -07:00
Sean Silva	b69db60f85	Pin the Python package to the exact PyTorch nightly. This avoids issues where PyTorch version drift has made things incompatible. One caveat is that you will need to specify `-f https://download.pytorch.org/whl/nightly/cpu/torch_nightly.html --pre` on the command line for pip to know where to find the nightly packages (there is no way around this) -- this is easiest to do by simultaneously passing `-r requirements.txt` on the pip command line.	2022-04-20 16:47:38 -07:00
powderluv	91d3e7ba15	Remove CCACHE settings and validate on OSX Builds whl package for OSX. Need to validate smoke tests next	2022-04-14 01:32:49 -07:00
Sean Silva	3a96078571	Pin the CI to the latest working PyTorch. I am investigating the breakage. Also, fix "externals" rename in setup.py and some cases where we weren't using `requirements.txt` consistently. Also, fix a case where the packaging script would get confused due to ".." in the path name.	2022-03-29 15:02:17 -07:00
Sean Silva	52c330cca2	Fix some more uses of "e2e" that I missed in the last commit.	2022-03-28 19:09:56 +00:00
Sean Silva	0378c75b35	Centralize all test serialization logic.	2022-03-28 10:17:13 -07:00
Ahmed S. Taei	8383497704	[NFC] Rename external -> externals (#699 )	2022-03-26 09:12:27 -07:00
Prashant Kumar	730cdcd071	Add hugging face `albert-base-v2` in torchscript_e2e_heavydep_tests `albert-base-v2` for sequence classification is added in e2e_heavy_test.	2022-03-24 17:43:24 +05:30
Sean Silva	729402c3f4	Reduce compilation time for TorchOps.cpp.inc The `assemblyFormat` stuff (which generates unrolled, per-op C++ code) was taking up a lot of compile time, and all the ops are essentially printed with the same logic. So this PR makes them all call the same helper function. This is done by using `let hasCustomAssemblyFormat = 1` and then implementing `FooOp::parse` and `FooOp::print`. Additionally, the `Generated*Ops.td` files are all collapsed into just `GeneratedTorchOps.td` (there is no reason to have the files separate, since the files are very large anyway so one is always having to search within them -- editors don't care that the file to search is now a bit bigger :) ). This reduces TorchOpsODSGenerated.cpp compile time (which is now GeneratedTorchOps.cpp) from 39 to 31 seconds on my machine. This is actually less than I expected, but this PR is an overall cleanup to the code anyway. The next step will be to introduce (better) functionality upstream for sharding the TorchOps.cpp.inc file, so that we can truly parallelize the O(#ops) costs. This is also necessary, because after this PR, TorchDialect.cpp is now the slowest file to compile, due to the `addOperations<... all the ops ...>` call, which needs to be shareded too.	2022-03-21 14:42:26 -07:00

1 2 3

112 Commits (84d345c650adac1645ea4dcb5dae88b4c91d3413)