torch-mlir

Commit Graph

Author	SHA1	Message	Date
Sambhav Jain	3e836d8dad	[fx_importer] Convert non-persistent buffers lifted as tensor constants (#2902 ) The investigation is largely recorded in https://github.com/llvm/torch-mlir/pull/2881, but this change allows us to capture non-persistent buffers that were lifted as tensor constants (after https://github.com/pytorch/pytorch/pull/118969 landed in upstream PyTorch), and propagate them to `Torch` dialect as "frozen" `torch.vtensor.literal`. I believe this patch should work with both nightly and stable PyTorch, but will let CI confirm the same. Thanks @stellaraccident for the valuable pointers and guidance. --------- Co-authored-by: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-02-13 12:38:32 -08:00
saienduri	9b967f6b5a	[MLIR][ONNX] Add OnnxToTorch support for Mean, IsInf, IsNaN, PRelu op (#2801 ) This commit adds the OnnxToTorch support for Mean, IsInf, IsNaN, and PRelu ops. All high priority ops were taken so went with these. The non trivial ones are Mean and IsInf which might require extra review --------- Co-authored-by: MaheshRavishankar <mravisha@amd.com>	2024-02-13 12:38:21 +05:30
Aart Bik	b6f4ca512e	[torch-mlir][sparse] sparsity metadata refinement (#2901 ) Various improvements on sparsity metadata: (1) define single data structure for all sparsity related metadata (2) handle batched dense dimensions, as well as dense subtensor dimensions (3) refine sparsity propagation for deeper networks	2024-02-12 16:10:57 -08:00
Ashay Rane	370d6ac9a2	build: find Protobuf using config mode search (#2900 ) This patch makes the Protobuf package mandatory in addition to forcing a config mode search. The (default) module mode search looks for the CMake-provided FindProtobuf.cmake file, but this file does not list Abseil as a dependency, causing linker issues like the one below: ``` ld: Undefined symbols: absl::lts_20230802::log_internal::LogMessageFatal::LogMessageFatal(char const*, int, std::__1::basic_string_view<char, std::__1::char_traits<char>>), referenced from: google::protobuf::RepeatedPtrField<std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>>::TypeHandler::Type const& google::protobuf::internal::RepeatedPtrFieldBase::Get<google::protobuf::RepeatedPtrField<std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>>::TypeHandler>(int) const (.cold.1) in OnnxImporter.cpp.o ``` By forcing a config mode search, CMake looks for the file that is installed as part of the protobuf package and which does contain the Abseil dependency. This workaround is also mentioned in a GitHub issue for Protobuf: https://github.com/protocolbuffers/protobuf/issues/12292#issuecomment-1529680040.	2024-02-12 17:31:41 -06:00
Aart Bik	be8375d350	[torch-mlir][sparse] implement first sparse_jit end-to-end path (#2894 ) This PR introduces a sparse_jit wrapper that can run simple models with sparse tensor inputs end-to-end. The implementation shows all required components on modifying sparse tensor types with a 1:N relation on the call sites. Two tests shows that the JIT runs end-to-end while computing the correct results. More details to follow (generalizing to COO and different ranks, as well as support for output sparse tensors), but the general concepts are all here now. _Update: Thanks to Rob, bump to proper LLVM/MLIR hash is done!_ _NOTE that all parameter passing changes are nicely done "downstream" in MLIR, so very little changes are required in torch-mlir code proper_ --------- Co-authored-by: Franz Haniel <77495327+frafranz@users.noreply.github.com> Co-authored-by: Franz Haniel <franz.haniel@amd.com>	2024-02-12 10:04:54 -08:00
Xida Ren (Cedar)	bfb93cb99f	Fix test_add_uint8 failure to lower to linalg (#2893 ) By updating convertScalarToDtype invocation pass original source and destination datatypes for the add op. Also fixes a potential problem with the sub op. --------- Co-authored-by: Xida Ren <xida.ren.dev@gmail.com>	2024-02-12 09:19:39 -08:00
Yuanqiang Liu	b8c48cf283	Bump stablehlo to openxla/stablehlo@e191eb4c3c3f3144503a8a117d760de5d… (#2891 ) …dcc7e89. * to involve `chlo-legalize-to-stablehlo` pass.	2024-02-12 01:05:00 +08:00
Rob Suderman	c0f139be0f	[torch] Add `torch.aten.eq.Tensor` comparison folder (#2889 ) Added a folded for a equals operator. This allows an equivalent comparison folder, primarily for when shape computations occur small size tensor.	2024-02-09 15:02:20 -08:00
Rob Suderman	d83b576c6e	Bump LLVM to llvm/llvm-project@bb180856ec (#2895 ) Includes some minor first for `AffineMap::inferFromExprList`	2024-02-09 14:07:49 -08:00
Rob Suderman	7d33ba69ac	[torch] Folder for torch.aten.select.int for splat cases (#2890 ) If the input or result is a splat value we can just constant fold the result. This is common for shape computations and can help with shape inference.	2024-02-09 14:02:54 -08:00
Franz Haniel	4cc62aeb24	Implement trace (#2790 ) The lowering decomposes AtenTraceOp into an AtenDiagonalOp followed by AtenSumOp. The progress is tracked in https://github.com/nod-ai/SHARK-Turbine/issues/333. --------- Co-authored-by: Franz Haniel <franz.haniel@amd.com>	2024-02-09 08:00:24 -08:00
Avinash Sharma	9659a436d1	Add lowering support for math::AbsIOp (#2875 ) There is no lowering support for math::AbsIOp, so if the operand is an integer type, it will fail to lower to math::AbsFOp since the op operand #0 must be floating-point-like.	2024-02-08 14:53:40 -08:00
Aart Bik	44f8f89826	[torch-mlir][sparse] add sparsification to linalg reference backend (#2887 ) This adds a few passes that will ensure linalg with sparse tensors are properly lowered to loops and can run using the ExecutionEngine for testing (a few details on parameter passing from PyTorch still TBD) Test results: $ ./tools/e2e_test.sh --config linalg Summary: Passed: 1144 Expectedly Failed: 8 $ python -m e2e_testing.main --config=torchdynamo -v Summary: Passed: 960 Expectedly Failed: 163 Filed issue: https://github.com/pytorch/pytorch/issues/119407	2024-02-08 09:37:31 -08:00
Ashay Rane	21f070e95f	onnx: fix checks in TorchOnnxToTorch pass to match the ONNX spec (#2848 ) This PR contains three commits to update the validation checks in the ONNX -> Torch conversion pass for the AveragePool, Pad, and Slice operators: > onnx: fix preconditions for lowering AveragePool ops > > The `pads` attribute of the AveragePool operator specifies the value to > pad at both the beginning as well as the end of the axis (see > https://onnx.ai/onnx/operators/onnx__AveragePool.html#attributes), so > the size of this attribute should be twice the rank of the input tensor. > However, our TorchOnnxToTorch bails out early since it incorrectly > compares the pads attribute with the rank (not twice the rank) of the > input tensor. > > This patch fixes the code to match the spec and adds a lit test. > onnx: allow optional constant value for Pad operator > > The `constant_value` input of the onnx.Pad operator is optional (see > https://onnx.ai/onnx/operators/onnx__Pad.html#inputs), but the existing > logic for lowering the operator into the Torch dialect assumes that it > is mandatory. > > This patch makes the attribute optional and constructs a default value > (a list of zeros the size of the input tensor) if the attribute was not > specified. > onnx: fix checks for axes and steps inputs of Slice operator > > The ONNX Spec for the Slice operator allows the `starts` and `ends` > inputs to have fewer indices that the dimensions of the `data` tensor > (see https://onnx.ai/onnx/operators/onnx__Slice.html), but our code > expects these inputs to be as many as the `data` tensor's dimensions. > > More precisely, the spec requires that the `starts` and `ends` inputs > are only as long as the `axes` input, but since the `axes` input is > optional, the default type for the `axes` input has to match the type > for the `starts` and `ends` inputs. Moreover, the number of indices in > the `steps` input also has to match those in the `axes` inputs (instad > of matching the dimensions of the `data` input). > > This patch fixes the checks in the TorchOnnxToTorch conversion so that > they match the ONNX spec.	2024-02-07 21:19:27 -08:00
Vivek Khandelwal	4df96616db	[MLIR][TORCH] Modify Onnx.Reshape lowering for static shape cases (#2852 ) This commit modifies the OnnxToTorch lowering of Onnx.Reshape op by creating the result shape list for the aten.reshape using the result shape values inferred from the op's result shape. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-02-07 17:44:07 -08:00
Rob Suderman	a8aad2a5ab	[torch] Add `torch.aten.where.*` folders (#2886 ) Where operation can be statically computed when involving splats of known value. Added handling these cases with multiple tests.	2024-02-07 19:43:31 -05:00
Dave Liddell	23647ab2d1	[torhc] aten.index_select folder (#2871 ) Folds aten::index_select ops under the following conditions: 1. If the input and output are the same shape, the indexing operation is a NOP, so just return the input. 2. If the input has shape <1x1x...xNx...x1> (all 1's except for one dim), and the output shape is <1x1x...x1> (all 1's), then there is a single index, so extract the single element value and return a tensor with that value. --------- Co-authored-by: Dave Liddell <dliddell@xilinx.com>	2024-02-07 16:17:15 -08:00
mmakevic	32dbf99ce2	Implement lowering of torch.aten.all.dim (#2873 ) Lowering of torch.aten.all.dim to linalg. Per PyTorch documentation: > This function matches the behaviour of NumPy in returning output of dtype bool for all supported dtypes except uint8. For uint8 the dtype of output is uint8 itself. Since there is no support for ui8 in torch-mlir currently (https://github.com/llvm/torch-mlir/pull/1384#issuecomment-1260011334) implementation returns failure for that case.	2024-02-07 12:34:52 -08:00
Xida Ren (Cedar)	fc04bc7ee9	[torch] AtenSliceOp folder that produces splat results (#2869 ) Includes `slice` folder and lit tests --------- Co-authored-by: Xida Ren <xida.ren.dev@gmail.com>	2024-02-07 19:00:46 +00:00
James Newling	723b8b1d28	Fix dev docs error/typo (#2880 ) Just a one line change in a .md file	2024-02-07 03:55:38 -08:00
saienduri	bfcf93ea21	Rename torch_mlir.compile APIs and introduce FX based analogs (#2842 ) Link to related RFC: https://discourse.llvm.org/t/rfc-rename-torch-mlir-compile-apis-and-introduce-fx-based-analogs/76646 This commit updates the documentation, tests, CMake files, and API for the proposed changes in the RFC. There is a new torch_mlir/fx.py for user level APIs related to importing modules and a corresponding test for this path can be found at test/python/fx_importer/basic_test.py. --------- Co-authored-by: MaheshRavishankar <mravisha@amd.com>	2024-02-06 19:07:59 -08:00
Xida Ren (Cedar)	cc06391630	AtenSortOp Folder (#2864 ) A chunk off https://github.com/llvm/torch-mlir/pull/2856 https://github.com/llvm/torch-mlir/pull/2860 --------- Co-authored-by: Xida Ren <xida.ren.dev@gmail.com> Co-authored-by: Rob Suderman <rob.suderman@gmail.com>	2024-02-06 21:12:12 +00:00
Daniel Garvey	faf7d4aaa5	[fx_importer] Add support for 0D tensors (#2870 ) Adds an escape hatch from creating a DenseResourceElementsAttr for single value tensors into DenseElementsAttr. For 0d or 1element, splats are better as DenseElementsAttr. Don't use DenseResourceElementsAttr for it	2024-02-06 00:19:31 -06:00
Dave Liddell	1cb14f6879	Rob's atenTensor folder (#2867 ) If a tensor is initialized by a list with a single constant integer, this folder turns it into a torch.vtensor.literal --------- Co-authored-by: Dave Liddell <dliddell@xilinx.com>	2024-02-05 17:10:42 -08:00
Rob Suderman	041a54ae0c	[torch] Supporting `torch.aten.mul.float` lowering to `arith` (#2833 ) Simple missing scalar operation for multiply floats was missing.	2024-02-05 16:23:04 -08:00
Rob Suderman	e3faef5224	[onnx] Convert `onnx.QLinearConv` to `torch` (#2851 ) Leaning on the QDQ functionality in torch we can support the QLinearConv operation by piggybacking through `torch.Convolution`. This includes some changes such as allowing the `onnx` rewriter to run recursively. Doing so allows `QLinearConv` to decopmose to `onnx.Convolution` which is then lowered to `torch`.	2024-02-05 16:09:41 -08:00
Rob Suderman	cb52c4b3cc	[onnx] Fix `onnx-to-torch` lowering for flatten shape (#2834 ) The existing `flatten` lowering did not define what the intermediate shape was. This could result in failures to lower further to linalg as the intermediate shape was unknown. Added a shape refinement section.	2024-02-05 14:23:46 -08:00
Xida Ren (Cedar)	b3a56c0711	Update add_ops to mention llvm-project/mlir/utils/generate-test-checks.py (#2862 )	2024-02-05 12:13:43 -08:00
Gaurav Shukla	f4562a8eaa	[ONNX] Fix the lowering of onnx.expand op (#2861 ) Signed-off-by: Gaurav Shukla <gauravshukla789@gmail.com>	2024-02-05 23:46:58 +05:30
Aart Bik	d1cd117998	[torch-mlir] remove trailing whitespace from md documentation (#2853 )	2024-02-02 11:02:53 -08:00
Xida Ren (Cedar)	24b8c8672a	[torch] Add folders for `torch.fill`, `torch.ones`, `torch.zeros` and `aten.getItem` (#2849 ) So that the CumSum Op in OPT can get the constant that it requires to be lowered to TMTensor --------- Co-authored-by: Rob Suderman <rob.suderman@gmail.com> Co-authored-by: Xida Ren <xida.ren.dev@gmail.com>	2024-02-02 10:46:33 -08:00
Ben Vanik	962d514308	Fixing implicit double->float conversion warning. (#2850 ) `[build] D:\Dev\iree\third_party\torch-mlir\lib\Conversion\TorchOnnxToTorch\DefaultDomainGtoP.cpp(734): warning C4305: 'argument': truncation from 'double' to 'float'`	2024-02-01 22:02:44 -08:00
Rob Suderman	29baa813bd	[onnx] Fix `pool` lowering for non-symmetric padding (#2837 ) `torch` requires that padding be symmetric for pooling operations. To support non-symmetric pad we need to separately materialize out the padding operation. --------- Co-authored-by: James Newling <james.newling@gmail.com>	2024-02-01 14:35:21 -08:00
Sambhav Jain	c7d7d7f004	[Bazel] Add TorchToTensor dep to TorchMLIRTorchConversionPasses (#2847 ) Fixes bazel build error: ``` ERROR: /root/.cache/bazel/_bazel_root/b89349c08f7224396763d14fe35cba11/external/torch-mlir/BUILD.bazel:547:11: Compiling lib/Dialect/TorchConversion/Transforms/Passes.cpp failed: (Exit 1): clang failed: error executing command /usr/lib/llvm-16/bin/clang -U_FORTIFY_SOURCE -fstack-protector -Wall -Wthread-safety -Wself-assign -Wunused-but-set-parameter -Wno-free-nonheap-object -fcolor-diagnostics -fno-omit-frame-pointer ... (remaining 224 arguments skipped) Use --sandbox_debug to see verbose messages from the sandbox and retain the sandbox build root for debugging external/torch-mlir/lib/Dialect/TorchConversion/Transforms/Passes.cpp:23:10: fatal error: 'torch-mlir/Conversion/TorchToTensor/TorchToTensor.h' file not found #include "torch-mlir/Conversion/TorchToTensor/TorchToTensor.h" ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1 error generated. Target @torch-mlir//:torch-mlir-opt failed to build ``` Bazel CI: https://github.com/sjain-stanford/torch-mlir/actions/runs/7735724133/job/21091865352	2024-01-31 22:07:06 -08:00
Dave Liddell	04be6ba773	Make the onnx importer more robust for internal/external and large models (#2794 ) Fix for https://github.com/llvm/torch-mlir/issues/2765 The onnx docs say that you can't do shape inference using the in-memory API for models > 2 GB. This fix replaces that API with the file-based API. Since the new API generates an intermediate file, also added a --keep switch to keep that file, which I delete by default. --------- Co-authored-by: Dave Liddell <dliddell@xilinx.com>	2024-01-31 21:58:43 -08:00
Rob Suderman	34f6948533	[torch] Support `!countIncludePad` when unpadded for average pool (#2836 ) We do not support average pool when `countIncludePad is set to false. However if the input is unpadded then the setting of the boolean is unneeded. Extended use by checking if padding is zero before rejecting the lowering.	2024-01-31 15:09:36 -08:00
Rob Suderman	0114a570e3	[torch] Support lowering `torch.item` to `tensor.extract` (#2835 ) Extracting scalar values from tensors can be implemented via a lowering to tensor.extract.	2024-01-31 15:09:12 -08:00
Sambhav Jain	8a17c98b74	Bump stablehlo to openxla/stablehlo@fd52182f76 (#2821 ) With the recent LLVM integrate and changes from https://github.com/llvm/llvm-project/pull/78260, we hit this build error in Stablehlo (which is quite old). ``` external/stablehlo/stablehlo/transforms/StablehloRefineShapes.cpp:1020:14: error: no member named 'startRootUpdate' in 'mlir::PatternRewriter' rewriter.startRootUpdate(op); ~~~~~~~~ ^ external/stablehlo/stablehlo/transforms/StablehloRefineShapes.cpp:1026:16: error: no member named 'finalizeRootUpdate' in 'mlir::PatternRewriter' rewriter.finalizeRootUpdate(op); ~~~~~~~~ ^ external/stablehlo/stablehlo/transforms/StablehloRefineShapes.cpp:1029:16: error: no member named 'cancelRootUpdate' in 'mlir::PatternRewriter' rewriter.cancelRootUpdate(op); ~~~~~~~~ ^ external/stablehlo/stablehlo/transforms/StablehloRefineShapes.cpp:1108:14: error: no member named 'updateRootInPlace' in 'mlir::PatternRewriter' rewriter.updateRootInPlace(op->getParentOp(), [&]() { return; }); ~~~~~~~~ ^ 4 errors generated. Target @torch-mlir//:torch-mlir-opt failed to build ``` I'm still puzzled as to how this didn't fail with the CMake merge gating CI (do we not test Stablehlo builds/tests?). In any case, bumping our submodule to https://github.com/openxla/stablehlo/pull/1918 fixes it. It exposes a new failing lit test in TorchToStablehlo though, that I have looped stablehlo developers into ([here](https://discord.com/channels/999073994483433573/999074539138990131/1201235845391331419)). ``` bazel run @torch-mlir//test/Conversion:TorchToStablehlo/scatter.mlir.test ...external/torch-mlir/test/Conversion/TorchToStablehlo/scatter.mlir within split at <stdin>:1 offset :33:8: error: unexpected error: Expects non-empty reduction block for type inference %0 = torch.aten.scatter.src %arg0, %int0, %arg1, %arg2 : !torch.vtensor<[?,?],si64>, !torch.int, !torch.vtensor<[?,?],si64>, !torch.vtensor<[?,?],si64> -> !torch.vtensor<[?,?],si64> ^ LLVM ERROR: Failed to infer result type(s). ``` Bazel CI: https://github.com/sjain-stanford/torch-mlir/actions/runs/7732673480/job/21083102228	2024-01-31 14:21:17 -08:00
Rob Suderman	54e258792c	[onnx] Import `onnx` constants as `onnx.Constant` instead of literals (#2831 ) To handle the conversion from raw bytes to `DenseElementsAttr` we need to handle the endianness conversion during `torch-onnx-to-torch`. Therefore when importing `onnx.Constant` it is better to represent using the `onnx` constant operation so that only one location requires the endianness correction.	2024-01-31 11:41:06 -08:00
Rob Suderman	3500523f75	[onnx] Convert resources to denseattr for `onnx.constant` to `torch` (#2830 ) `onnx` explicitly specifies that `raw_data` is stored in `little-endian` layout. While converting to `torch` we need to convert from a known endian format to an internal format of consistent layout. This means endianness must be correct during the import of `onnx.Constant`. --------- Co-authored-by: Xida Ren (Cedar) <cedar.ren@gmail.com>	2024-01-31 11:40:53 -08:00
Ilija Kalinić	54ef18c556	Implement lowering of torch.aten.lerp.Scalar (#2773 ) Closes nod-ai/SHARK-Turbine#356	2024-01-31 09:39:38 -08:00
Stella Laurenzo	7301aa80fd	Enable -Werror in lib/ and LTC. (#2841 ) Required some massaging of LTC to make it warning clean, and I had to manually disable some warnings on the generated source files (which we don't control). The project is warning clean now. The `-Werror` flag is disabled by default as we can't control everywhere people will try to build/install. The CI enables it via -DTORCH_MLIR_ENABLE_WERROR_FLAG=ON.	2024-01-30 23:33:21 -08:00
Stella Laurenzo	943164d797	Fix some spurious `None` values in tests (broken at head). (#2840 )	2024-01-30 22:39:22 -08:00
Stella Laurenzo	26c0ecd09c	[nfc] Remove unused var causing error downstream	2024-01-30 22:18:13 -08:00
Aart Bik	105aad6f57	[torch-mlir] provide FX traced graph importer for sparse tensors (#2817 ) Note that we are waiting for actual FX traced graph support for sparse tensors. For details see https://github.com/pytorch/pytorch/issues/117188 Until then, however, we provide this clever importer that builds the FX traced graph for for the dense case and then puts a sparse annotation back on the parameters. With import test.	2024-01-30 21:22:12 -08:00
Ramiro Leal-Cavazos	1a7442e0aa	Add clang-format check to CI (#2816 ) This PR adds a check to the CI right after checking out the Torch-MLIR repository to make sure that the changes in the PR don't require any `git clang-format` modifications.	2024-01-30 19:59:46 -08:00
Yuanqiang Liu	d778950f45	[Torch Dialect] add fold pattern for aten.clone (#2804 )	2024-01-31 09:43:21 +08:00
Rob Suderman	25a5a22cbd	[torch] Support `torch.convolution` quantized lowering to `linalg` (#2811 ) Linalg has quantized specific operations. We can lower to these operations when there is a known zeropoint and scale operations. This allows the `convolution` to occur with lower bitwidth's, improving the overall performance.	2024-01-30 13:46:47 -08:00
Aaron St George	4c557847bd	Don't fold `aten.detach` if result isn't same type as input. (#2824 ) We were seeing some assertion failures after some checks around folders were tightened up in LLVM: https://github.com/llvm/llvm-project/pull/75887 . This PR essentially moves the logic that used to be applied at the LLVM level into the folder, which seems to be the suggested fix. I'm not sure if the IR that caused issues for us _should_ be valid? ``` %1 = torch.aten.detach %arg0 : !torch.tensor<[1],f32> -> !torch.tensor ``` A better fix might be to create a verifier ensuring the result of `aten.detach` has the same type as its operand. --------- Co-authored-by: aaron-stgeorge <aaron.stgeorge@getcruise.com>	2024-01-30 09:45:51 -08:00
Rob Suderman	db67bc555a	Bump LLVM to llvm/llvm-project@70eb0e3 (#2827 )	2024-01-30 09:01:42 -08:00

1 2 3 4 5 ...

2529 Commits (3e836d8dad551b6e5302de1b84840b90ee039c83) All Branches Search

2529 Commits (3e836d8dad551b6e5302de1b84840b90ee039c83)

All Branches