torch-mlir

Commit Graph

Author	SHA1	Message	Date
Rob Suderman	29baa813bd	[onnx] Fix `pool` lowering for non-symmetric padding (#2837 ) `torch` requires that padding be symmetric for pooling operations. To support non-symmetric pad we need to separately materialize out the padding operation. --------- Co-authored-by: James Newling <james.newling@gmail.com>	2024-02-01 14:35:21 -08:00
Dave Liddell	04be6ba773	Make the onnx importer more robust for internal/external and large models (#2794 ) Fix for https://github.com/llvm/torch-mlir/issues/2765 The onnx docs say that you can't do shape inference using the in-memory API for models > 2 GB. This fix replaces that API with the file-based API. Since the new API generates an intermediate file, also added a --keep switch to keep that file, which I delete by default. --------- Co-authored-by: Dave Liddell <dliddell@xilinx.com>	2024-01-31 21:58:43 -08:00
Sambhav Jain	8a17c98b74	Bump stablehlo to openxla/stablehlo@fd52182f76 (#2821 ) With the recent LLVM integrate and changes from https://github.com/llvm/llvm-project/pull/78260, we hit this build error in Stablehlo (which is quite old). ``` external/stablehlo/stablehlo/transforms/StablehloRefineShapes.cpp:1020:14: error: no member named 'startRootUpdate' in 'mlir::PatternRewriter' rewriter.startRootUpdate(op); ~~~~~~~~ ^ external/stablehlo/stablehlo/transforms/StablehloRefineShapes.cpp:1026:16: error: no member named 'finalizeRootUpdate' in 'mlir::PatternRewriter' rewriter.finalizeRootUpdate(op); ~~~~~~~~ ^ external/stablehlo/stablehlo/transforms/StablehloRefineShapes.cpp:1029:16: error: no member named 'cancelRootUpdate' in 'mlir::PatternRewriter' rewriter.cancelRootUpdate(op); ~~~~~~~~ ^ external/stablehlo/stablehlo/transforms/StablehloRefineShapes.cpp:1108:14: error: no member named 'updateRootInPlace' in 'mlir::PatternRewriter' rewriter.updateRootInPlace(op->getParentOp(), [&]() { return; }); ~~~~~~~~ ^ 4 errors generated. Target @torch-mlir//:torch-mlir-opt failed to build ``` I'm still puzzled as to how this didn't fail with the CMake merge gating CI (do we not test Stablehlo builds/tests?). In any case, bumping our submodule to https://github.com/openxla/stablehlo/pull/1918 fixes it. It exposes a new failing lit test in TorchToStablehlo though, that I have looped stablehlo developers into ([here](https://discord.com/channels/999073994483433573/999074539138990131/1201235845391331419)). ``` bazel run @torch-mlir//test/Conversion:TorchToStablehlo/scatter.mlir.test ...external/torch-mlir/test/Conversion/TorchToStablehlo/scatter.mlir within split at <stdin>:1 offset :33:8: error: unexpected error: Expects non-empty reduction block for type inference %0 = torch.aten.scatter.src %arg0, %int0, %arg1, %arg2 : !torch.vtensor<[?,?],si64>, !torch.int, !torch.vtensor<[?,?],si64>, !torch.vtensor<[?,?],si64> -> !torch.vtensor<[?,?],si64> ^ LLVM ERROR: Failed to infer result type(s). ``` Bazel CI: https://github.com/sjain-stanford/torch-mlir/actions/runs/7732673480/job/21083102228	2024-01-31 14:21:17 -08:00
Rob Suderman	3500523f75	[onnx] Convert resources to denseattr for `onnx.constant` to `torch` (#2830 ) `onnx` explicitly specifies that `raw_data` is stored in `little-endian` layout. While converting to `torch` we need to convert from a known endian format to an internal format of consistent layout. This means endianness must be correct during the import of `onnx.Constant`. --------- Co-authored-by: Xida Ren (Cedar) <cedar.ren@gmail.com>	2024-01-31 11:40:53 -08:00
Stella Laurenzo	943164d797	Fix some spurious `None` values in tests (broken at head). (#2840 )	2024-01-30 22:39:22 -08:00
Aart Bik	105aad6f57	[torch-mlir] provide FX traced graph importer for sparse tensors (#2817 ) Note that we are waiting for actual FX traced graph support for sparse tensors. For details see https://github.com/pytorch/pytorch/issues/117188 Until then, however, we provide this clever importer that builds the FX traced graph for for the dense case and then puts a sparse annotation back on the parameters. With import test.	2024-01-30 21:22:12 -08:00
Yuanqiang Liu	d778950f45	[Torch Dialect] add fold pattern for aten.clone (#2804 )	2024-01-31 09:43:21 +08:00
Rob Suderman	25a5a22cbd	[torch] Support `torch.convolution` quantized lowering to `linalg` (#2811 ) Linalg has quantized specific operations. We can lower to these operations when there is a known zeropoint and scale operations. This allows the `convolution` to occur with lower bitwidth's, improving the overall performance.	2024-01-30 13:46:47 -08:00
Aaron St George	4c557847bd	Don't fold `aten.detach` if result isn't same type as input. (#2824 ) We were seeing some assertion failures after some checks around folders were tightened up in LLVM: https://github.com/llvm/llvm-project/pull/75887 . This PR essentially moves the logic that used to be applied at the LLVM level into the folder, which seems to be the suggested fix. I'm not sure if the IR that caused issues for us _should_ be valid? ``` %1 = torch.aten.detach %arg0 : !torch.tensor<[1],f32> -> !torch.tensor ``` A better fix might be to create a verifier ensuring the result of `aten.detach` has the same type as its operand. --------- Co-authored-by: aaron-stgeorge <aaron.stgeorge@getcruise.com>	2024-01-30 09:45:51 -08:00
aldesilv	eff325abc3	OnnxToTorch ReduceMax lowering (#2768 ) Fixes https://github.com/nod-ai/SHARK-Turbine/issues/352	2024-01-30 11:44:48 +05:30
Rob Suderman	d3fd754b93	[onnx] `onnx.MatMulInteger` lowering to `torch.mm` and `quint*` types (#2761 ) Torch does not have an equivalent matmul operation for integers. Instead it sidechannels the information via its quantized types. For this lowering we setup these sidechannels then invoke `torch.mm`.	2024-01-29 09:40:21 -08:00
Aart Bik	46a25d7241	[torch-mlir][sparse] preserve sparsity during lowering torch to linalg (#2809 ) This preserves sparsity at the most obvious places of lowering TORCH tensors to MLIR RankedTensorType tensors. Other places are marked for audit. With some initial lowering tests.	2024-01-26 10:54:59 -08:00
Vivek Khandelwal	da7c6d2c16	[MLIR][TORCH] Add support for dynamic shape for Onnx.Transpose op (#2803 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-01-26 09:46:54 -08:00
Phaneesh Barwaria	4964977e85	[ONNX][MLIR] support constantOfShape op (#2747 )	2024-01-26 09:36:39 -08:00
Aart Bik	0aed231e21	[torch-mlir][conversion-test] cleanup trailing whitespace in mlir files (#2807 )	2024-01-25 14:24:28 -08:00
Aart Bik	fe836ceebf	[torch-mlir][test] cleanup trailing whitespace in mlir files (#2806 )	2024-01-25 14:24:13 -08:00
Aart Bik	e824fbc65c	[torch-mlir][torch] add encoding field to torch type (#2799 ) This adds an encoding field to the torch type, using the interfaces for printing, parsing, and verification. Note that although this change prepares adding sparsity to the torch type (as illustrated by the round trip and invalid tests), nothing in this change depends on the actual contents of the encoding field!	2024-01-25 10:04:04 -08:00
Rob Suderman	f6f890520b	[torch][quant] Quantized `torch.mm` for linalg with end-to-end test (#2750 ) This includes custom op matching for decomposed operations and fusing dequantization into dense operations. As a validation we compare to the dequant+mm torch implementation.	2024-01-24 14:02:50 -08:00
Rob Suderman	60bf6c25af	[onnx] Lower `onnx.QLinearMatMul` lowering to `torch` operators (#2776 ) We can plumb the linear matmul into pytorch using its quantized types with side channel information. To handle the final int8 operation we dequantize and requantize.	2024-01-24 12:28:48 -08:00
Vivek Khandelwal	894805dd5e	[MLIR][TORCH] Support for `onnx.LayerNormalization` (#2789 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-01-24 11:08:20 -08:00
Gaurav Shukla	12f123eff8	[ONNX][MLIR] Add support for pad op in the onnx pipeline (#2738 ) This commit adds mapping from `onnx.pad` op to `torch.pad` op. Currently it does not support `axes` parameter of `onnx.pad` op. Signed-off-by: Gaurav Shukla <gaurav.shukla@amd.com>	2024-01-25 00:33:37 +05:30
Phaneesh Barwaria	ac8975ea12	[MLIR] [ONNX] lowering for onnx tile op and sign op (#2725 )	2024-01-24 22:56:21 +05:30
Chi_Liu	77ae56337d	[ONNX][MLIR] Add support for onnx.Exp op (#2792 ) https://github.com/nod-ai/SHARK-Turbine/issues/312	2024-01-23 13:45:00 -08:00
James Newling	dc056e58e6	[MLIR][TORCH] Add onnx.cast cases used by OPT-1.25M (#2787 )	2024-01-23 21:06:25 +05:30
Gaurav Shukla	b7a0329676	[ONNX][MLIR] Fix padding size constraint for onnx.maxpool op (#2782 ) Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com>	2024-01-23 19:23:01 +05:30
Chi_Liu	cad98e8113	[ONNX][TORCH-MLIR] Add TopK support (#2774 ) https://github.com/nod-ai/SHARK-Turbine/issues/331	2024-01-22 12:56:39 -08:00
Srinath Avadhanula	73b30604da	Do not try to legalize transposed convolution (#2721 ) Currently transposed convolution is not handled correctly by `TorchToTosa`. This PR allows transposed convolutions to pass through the conversion so that they can be handled by other conversion passes later in a pipeline. An example input which produces a compilation error is: ``` func.func @forward(%input: !torch.vtensor<[1,64,1,100],f32>) -> !torch.vtensor<[1,64,2,200],f32> { %true = torch.constant.bool true %int1 = torch.constant.int 1 %int2 = torch.constant.int 2 %weight = torch.vtensor.literal(dense<0.0> : tensor<64x64x3x3xf32>) : !torch.vtensor<[64,64,3,3],f32> %bias = torch.vtensor.literal(dense<0.0> : tensor<64xf32>) : !torch.vtensor<[64],f32> %stride = torch.prim.ListConstruct %int2, %int2 : (!torch.int, !torch.int) -> !torch.list<int> %int1x1 = torch.prim.ListConstruct %int1, %int1 : (!torch.int, !torch.int) -> !torch.list<int> %output = torch.aten.convolution %input, %weight, %bias, %stride, %int1x1, %int1x1, %true, %int1x1, %int1 : !torch.vtensor<[1,64,1,100],f32>, !torch.vtensor<[64,64,3,3],f32>, !torch.vtensor<[64],f32>, !torch.list<int>, !torch.list<int>, !torch.list<int>, !torch.bool, !torch.list<int>, !torch.int -> !torch.vtensor<[1,64,2,200],f32> return %output : !torch.vtensor<[1,64,2,200],f32> } ``` This MLIR produces an error about a cast operation with a size mismatch when passed through `torch-to-tosa`: ``` error: 'tensor.cast' op operand type 'tensor<1x64x1x50xf32>' and result type 'tensor<1x64x2x200xf32>' are cast incompatible ``` --------- Co-authored-by: Srinath Avadhanula <srinath.avadhanula@getcruise.com>	2024-01-22 10:57:56 -08:00
Dave Liddell	2f4924015d	[onnx] Added flatten (#2760 ) [https://github.com/nod-ai/SHARK-Turbine/issues/328](url) --------- Co-authored-by: Dave Liddell <dliddell@xilinx.com>	2024-01-19 16:18:16 -08:00
Gaurav Shukla	3b85c70748	[ONNX][MLIR] Add support for onnx.gather op (#2726 ) This commit adds support for gather op in the onnx pipeline. https://github.com/nod-ai/SHARK-Turbine/issues/242 Signed-off-by: Gaurav Shukla <gaurav.shukla@amd.com>	2024-01-19 21:58:29 +05:30
John Wu	704cfdaf08	Add aten.pool_max3d support to torch-to-linalg (#2735 ) Added verification logic to the abstract_interpreter_lib_gen.py Also made some unit tests Initially, I thought we can use `linalg::pooling_ndhwc_max` to help implement this problem. However, on a 5-dimensional matrix it does the pooling on dimensions (2, 3, 4) which is not what we want. We want pooling on dimensions (3, 4, 5). To achieve this, we would need to lower our code using the `linalg` dialect. Turns out the pooling code in `linalg` looks like this. ``` func @max_pooling_ncdhw(%I: memref<?x?x?x?x?xf32>, %K: memref<3xindex>, %O: memref<?x?x?x?x?xf32>, %strides: memref<3xindex>, %dilations: memref<3xindex>) { %c0 = arith.constant 0 : index %c1 = arith.constant 1 : index %N = memref.dim %I, %c0 : memref<?x?x?x?x?xf32> %C = memref.dim %I, %c1 : memref<?x?x?x?x?xf32> %D = memref.dim %I, 2 : memref<?x?x?x?x?xf32> %H = memref.dim %I, 3 : memref<?x?x?x?x?xf32> %W = memref.dim %I, 4 : memref<?x?x?x?x?xf32> %kernel_d = memref.load %K[%c0] : memref<3xindex> %kernel_h = memref.load %K[%c1] : memref<3xindex> %kernel_w = memref.load %K[2] : memref<3xindex> %stride_d = memref.load %strides[%c0] : memref<3xindex> %stride_h = memref.load %strides[%c1] : memref<3xindex> %stride_w = memref.load %strides[2] : memref<3xindex> %dilation_d = memref.load %dilations[%c0] : memref<3xindex> %dilation_h = memref.load %dilations[%c1] : memref<3xindex> %dilation_w = memref.load %dilations[2] : memref<3xindex> linalg.generic { indexing_maps = [ affine_map<(n, c, d, h, w, kd, kh, kw) -> (n, c, d * %stride_d + kd * %dilation_d, h * %stride_h + kh * %dilation_h, w * %stride_w + kw * %dilation_w)>, // Map for input tensor affine_map<(n, c, d, h, w, kd, kh, kw) -> (kd, kh, kw)>, // Map for kernel tensor affine_map<(n, c, d, h, w, kd, kh, kw) -> (n, c, d, h, w)> // Map for output tensor ], iterator_types = ["parallel", "parallel", "parallel", "parallel", "parallel", "reduction", "reduction", "reduction"], doc = "3D Max Pooling NCDHW with Strides, Dilations, and Kernel Size" } ins(%I, %K : memref<?x?x?x?x?xf32>, memref<3xindex>) outs(%O : memref<?x?x?x?x?xf32>) { ^bb0(%input_elem: f32, %kernel_elem: index, %output_elem: f32): %max_val = arith.maxf %input_elem, %output_elem : f32 linalg.yield %max_val : f32 } return } ``` This was implemented based on it's source code with the adjustments mentioned above: `4ca1b5e094/mlir/include/mlir/Dialect/Linalg/IR/LinalgNamedStructuredOps.yaml (L5647)` Issues related to this can be found here https://github.com/nod-ai/SHARK-Turbine/issues/324	2024-01-19 21:09:46 +05:30
Andreas Falkenberg	4de4d38b87	Initial commit of NonZero op (#2766 )	2024-01-18 15:23:13 -10:00
Rob Suderman	b5387c0f29	[onnx] Lowering `onnx.dequantize_linear` to `torch` (#2759 ) We can make the per-tensor version of the operation to the dequantize operation via marking with the make quantized tensor component. This introductions the `qint` and `quint` tensor type that can be lowered to teh appropriate dequantization behavior during the torch-to-linalg conversion.	2024-01-18 16:47:21 -08:00
Rob Suderman	bd11877f6f	[onnx] Support lowering quantize linear to `torch` (#2751 ) We can map the per_tensor case to the `torch.aten.quantize_per_linear` operation. In this case we extract the `scale` and `zeropoint` values and directly invoke the quantization, then return the integer representation value.	2024-01-18 16:33:10 -08:00
Ze Zhang	77a03f2069	torch-to-tosa lowering support for AtenLinalgVectorNormOp (#2734 ) This PR add torch-to-tosa lowering support for AtenLinalgVectorNormOp e2e test: python -m e2e_testing.main --config=tosa LIT tests: cmake --build build --target tools/torch-mlir/all --------- Co-authored-by: Ze Zhang <ze.zhang@getcruise.com>	2024-01-18 12:32:23 -08:00
Phaneesh Barwaria	eed144bfbc	[ONNX][MLIR] add Identity op support (#2754 )	2024-01-16 19:06:54 +05:30
kumardeepakamd	87389f0762	[ONNXToTorch] Add conversion for Onnx range (#2752 ) Implemented ONNX.Range. The spec says the data type for start, limit, delta are 0-D can be double, float, int16, int32, int64, All int types mapped to !torch.int and all float types mapped to !torch.float --------- Co-authored-by: Kumar Deepak <kumar@xilinx.com>	2024-01-15 14:26:46 -05:00
Rob Suderman	197b3b475c	[onnx] Convert `onnx.constant` to `torch` literal tensor (#2748 ) Handles the multiple cases of `onnx` constant values and converts them to `torch` literal tensors. This can include splats with a single integer or floating point value, a set of explicit integer values, or an elements array attr of values.	2024-01-15 09:31:22 -08:00
Han-Chung Wang	10acea71be	Bump LLVM to llvm/llvm-project@0cb024b (#2753 ) - Add fixes for `af78e5daf0` - Add fixes for `bb6d5c2200`	2024-01-15 07:12:12 -08:00
Chi_Liu	c7452af4fa	[MLIR][ONNX] Add OnnxToTorch support for Maxpool Op (#2695 ) Add Maxpool ONNX op support. Add Utils.h/cpp files to create a constant int list for ONNX.	2024-01-12 14:54:38 -08:00
Ze Zhang	670a99ae19	Handle torch.none type in tosa.clamp op (#2739 ) This PR updates the torch-to-tosa conversion with following changes: - Support torch.none as min/max input argument for tosa.clamp op - Support negative value as start index for tosa.slice op - Add tosa.logical_or lowering support e2e test: python -m e2e_testing.main --config=tosa LIT tests: cmake --build build --target tools/torch-mlir/all --------- Co-authored-by: Ze Zhang <ze.zhang@getcruise.com>	2024-01-11 10:36:48 -08:00
Andreas Falkenberg	5862854bc8	[ONNX][TORCH-MLIR] LayerNorm (#2716 ) Layer Normalization using the torch.aten.native_layer_norm https://github.com/nod-ai/SHARK-Turbine/issues/325	2024-01-11 14:27:04 +05:30
kumardeepakamd	29569713f3	support for onnx.expand operator (#2729 ) maps onnx.expand to torch aten broadcast_to, three tests added --------- Co-authored-by: Kumar Deepak <kumar@xilinx.com>	2024-01-10 13:05:37 -08:00
Vivek Khandelwal	208ae35583	[MLIR][ONNX] Add TorchToOnnx Support for DepthToSpace op Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-01-10 17:50:47 +05:30
Vivek Khandelwal	4707d3bdc6	[MLIR][ONNX] Add OnnxToTorch support for Bernoulli and CastLike op Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-01-10 16:24:06 +05:30
Vivek Khandelwal	35e8f86792	[MLIR][ONNX] Add OnnxToTorch support for Dropout and Elu op Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-01-10 16:23:55 +05:30
John Wu	4e5e34d215	[MLIR][ONNX] Add OnnxToTorch support for Slice Op (#2696 )	2024-01-03 19:41:10 -08:00
Xida Ren (Cedar)	1778314620	add basic cumsum. this doesn't support the exclusive and reverse attrs (#2717 ) fixes #2711	2024-01-03 09:52:59 -08:00
Xida Ren (Cedar)	9fc212ea9a	support Onnx opset 1-13 ReduceMean where axes is supplied as an attr (#2703 ) (instead of an input) Addresses part of #2689. fixes #2702	2023-12-28 09:31:41 -08:00
Xida Ren (Cedar)	d560698e3d	Lower `onnx.split` to `torch.aten` (#2686 )	2023-12-27 17:53:07 -08:00
aldesilv	2d796b7502	lower onnx max op to torch aten maximum op (#2618 ) lower onnx min op to torch aten minimum op	2023-12-27 11:07:35 -08:00

1 2 3 4 5 ...

628 Commits (962d5143085b2bea7db0c0e9bdc26bf5ea8db2b5)