torch-mlir

Commit Graph

Author	SHA1	Message	Date
Giacomo Serafini	ac4cb971e7	[Torch Dialect] Add `torch.aten.mul.int_float` (required to simplify shape calculation of `upsample_nearest2d`) (#3764 ) As per title. See also [PR](https://github.com/llvm/torch-mlir/pull/3750) for `torch.aten.mul.float_int`. --------- Co-authored-by: zjgarvey <47986913+zjgarvey@users.noreply.github.com>	2024-11-21 14:24:33 +08:00
Ze Zhang	abb9282524	Add canonicalize pattern for aten.mul.int and aten.floordiv.int (#3680 ) This PR add `floordiv` to the `PY_BUILTIN_TO_TORCH_OP`. For `aten.mul.int` and `aten.floordiv.int` ops, we add new Canonicalization Patterns as follow: ``` %1 = torch.aten.mul.int %input, %const-5 %2 = torch.aten.mul.int %1, %const-6 ``` Will be replaced by `torch.aten.mul.int %input, %const-30` And ``` %1 = torch.aten.mul.int %input, %const-5 %2 = torch.aten.floordiv.int %1, %const-5 ``` Will directly return `%input` This PR also relaxes the `float` type constraint in TorchToTosa for the `AtenRsubScalarOp` conversion. To test: `cmake --build build --target check-torch-mlir-all`	2024-11-21 12:58:10 +08:00
yyp0	367d13203e	[Torch] add fold logic for some ops (#3794 )	2024-10-16 16:43:31 +08:00
Vivek Khandelwal	4a0bed0ce0	[ONNX] Add training mode support for BatchNormalization op (#3597 ) This commit extends the OnnxToTorch lowering for BatchNormalization op for supporting the case when training=True. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-08-14 10:46:38 +05:30
Rob Suderman	af67f9efb0	[onnx] Support integer types for `onnx.Pow` (#3626 ) Pow is not support for the `torch` operator. Add casting for integer types.	2024-08-13 09:39:04 -07:00
Rob Suderman	39307f0462	[onnx] Fix `onnx.Gather` for bad expansion (#3625 ) A case where unsqueeze was require was missed causing compilation failures.	2024-08-13 09:38:55 -07:00
aldesilv	a4ba02eef5	[ONNX] add support for tfidfvectorizer (#3553 ) 1-d/2-d input and output implemented based on the description and example test cases in https://github.com/onnx/onnx/blob/main/docs/Operators.md#TfIdfVectorizer and some notes from https://github.com/onnx/onnx/blob/main/onnx/reference/ops/op_tfidf_vectorizer.py#L128 --------- Co-authored-by: zjgarvey <zjgarvey@gmail.com>	2024-08-12 18:10:11 -05:00
Rob Suderman	d3695a97a0	[onnx] Fix `onnx.Hardmax` lowering to torch (#3624 ) The lowering to torch makes assumption about the dimensions / types of reduce max and onehot. We need to correct for expected torch behavior.	2024-08-12 11:19:02 -07:00
Phaneesh Barwaria	026dfade64	onnx.MelWeightMatrix TorchOnnxToTorch (#3503 ) Just uploading what I have till now [Gist](https://gist.github.com/PhaneeshB/761f75f5522d9f4a40ef949a328e93fe) of pytorch impl that I'm following to implement the OnnxToTorch lowering Additional Details - (also pasted as comment in gist) [Op Description](https://github.com/onnx/onnx/blob/main/docs/Operators.md#melweightmatrix) in Onnx Documentation [Example](https://github.com/onnx/onnx/blob/main/docs/Operators.md#examples-93) Used the same example in this file. the Expected output is shown in the example [Reference Onnx Impl](`4c3ed5e08b/onnx/reference/ops/op_mel_weight_matrix.py (L13)`) - This is the base for the above code.	2024-08-12 21:18:29 +05:30
Matthias Gehre	334633b738	e2e: Enable generate-runtime-verification pass (#3615 ) This adds the `generate-runtime-verification` pass into the linalg refbackend, and moves all tests that now abort at runtime into the crash set, sorted by their respective errors. I have fixed on set of errors found that way, which are mismatches between the static dimensions we cast to and the actual dynamic dimensions. This was caused by wrong annotations on the test cases, like in https://github.com/llvm/torch-mlir/pull/3615/files#diff-48bfbf41fcad5fa01b49197d251114f84a2b8de4f1d87ab938a061aedd1419b1R1931	2024-08-12 14:15:12 +02:00
Felix Schneider	0314188dbe	[torch] Basic support for per-channel quantized graphs (#3623 ) This patch adds basic support for lowering graphs with per-channel quantization. Per-channel quantized ops have to be excluded from `FuseQuantizedOps` for now but can be used in QDQ quantized form. Using this patch, we're able to import and execute (on the linalg backend) graphs with per-channel quantization applied using the "new" PyTorch 2.0 Export Quantization.	2024-08-10 15:51:09 +02:00
Rob Suderman	44266ab0c4	[onnx] Support `fp8` for `onnx.QuantizeLinear` (#3619 ) We need to directly decompose quantize linear for `fp8` types as the equivalent torch operations do not support the operation.	2024-08-09 12:32:46 -07:00
Rob Suderman	8358e8c255	[onnx] Add support for `fp8` `onnx.DequantizeLinear` (#3617 ) Fp8 needs a slightly different path for dequantization as the `torch` dequantize operation does not support `fp8` types.	2024-08-08 16:20:53 -07:00
Rob Suderman	880e64bbbb	[onnx] `onnx.Split` may not have `num_outputs` which can be inferred (#3608 ) The attribute does not exist in all variants of the operation. It can be inferred from the number of results so we should just do that.	2024-08-08 16:17:38 -07:00
Rob Suderman	fd98476f77	[torch] Unpacking sometimes misses shape inference (#3609 ) It is possible that the unpacked tensor does not match the same inferred shapes. This is pretty common when ingesting form the `onnx` frontend.	2024-08-08 16:17:31 -07:00
zjgarvey	7f2a17e757	[ONNX] fix padding for `onnx.MaxPool` (#3611 ) The saga of aligning onnx and torch padding conventions continues. ```python onnx_pads = [low_x, low_y, low_z, high_x, high_y, high_z] torch_pads = [low_z, high_z, low_y, high_y, low_x, high_x] ``` So not only is the lexicographical ordering hierarchy swapped (low/high x spatial-dim -> spatial-dim x low/high) but the ordering in the the spatial-dim specification is also reversed. This patch properly reverses the pad ordering (and actually uses the `shuffledPadding` to pad).	2024-08-07 20:34:00 -07:00
Rob Suderman	6c33ab024e	[onnx] `onnx.CenterCropPad` used an incorrect type for toScalar (#3605 ) To scalar should have a rank-0 tensor type not rank-1 with length 1. Changing allows proper compilation.	2024-08-07 20:33:33 -07:00
Rob Suderman	59a4c6fda4	[onnx] Fix transposition code for `onnx.OneHot` (#3606 ) The post onehot transposition code was unexercised. Fixed the test and transformation to check use.	2024-08-07 18:20:26 -07:00
zjgarvey	c8efc201f4	[Onnx] expand support for constant matching (#3607 ) The pattern `m_OnnxListOfConstantInts` previously only checked if the attr inside an `onnx.Constant` op is a `DenseResourceElementsAttr`, but didn't handle `ElementsAttr`'s. This patch adds support for `ElementsAttr` and provides an example of it's use via a lit test for `onnx.Unsqueeze`.	2024-08-07 19:35:34 -05:00
Marius Brehler	341f415b1e	[onnx] Fix lowering `onnx.Shrink` to Torch (#3603 ) This fixes the result type of the `torch.aten.lt.Scalar` and `torch.aten.ge.Scalar` ops created during the lowering of `onnx.Shrink` to Torch.	2024-08-07 21:25:14 +02:00
Rob Suderman	18139994e8	[onnx] Fix edge condition for `onnx.ReduceMax` (#3598 ) For length-0 on `onnx.ReduceMax` the length 0 case was incorrect due to a copy paste error.	2024-08-07 10:32:28 -07:00
Rob Suderman	b48e55c2f7	[onnx] Handle negative indices for `onnx.GatherElements` (#3599 ) Add a check for negative indices and offset appropriately for `onnx.GatherElements`.	2024-08-06 18:54:01 -07:00
Rob Suderman	b1a232222f	[onnx] Fix `onnx.Shape` to include `start` and `end` processing (#3580 ) `onnx.Shape` can select only a subset of indices using attributes. Add support for these attributes. --------- Co-authored-by: zjgarvey <47986913+zjgarvey@users.noreply.github.com>	2024-08-05 13:56:07 -07:00
Gaurav Shukla	839fe90f86	[MLIR][ONNX] Add support for onnx.scan op (#3516 ) This commit lowers onnx.scan op to torch.prim.Loop op and adds the lowering in the onnx pipeline. Signed-off-by: Gaurav Shukla <gaurav.shukla@amd.com>	2024-08-05 15:37:26 +05:30
zjgarvey	d0933b0eb6	[TorchToLinalg] Fix possible OOB access in Interpolate lowering (#3570 ) Following up from the discussion in <https://github.com/llvm/torch-mlir/pull/3550>, I've edited the lowering to prevent OOB extracts in a more direct fashion (i.e., just clamping directly). I don't think this affects the lit tests at all, but I've tested the changes in our external test suite at <https://github.com/nod-ai/SHARK-TestSuite/tree/main/>. I found the issue when I was unexpectedly getting `nan`'s along the output image border for a resize test there.	2024-08-02 13:55:37 -05:00
Rob Suderman	f7b5c13870	Change linalg.matmul_unsigned to linalg.matmul with unsigned type_fn (#3587 ) Change linalg.matmul_unsigned to linalg.matmul with unsigned type_fn Signed-off-by: Max Dawkins <max.dawkins@gmail.com> Co-authored-by: Max Dawkins <max.dawkins@gmail.com>	2024-08-02 11:32:24 -07:00
Rob Suderman	d273bdfabf	[onnx] Fix default `alpha` for `onnx.Elu` (#3583 ) We were defaulting to `0.0` for `onnx.Elu` when it is supposed to be `1.0`.	2024-08-02 09:29:17 -07:00
Rob Suderman	3d33c5a206	[onnx] Fix `onnx.ScatterElements` for negative indices (#3582 ) We need to adjust for negative scatter indice values. Added materializing out the inbounds adjustment.	2024-08-02 09:01:10 -07:00
yyp0	22cd4441e7	[Torch] Add support for static uneven divisible AdaptiveAvgPool2d (#3566 ) The static uneven divisible AdaptiveAvgPool2d means that although the input size is not an integer multiple of ouput size, but the kernel and stride size can also be fixed (not dynamic). The derivation logic of kernel and stride size is consistent with torch/_decomp/decomposations.py:adaptive_avg_pool2d as described in the following: 1. Stride Size Firstly , derive the start index in each reduce operation according to the output size (`n`), `start_index = ([0, 1, ..., n - 1] * input_size) // output_size`. For each index `k`, if `k * (input_size % output_size) < output_size`, then the current and previous stride keeps the same as `input_size // output_size`. So suppose `(n-1) * (input_size % output_size) < output_size`, the stride in the whole AdaptiveAvgPool2d process keeps static, as `input_size // output_size`. 2. Kernel Size torch/_decomp/decomposations.py:adaptive_avg_pool2d calculates a static kernel size when the input/output sizes satisfy either of the two conditions, `input_size % output_size == 0` or `output_size % (input_size % output_size) == 0`. Here if `input_size % output_size == 0`, then the kernel size equals `input_size // output_size`, otherwise `input_size // output_size + 1.`	2024-08-01 11:37:53 +08:00
Rob Suderman	7f475e174e	Add extf-trunc f32-f64-f32 ellision (#3579 ) Torch has all scalars represented as i64 and f64 types which results in extraneous trunc-extf commands. We can rework this by elliding widen-narrow cases away.	2024-07-31 16:50:00 -07:00
Vinayak Dev	30c4d2f2b8	[torch] Add OnnxToTorch lowering for Onnx.Unique op (#3523 ) Adds OnnxToTorch Lowering for the `Onnx.Unique` op.	2024-07-29 17:32:44 +05:30
pdhirajkumarprasad	a211ccbcff	Implementation of SplitToSequence ops lowering (#3509 ) Added support for splitToSequence ops lowering Added test case with filecheck	2024-07-29 15:44:22 +05:30
Vivek Khandelwal	b6e4725259	[ONNX] Add OnnxToTorch lowering for NonMaxSuppression op (#3501 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-07-26 21:01:27 +05:30
Yuanqiang Liu	003b06dfa1	[Torch] enhance naryFolderHelper to support mixed dtypes (#3559 ) * so that it could support like `i64 + f64 => f64`. * also unify `aten.log`'s folder code to use `naryFolderHelper`.	2024-07-24 17:54:59 +08:00
Yuanqiang Liu	aad1604046	[Torch] enhance fold of aten.squeeze.dim (#3558 )	2024-07-24 14:13:48 +08:00
Ze Zhang	d1e172f418	Register fake_quantize_cachemask ops and add their decompose patterns (#3556 ) Test: `cmake --build build --target check-torch-mlir-all`	2024-07-23 11:33:12 -07:00
Yuanqiang Liu	21ad890009	[Torch] enhance fold of aten.slice.Tensor (#3557 ) so that it could support folding slice with any static shape.	2024-07-23 22:53:03 +08:00
Vivek Khandelwal	22c9008bb9	build: Update Roll PyTorch version (#3548 ) This commit also updates the PyTorch and Torchvision nightly links since they are now moved to a different location. PyTorch Nightly: https://download.pytorch.org/whl/nightly/cpu/torch/ Torchvision Nightly: https://download.pytorch.org/whl/nightly/cpu/torchvision/ Disables dtype checks for some ops, tracked by https://github.com/llvm/torch-mlir/issues/3552 Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-07-19 21:38:57 +05:30
jinchen	f0ce1e94ce	[ONNX] Add OnnxToTorch support for SequenceMap (#3535 )	2024-07-17 14:25:09 -07:00
Arham Khan	574143448b	[E2E][ONNX] torch.multinomial (#3404 ) This PR adds a conversion in the TorchOnnxToTorch pass for the ONNX Multinomial operation. It also adds a TorchToLinalg lowering for the `aten.Multinomial` op and does a light refactor of some repeated code that generates random floating point numbers in `TorchToLinalg/Random.cpp`.	2024-07-16 23:09:39 +05:30
Matthew Francis-Landau	fe9db78120	Allow custom ops to return an array of tensors (#3531 ) This PR adds support to `fx_importer.py` for handling custom ops that return an array of tensors. As long as the length of the array is consistent across runs (determined statically), then this patch will work. This does not require that the number of tensors returned is determined by the op's definition. CC @sjain-stanford	2024-07-14 11:54:23 -07:00
Sambhav Jain	7411ff2f69	[Symbolic Shapes] Test coverage for unbacked symint from data dependent ops (#3542 ) We do have support for translating unbacked symbolic_ints that arise from data-dependent ops like `aten.nonzero`. This PR adds the python lit test coverage for the same.	2024-07-14 11:52:03 -07:00
zjgarvey	0fb8b017d8	Adds misc fixes for some padding related issues (#3528 ) This patch adds a few misc pad op related changes: 1. Addresses issue <https://github.com/llvm/torch-mlir/issues/3457> 2. Addresses issue <https://github.com/llvm/torch-mlir/issues/3442> 3. Fixes the padding order for asymmetrically padded onnx.Conv ops 4. Enables passing quantization through those onnx.Conv op pre-paddings 5. Modifies the torch-to-linalg lowering of AtenReplicationPad2d op to enable support for input rank != 4 Unfortunately, even with all of these changes, the e2e tests for the ReplicationPad2d still fail the onnx config, since the torch export procedure for rearranging the pad order is complicated enough that the padding ints end up not being able to fold back to constants.	2024-07-11 20:01:45 -05:00
zjgarvey	dcb48dd46c	[ONNX] Fix LpNormalization Lowering (#3521 ) The LpNormalization lowering was previously just computing the norm, which is incorrect. This computes the norm then divides the input tensor by it's norm. I've tested this against some simple onnx models locally. I'll look into adding a test case for this in an external test suite.	2024-07-09 15:42:26 -05:00
Gaurav Shukla	0b46d1110a	[MLIR][ONNX] Add support for onnx.ScatterND (#3479 ) This commit adds support for onnx.ScatterND op in the onnx pipeline. Signed-off-by: Gaurav Shukla <gaurav.shukla@amd.com>	2024-07-08 13:27:14 +05:30
Yuanqiang Liu	3225f20ab1	[Stablehlo] use index type as dim size, avoid to generate index_cast (#3526 ) For example, the original IR is: ``` module attributes {torch.debug_module_name = "Matmul3D"} { func.func @forward(%arg0: tensor<?x?x?xf32>, %arg1: tensor<?x?x?xf32>) -> tensor<?x?x?xf32> { %c0 = arith.constant 0 : index %c1 = arith.constant 1 : index %c2 = arith.constant 2 : index %dim = tensor.dim %arg1, %c0 : tensor<?x?x?xf32> %0 = arith.index_cast %dim : index to i64 %dim_0 = tensor.dim %arg1, %c1 : tensor<?x?x?xf32> %1 = arith.index_cast %dim_0 : index to i64 %dim_1 = tensor.dim %arg1, %c2 : tensor<?x?x?xf32> %2 = arith.index_cast %dim_1 : index to i64 %from_elements = tensor.from_elements %0, %1, %2 : tensor<3xi64> %3 = stablehlo.dynamic_broadcast_in_dim %arg1, %from_elements, dims = [0, 1, 2] : (tensor<?x?x?xf32>, tensor<3xi64>) -> tensor<?x?x?xf32> %4 = stablehlo.dot_general %arg0, %3, batching_dims = [0] x [0], contracting_dims = [2] x [1] : (tensor<?x?x?xf32>, tensor<?x?x?xf32>) -> tensor<?x?x?xf32> return %4 : tensor<?x?x?xf32> } } ``` After using IndexType, the IR is: ``` module attributes {torch.debug_module_name = "Matmul3D"} { func.func @forward(%arg0: tensor<?x?x?xf32>, %arg1: tensor<?x?x?xf32>) -> tensor<?x?x?xf32> { %c0 = arith.constant 0 : index %c1 = arith.constant 1 : index %c2 = arith.constant 2 : index %dim = tensor.dim %arg1, %c0 : tensor<?x?x?xf32> %dim_0 = tensor.dim %arg1, %c1 : tensor<?x?x?xf32> %dim_1 = tensor.dim %arg1, %c2 : tensor<?x?x?xf32> %from_elements = tensor.from_elements %dim, %dim_0, %dim_1 : tensor<3xindex> %0 = stablehlo.dynamic_broadcast_in_dim %arg1, %from_elements, dims = [0, 1, 2] : (tensor<?x?x?xf32>, tensor<3xindex>) -> tensor<?x?x?xf32> %1 = stablehlo.dot_general %arg0, %0, batching_dims = [0] x [0], contracting_dims = [2] x [1] : (tensor<?x?x?xf32>, tensor<?x?x?xf32>) -> tensor<?x?x?xf32> return %1 : tensor<?x?x?xf32> } } ``` The benefits of using IndexType on shape tensor: * simplify the IR, avoid to generate `arith.index_cast` * let backend compiler have a chance to decide the index width of shape tensor * let stablehlo backend have a chance to serialize dynamic shape IR by [shape_legalize_to_stablehlo](https://github.com/openxla/stablehlo/blob/main/stablehlo/tests/shape_legalize_to_stablehlo.mlir)	2024-07-07 18:03:03 +08:00
Ze Zhang	d466d5b809	Register fake_quantize related ops (#3522 ) Register `aten.fake_quantize_per_channel_affine` and `aten.fake_quantize_per_tensor_affine.tensor_qparams` ops --------- Co-authored-by: Ze Zhang <ze.zhang@getcruise.com>	2024-07-05 11:02:03 -07:00
Sagar Kulkarni	0fe74845da	[ONNX] Fix bug in ONNXToTorch PadOp's pads tensor rearrangement (#3485 ) Fix the pad tensor rearrangement such that we change the representation from [x1_begin, x2_begin, ..., x1_end, x2_end,...] to [xn_begin, xn_end, ...., x2_begin, x2_end, x1_begin, x1_end] where x1, x2 .. xn are the dimensions of the pads tensor argument. --------- Co-authored-by: zjgarvey <zjgarvey@gmail.com> Co-authored-by: zjgarvey <47986913+zjgarvey@users.noreply.github.com>	2024-07-03 15:02:49 -05:00
jinchen	3915db0a86	[ONNX] Add OnnxToTorch support for CenterCropPad (#3496 )	2024-06-28 12:47:29 -07:00
Aart Bik	6fece25ff3	[torch-mlir][sparse] add decomposition features to sparse compiler (#3505 ) Fixes https://github.com/llvm/torch-mlir/issues/3499	2024-06-28 10:18:36 -07:00

1 2 3 4 5 ...

901 Commits (byteir)