torch-mlir

Commit Graph

Author	SHA1	Message	Date
Rob Suderman	f9766c89f6	[onnx] Handle `torch.aten` for inner product case (#3634 ) The following case was failing to lower for einsum. This fixes up the inner product issue.	2024-08-24 11:41:25 -07:00
Vivek Khandelwal	fcc5f444cd	MLIR][TORCH] Fix GroupNorm decomposition by adding shape info (#3658 ) This commit adds the shape info for the tensors created during the decomposition of GroupNorm op. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-08-22 21:20:40 +05:30
Vivek Khandelwal	0a86deb59a	build: manually update PyTorch version (#3627 ) Set PyTorch and TorchVision version to nightly release 2024-08-18. This commit also updates the `scaled_dot_product_attention` op. A new attribute `enable_gqa` has been added. As of now, only the default value for the same is supported. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-08-19 12:03:56 +05:30
pkapris-syrmia	23ec5399e5	Implement lowering of aten.atleast_2d (#3546 ) This operator is needed to implement aten.vstack, which will be submitted in a subsequent PR	2024-08-14 18:52:31 +05:30
pkapris-syrmia	10fe5d08d1	Implement lowering for torch.aten.rad2deg (#3586 )	2024-08-14 16:37:28 +05:30
Rob Suderman	9ab93436c4	[torch] Support diagonal `einsum.Diagonal` (#3618 ) The einsum lowering was missing the behavior for duplicate indices in the equation. This amounts to a diagonalization along duplicate pairs of indices in the equation.	2024-08-13 09:38:43 -07:00
Yuanqiang Liu	c5b3cf299a	[Torch] emit upsample_nearest1d/2d/vec, and add shape/dtype functions (#3629 )	2024-08-13 19:14:24 +08:00
Felix Schneider	0314188dbe	[torch] Basic support for per-channel quantized graphs (#3623 ) This patch adds basic support for lowering graphs with per-channel quantization. Per-channel quantized ops have to be excluded from `FuseQuantizedOps` for now but can be used in QDQ quantized form. Using this patch, we're able to import and execute (on the linalg backend) graphs with per-channel quantization applied using the "new" PyTorch 2.0 Export Quantization.	2024-08-10 15:51:09 +02:00
Rob Suderman	fd98476f77	[torch] Unpacking sometimes misses shape inference (#3609 ) It is possible that the unpacked tensor does not match the same inferred shapes. This is pretty common when ingesting form the `onnx` frontend.	2024-08-08 16:17:31 -07:00
Rob Suderman	59a4c6fda4	[onnx] Fix transposition code for `onnx.OneHot` (#3606 ) The post onehot transposition code was unexercised. Fixed the test and transformation to check use.	2024-08-07 18:20:26 -07:00
Chi_Liu	a51b4e014a	[Torch] Disable 1-d quantized convolution (#3601 ) To fix https://github.com/nod-ai/SHARK-Turbine/issues/253#issuecomment-2271815640 Prevent fusion for 1d convolution ops and just do it as an f32 conv since there isn't a linalg named op for quantized 1-d convolution yet. Get 24 onnx eca* models passed in iree-comiple.	2024-08-07 09:01:16 -07:00
Rob Suderman	7e7af67080	Avoid warnings-as-errors build failure (#3588 ) Lambda needs a return value to avoid a build failure.	2024-08-02 12:27:31 -07:00
yyp0	22cd4441e7	[Torch] Add support for static uneven divisible AdaptiveAvgPool2d (#3566 ) The static uneven divisible AdaptiveAvgPool2d means that although the input size is not an integer multiple of ouput size, but the kernel and stride size can also be fixed (not dynamic). The derivation logic of kernel and stride size is consistent with torch/_decomp/decomposations.py:adaptive_avg_pool2d as described in the following: 1. Stride Size Firstly , derive the start index in each reduce operation according to the output size (`n`), `start_index = ([0, 1, ..., n - 1] * input_size) // output_size`. For each index `k`, if `k * (input_size % output_size) < output_size`, then the current and previous stride keeps the same as `input_size // output_size`. So suppose `(n-1) * (input_size % output_size) < output_size`, the stride in the whole AdaptiveAvgPool2d process keeps static, as `input_size // output_size`. 2. Kernel Size torch/_decomp/decomposations.py:adaptive_avg_pool2d calculates a static kernel size when the input/output sizes satisfy either of the two conditions, `input_size % output_size == 0` or `output_size % (input_size % output_size) == 0`. Here if `input_size % output_size == 0`, then the kernel size equals `input_size // output_size`, otherwise `input_size // output_size + 1.`	2024-08-01 11:37:53 +08:00
Rob Suderman	7f475e174e	Add extf-trunc f32-f64-f32 ellision (#3579 ) Torch has all scalars represented as i64 and f64 types which results in extraneous trunc-extf commands. We can rework this by elliding widen-narrow cases away.	2024-07-31 16:50:00 -07:00
yyp0	f49b9c14f1	[Torch] Add support for Aten__Or__BoolOp (#3574 )	2024-07-31 17:23:53 +08:00
Ivan Butygin	8bd1b9751f	`max_unpool3d` linalg lowering (#3536 ) An attempt of `aten.max_unpool3d` to linalg lowering. There are known issues with this implementation (see comment in code).	2024-07-30 20:59:17 +03:00
Vivek Khandelwal	b6e4725259	[ONNX] Add OnnxToTorch lowering for NonMaxSuppression op (#3501 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-07-26 21:01:27 +05:30
yyp0	ea60d72489	[Torch] Add AtenMaskedFillTensorOp support (#3561 )	2024-07-26 15:32:13 +08:00
Yuanqiang Liu	003b06dfa1	[Torch] enhance naryFolderHelper to support mixed dtypes (#3559 ) * so that it could support like `i64 + f64 => f64`. * also unify `aten.log`'s folder code to use `naryFolderHelper`.	2024-07-24 17:54:59 +08:00
Yuanqiang Liu	aad1604046	[Torch] enhance fold of aten.squeeze.dim (#3558 )	2024-07-24 14:13:48 +08:00
Ze Zhang	d1e172f418	Register fake_quantize_cachemask ops and add their decompose patterns (#3556 ) Test: `cmake --build build --target check-torch-mlir-all`	2024-07-23 11:33:12 -07:00
Yuanqiang Liu	21ad890009	[Torch] enhance fold of aten.slice.Tensor (#3557 ) so that it could support folding slice with any static shape.	2024-07-23 22:53:03 +08:00
Yuanqiang Liu	78846425e2	[Torch] add constriants when decompose aten.split_with_sizes (#3555 )	2024-07-23 10:34:29 +08:00
Vivek Khandelwal	22c9008bb9	build: Update Roll PyTorch version (#3548 ) This commit also updates the PyTorch and Torchvision nightly links since they are now moved to a different location. PyTorch Nightly: https://download.pytorch.org/whl/nightly/cpu/torch/ Torchvision Nightly: https://download.pytorch.org/whl/nightly/cpu/torchvision/ Disables dtype checks for some ops, tracked by https://github.com/llvm/torch-mlir/issues/3552 Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-07-19 21:38:57 +05:30
bosko-syrmia	2cdf3deae3	implement lowering of torch.aten._linalg_slogdet (#3524 )	2024-07-19 11:24:43 +05:30
Branko Trifkovic	c7d972ed58	Implement lowering of torch.aten.tril_indices (#3517 )	2024-07-18 18:38:12 +05:30
pkapris-syrmia	fde286f491	Implement lowering for torch.aten.hann_window.periodic (#3502 )	2024-07-17 18:21:23 +05:30
pkapris-syrmia	b59efc75f3	Implement lowering of torch.aten.atleast_1d (#3498 ) This operator is necessary in order to implement torch.aten.vstack. Which will be added in a future PR.	2024-07-17 18:20:30 +05:30
Arham Khan	574143448b	[E2E][ONNX] torch.multinomial (#3404 ) This PR adds a conversion in the TorchOnnxToTorch pass for the ONNX Multinomial operation. It also adds a TorchToLinalg lowering for the `aten.Multinomial` op and does a light refactor of some repeated code that generates random floating point numbers in `TorchToLinalg/Random.cpp`.	2024-07-16 23:09:39 +05:30
rohan-tan-bhowmik	0791a8860c	[Torch] Implements TorchToLinalg lowering of torch.ops.aten._weight_norm_interface (#3538 ) Resolves https://github.com/nod-ai/SHARK-Turbine/issues/757. Adds TorchToLinalg lowering for `Aten_WeightNormInterfaceOp`. --------- Co-authored-by: Ubuntu <rbhowmik@RohanBhowmikVM.judsoscro3wupi0qm4bjlj5m3b.bx.internal.cloudapp.net>	2024-07-16 23:09:12 +05:30
Yuanqiang Liu	714270a922	[Stablehlo] legalize deprecated ops to stablehlo ops (#3543 )	2024-07-17 00:05:11 +08:00
Xinyu Yang	e5d1677894	[Torch] Eliminate getWithLeastStaticInformation in DecomposeAtenLinspaceOp and DecomposeAtenFakeQuantizePerTensorAffineOp (#3539 ) as title	2024-07-15 10:02:36 +08:00
Yuanqiang Liu	5e4f00acb1	[Torch] add support for aten.scatter_add (#3534 )	2024-07-12 09:15:42 +08:00
zjgarvey	0fb8b017d8	Adds misc fixes for some padding related issues (#3528 ) This patch adds a few misc pad op related changes: 1. Addresses issue <https://github.com/llvm/torch-mlir/issues/3457> 2. Addresses issue <https://github.com/llvm/torch-mlir/issues/3442> 3. Fixes the padding order for asymmetrically padded onnx.Conv ops 4. Enables passing quantization through those onnx.Conv op pre-paddings 5. Modifies the torch-to-linalg lowering of AtenReplicationPad2d op to enable support for input rank != 4 Unfortunately, even with all of these changes, the e2e tests for the ReplicationPad2d still fail the onnx config, since the torch export procedure for rearranging the pad order is complicated enough that the padding ints end up not being able to fold back to constants.	2024-07-11 20:01:45 -05:00
Yuanqiang Liu	b38585e077	[Torch Dialect] fix aten.nan_to_num's decomposition when inf=None (#3530 ) also add shape infer in decomposition, see https://github.com/llvm/torch-mlir/issues/3312	2024-07-11 08:46:40 +08:00
Ze Zhang	d466d5b809	Register fake_quantize related ops (#3522 ) Register `aten.fake_quantize_per_channel_affine` and `aten.fake_quantize_per_tensor_affine.tensor_qparams` ops --------- Co-authored-by: Ze Zhang <ze.zhang@getcruise.com>	2024-07-05 11:02:03 -07:00
Sagar Kulkarni	0fe74845da	[ONNX] Fix bug in ONNXToTorch PadOp's pads tensor rearrangement (#3485 ) Fix the pad tensor rearrangement such that we change the representation from [x1_begin, x2_begin, ..., x1_end, x2_end,...] to [xn_begin, xn_end, ...., x2_begin, x2_end, x1_begin, x1_end] where x1, x2 .. xn are the dimensions of the pads tensor argument. --------- Co-authored-by: zjgarvey <zjgarvey@gmail.com> Co-authored-by: zjgarvey <47986913+zjgarvey@users.noreply.github.com>	2024-07-03 15:02:49 -05:00
Scott Todd	ca0e906675	Fix `uint64_t` type. (#3519 ) `u_int64_t` is nonstandard and does not exist in MSVC.	2024-07-02 16:06:20 +00:00
Yuanqiang Liu	e2fbded49c	[Torch Dialect] improve argmax/argmin's decomposition to support keep… (#3514 ) …dim=True when dim=None	2024-07-02 09:08:57 +08:00
Yuanqiang Liu	0e71a192d8	[Torch] support decomposition of aten.aminmax (#3513 ) * unify decompisition of `aten.amax` and `aten.amin` * support `aten.amax` with `dim=()`	2024-06-29 21:44:05 +08:00
Jiawei Wu	f75cbb4df9	[torch dialect] emit aten.fmax/fmin and add decomposition patterns (#3510 )	2024-06-29 00:07:55 +08:00
Phaneesh Barwaria	5a627c46b7	onnx.DFT basic support (#3463 ) - adds support for DFT v20 on the FFT and IFFT path - adds required skeleton code for IFFT ops to be recognised in TMlir	2024-06-28 20:08:43 +05:30
Christopher McGirr	7e6d76e997	[Torch] Fix torch.constant.int operation parsing (#3476 ) Due to the custom operation parser, the print and parser were expecting two different forms. One having the dictionary before the value and the other after. Following the format of the other constants ops, the constant.int will follow the `value attr-dict` format. Updated the parser accordingly.	2024-06-28 16:06:52 +02:00
Aart Bik	1f73895f93	[torch-mlir] bump to llvm/llvm-project@9b78ddf3b2 (#3491 ) This bump triggered an upstream assert. Includes a WAR for #3506. Also includes several things I needed to do to repro: * When TORCH_MLIR_TEST_CONCURRENCY=1, test runs will be printed. * Added TORCH_MLIR_TEST_VERBOSE=1 handling to enable verbose mode (useful on CI). --------- Co-authored-by: Stella Laurenzo <stellaraccident@gmail.com>	2024-06-27 19:28:02 -07:00
Matthias Gehre	6678e1a256	TorchToLinalg: Try folding shape computations to keep static shapes when possible (#3475 ) Before this PR, a statically shaped aten.convolution would generate dynamically shaped linalg IR, and even `-canonicalize` would not be able to fold it back into static shapes. This PR ensure that shape calculations are folded on construction to directly generate statically shaped linalg IR. We achieve that by ensuring that `arith` ops involved in computing shapes are created via `createOrFold`, so that later uses of `getAsOpFoldResult` see constants instead of those ops. For example ``` module { func.func @forward(%arg0: !torch.vtensor<[32,336,112,112],f32>, %arg1: !torch.vtensor<[336,168,3,3],f32>, %arg2: !torch.vtensor<[336],f32>) -> !torch.vtensor<[32,336,56,56],f32> { %false = torch.constant.bool false %int2 = torch.constant.int 2 %int1 = torch.constant.int 1 %0 = torch.prim.ListConstruct %int1, %int1 : (!torch.int, !torch.int) -> !torch.list<int> %1 = torch.prim.ListConstruct %int2, %int2 : (!torch.int, !torch.int) -> !torch.list<int> %2 = torch.prim.ListConstruct : () -> !torch.list<int> %3 = torch.aten.convolution %arg0, %arg1, %arg2, %1, %0, %0, %false, %2, %int2 : !torch.vtensor<[32,336,112,112],f32>, !torch.vtensor<[336,168,3,3],f32>, !torch.vtensor<[336],f32>, !torch.list<int>, !torch.list<int>, !torch.list<int>, !torch.bool, !torch.list<int>, !torch.int -> !torch.vtensor<[32,336,56,56],f32> return %3 : !torch.vtensor<[32,336,56,56],f32> } } ``` would result in ``` [...] %padded = tensor.pad %2 low[%14, %15, %16, %17] high[%14, %15, %16, %17] { ^bb0(%arg3: index, %arg4: index, %arg5: index, %arg6: index): tensor.yield %cst : f32 } : tensor<32x336x112x112xf32> to tensor<?x?x?x?xf32> [...] %45 = linalg.conv_2d_ngchw_gfchw {dilations = dense<1> : vector<2xi64>, strides = dense<2> : vector<2xi64>} ins(%expanded, %expanded_37 : tensor<?x2x?x?x?xf32>, tensor<2x168x168x3x3xf32>) outs(%expanded_44 : tensor<32x2x168x?x?xf32>) -> tensor<32x2x168x?x?xf32> [...] ``` and with this PR all shapes are static.	2024-06-27 08:43:10 +02:00
zjgarvey	d2bc70f188	[TorchToLinalg][ONNX] Add Basic Determinant Support (#3481 ) This adds support for a few ops: - torch.linalg_det - torch._linalg_det (if the LU and pivot returns are unused) - onnx.Det An scf loop is used, since the row reduction algorithm applied here has some loop-carried dependencies. The current support being added here is very basic, and only works if no permutations are required during row reduction, and assumes the matrices are non-singular.	2024-06-25 13:34:19 -05:00
zjgarvey	368fabf0c1	[ONNX] Basic Support for DeformConv (#3469 ) This adds a torchvision op to torch-mlir and a path from onnx.DeformConv to torchvision.deform_conv2d. I'm not implementing the torch->linalg lowering for the torchvision op yet, but posting this PR to get feedback on some of the choices being made here and to flesh out the onnx frontend a bit.	2024-06-25 12:16:51 -05:00
zjgarvey	e346c911f7	[ONNX] Add basic support for RoiAlign (#3493 ) This adds an onnx->torch conversion for onnx.RoiAlign into torchvision.roi_align or torchvision.roi_pool, and adds those two torchvision ops to torch-mlir.	2024-06-25 11:02:45 -05:00
Vinayak Dev	02340408b7	[torch] Add OnnxToTorch lowering for Onnx.STFT op (#3492 ) Adds OnnxToTorch lowering for `Onnx.STFT` op.	2024-06-25 19:00:45 +05:30
Branko Trifkovic	98c6971a01	Implement lowering of torch.aten.triu_indices (#3451 ) Closes [nod-ai/SHARK-Turbine/issues/709](https://github.com/nod-ai/SHARK-Turbine/issues/709) --------- Co-authored-by: Branko Trifkovic <branko.trifkovic@syrmia.com>	2024-06-21 16:16:38 -07:00

1 2 3 4 5 ...

975 Commits (638ef1451290d471830e9ad594c0a037dc861811)