torch-mlir

Commit Graph

Author	SHA1	Message	Date
Branko Trifkovic	da877a781e	Added support for integer to complex conversion (#3604 )	2024-08-14 18:13:00 +05:30
Vivek Khandelwal	4a0bed0ce0	[ONNX] Add training mode support for BatchNormalization op (#3597 ) This commit extends the OnnxToTorch lowering for BatchNormalization op for supporting the case when training=True. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-08-14 10:46:38 +05:30
Rob Suderman	2511cf46b4	[onnx] Fix `onnx.RNN` for layout attribute (#3620 ) The `layout` attribute was not considered for the `onnx.RNN` operation. Added support for the attribute to transpose the inputs / outputs of the RNN when valid.	2024-08-13 14:34:25 -07:00
Rob Suderman	af67f9efb0	[onnx] Support integer types for `onnx.Pow` (#3626 ) Pow is not support for the `torch` operator. Add casting for integer types.	2024-08-13 09:39:04 -07:00
Rob Suderman	39307f0462	[onnx] Fix `onnx.Gather` for bad expansion (#3625 ) A case where unsqueeze was require was missed causing compilation failures.	2024-08-13 09:38:55 -07:00
pkapris-syrmia	d11d6f6fea	[TorchToLinalg] Fix torch.aten.remainder for negative operands (#3581 ) Closes #3575 The PyTorch remainder operator is meant to compute the Python modulus operator entrywise: https://pytorch.org/docs/stable/generated/torch.remainder.html#torch.remainder In python the modulus operator is meant to always return a result with the same sign as the divisor: https://docs.python.org/3/reference/expressions.html#binary-arithmetic-operations In other words, torch.aten.remainder should return a Python-style modulus instead of a C-style modulus. However the remainder operator was simply translated into arith.ModSI or arith.ModF, which both effectively compute the C-style modulus. Now the lowering has been modified so that the modulus operator works properly with negative numbers, both in the dividend, and the divisor.	2024-08-13 21:17:21 +05:30
aldesilv	a4ba02eef5	[ONNX] add support for tfidfvectorizer (#3553 ) 1-d/2-d input and output implemented based on the description and example test cases in https://github.com/onnx/onnx/blob/main/docs/Operators.md#TfIdfVectorizer and some notes from https://github.com/onnx/onnx/blob/main/onnx/reference/ops/op_tfidf_vectorizer.py#L128 --------- Co-authored-by: zjgarvey <zjgarvey@gmail.com>	2024-08-12 18:10:11 -05:00
Rob Suderman	d3695a97a0	[onnx] Fix `onnx.Hardmax` lowering to torch (#3624 ) The lowering to torch makes assumption about the dimensions / types of reduce max and onehot. We need to correct for expected torch behavior.	2024-08-12 11:19:02 -07:00
Phaneesh Barwaria	026dfade64	onnx.MelWeightMatrix TorchOnnxToTorch (#3503 ) Just uploading what I have till now [Gist](https://gist.github.com/PhaneeshB/761f75f5522d9f4a40ef949a328e93fe) of pytorch impl that I'm following to implement the OnnxToTorch lowering Additional Details - (also pasted as comment in gist) [Op Description](https://github.com/onnx/onnx/blob/main/docs/Operators.md#melweightmatrix) in Onnx Documentation [Example](https://github.com/onnx/onnx/blob/main/docs/Operators.md#examples-93) Used the same example in this file. the Expected output is shown in the example [Reference Onnx Impl](`4c3ed5e08b/onnx/reference/ops/op_mel_weight_matrix.py (L13)`) - This is the base for the above code.	2024-08-12 21:18:29 +05:30
Felix Schneider	0314188dbe	[torch] Basic support for per-channel quantized graphs (#3623 ) This patch adds basic support for lowering graphs with per-channel quantization. Per-channel quantized ops have to be excluded from `FuseQuantizedOps` for now but can be used in QDQ quantized form. Using this patch, we're able to import and execute (on the linalg backend) graphs with per-channel quantization applied using the "new" PyTorch 2.0 Export Quantization.	2024-08-10 15:51:09 +02:00
Rob Suderman	44266ab0c4	[onnx] Support `fp8` for `onnx.QuantizeLinear` (#3619 ) We need to directly decompose quantize linear for `fp8` types as the equivalent torch operations do not support the operation.	2024-08-09 12:32:46 -07:00
Rob Suderman	8358e8c255	[onnx] Add support for `fp8` `onnx.DequantizeLinear` (#3617 ) Fp8 needs a slightly different path for dequantization as the `torch` dequantize operation does not support `fp8` types.	2024-08-08 16:20:53 -07:00
Rob Suderman	880e64bbbb	[onnx] `onnx.Split` may not have `num_outputs` which can be inferred (#3608 ) The attribute does not exist in all variants of the operation. It can be inferred from the number of results so we should just do that.	2024-08-08 16:17:38 -07:00
Rob Suderman	4350672685	[torch] Add integer support for pooling operations (#3610 ) If we pass an integer type to the pooling operation we incorrectly pad with an integer value with causes downstream compilation failures.	2024-08-07 21:42:10 -07:00
zjgarvey	7f2a17e757	[ONNX] fix padding for `onnx.MaxPool` (#3611 ) The saga of aligning onnx and torch padding conventions continues. ```python onnx_pads = [low_x, low_y, low_z, high_x, high_y, high_z] torch_pads = [low_z, high_z, low_y, high_y, low_x, high_x] ``` So not only is the lexicographical ordering hierarchy swapped (low/high x spatial-dim -> spatial-dim x low/high) but the ordering in the the spatial-dim specification is also reversed. This patch properly reverses the pad ordering (and actually uses the `shuffledPadding` to pad).	2024-08-07 20:34:00 -07:00
Rob Suderman	6c33ab024e	[onnx] `onnx.CenterCropPad` used an incorrect type for toScalar (#3605 ) To scalar should have a rank-0 tensor type not rank-1 with length 1. Changing allows proper compilation.	2024-08-07 20:33:33 -07:00
Rob Suderman	59a4c6fda4	[onnx] Fix transposition code for `onnx.OneHot` (#3606 ) The post onehot transposition code was unexercised. Fixed the test and transformation to check use.	2024-08-07 18:20:26 -07:00
Marius Brehler	341f415b1e	[onnx] Fix lowering `onnx.Shrink` to Torch (#3603 ) This fixes the result type of the `torch.aten.lt.Scalar` and `torch.aten.ge.Scalar` ops created during the lowering of `onnx.Shrink` to Torch.	2024-08-07 21:25:14 +02:00
Rob Suderman	18139994e8	[onnx] Fix edge condition for `onnx.ReduceMax` (#3598 ) For length-0 on `onnx.ReduceMax` the length 0 case was incorrect due to a copy paste error.	2024-08-07 10:32:28 -07:00
zjgarvey	8d95fe9eeb	[TorchToArith] Add a lowering for `torch.add.float_int` (#3594 )	2024-08-07 11:55:27 -05:00
Branko Trifkovic	2d6bfb2dec	[LINALG] Added support for conversion from float to complex. (#3595 )	2024-08-07 12:36:48 +05:30
Rob Suderman	b48e55c2f7	[onnx] Handle negative indices for `onnx.GatherElements` (#3599 ) Add a check for negative indices and offset appropriately for `onnx.GatherElements`.	2024-08-06 18:54:01 -07:00
Rob Suderman	b1a232222f	[onnx] Fix `onnx.Shape` to include `start` and `end` processing (#3580 ) `onnx.Shape` can select only a subset of indices using attributes. Add support for these attributes. --------- Co-authored-by: zjgarvey <47986913+zjgarvey@users.noreply.github.com>	2024-08-05 13:56:07 -07:00
Gaurav Shukla	839fe90f86	[MLIR][ONNX] Add support for onnx.scan op (#3516 ) This commit lowers onnx.scan op to torch.prim.Loop op and adds the lowering in the onnx pipeline. Signed-off-by: Gaurav Shukla <gaurav.shukla@amd.com>	2024-08-05 15:37:26 +05:30
zjgarvey	d0933b0eb6	[TorchToLinalg] Fix possible OOB access in Interpolate lowering (#3570 ) Following up from the discussion in <https://github.com/llvm/torch-mlir/pull/3550>, I've edited the lowering to prevent OOB extracts in a more direct fashion (i.e., just clamping directly). I don't think this affects the lit tests at all, but I've tested the changes in our external test suite at <https://github.com/nod-ai/SHARK-TestSuite/tree/main/>. I found the issue when I was unexpectedly getting `nan`'s along the output image border for a resize test there.	2024-08-02 13:55:37 -05:00
zjgarvey	79ae0afc2f	[TorchToLinalg] Simplify QuantizePerTensor lowering (#3576 ) Uses arith::MaximumFOp and arith::MinimumFOp instead of comparison and select ops to improve readability of IR.	2024-08-02 13:40:52 -05:00
Rob Suderman	f7b5c13870	Change linalg.matmul_unsigned to linalg.matmul with unsigned type_fn (#3587 ) Change linalg.matmul_unsigned to linalg.matmul with unsigned type_fn Signed-off-by: Max Dawkins <max.dawkins@gmail.com> Co-authored-by: Max Dawkins <max.dawkins@gmail.com>	2024-08-02 11:32:24 -07:00
Rob Suderman	d273bdfabf	[onnx] Fix default `alpha` for `onnx.Elu` (#3583 ) We were defaulting to `0.0` for `onnx.Elu` when it is supposed to be `1.0`.	2024-08-02 09:29:17 -07:00
Rob Suderman	3d33c5a206	[onnx] Fix `onnx.ScatterElements` for negative indices (#3582 ) We need to adjust for negative scatter indice values. Added materializing out the inbounds adjustment.	2024-08-02 09:01:10 -07:00
Rob Suderman	306ed62edd	[onnx][torch] Fix `onnx.SoftmaxCrossEntropyLoss` for ignore index (#3585 ) There were two issues related to `ignore_index` being set (1) the onnx-to-linalg pass as not reading the value correctly (2) the mean pass was not considering the `ignore_index` value For (2) when taking the mean we need to know how many of the values were considered in the sum and therefore we cannot divide by the total number of elements. Adding a summation across the total number should correct this issue.	2024-08-02 09:00:56 -07:00
Jiawei Wu	edc87fc577	[stablehlo] support dynamic-shaped index in stablehlo conversion for aten.index-like ops (#3322 ) For now, at most one dynamic dim of index tensors in aten.index/aten.index_put-like op is supported.	2024-08-01 10:41:09 +08:00
Jiawei Wu	7b2902f6e2	[stablehlo]: fix aten.index_put_hacked_twin lowering to StableHlo (#3572 ) Current StableHlo lowering strategy works well when `src` tensor's rank is no bigger than `dst` tensor's. The new patch make it succeed in other cases. The following is an example. ``` %190 = torch.prim.ListConstruct %arg4 : (!torch.vtensor<[1,1024],si64>) -> !torch.list<vtensor> %191 = torch.aten.index_put.hacked_twin %189, %190, %186, %true : !torch.vtensor<[1024,768],f32>, !torch.list<vtensor>, !torch.vtensor<[1,1024,768],f32>, !torch.bool -> !torch.vtensor<[1024,768],f32> ```	2024-07-31 22:33:57 +08:00
Suraj Sudhir	d3efab984b	[TOSA] Fix Tensor.hacked_twin to support diff size indexes (#3547 ) - Broadcasts index list tensors - Adds torch.nn.Unfold test Signed-off-by: Suraj Sudhir <suraj.sudhir@arm.com>	2024-07-30 14:32:05 -07:00
Ivan Butygin	8bd1b9751f	`max_unpool3d` linalg lowering (#3536 ) An attempt of `aten.max_unpool3d` to linalg lowering. There are known issues with this implementation (see comment in code).	2024-07-30 20:59:17 +03:00
zjgarvey	f1c74e1431	[TorchToLinalg] add support for depthwise qconv (#3564 ) - Adds support for lowering depthwise + quantized convolution ops to linalg::DepthwiseConv2DNhwcHwcQOp - Changed the variable name for groupSize (which is really C/G) to the more appropriate numGroups (G). - Discovered in e2e testing that linalg does not accept (Cin = groups && Cout = K*groups for K>1) as a "depthwise" conv, so this also updates the case-checking to reflect this issue.	2024-07-29 12:25:07 -07:00
zjgarvey	50d6ce225f	Align Quantization Rounding Scheme with ONNX/Pytorch (#3569 ) Pytorch and ONNX apparently round to nearest, ties go to nearest even, but we were using `math::round` for the torch-to-linalg conversion of `quantize_per_tensor`, which rounds away from zero on ties.	2024-07-29 12:24:46 -07:00
Vinayak Dev	30c4d2f2b8	[torch] Add OnnxToTorch lowering for Onnx.Unique op (#3523 ) Adds OnnxToTorch Lowering for the `Onnx.Unique` op.	2024-07-29 17:32:44 +05:30
pdhirajkumarprasad	a211ccbcff	Implementation of SplitToSequence ops lowering (#3509 ) Added support for splitToSequence ops lowering Added test case with filecheck	2024-07-29 15:44:22 +05:30
Vivek Khandelwal	b6e4725259	[ONNX] Add OnnxToTorch lowering for NonMaxSuppression op (#3501 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-07-26 21:01:27 +05:30
Vivek Khandelwal	15cf7106c4	[ONNX] Reduce Onnx.Flatten op version (#3560 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-07-24 21:27:20 +05:30
jinchen	f0ce1e94ce	[ONNX] Add OnnxToTorch support for SequenceMap (#3535 )	2024-07-17 14:25:09 -07:00
Arham Khan	574143448b	[E2E][ONNX] torch.multinomial (#3404 ) This PR adds a conversion in the TorchOnnxToTorch pass for the ONNX Multinomial operation. It also adds a TorchToLinalg lowering for the `aten.Multinomial` op and does a light refactor of some repeated code that generates random floating point numbers in `TorchToLinalg/Random.cpp`.	2024-07-16 23:09:39 +05:30
Yuanqiang Liu	5e4f00acb1	[Torch] add support for aten.scatter_add (#3534 )	2024-07-12 09:15:42 +08:00
zjgarvey	0fb8b017d8	Adds misc fixes for some padding related issues (#3528 ) This patch adds a few misc pad op related changes: 1. Addresses issue <https://github.com/llvm/torch-mlir/issues/3457> 2. Addresses issue <https://github.com/llvm/torch-mlir/issues/3442> 3. Fixes the padding order for asymmetrically padded onnx.Conv ops 4. Enables passing quantization through those onnx.Conv op pre-paddings 5. Modifies the torch-to-linalg lowering of AtenReplicationPad2d op to enable support for input rank != 4 Unfortunately, even with all of these changes, the e2e tests for the ReplicationPad2d still fail the onnx config, since the torch export procedure for rearranging the pad order is complicated enough that the padding ints end up not being able to fold back to constants.	2024-07-11 20:01:45 -05:00
Xida Ren (Cedar)	5342aa70cf	Support onnx.GRU and onnx.RNN (#3447 )	2024-07-10 14:04:17 -04:00
Yuanqiang Liu	5bee9aac63	[Stablehlo] simplify promoteType (#3525 ) only provide `outElementType` when promoteType	2024-07-10 10:52:19 +08:00
zjgarvey	dcb48dd46c	[ONNX] Fix LpNormalization Lowering (#3521 ) The LpNormalization lowering was previously just computing the norm, which is incorrect. This computes the norm then divides the input tensor by it's norm. I've tested this against some simple onnx models locally. I'll look into adding a test case for this in an external test suite.	2024-07-09 15:42:26 -05:00
Gaurav Shukla	0b46d1110a	[MLIR][ONNX] Add support for onnx.ScatterND (#3479 ) This commit adds support for onnx.ScatterND op in the onnx pipeline. Signed-off-by: Gaurav Shukla <gaurav.shukla@amd.com>	2024-07-08 13:27:14 +05:30
Matthias Gehre	6ea6a6c2fe	TorchOnnxToTorch: Fix stack-use-after-free (#3480 ) We used to move the SmallVector into an ArrayRef and then the SmallVector left the scope. Found by asan.	2024-07-08 09:20:09 +02:00
Yuanqiang Liu	3225f20ab1	[Stablehlo] use index type as dim size, avoid to generate index_cast (#3526 ) For example, the original IR is: ``` module attributes {torch.debug_module_name = "Matmul3D"} { func.func @forward(%arg0: tensor<?x?x?xf32>, %arg1: tensor<?x?x?xf32>) -> tensor<?x?x?xf32> { %c0 = arith.constant 0 : index %c1 = arith.constant 1 : index %c2 = arith.constant 2 : index %dim = tensor.dim %arg1, %c0 : tensor<?x?x?xf32> %0 = arith.index_cast %dim : index to i64 %dim_0 = tensor.dim %arg1, %c1 : tensor<?x?x?xf32> %1 = arith.index_cast %dim_0 : index to i64 %dim_1 = tensor.dim %arg1, %c2 : tensor<?x?x?xf32> %2 = arith.index_cast %dim_1 : index to i64 %from_elements = tensor.from_elements %0, %1, %2 : tensor<3xi64> %3 = stablehlo.dynamic_broadcast_in_dim %arg1, %from_elements, dims = [0, 1, 2] : (tensor<?x?x?xf32>, tensor<3xi64>) -> tensor<?x?x?xf32> %4 = stablehlo.dot_general %arg0, %3, batching_dims = [0] x [0], contracting_dims = [2] x [1] : (tensor<?x?x?xf32>, tensor<?x?x?xf32>) -> tensor<?x?x?xf32> return %4 : tensor<?x?x?xf32> } } ``` After using IndexType, the IR is: ``` module attributes {torch.debug_module_name = "Matmul3D"} { func.func @forward(%arg0: tensor<?x?x?xf32>, %arg1: tensor<?x?x?xf32>) -> tensor<?x?x?xf32> { %c0 = arith.constant 0 : index %c1 = arith.constant 1 : index %c2 = arith.constant 2 : index %dim = tensor.dim %arg1, %c0 : tensor<?x?x?xf32> %dim_0 = tensor.dim %arg1, %c1 : tensor<?x?x?xf32> %dim_1 = tensor.dim %arg1, %c2 : tensor<?x?x?xf32> %from_elements = tensor.from_elements %dim, %dim_0, %dim_1 : tensor<3xindex> %0 = stablehlo.dynamic_broadcast_in_dim %arg1, %from_elements, dims = [0, 1, 2] : (tensor<?x?x?xf32>, tensor<3xindex>) -> tensor<?x?x?xf32> %1 = stablehlo.dot_general %arg0, %0, batching_dims = [0] x [0], contracting_dims = [2] x [1] : (tensor<?x?x?xf32>, tensor<?x?x?xf32>) -> tensor<?x?x?xf32> return %1 : tensor<?x?x?xf32> } } ``` The benefits of using IndexType on shape tensor: * simplify the IR, avoid to generate `arith.index_cast` * let backend compiler have a chance to decide the index width of shape tensor * let stablehlo backend have a chance to serialize dynamic shape IR by [shape_legalize_to_stablehlo](https://github.com/openxla/stablehlo/blob/main/stablehlo/tests/shape_legalize_to_stablehlo.mlir)	2024-07-07 18:03:03 +08:00

1 2 3 4 5 ...

973 Commits (da877a781e5a7f024d9501be35d98859be08f3f4)