torch-mlir

Commit Graph

Author	SHA1	Message	Date
rohan-tan-bhowmik	e86f56bc76	[Torch] [TMTensor] Added mask and is_causal support for torch.aten.scaled_dot_product_attention (#3690 ) Enabled mask and is_causal parameters for torch.aten.scaled_dot_product attention + relevant comments + tests. The tests added highlight the new capabilities introduced in this PR, including: Attention with F16 mask Attention with Boolean mask Causal attention with same Q K V shapes Causal attention without Q K V shapes Made sure that one cannot input both mask and is_causal.	2024-09-09 15:51:41 -07:00
Srinath Avadhanula	0a788e0467	Decompose aten.fmod into aten.mul,sub,div etc. (#3689 ) As titled, create a new decomposition for `aten.fmod.Tensor` to `aten.div`, `aten.trunc`, `aten.mul` and `aten.sub`. Note that we only use `aten.trunc` for floating point operations. This further gets decomposed to `aten.where` etc. by other existing decompositions. This decomposition now makes TOSA pass for a simple model with `aten.fmod` while it makes `stablehlo` fail. For now, we disallow this decomposition for `stablehlo` --------- Co-authored-by: Srinath Avadhanula <srinath.avadhanula@getcruise.com>	2024-09-09 09:00:11 -07:00
Felix Schneider	df6098e43d	[TorchToLinalg] Use `linalg.transpose` instead of `generic` when lowering `aten.T` (#3660 ) The lowering pattern for `aten.T` uses transposition implemented via `linalg.generic`. For downstream passes it is advantageous to use named ops wherever possible, so this patch changes the lowering to use `linalg.transpose` instead.	2024-09-07 08:09:10 +02:00
Branko Trifkovic	70d5730c87	[LINALG] Implement lowering of torch.aten.rot90 (#3551 )	2024-09-06 10:36:17 +05:30
justin-ngo-arm	d4b5e05ac1	[TOSA] Add Torch to Tosa Legalization for torch.tril (#3678 ) Change-Id: Ie5ba31a27394c3adcea00266a9d562862dbd8b08 Signed-off-by: Justin Ngo <justin.ngo@arm.com>	2024-09-05 11:27:29 -07:00
Christopher McGirr	b790061b69	[FxImporter] Add InputInfo to Resolve Literal Hook (#3688 )	2024-09-06 00:53:11 +08:00
zjgarvey	295bf418a4	Add a canonicalization pattern for `aten.unflatten.int` (#3656 ) Addresses an issue in <https://github.com/llvm/torch-mlir/issues/3651> where some unflatten ops generated from onnx models weren't propagating static shape information. It may be necessary to add further optimizations for the more general case when some static information is present in the unflatten (or possibly reshape/view) op's `sizes` list, but not reflected in the output shape. These ops will only successfully infer shapes if the `sizes` list is gotten from a list of constant ints (with possibly one -1). A common example where this fails is when some of the `sizes` are determined from `aten.size.int` ops on dynamic tensors, and other `sizes` are known statically. This PR includes: - a canonicalizer for `aten.unflatten.int` which converts to `aten.unsqueeze` when it is expanding one dim to two, and one of the new dims is statically 1. - an improvement to the folder for `aten.__or__.bool` which does not rely on both operands being static.	2024-09-03 16:38:20 -07:00
zjgarvey	2960538c6d	[fximporter] Avoid importing from `_torchMlir` (#3685 ) Downstream projects don't necessarily register this C++ module. This package removes the dependency and uses `torch.iinfo` to access the max and min values instead.	2024-09-03 13:52:06 -05:00
Ze Zhang	b3942ff984	Add canonicalize pattern for aten.mul.int and aten.floordiv.int (#3680 ) This PR add `floordiv` to the `PY_BUILTIN_TO_TORCH_OP`. For `aten.mul.int` and `aten.floordiv.int` ops, we add new Canonicalization Patterns as follow: ``` %1 = torch.aten.mul.int %input, %const-5 %2 = torch.aten.mul.int %1, %const-6 ``` Will be replaced by `torch.aten.mul.int %input, %const-30` And ``` %1 = torch.aten.mul.int %input, %const-5 %2 = torch.aten.floordiv.int %1, %const-5 ``` Will directly return `%input` This PR also relaxes the `float` type constraint in TorchToTosa for the `AtenRsubScalarOp` conversion. To test: `cmake --build build --target check-torch-mlir-all`	2024-09-03 09:13:59 -07:00
Vivek Khandelwal	70de04a873	build: manually update PyTorch version (#3683 ) Set PyTorch and TorchVision version to nightly release 2024-09-02. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-09-03 21:25:00 +05:30
Vivek Khandelwal	567ed44fd0	[MLIR][TORCH] Add E2E support for aten.polar op (#3671 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-09-03 10:51:03 +05:30
Longsheng Mou	3180704b14	[TorchToLinalg][test] Add test for ConvertAtenConvolutionOp (#3679 ) This patch add a test for `638ef14`, which use `linalg.broadcast` instead of `generic` for convolution bias. Co-authored-by: Rongsheng Gao <gaorongsheng@huawei.com>	2024-08-30 09:51:50 +00:00
jinchen	fd759e4b1f	Fix onnx.Gather lowering with dynamic shapes (#3675 ) Supports the result with dynamic shape and scalar indices like ``` func.func @test_gather_scalar(%arg0: !torch.vtensor<[3,4,5],f32>, %arg1: !torch.vtensor<[], si64>) -> !torch.vtensor<[?,?],f32> attributes {torch.onnx_meta.opset_version = 13 : si64} { %0 = torch.operator "onnx.Gather"(%arg0, %arg1) {torch.onnx.axis = 0 : si64} : (!torch.vtensor<[3,4,5],f32>, !torch.vtensor<[], si64>) -> !torch.vtensor<[?,?],f32> return %0 : !torch.vtensor<[?,?],f32> } ``` `Torch::AtenSqueezeOp` is referring to the result shape, so it will failed on lowering if the result shape is dynamic.	2024-08-29 17:02:16 -07:00
Muhammad Abubakar	98e08023bb	Bump llvm to f9031f00f2c9 (#3672 ) As title --------- Co-authored-by: Muhammad Abubakar <jane.doe@getcruise.com>	2024-08-28 11:29:10 -07:00
lingzhiz1998	5bc59ce1fa	[TorchToLinalg] Support lowering MaxPool3dWithIndices (#3652 ) Support torch.MaxPool3dWithIndices lowering to linalg backend.	2024-08-27 14:14:25 -05:00
Vivek Khandelwal	b92e61832f	build: manually update PyTorch version (#3666 ) Set PyTorch and TorchVision version to nightly release 2024-08-25. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-08-27 09:28:30 -07:00
penguin_wwy	6eba5bc9ee	[Torch] Extract TensorPlaceholder to a common interface (#3668 )	2024-08-27 23:31:28 +08:00
Xida Ren (Cedar)	eb7bf78a9c	Add RestructureNonConstantAxes pass to address reduce op tests failing on non constant axes (#3600 )	2024-08-26 14:06:06 -07:00
Felix Schneider	638ef14512	[TorchToLinalg] Use `linalg.broadcast` instead of `generic` for conv bias (#3661 ) The current implementation uses a `linalg.generic` to broadcast the bias tensor for the lowering of convolutions. This is suboptimal for later pattern matching. This patch changes it to use the respective named op, `linalg.broadcast`, instead.	2024-08-26 20:29:11 +02:00
Vimal	fa39d91357	[FxImporter] Fix sympy_int_to_int utility (#3657 ) New sympy type is introduced to represent integer infinity in upstream PyTorch repo. Subsequently, sympy.oo is no longer used to represent infinity upper bound for dynamic dimensions where the upper bound is unknown. Instead `int_oo` is used to represent integer infinity. This commit updates the `_sympy_int_to_int` utility in light of this change.	2024-08-26 09:31:17 -07:00
Rob Suderman	f9766c89f6	[onnx] Handle `torch.aten` for inner product case (#3634 ) The following case was failing to lower for einsum. This fixes up the inner product issue.	2024-08-24 11:41:25 -07:00
Rob Suderman	6cf139687d	[onnx] Support for optional `axis` attribute for `onnx.Pad` (#3635 ) The `axis` attribute is optionally available. Added support by computing the pad based on the axis values. --------- Signed-off-by: Rob Suderman <rob.suderman@gmail.com>	2024-08-24 11:41:08 -07:00
Rob Suderman	b3b8e2e96a	[torch] Fix lowerings of rshift and lshift (#3665 ) I missed adding second operand conversion and adding them to the set of rewrite patterns.	2024-08-24 03:27:18 +00:00
Rob Suderman	9a4c8c606c	[torch] Add `torch.aten.view.dtype` to op list (#3664 ) Support dtype conversion between types. This is useful for bitcasting buffers between differing bit depths.	2024-08-23 19:02:53 -07:00
Phaneesh Barwaria	9a6fe58a02	onnx.MelWeightMatrix Onnx to Torch to Linalg (#3659 ) - This PR adds new (and equivalent) more tensorized impl of MelWeightMatrix which lowers all the way to linalg. - [Ref Pytorch Impl](https://gist.github.com/PhaneeshB/4e6dfcded3007b1b686fbe28f07a67cd) - Thanks to @rsuderman for pointing out the difficulties [earlier impl](#3503) posed during lowering to linalg and also for providing a better numpy impl 🙏	2024-08-22 08:55:03 -07:00
Vivek Khandelwal	fcc5f444cd	MLIR][TORCH] Fix GroupNorm decomposition by adding shape info (#3658 ) This commit adds the shape info for the tensors created during the decomposition of GroupNorm op. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-08-22 21:20:40 +05:30
Dmitry Babokin	a980130676	Fix macOS package build (#3562 ) Without `--no-build-isolation` pip invokes `setup.py` in fresh environment, which doesn't have `torch` installed. But `setup.py` does `import torch` to check PyTorch version, so the build crashes. At the same time the script creates a disposable virtual environment with all required dependencies specifically to run wheel build. Note that Linux package build also runs with this option. `15cf7106c4/setup.py (L230)` This was introduced by this commit: `74f7a0c9d6` And looks like macOS builds were not running in CI ever since. I also updated Python versions in `install_macos_deps.sh`.	2024-08-22 19:13:20 +05:30
Xida Ren (Cedar)	4358aaccd6	Add per-test timeouts to catch infinite loops (#3650 ) Previously we only had full suite timeouts, making it impossible to identify which specific tests were hanging. This patch adds: 1. Per-test timeout support in the test framework 2. A default 600s timeout for all tests 3. A deliberately slow test to verify the timeout mechanism works The timeout is implemented using Python's signal module. Tests that exceed their timeout are marked as failures with an appropriate error message. This should help catch and isolate problematic tests that enter infinite loops, without needing to re-run the entire suite multiple times.	2024-08-21 11:37:31 -07:00
lingzhiz1998	7f886cc270	[TorchToLinalg] Support torch.isclose lower to linalg (#3631 )	2024-08-21 11:55:54 +08:00
Ian Wood	a24114efa3	[TorchToLinalg] remove `extract_slice` grid_sample lowering (#3483 ) Instead of using extract_slice for grid sampler, use affine constants to access the X and Y values in the generic op's region.	2024-08-20 14:23:43 -07:00
zjgarvey	f66908f190	[TorchToLinalg] address a dtype mismatch in `aten.multinomial` lowering (#3630 ) Resolves <https://github.com/llvm/torch-mlir/issues/3628> Unblocks a compile failure for one of the MiGraphx models (`AgentModel`).	2024-08-20 15:14:48 -05:00
Aart Bik	f72770a725	[torch-mlir][sparse] replace ad-hoc mechanism with proper FX export (#3648 ) Now that the PyDev feature request pytorch/pytorch#117188 has been completed, we can remove all the ad-hoc code that propagates sparsity metadata and replace it with the built-int PyDev metadata for sparse tensors. This removes a lot of code and also ensures sparsity is consistent with the torch.sparse package for all cases.	2024-08-20 09:56:21 -07:00
Vivek Khandelwal	0a86deb59a	build: manually update PyTorch version (#3627 ) Set PyTorch and TorchVision version to nightly release 2024-08-18. This commit also updates the `scaled_dot_product_attention` op. A new attribute `enable_gqa` has been added. As of now, only the default value for the same is supported. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-08-19 12:03:56 +05:30
Marius Brehler	56a663690c	Update links to examples (#3641 ) Closes #3440	2024-08-16 18:59:44 +02:00
Rob Suderman	78deb175b3	[onnx] Fix shortcircuit path (#3633 ) The implementation was short circuiting the second result. Updated to guarantee we do not short circuit.	2024-08-16 09:23:47 -07:00
Rob Suderman	3a599bec80	[onnx] Fix onnx.ThresholdedRelu crash (#3638 ) Result type was not fetched causing a crash on construction	2024-08-16 09:23:38 -07:00
Hacker1337	5b19ab93dc	Fixed installation command in README.md (#3466 ) Current pip installation command raises error ``` ERROR: Could not find a version that satisfies the requirement torch-mlir (from versions: none) ERROR: No matching distribution found for torch-mlir ``` (checked on Ubuntu 22.04.2 LTS with `venv` and with `conda`) Because it is trying to install torch-mlir from pytorch repository. The installation command was wrongly split into 2 in #3073. I just merged them back to 1 installation command with both pytorch and llvm/torch-mlir channels.	2024-08-16 09:07:35 -07:00
penguin_wwy	37e89828a1	[FxImporter] refactor canonicalize using table driven (#3402 )	2024-08-16 22:57:18 +08:00
Rob Suderman	f09cb766dc	[onnx] Fix `torch` lowering for determinant (#3639 ) The determinant lowering had some extract / insert shape mismatches. Replumbed shape manipulations to correctly implement the determinant operation.	2024-08-15 15:41:50 -07:00
yyp0	43e3118eb9	[Stablehlo] use stablehlo specs lowering AtenSliceScatterOp (#3592 )	2024-08-15 20:06:29 +08:00
Yevhenii Havrylko	64b0d4aed3	Add missing dependency to TorchMLIRRefBackend target (#3107 ) Discovered in https://github.com/llvm/torch-mlir/issues/3104 Most likely when building with stablehlo, while waiting for it missing dependency was generated to location shared with another dependency.	2024-08-14 23:41:51 +08:00
pkapris-syrmia	23ec5399e5	Implement lowering of aten.atleast_2d (#3546 ) This operator is needed to implement aten.vstack, which will be submitted in a subsequent PR	2024-08-14 18:52:31 +05:30
Branko Trifkovic	da877a781e	Added support for integer to complex conversion (#3604 )	2024-08-14 18:13:00 +05:30
Hacker1337	cb6a499460	Update architecture.md. Fixed brocken link (#3565 )	2024-08-14 16:38:51 +05:30
pkapris-syrmia	10fe5d08d1	Implement lowering for torch.aten.rad2deg (#3586 )	2024-08-14 16:37:28 +05:30
rohan-tan-bhowmik	1c16de147a	Minor change in TMTensorOps.td (#3602 ) Fixed a little programming choice style that bothered me.	2024-08-14 16:33:49 +05:30
Vivek Khandelwal	4a0bed0ce0	[ONNX] Add training mode support for BatchNormalization op (#3597 ) This commit extends the OnnxToTorch lowering for BatchNormalization op for supporting the case when training=True. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-08-14 10:46:38 +05:30
Rob Suderman	2511cf46b4	[onnx] Fix `onnx.RNN` for layout attribute (#3620 ) The `layout` attribute was not considered for the `onnx.RNN` operation. Added support for the attribute to transpose the inputs / outputs of the RNN when valid.	2024-08-13 14:34:25 -07:00
Rob Suderman	af67f9efb0	[onnx] Support integer types for `onnx.Pow` (#3626 ) Pow is not support for the `torch` operator. Add casting for integer types.	2024-08-13 09:39:04 -07:00
Rob Suderman	39307f0462	[onnx] Fix `onnx.Gather` for bad expansion (#3625 ) A case where unsqueeze was require was missed causing compilation failures.	2024-08-13 09:38:55 -07:00

1 2 3 4 5 ...

3125 Commits (e86f56bc763fc1d0ce29806845a7b3d452054831) All Branches Search

3125 Commits (e86f56bc763fc1d0ce29806845a7b3d452054831)

All Branches