torch-mlir

Commit Graph

Author	SHA1	Message	Date
Ian Wood	e88faf08ff	Create scatter op with unique indicies (#3853 ) For the op `index_put_`, if accumulate == false, the behavior is undefined if the indicies aren't unique (https://pytorch.org/docs/stable/generated/torch.Tensor.index_put_.html). So, when converting `AtenIndexPutHackedTwinOp` to a TMTensor scatter op, mark the indices as unique if when `accumulate == false`. This should have no functional effect (unless users are relying on UB) and assuming unique indices has the benefit of unlocking better optimizations in further compiler stages. Signed-off-by: Ian Wood <ianwood2024@u.northwestern.edu>	2024-11-05 12:48:34 -08:00
Rob Suderman	25738b8c19	[linalg] Broadcast batch for mask on sdpa lowering (#3824 ) Attention often broadcasts a mask across the batch dimension as masking is usually performed the same across attention heads. Added this materialization to the mask dimensions optionally.	2024-10-31 17:59:24 -07:00
Xida Ren (Cedar)	9938abf25e	AtenCumprodOp (#3737 )	2024-09-26 18:17:22 -04:00
Rob Suderman	5ce48dfacd	[torch] Fix attention on linalg for dynamic shapes (#3714 ) Current version does not work for a mixture of dynamic and static shaped batch dimensions. Rework to grab the correct dynamic shapes. --------- Co-authored-by: dan <danimal197@gmail.com>	2024-09-18 14:52:54 -05:00
rohan-tan-bhowmik	e86f56bc76	[Torch] [TMTensor] Added mask and is_causal support for torch.aten.scaled_dot_product_attention (#3690 ) Enabled mask and is_causal parameters for torch.aten.scaled_dot_product attention + relevant comments + tests. The tests added highlight the new capabilities introduced in this PR, including: Attention with F16 mask Attention with Boolean mask Causal attention with same Q K V shapes Causal attention without Q K V shapes Made sure that one cannot input both mask and is_causal.	2024-09-09 15:51:41 -07:00
Vivek Khandelwal	0a86deb59a	build: manually update PyTorch version (#3627 ) Set PyTorch and TorchVision version to nightly release 2024-08-18. This commit also updates the `scaled_dot_product_attention` op. A new attribute `enable_gqa` has been added. As of now, only the default value for the same is supported. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-08-19 12:03:56 +05:30
Yuanqiang Liu	5e4f00acb1	[Torch] add support for aten.scatter_add (#3534 )	2024-07-12 09:15:42 +08:00
Ramiro Leal-Cavazos	e29191bd08	[LINALG] Broadcast `values` to shape of slize in `index_put` (#3487 ) The `index_put` operation, `input[indices] = values`, allows for the values to be any shape that is broadcastable to the slice `input[indices]`. This commit adds broadcasting support to the Linalg lowering of `IndexPutHackedTwinOp`. Fixes: #3465	2024-06-26 08:59:49 +00:00
ptrifunovic98	4555629246	Implement lowering of torch.aten.kthvalue (#3360 ) Closes [nod-ai/SHARK-Turbine#620](https://github.com/nod-ai/SHARK-Turbine/issues/620)	2024-06-15 11:18:39 +05:30
Rob Suderman	afca88a058	[NFC] Change to cast instead of .cast variants (#3405 ) Member casts have been deprecated. Changing over a bunch of the member cast calls to the global templated variants to remove deprecation warnings.	2024-05-30 23:45:13 -07:00
penguin_wwy	1f544c37d0	[NFC] Remove unused header files (#3386 )	2024-05-30 14:30:36 +08:00
Jiawei Wu	346a536c9f	[Torch Dialect] decompose all index_put-like op to aten.index_put.hacked_twin for stricter semantics (#3071 ) This PR decomposes all index_put-like op to aten.index_put.hacked_twin for stricter semantics, i.e., no None index in indices argument.	2024-05-08 22:44:57 +08:00
penguin_wwy	6679728c56	Fix deprecated uses of cast/dyn_cast/dyn_cast_or_null/isa (#3243 ) Like #3130, gradually replace the deprecated code https://github.com/llvm/mlir-www/blob/main/website/content/deprecation/_index.md#deprecated	2024-04-27 14:00:56 -07:00
Rob Suderman	a1fe307a76	[torch] Support implicit batch for index_put (#3128 ) If there is only a single value scattered there can be an implicit batch dimension. This includes a check for the implicit batch dimension when reshaping the update tensor. It includes an e2e test to verify correctness.	2024-04-11 10:18:03 -07:00
Rob Suderman	e30a083aff	[torch] Rework lowering to tm_tensor.scatter to stop serialization (#2940 ) We collapsed and broadcasted scatter indices to a single element version. We should instead upport `tm_tensor.scatter`s support for multiple indices and the implicitly broadcasted behavior. This avoids the serialization and materializing a needlessly large indices tensor.	2024-02-27 11:46:57 -08:00
Rob Suderman	e9cdd6cbc5	[torch] Fix tm_tensor.attention for end-to-end (#2907 ) Some operations include a backend matcher for specialized operations. We map these back to generics so they appropriately match to the high performance versions. This is done for the attention operation.	2024-02-13 21:18:01 -08:00
Rob Suderman	d83b576c6e	Bump LLVM to llvm/llvm-project@bb180856ec (#2895 ) Includes some minor first for `AffineMap::inferFromExprList`	2024-02-09 14:07:49 -08:00
Stella Laurenzo	278c41e938	Bump llvm-project to f66cd9e9556a53142a26a5c21a72e21f1579217c. (#2466 ) Picks up DenseResourceElementsAttr python support and fixes minf/maxf C++ rename.	2023-09-19 10:50:53 -07:00
Stella Laurenzo	a8fd275a00	Fix build issue on MSVC by not having a conditional on disjoint types.	2023-09-06 20:05:31 -07:00
Ramiro Leal-Cavazos	41bafe13cc	[build] Update llvm tag to a3f2751f (#2397 ) This commit updates the `llvm-project` and `mlir-hlo` submodules to commits: llvm-project: a3f2751f782f3cdc6ba4790488ec20163a40ac37 mlir-hlo: 97c7e4b4506c3a2441c923e592833f45da439009 Changes made: - Rename `getSuccessorEntryOperands` with `getEntrySuccessorOperands` and remove `operands` from `getSuccessorRegions` (https://reviews.llvm.org/D157506) - Make `TypeConverter` a `const` (https://reviews.llvm.org/D157601)	2023-08-15 09:53:28 -07:00
Vivek Khandelwal	0109bf705b	[MLIR][TORCH] Fix aten.cumsum lowering for int32 input (#2351 ) Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2023-07-28 09:45:12 -07:00
Gaurav Shukla	552887783a	[TM_TENSOR] Add `aten.scatter.[src\|value]` op This commit adds support of `aten.scatter.src` and `aten.scatter.value` ops. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2023-05-29 12:35:53 +05:30
gpetters94	0302cf1d92	Add TMTensor::Attention and lower ScaledDotProductAttentionOp to it (#2027 )	2023-05-16 15:17:45 -04:00
Eric Kunze	6a833e1922	Update to LLVM 3157f03a349cfc852cdd994675eaa9652caa2e3a (#2060 ) New requirement to explicitly cast for interfaces https://reviews.llvm.org/D148493	2023-04-25 08:52:46 -07:00
Abhishek Varma	a13d301356	[MLIR][TORCH] Add e2e support for aten.sort op -- This commit adds e2e support for atend.sort op. -- 1. Adds aten.sort op in torch dialect. -- 2. Adds tm_tensor.sort op in TMTensor dialect. -- 3. Adds lowering of aten.sort -> tm_tensor.sort. Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>	2023-04-13 12:59:43 +05:30
Vivek Khandelwal	e90ea3d7ab	[MLIR][TORCH] Extend implementation of aten._index_put_impl op. This commits adds the support for cases for index_put_op: 1.) where index is a 2-d tensor. 2.) where indices is a list of tensors and none, with exactly 2 non none tensors along the consecutive dimensions. This commit also adds a utility to compute the broadcast shape given the two input tensors. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2023-04-05 14:04:30 +05:30
Zachary Cetinic	e7111d473b	[Torch Dialect] Scatter reduce lowering (#1884 ) - Lowers the torch.scatter_reduce to linalg_on_tensors dialect. - Includes support for "sum", "prod", "amax", "amin" and "mean".	2023-02-21 23:05:55 +00:00
Ramiro Leal-Cavazos	6c86bec04f	build: update llvm tag to 9acc2f37 (#1828 ) This commit makes the following changes: - Update dialects to use fold API `kEmitFoldAdaptorFolder` and update signature of `fold` methods (see PSA https://discourse.llvm.org/t/psa-new-improved-fold-method-signature-has-landed-please-update-your-downstream-projects/67618) - Replace `makeArrayRef` with `ArrayRef` (see https://reviews.llvm.org/D140896) - Remove `TypeRange{}` arg from `b.create<scf::IfOp>` since builder no longer takes that argument - Make `func`s in `Torch/invalid.mlir` private, since symbol declarations cannot be public. (see https://discourse.llvm.org/t/rfc-symbol-definition-declaration-x-visibility-checks/2140)	2023-01-25 01:29:42 +00:00
Tanyo Kwok	577e38da58	build: update llvm tag to 7ccbb4df (#1736 ) Summary of changes: - LLVM now includes <optional> instead of "llvm/ADT/Optional.h" in most (although not all) places (https://reviews.llvm.org/rG541ef3d61e9341cd38420c0dbca9250c4d0ea04c). This patch replaces the affected instances of `llvm::Optional` with `std::optional`. - In the usages of llvm::Optional that remain, llvm::Optional::value() is deprecated, so this patch replaces them with a dereference.	2022-12-20 18:17:27 +08:00
Ramiro Leal-Cavazos	73bd32d06c	Make `getTensorRank` safer by changing return to `Optional<unsigned>` (#1707 ) Currently `getTensorRank` returns -1 if it was unable to get the rank of the tensor. However, not every use in the codebase was checking the return value, and in some cases, the return value was casted to unsigned leading to some infinte loops when an unranked tensor reached a decomposition. This commit changes the return of `getTensorRank` to `Optional<unsigned>` to make it clear to the user that the function can fail. This commit also changes a couple of for loops that iterate a vector in reverse order that can potentially become infinite loops into range-based for loops.	2022-12-12 08:56:28 -08:00
Ramiro Leal-Cavazos	dd35488da5	build: update llvm tag to 798fa4b4 (#1684 ) - Support for non-prefixed accessors has been removed. See: https://reviews.llvm.org/D136727 - Rename `operands` to `methodOperands` in `prim.CallMethod` since the name `operands` overlaps with a builtin method name. See: https://reviews.llvm.org/D136727 - Add passes in refbackend to lower memref.subview. See: https://reviews.llvm.org/D136377 - Replace `CopyToValueTensorOps` first in `RewriteViewLikeSubgraph` in maximize-value-semantics. The current implementation of the `RewriteViewLikeSubgraph` pass in maximize-value-semantics creates temporarily invalid IR. In particular, given a forward slice starting from a `CopyToNonValueTensorOp` and ending in `CopyToValueTensorOp`s, the pass first replaces all uses of the `CopyToNonValueTensorOp` with its operand, which results in all the `CopyToValueTensorOp` users having their operand have type `!torch.vtensor`, which is invalid. The correct way to do things is to first replace all the `CopyToValueTensorOp`s with their operand, and then replace all uses of the `CopyToNonValueTensorOp` with its operand. This only started failing now because the generated accessor `getOperand` for the `CopyToValueTensorOp` now returns a `TypedValue<NonValueTensorType>`, which has an assert checking that the value returned is of the expected type.	2022-12-07 12:20:41 -08:00
Vivek Khandelwal	e7edcc62fd	build: update llvm tag to 147fe9de Summary of changes: - Replace call to `MemoryEffectOpInterface::hasNoEffect` with `isMemoryEffectFree`. - Make fix for the dynamic dims, since `kDynamicSize` value changed to `std::numeric_limits<int64_t>::min()` from `-1` in llvm - `makeShapeLLVMCompatible` and `makeShapeTorchCompatible` utilities convert shapes in order to remain consistent with the Torch and MLIR semantics. - Update tags llvm: 147fe9de29dc13c14835127b35280c4d95c8e8ba mhlo: 1944b5fa6062ec4c065d726c9c5d64f1487ee8c5 Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-12-01 13:36:50 +05:30
Gaurav Shukla	0d209998d1	llvm: update tag to e864ac6945 (#1600 ) Summary of changes: 1. Replace `string` iterator types by `IteratorType` enum. (`e6598b053d`) 2. Update `includes` wrt new directory layout of MLIR HLO codebase. (`9fd8d251a8`) 3. Update tags llvm: e864ac694540342d5e59f59c525c5082f2594fb8 MHLO: eab364ba2a66bd0613efb94f8a738c1c97aaee92 Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com> Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-11-16 14:40:36 -08:00
Ramiro Leal-Cavazos	b723186983	Remove all but one of valsem ops + move fill.Scalar to elementwise (#1531 ) This commit removes almost all of the valsem ops, since the value semantics version of the ops now exist in PyTorch. The only op missing is `aten.bernoulli_.float`. In addition, this commit also simplifies the implementation of `aten.fill.Scalar` by moving it to the pattern that converts elementwise ops.	2022-10-28 15:06:11 +00:00
Ramiro Leal-Cavazos	82a3860e25	build: update llvm tag to 4546397e (#1502 ) This commit makes the following changes needed to update bump LLVM: - Replace `linalg.init_tensor` with `tensor.empty` (see: https://reviews.llvm.org/D135129) - Replace `NoSideEffect` with `Pure` (see https://reviews.llvm.org/D135505) - Replace `body` region accessor for `ReduceOp` and `ReduceWindowOp` with `getBody` - Fix incorrect use of `tosa::ReduceSumOp` in `AtenNativeLayerNormOp` conversion pattern. The result type of `tosa::ReduceSumOp` must have the same rank as the input type. (see: https://www.mlplatform.org/tosa/tosa_spec.html#_reduce_sum) Co-authored-by: Ashay Rane <ashay@users.noreply.github.com> Co-authored-by: Ashay Rane <ashay@users.noreply.github.com>	2022-10-18 04:22:53 +00:00
Ashay Rane	faa9a78e38	build: update llvm tag to 6f46ff37 (#1448 ) Summary of changes: - Updated references to the Arith dialect (https://reviews.llvm.org/D134762) - Switched to prefixed accessors for MemRef dialect (https://reviews.llvm.org/D134995) - Fixed warnings about signed/unsigned comparisons, ignored return values, and unused variables	2022-10-05 08:28:06 -05:00
George Petterson	a12b9c4492	Add lowering for aten::cumsum	2022-09-12 09:28:07 +05:30
Ashay Rane	9208bf0eb6	llvm: bump tag to e1318078 (#781 ) The updated LLVM code includes a patch to create bfloat16 array attributes, thus enabling a different patch to torch-mlir to flesh out support for the bfloat16 type.	2022-04-26 12:27:51 -07:00
Vivek Khandelwal	1bccb4fc8a	[MLIR][TORCH] Add E2E support for aten::max_pool2d_with_indices_backward op This commit adds lowering of `aten::max_pool2d_with_indices_backward` op. This commit also fixes formatting issues in basic.py. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-04-14 21:46:47 +05:30
Ramiro Leal-Cavazos	51d4d55f8a	Add support for multi-dim input to `index_put_impl` (#722 ) This commit adds support for multi-dimensional tensors as input to the `_index_put_impl_` op. The support was to some degree already there, since `ScatterOp` already supports multi-dimensional tensors. This commit also adds a bit more error checking to `index_put` and refactors the code for creating `ScatterOp`s to mimic the way one would make a `Linalg::GenericOp`.	2022-03-31 09:27:21 -07:00
Vigilans	63fb1e5aad	Bump LLVM at 8361c5da30588d3d4a48eae648f53be1feb5cfad	2022-03-18 13:16:14 -04:00
Vivek Khandelwal	3d95c3d6c9	[MLIR][TORCH] Add value tensor variant to aten::_index_put_impl_ This commit adds the op `ValsemVariantAtenIndexPutImplOp` that represents `Aten_IndexPutImpl_Op` without the underscore. This is needed to make sure that the `ReduceOpVariants` pass turns the in-place op into an op that takes value tensors as inputs, otherwise the `MaximizeValueSemantics` pass will not be able to add value semantics correctly. This commit also adds the lowering of `ValsemVariantAtenIndexPutImplOp` op. This commit also updates the `torch.bincount` op test cases.	2022-03-16 22:02:02 +05:30
Vivek Khandelwal	1a2a9e066f	[MLIR][TORCH] Add TorchToTMTensor pass This pass is added to lower ops, which can not be lowered via the TorchToLinalg pass, such as `torch.bincount` op. This pass also uses torch-mlir's TMTensor Dialect to lower the complex ops. Also add torch.bincount op lowering with the help of TMTensor dialect Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-08 22:52:34 +05:30

43 Commits (17c1985c4db326b8773a3e76614af26e14134c8a)