torch-mlir

Commit Graph

Author	SHA1	Message	Date
rohan-tan-bhowmik	e86f56bc76	[Torch] [TMTensor] Added mask and is_causal support for torch.aten.scaled_dot_product_attention (#3690 ) Enabled mask and is_causal parameters for torch.aten.scaled_dot_product attention + relevant comments + tests. The tests added highlight the new capabilities introduced in this PR, including: Attention with F16 mask Attention with Boolean mask Causal attention with same Q K V shapes Causal attention without Q K V shapes Made sure that one cannot input both mask and is_causal.	2024-09-09 15:51:41 -07:00
rohan-tan-bhowmik	1c16de147a	Minor change in TMTensorOps.td (#3602 ) Fixed a little programming choice style that bothered me.	2024-08-14 16:33:49 +05:30
ptrifunovic98	4555629246	Implement lowering of torch.aten.kthvalue (#3360 ) Closes [nod-ai/SHARK-Turbine#620](https://github.com/nod-ai/SHARK-Turbine/issues/620)	2024-06-15 11:18:39 +05:30
Rob Suderman	afca88a058	[NFC] Change to cast instead of .cast variants (#3405 ) Member casts have been deprecated. Changing over a bunch of the member cast calls to the global templated variants to remove deprecation warnings.	2024-05-30 23:45:13 -07:00
Rob Suderman	db6721084a	Integrate LLVM at llvm/llvm-project@593f6fdcb4 (#3260 )	2024-04-29 12:01:40 -07:00
Rob Suderman	e30a083aff	[torch] Rework lowering to tm_tensor.scatter to stop serialization (#2940 ) We collapsed and broadcasted scatter indices to a single element version. We should instead upport `tm_tensor.scatter`s support for multiple indices and the implicitly broadcasted behavior. This avoids the serialization and materializing a needlessly large indices tensor.	2024-02-27 11:46:57 -08:00
Rob Suderman	e9cdd6cbc5	[torch] Fix tm_tensor.attention for end-to-end (#2907 ) Some operations include a backend matcher for specialized operations. We map these back to generics so they appropriately match to the high performance versions. This is done for the attention operation.	2024-02-13 21:18:01 -08:00
Quinn Dawkins	494089d53d	Clang format refresh (#2812 ) After noticing a number of commits with unrelated formatting changes, I think something was changed with clang-format at one point and we're seeing a number of unrelated changes. Doing a refresh can help avoid this. The changes made here came from ``` find lib -iname .h -o -iname .cpp \| xargs clang-format -i --style=llvm find include -iname .h -o -iname .cpp \| xargs clang-format -i --style=llvm find projects -iname .h -o -iname .cpp \| xargs clang-format -i --style=llvm ```	2024-01-29 12:59:33 -05:00
Stella Laurenzo	6961f0a247	Re-organize project structure to separate PyTorch dependencies from core project. (#2542 ) This is a first step towards the structure we discussed here: https://gist.github.com/stellaraccident/931b068aaf7fa56f34069426740ebf20 There are two primary goals: 1. Separate the core project (C++ dialects and conversions) from the hard PyTorch dependencies. We move all such things into projects/pt1 as a starting point since they are presently entangled with PT1-era APIs. Additional work can be done to disentangle components from that (specifically LTC is identified as likely ultimately living in a `projects/ltc`). 2. Create space for native PyTorch2 Dynamo-based infra to be upstreamed without needing to co-exist with the original TorchScript path. Very little changes in this path with respect to build layering or options. These can be updated in a followup without commingling directory structure changes. This also takes steps toward a couple of other layering enhancements: * Removes the llvm-external-projects/torch-mlir-dialects sub-project, collapsing it into the main tree. * Audits and fixes up the core C++ build to account for issues found while moving things. This is just an opportunistic pass through but roughly ~halves the number of build actions for the project from the high 4000's to the low 2000's. It deviates from the discussed plan by having a `projects/` tree instead of `compat/`. As I was thinking about it, this will better accommodate the follow-on code movement. Once things are roughly in place and the CI passing, followups will focus on more in-situ fixes and cleanups.	2023-11-02 19:45:55 -07:00

9 Commits (main)