torch-mlir

History

rohan-tan-bhowmik e86f56bc76 [Torch] [TMTensor] Added mask and is_causal support for torch.aten.scaled_dot_product_attention (#3690 ) Enabled mask and is_causal parameters for torch.aten.scaled_dot_product attention + relevant comments + tests. The tests added highlight the new capabilities introduced in this PR, including: Attention with F16 mask Attention with Boolean mask Causal attention with same Q K V shapes Causal attention without Q K V shapes Made sure that one cannot input both mask and is_causal.		2024-09-09 15:51:41 -07:00
..
torch-mlir	[LINALG] Implement lowering of torch.aten.rot90 (#3551 )	2024-09-06 10:36:17 +05:30
torch-mlir-c	[ONNX] add int16 quantization support (#3446 )	2024-06-12 10:37:22 +05:30
torch-mlir-dialects	[Torch] [TMTensor] Added mask and is_causal support for torch.aten.scaled_dot_product_attention (#3690 )	2024-09-09 15:51:41 -07:00
CMakeLists.txt	[NFC reformat] Run pre-commit on all files and format misc.	2024-04-27 14:08:09 -07:00