torch-mlir

History

rohan-tan-bhowmik e86f56bc76 [Torch] [TMTensor] Added mask and is_causal support for torch.aten.scaled_dot_product_attention (#3690 ) Enabled mask and is_causal parameters for torch.aten.scaled_dot_product attention + relevant comments + tests. The tests added highlight the new capabilities introduced in this PR, including: Attention with F16 mask Attention with Boolean mask Causal attention with same Q K V shapes Causal attention without Q K V shapes Made sure that one cannot input both mask and is_causal.		2024-09-09 15:51:41 -07:00
..
CAPI	[ONNX] add int16 quantization support (#3446 )	2024-06-12 10:37:22 +05:30
Conversion	[Torch] [TMTensor] Added mask and is_causal support for torch.aten.scaled_dot_product_attention (#3690 )	2024-09-09 15:51:41 -07:00
Dialect	[Torch] [TMTensor] Added mask and is_causal support for torch.aten.scaled_dot_product_attention (#3690 )	2024-09-09 15:51:41 -07:00
RefBackend	Add missing dependency to TorchMLIRRefBackend target (#3107 )	2024-08-14 23:41:51 +08:00
CMakeLists.txt	Link necessary op interface implementations (#3364 )	2024-06-03 19:43:28 -05:00
InitAll.cpp	[Stablehlo] legalize deprecated ops to stablehlo ops (#3543 )	2024-07-17 00:05:11 +08:00