torch-mlir

History

Rob Suderman 25738b8c19 [linalg] Broadcast batch for mask on sdpa lowering (#3824 ) Attention often broadcasts a mask across the batch dimension as masking is usually performed the same across attention heads. Added this materialization to the mask dimensions optionally.		2024-10-31 17:59:24 -07:00
..
e2e_testing	support `aten._trilinear` and improve `einsum` decomposition (#3784 )	2024-10-31 14:30:40 -05:00
examples	[FxImporter] Add an e2e test example for FxImporter (#3331 )	2024-05-14 00:45:19 +08:00
python	[linalg] Broadcast batch for mask on sdpa lowering (#3824 )	2024-10-31 17:59:24 -07:00
test	[NFC] Update black version (#3256 )	2024-04-29 11:06:01 +08:00
tools	Re-organize project structure to separate PyTorch dependencies from core project. (#2542 )	2023-11-02 19:45:55 -07:00
CMakeLists.txt	Re-organize project structure to separate PyTorch dependencies from core project. (#2542 )	2023-11-02 19:45:55 -07:00