torch-mlir/projects/pt1
Rob Suderman 25738b8c19
[linalg] Broadcast batch for mask on sdpa lowering (#3824)
Attention often broadcasts a mask across the batch dimension as masking
is usually performed the same across attention heads. Added this
materialization to the mask dimensions optionally.
2024-10-31 17:59:24 -07:00
..
e2e_testing support `aten._trilinear` and improve `einsum` decomposition (#3784) 2024-10-31 14:30:40 -05:00
examples [FxImporter] Add an e2e test example for FxImporter (#3331) 2024-05-14 00:45:19 +08:00
python [linalg] Broadcast batch for mask on sdpa lowering (#3824) 2024-10-31 17:59:24 -07:00
test [NFC] Update black version (#3256) 2024-04-29 11:06:01 +08:00
tools Re-organize project structure to separate PyTorch dependencies from core project. (#2542) 2023-11-02 19:45:55 -07:00
CMakeLists.txt Re-organize project structure to separate PyTorch dependencies from core project. (#2542) 2023-11-02 19:45:55 -07:00