torch-mlir

History

zjgarvey 5e564b5864 Adds Some Quantization Support for AtenMatmulOp (#3147 ) 1. onnx.MatMulInteger now converts to aten.matmul instead of aten.mm 2. aten.matmul, for ranks >=2, now allows quantized inputs and will lower to linalg::quantized_matmul or linalg::quantized_batch_matmul. 3. added AtenMatmulOp to the FuseQuantizeOps rewrite patters QuantizeOperands, QuantizeTransposedOperands, and QuantizeAccumulator 4. added several tests, including some to test AtenMmOp with varying quantization signed-ness. 5. a quantized matmul mat-vec test is added to verify the failure to lower to linalg; cleaned of out-of-date code related to common torch-mlir lowering xfails. 6. in debugging a real model with quantized matmuls, I found a bug on the scalarize-shapes pass which resulted from the aten.full op folder returning an incompatible result type. This is fixed by the small change here to [lib/Dialect/Torch/IR/TorchOps.cpp](https://github.com/llvm/torch-mlir/compare/main...zjgarvey:torch-mlir:MatMulIntegerFix?expand=1#diff-dc8ed165c207918e606490eee3984b1ad51d7034e6aac36fc046bf47f6f03f4f).		2024-04-15 16:06:47 -07:00
..
CAPI	Re-organize project structure to separate PyTorch dependencies from core project. (#2542 )	2023-11-02 19:45:55 -07:00
Conversion	Adds Some Quantization Support for AtenMatmulOp (#3147 )	2024-04-15 16:06:47 -07:00
Dialect	[Torch] Add general AdaptiveAvgPool2dOp decompose support (#3111 )	2024-04-11 17:02:59 +08:00
RefBackend	Re-organize project structure to separate PyTorch dependencies from core project. (#2542 )	2023-11-02 19:45:55 -07:00
python	[torch-mlir][sparse] support sparse tensor output (#3152 )	2024-04-12 09:56:32 -07:00
CMakeLists.txt	Re-organize project structure to separate PyTorch dependencies from core project. (#2542 )	2023-11-02 19:45:55 -07:00
lit.cfg.py	[onnx] Add torch-mlir-import-onnx tool. (#2637 )	2023-12-12 22:01:30 -08:00
lit.site.cfg.py.in	Re-organize project structure to separate PyTorch dependencies from core project. (#2542 )	2023-11-02 19:45:55 -07:00