torch-mlir/lib/Conversion
Felix Schneider 0314188dbe
[torch] Basic support for per-channel quantized graphs (#3623)
This patch adds basic support for lowering graphs with per-channel
quantization. Per-channel quantized ops have to be excluded from
`FuseQuantizedOps` for now but can be used in QDQ quantized form.

Using this patch, we're able to import and execute (on the linalg
backend) graphs with per-channel quantization applied using the "new"
PyTorch 2.0 Export Quantization.
2024-08-10 15:51:09 +02:00
..
TorchConversionToMLProgram [NFC] Remove unused header files (#3386) 2024-05-30 14:30:36 +08:00
TorchOnnxToTorch [onnx] Support `fp8` for `onnx.QuantizeLinear` (#3619) 2024-08-09 12:32:46 -07:00
TorchToArith [TorchToArith] Add a lowering for `torch.add.float_int` (#3594) 2024-08-07 11:55:27 -05:00
TorchToLinalg [torch] Basic support for per-channel quantized graphs (#3623) 2024-08-10 15:51:09 +02:00
TorchToSCF [NFC] Change to *cast instead of .*cast variants (#3405) 2024-05-30 23:45:13 -07:00
TorchToStablehlo [stablehlo] support dynamic-shaped index in stablehlo conversion for aten.index-like ops (#3322) 2024-08-01 10:41:09 +08:00
TorchToTMTensor [Torch] add support for aten.scatter_add (#3534) 2024-07-12 09:15:42 +08:00
TorchToTensor [NFC] Remove unused header files (#3386) 2024-05-30 14:30:36 +08:00
TorchToTosa [TOSA] Fix Tensor.hacked_twin to support diff size indexes (#3547) 2024-07-30 14:32:05 -07:00
Utils [LINALG] Added support for conversion from float to complex. (#3595) 2024-08-07 12:36:48 +05:30
CMakeLists.txt [torch] Improve shape inference for `torch-to-linalg` path for reshapes (#3055) 2024-03-26 12:41:40 -07:00
PassDetail.h Minor fixes for `ConvertTorchConversionToMLProgram`. (#1991) 2023-04-04 09:09:58 -07:00
Passes.cpp Clang format refresh (#2812) 2024-01-29 12:59:33 -05:00