torch-mlir/test/Dialect
Felix Schneider 0314188dbe
[torch] Basic support for per-channel quantized graphs (#3623)
This patch adds basic support for lowering graphs with per-channel
quantization. Per-channel quantized ops have to be excluded from
`FuseQuantizedOps` for now but can be used in QDQ quantized form.

Using this patch, we're able to import and execute (on the linalg
backend) graphs with per-channel quantization applied using the "new"
PyTorch 2.0 Export Quantization.
2024-08-10 15:51:09 +02:00
..
TMTensor [torch-mlir] bump stablehlo/llvm version (#3471) 2024-06-18 16:59:53 -07:00
Torch [torch] Basic support for per-channel quantized graphs (#3623) 2024-08-10 15:51:09 +02:00
TorchConversion Add extf-trunc f32-f64-f32 ellision (#3579) 2024-07-31 16:50:00 -07:00