torch-mlir

History

Felix Schneider 0314188dbe [torch] Basic support for per-channel quantized graphs (#3623 ) This patch adds basic support for lowering graphs with per-channel quantization. Per-channel quantized ops have to be excluded from `FuseQuantizedOps` for now but can be used in QDQ quantized form. Using this patch, we're able to import and execute (on the linalg backend) graphs with per-channel quantization applied using the "new" PyTorch 2.0 Export Quantization.		2024-08-10 15:51:09 +02:00
..
TMTensor	[torch-mlir] bump stablehlo/llvm version (#3471 )	2024-06-18 16:59:53 -07:00
Torch	[torch] Basic support for per-channel quantized graphs (#3623 )	2024-08-10 15:51:09 +02:00
TorchConversion	Add extf-trunc f32-f64-f32 ellision (#3579 )	2024-07-31 16:50:00 -07:00