torch-mlir/lib
zjgarvey 694210f429
[TorchToLinalg] Fix Quantized Convolution Accumulator Type (#3459)
1. truncates zero-points to i32
2. modifies the default accumulator type for i8 from i64 to i32. 
3. now uses the input dtype to infer accumulator dtype.
2024-06-20 13:54:20 -07:00
..
CAPI [ONNX] add int16 quantization support (#3446) 2024-06-12 10:37:22 +05:30
Conversion [TorchToLinalg] Fix Quantized Convolution Accumulator Type (#3459) 2024-06-20 13:54:20 -07:00
Dialect [TorchToLinalg] Fix Quantized Convolution Accumulator Type (#3459) 2024-06-20 13:54:20 -07:00
RefBackend [NFC] Change to *cast instead of .*cast variants (#3405) 2024-05-30 23:45:13 -07:00
CMakeLists.txt Link necessary op interface implementations (#3364) 2024-06-03 19:43:28 -05:00
InitAll.cpp [Stablehlo] support uint8 (#3367) 2024-06-04 09:04:59 +08:00