torch-mlir

History

zjgarvey de28c8540b [ONNX] add int16 quantization support (#3446 ) There is currently no int16 quantization support in torch. This patch adds a new mlir type to correspond to the missing "torch.qint16" type, and enables lowering of quantization-related onnx ops using int16 types. In follow-up patches, custom quantization logic for ops like aten.matmul/aten.mm/aten.convolution may need to be revisited to allow support for qint16. The passes in FuseQuantizedOps.cpp may also need slight modifications.		2024-06-12 10:37:22 +05:30
..
torch-mlir	[ONNX] add int16 quantization support (#3446 )	2024-06-12 10:37:22 +05:30
torch-mlir-c	[ONNX] add int16 quantization support (#3446 )	2024-06-12 10:37:22 +05:30
torch-mlir-dialects	[NFC] Change to cast instead of .cast variants (#3405 )	2024-05-30 23:45:13 -07:00
CMakeLists.txt	[NFC reformat] Run pre-commit on all files and format misc.	2024-04-27 14:08:09 -07:00