torch-mlir

History

zjgarvey de28c8540b [ONNX] add int16 quantization support (#3446 ) There is currently no int16 quantization support in torch. This patch adds a new mlir type to correspond to the missing "torch.qint16" type, and enables lowering of quantization-related onnx ops using int16 types. In follow-up patches, custom quantization logic for ops like aten.matmul/aten.mm/aten.convolution may need to be revisited to allow support for qint16. The passes in FuseQuantizedOps.cpp may also need slight modifications.		2024-06-12 10:37:22 +05:30
..
CMakeLists.txt	Re-organize project structure to separate PyTorch dependencies from core project. (#2542 )	2023-11-02 19:45:55 -07:00
TorchDialect.cpp	Fix deprecated uses of cast/dyn_cast/dyn_cast_or_null/isa (#3243 )	2024-04-27 14:00:56 -07:00
TorchOps.cpp	[Torch] fix toBuiltinTensor() (#3415 )	2024-06-08 09:36:32 +08:00
TorchOpsODSGenerated.cpp	Reduce compilation time for TorchOps.cpp.inc	2022-03-21 14:42:26 -07:00
TorchTypes.cpp	[ONNX] add int16 quantization support (#3446 )	2024-06-12 10:37:22 +05:30
UtilsForODSGenerated.cpp	llvm: bump tag to e1318078 (#781 )	2022-04-26 12:27:51 -07:00
UtilsForODSGenerated.h	Reduce compilation time for TorchOps.cpp.inc	2022-03-21 14:42:26 -07:00