torch-mlir/lib/Dialect/Torch/Transforms
zjgarvey de28c8540b
[ONNX] add int16 quantization support (#3446)
There is currently no int16 quantization support in torch. This patch
adds a new mlir type to correspond to the missing "torch.qint16" type,
and enables lowering of quantization-related onnx ops using int16 types.

In follow-up patches, custom quantization logic for ops like
aten.matmul/aten.mm/aten.convolution may need to be revisited to allow
support for qint16. The passes in FuseQuantizedOps.cpp may also need
slight modifications.
2024-06-12 10:37:22 +05:30
..
AbstractInterpLibrary.cpp build: manually update PyTorch version (#3340) 2024-06-06 22:23:40 +05:30
AdjustCallingConventions.cpp [NFC] Change to *cast instead of .*cast variants (#3405) 2024-05-30 23:45:13 -07:00
CMakeLists.txt [torch] Improve shape inference for `torch-to-linalg` path for reshapes (#3055) 2024-03-26 12:41:40 -07:00
DecomposeComplexOps.cpp [Torch] Emit rrelu and decompose it (#3250) 2024-06-03 19:25:52 +08:00
DropAbstractInterpCalculations.cpp Update to LLVM 029313cc979ae71877b65794b1063d4e51184cc8 2023-03-21 04:16:20 -07:00
EraseModuleInitializer.cpp [NFC] Remove unused header files (#3386) 2024-05-30 14:30:36 +08:00
FuseQuantizedOps.cpp [TorchToLinalg] add support for quantized group conv (#3341) 2024-06-03 21:57:44 +05:30
GlobalizeObjectGraph.cpp [NFC] Remove unused header files (#3386) 2024-05-30 14:30:36 +08:00
InlineGlobalSlots.cpp [NFC] Change to *cast instead of .*cast variants (#3405) 2024-05-30 23:45:13 -07:00
LowerToBackendContract.cpp [Torch] Emit rrelu and decompose it (#3250) 2024-06-03 19:25:52 +08:00
MatchQuantizedOps.cpp [ONNX] add int16 quantization support (#3446) 2024-06-12 10:37:22 +05:30
MaximizeValueSemantics.cpp [NFC] Change to *cast instead of .*cast variants (#3405) 2024-05-30 23:45:13 -07:00
PassDetail.h llvm: bump tag to e1318078 (#781) 2022-04-26 12:27:51 -07:00
Passes.cpp [Pipeline] Use dedicated simplification pipeline for TorchDynamo frontend (#3376) 2024-05-22 05:23:18 -07:00
PrepareForGlobalizeObjectGraph.cpp [NFC] Remove unused header files (#3386) 2024-05-30 14:30:36 +08:00
RecomposeComplexOps.cpp [Torch] support recompose of aten.split.with_sizes and aten.tensor_sp… (#3401) 2024-05-31 09:56:47 +08:00
ReduceOpVariants.cpp [NFC] Change to *cast instead of .*cast variants (#3405) 2024-05-30 23:45:13 -07:00
RefinePublicReturn.cpp [NFC] Remove unused header files (#3386) 2024-05-30 14:30:36 +08:00
ReifyAbstractInterpCalculationsUtils.cpp [NFC] Change to *cast instead of .*cast variants (#3405) 2024-05-30 23:45:13 -07:00
ReifyAbstractInterpCalculationsUtils.h handles 2,3,4 from https://github.com/llvm/torch-mlir/issues/1963 (#1964) 2023-03-24 21:50:01 -05:00
ReifyDtypeCalculations.cpp Fix deprecated uses of cast/dyn_cast/dyn_cast_or_null/isa (#3243) 2024-04-27 14:00:56 -07:00
ReifyShapeCalculations.cpp Fix deprecated uses of cast/dyn_cast/dyn_cast_or_null/isa (#3243) 2024-04-27 14:00:56 -07:00
ScalarizeShapes.cpp [NFC] Remove unused header files (#3386) 2024-05-30 14:30:36 +08:00
SimplifyAbstractInterpCalculationsUtils.cpp Fix deprecated uses of cast/dyn_cast/dyn_cast_or_null/isa (#3243) 2024-04-27 14:00:56 -07:00
SimplifyAbstractInterpCalculationsUtils.h Replace RefineTypes with dtype functions (#2105) 2023-05-12 13:40:45 -07:00
SimplifyDtypeCalculations.cpp [NFC] Change to *cast instead of .*cast variants (#3405) 2024-05-30 23:45:13 -07:00
SimplifyShapeCalculations.cpp [NFC] Change to *cast instead of .*cast variants (#3405) 2024-05-30 23:45:13 -07:00