torch-mlir

History

zjgarvey de28c8540b [ONNX] add int16 quantization support (#3446 ) There is currently no int16 quantization support in torch. This patch adds a new mlir type to correspond to the missing "torch.qint16" type, and enables lowering of quantization-related onnx ops using int16 types. In follow-up patches, custom quantization logic for ops like aten.matmul/aten.mm/aten.convolution may need to be revisited to allow support for qint16. The passes in FuseQuantizedOps.cpp may also need slight modifications.		2024-06-12 10:37:22 +05:30
..
AbstractInterpLibrary.cpp	build: manually update PyTorch version (#3340 )	2024-06-06 22:23:40 +05:30
AdjustCallingConventions.cpp	[NFC] Change to cast instead of .cast variants (#3405 )	2024-05-30 23:45:13 -07:00
CMakeLists.txt	[torch] Improve shape inference for `torch-to-linalg` path for reshapes (#3055 )	2024-03-26 12:41:40 -07:00
DecomposeComplexOps.cpp	[Torch] Emit rrelu and decompose it (#3250 )	2024-06-03 19:25:52 +08:00
DropAbstractInterpCalculations.cpp	Update to LLVM 029313cc979ae71877b65794b1063d4e51184cc8	2023-03-21 04:16:20 -07:00
EraseModuleInitializer.cpp	[NFC] Remove unused header files (#3386 )	2024-05-30 14:30:36 +08:00
FuseQuantizedOps.cpp	[TorchToLinalg] add support for quantized group conv (#3341 )	2024-06-03 21:57:44 +05:30
GlobalizeObjectGraph.cpp	[NFC] Remove unused header files (#3386 )	2024-05-30 14:30:36 +08:00
InlineGlobalSlots.cpp	[NFC] Change to cast instead of .cast variants (#3405 )	2024-05-30 23:45:13 -07:00
LowerToBackendContract.cpp	[Torch] Emit rrelu and decompose it (#3250 )	2024-06-03 19:25:52 +08:00
MatchQuantizedOps.cpp	[ONNX] add int16 quantization support (#3446 )	2024-06-12 10:37:22 +05:30
MaximizeValueSemantics.cpp	[NFC] Change to cast instead of .cast variants (#3405 )	2024-05-30 23:45:13 -07:00
PassDetail.h	llvm: bump tag to e1318078 (#781 )	2022-04-26 12:27:51 -07:00
Passes.cpp	[Pipeline] Use dedicated simplification pipeline for TorchDynamo frontend (#3376 )	2024-05-22 05:23:18 -07:00
PrepareForGlobalizeObjectGraph.cpp	[NFC] Remove unused header files (#3386 )	2024-05-30 14:30:36 +08:00
RecomposeComplexOps.cpp	[Torch] support recompose of aten.split.with_sizes and aten.tensor_sp… (#3401 )	2024-05-31 09:56:47 +08:00
ReduceOpVariants.cpp	[NFC] Change to cast instead of .cast variants (#3405 )	2024-05-30 23:45:13 -07:00
RefinePublicReturn.cpp	[NFC] Remove unused header files (#3386 )	2024-05-30 14:30:36 +08:00
ReifyAbstractInterpCalculationsUtils.cpp	[NFC] Change to cast instead of .cast variants (#3405 )	2024-05-30 23:45:13 -07:00
ReifyAbstractInterpCalculationsUtils.h	handles 2,3,4 from https://github.com/llvm/torch-mlir/issues/1963 (#1964 )	2023-03-24 21:50:01 -05:00
ReifyDtypeCalculations.cpp	Fix deprecated uses of cast/dyn_cast/dyn_cast_or_null/isa (#3243 )	2024-04-27 14:00:56 -07:00
ReifyShapeCalculations.cpp	Fix deprecated uses of cast/dyn_cast/dyn_cast_or_null/isa (#3243 )	2024-04-27 14:00:56 -07:00
ScalarizeShapes.cpp	[NFC] Remove unused header files (#3386 )	2024-05-30 14:30:36 +08:00
SimplifyAbstractInterpCalculationsUtils.cpp	Fix deprecated uses of cast/dyn_cast/dyn_cast_or_null/isa (#3243 )	2024-04-27 14:00:56 -07:00
SimplifyAbstractInterpCalculationsUtils.h	Replace RefineTypes with dtype functions (#2105 )	2023-05-12 13:40:45 -07:00
SimplifyDtypeCalculations.cpp	[NFC] Change to cast instead of .cast variants (#3405 )	2024-05-30 23:45:13 -07:00
SimplifyShapeCalculations.cpp	[NFC] Change to cast instead of .cast variants (#3405 )	2024-05-30 23:45:13 -07:00