torch-mlir

History

zjgarvey 75d1d72059 Generalize Operand Quantization in FuseQuantizeOps (#3327 ) This change enables more customization with operand quantization, and generalizes the patterns QuantizeOperands and QuantizeTransposeOperands to QuantizeOperandsPastCommutingOps. This allows for passing quantization through operations which are functionally unaffected by quantization, such as view-like ops. The purpose of this change is to address a myriad of quantization issues seen in quantized onnx models that have some reshape-like operations sandwiched in between a dequant and something like a matmul (whose other operand is immediately quantizable).		2024-05-12 20:49:59 -07:00
..
CAPI	Clang format refresh (#2812 )	2024-01-29 12:59:33 -05:00
Conversion	[Stablehlo] fix aten.randn's lowering with f32 element type (#3329 )	2024-05-11 17:40:04 +08:00
Dialect	Generalize Operand Quantization in FuseQuantizeOps (#3327 )	2024-05-12 20:49:59 -07:00
RefBackend	Fix deprecated uses of cast/dyn_cast/dyn_cast_or_null/isa (#3243 )	2024-04-27 14:00:56 -07:00
CMakeLists.txt	[Stablehlo] add stablehlo-aggressive-simplification in e2e test (#3109 )	2024-04-07 10:48:11 +08:00
InitAll.cpp	[Stablehlo] add stablehlo-aggressive-simplification in e2e test (#3109 )	2024-04-07 10:48:11 +08:00