torch-mlir

Commit Graph

Author	SHA1	Message	Date
Felix Schneider	aca33f1742	[TorchToLinalg] Use Op with native channel order for quantized conv2d (#3807 ) I've upstreamed the necessary quantized linalg Op with the "channel-first" ordering used by torch (https://github.com/llvm/llvm-project/pull/107740) for 2d convolution. This patch changes the lowering for the quantized 2d case of `aten.convolution` accordingly, which saves three transpositions per convolution (input, weights, result) and therefore removes the requirement to try to optimize these away in downstream passes.	2024-10-22 20:26:16 +02:00
Longsheng Mou	3180704b14	[TorchToLinalg][test] Add test for ConvertAtenConvolutionOp (#3679 ) This patch add a test for `638ef14`, which use `linalg.broadcast` instead of `generic` for convolution bias. Co-authored-by: Rongsheng Gao <gaorongsheng@huawei.com>	2024-08-30 09:51:50 +00:00
zjgarvey	694210f429	[TorchToLinalg] Fix Quantized Convolution Accumulator Type (#3459 ) 1. truncates zero-points to i32 2. modifies the default accumulator type for i8 from i64 to i32. 3. now uses the input dtype to infer accumulator dtype.	2024-06-20 13:54:20 -07:00
Thomas Dietert	d2432bbe5a	[MLIR][Torch] Do not convert bias tensor to element type if NoneType (#3072 ) The `convertTensorToElementType` function expects it's argument to have a valid tensor type that is not `Torch::NoneType`. This PR checks that the bias tensor is not of type `Torch::NoneType` before calling `convertTensorToElementType` on the bias tensor argument in the `matchAndRewrite` member function of the `ConvertAtenConvolutionOp` class.	2024-04-02 14:19:26 +05:30

4 Commits (7f9f99c6f8c84323d896b47fcd67c4bc668f6577)