torch-mlir

History

John Wu 704cfdaf08 Add aten.pool_max3d support to torch-to-linalg (#2735 ) Added verification logic to the abstract_interpreter_lib_gen.py Also made some unit tests Initially, I thought we can use `linalg::pooling_ndhwc_max` to help implement this problem. However, on a 5-dimensional matrix it does the pooling on dimensions (2, 3, 4) which is not what we want. We want pooling on dimensions (3, 4, 5). To achieve this, we would need to lower our code using the `linalg` dialect. Turns out the pooling code in `linalg` looks like this. ``` func @max_pooling_ncdhw(%I: memref<?x?x?x?x?xf32>, %K: memref<3xindex>, %O: memref<?x?x?x?x?xf32>, %strides: memref<3xindex>, %dilations: memref<3xindex>) { %c0 = arith.constant 0 : index %c1 = arith.constant 1 : index %N = memref.dim %I, %c0 : memref<?x?x?x?x?xf32> %C = memref.dim %I, %c1 : memref<?x?x?x?x?xf32> %D = memref.dim %I, 2 : memref<?x?x?x?x?xf32> %H = memref.dim %I, 3 : memref<?x?x?x?x?xf32> %W = memref.dim %I, 4 : memref<?x?x?x?x?xf32> %kernel_d = memref.load %K[%c0] : memref<3xindex> %kernel_h = memref.load %K[%c1] : memref<3xindex> %kernel_w = memref.load %K[2] : memref<3xindex> %stride_d = memref.load %strides[%c0] : memref<3xindex> %stride_h = memref.load %strides[%c1] : memref<3xindex> %stride_w = memref.load %strides[2] : memref<3xindex> %dilation_d = memref.load %dilations[%c0] : memref<3xindex> %dilation_h = memref.load %dilations[%c1] : memref<3xindex> %dilation_w = memref.load %dilations[2] : memref<3xindex> linalg.generic { indexing_maps = [ affine_map<(n, c, d, h, w, kd, kh, kw) -> (n, c, d * %stride_d + kd * %dilation_d, h * %stride_h + kh * %dilation_h, w * %stride_w + kw * %dilation_w)>, // Map for input tensor affine_map<(n, c, d, h, w, kd, kh, kw) -> (kd, kh, kw)>, // Map for kernel tensor affine_map<(n, c, d, h, w, kd, kh, kw) -> (n, c, d, h, w)> // Map for output tensor ], iterator_types = ["parallel", "parallel", "parallel", "parallel", "parallel", "reduction", "reduction", "reduction"], doc = "3D Max Pooling NCDHW with Strides, Dilations, and Kernel Size" } ins(%I, %K : memref<?x?x?x?x?xf32>, memref<3xindex>) outs(%O : memref<?x?x?x?x?xf32>) { ^bb0(%input_elem: f32, %kernel_elem: index, %output_elem: f32): %max_val = arith.maxf %input_elem, %output_elem : f32 linalg.yield %max_val : f32 } return } ``` This was implemented based on it's source code with the adjustments mentioned above: `4ca1b5e094/mlir/include/mlir/Dialect/Linalg/IR/LinalgNamedStructuredOps.yaml (L5647)` Issues related to this can be found here https://github.com/nod-ai/SHARK-Turbine/issues/324		2024-01-19 21:09:46 +05:30
..
CMakeLists.txt	Re-organize project structure to separate PyTorch dependencies from core project. (#2542 )	2023-11-02 19:45:55 -07:00
DataMovement.cpp	Implement aten.reflection_pad2d lowering to linalg	2024-01-10 21:32:22 -10:00
IndirectDataMovement.cpp	[TorchToLinalg] NFC: Move Utils.h to an externally accessible location (#2603 )	2023-12-01 19:38:21 -05:00
Linear.cpp	[aten] Make `torch.aten.matmul` to `linalg` work for non-broadcasting case (#2659 )	2023-12-20 10:09:10 -08:00
Pooling.cpp	Add aten.pool_max3d support to torch-to-linalg (#2735 )	2024-01-19 21:09:46 +05:30
PopulatePatterns.h	Re-enable custom op support	2022-08-16 22:49:08 +05:30
Random.cpp	[TorchToLinalg] NFC: Move Utils.h to an externally accessible location (#2603 )	2023-12-01 19:38:21 -05:00
Reduction.cpp	[TorchToLinalg] Drop constexpr from ifs in argmin/max.dim (#2617 )	2023-12-07 13:08:17 -05:00
TensorConstructors.cpp	[TorchToLinalg] Add lowering for aten.replication_pad2d (#2715 )	2024-01-15 14:02:27 -05:00
TensorScalarInterop.cpp	Elide dynamic broadcast checks when in strict symbolic shapes mode. (#2496 )	2023-09-29 16:45:48 -07:00
TorchToLinalg.cpp	Add complex types support with basic complex ops.	2023-05-11 21:29:07 +05:30
Uncategorized.cpp	Implement lowering of torch.aten.remainder.Tensor (#2763 )	2024-01-19 18:09:08 +05:30
Utils.cpp	[torch][quant] Support quantize and dequantize for torch (#2731 )	2024-01-12 19:11:14 -08:00