torch-mlir

Commit Graph

Author	SHA1	Message	Date
Rob Suderman	7f475e174e	Add extf-trunc f32-f64-f32 ellision (#3579 ) Torch has all scalars represented as i64 and f64 types which results in extraneous trunc-extf commands. We can rework this by elliding widen-narrow cases away.	2024-07-31 16:50:00 -07:00
Yuanqiang Liu	714270a922	[Stablehlo] legalize deprecated ops to stablehlo ops (#3543 )	2024-07-17 00:05:11 +08:00
Matthias Gehre	6678e1a256	TorchToLinalg: Try folding shape computations to keep static shapes when possible (#3475 ) Before this PR, a statically shaped aten.convolution would generate dynamically shaped linalg IR, and even `-canonicalize` would not be able to fold it back into static shapes. This PR ensure that shape calculations are folded on construction to directly generate statically shaped linalg IR. We achieve that by ensuring that `arith` ops involved in computing shapes are created via `createOrFold`, so that later uses of `getAsOpFoldResult` see constants instead of those ops. For example ``` module { func.func @forward(%arg0: !torch.vtensor<[32,336,112,112],f32>, %arg1: !torch.vtensor<[336,168,3,3],f32>, %arg2: !torch.vtensor<[336],f32>) -> !torch.vtensor<[32,336,56,56],f32> { %false = torch.constant.bool false %int2 = torch.constant.int 2 %int1 = torch.constant.int 1 %0 = torch.prim.ListConstruct %int1, %int1 : (!torch.int, !torch.int) -> !torch.list<int> %1 = torch.prim.ListConstruct %int2, %int2 : (!torch.int, !torch.int) -> !torch.list<int> %2 = torch.prim.ListConstruct : () -> !torch.list<int> %3 = torch.aten.convolution %arg0, %arg1, %arg2, %1, %0, %0, %false, %2, %int2 : !torch.vtensor<[32,336,112,112],f32>, !torch.vtensor<[336,168,3,3],f32>, !torch.vtensor<[336],f32>, !torch.list<int>, !torch.list<int>, !torch.list<int>, !torch.bool, !torch.list<int>, !torch.int -> !torch.vtensor<[32,336,56,56],f32> return %3 : !torch.vtensor<[32,336,56,56],f32> } } ``` would result in ``` [...] %padded = tensor.pad %2 low[%14, %15, %16, %17] high[%14, %15, %16, %17] { ^bb0(%arg3: index, %arg4: index, %arg5: index, %arg6: index): tensor.yield %cst : f32 } : tensor<32x336x112x112xf32> to tensor<?x?x?x?xf32> [...] %45 = linalg.conv_2d_ngchw_gfchw {dilations = dense<1> : vector<2xi64>, strides = dense<2> : vector<2xi64>} ins(%expanded, %expanded_37 : tensor<?x2x?x?x?xf32>, tensor<2x168x168x3x3xf32>) outs(%expanded_44 : tensor<32x2x168x?x?xf32>) -> tensor<32x2x168x?x?xf32> [...] ``` and with this PR all shapes are static.	2024-06-27 08:43:10 +02:00
Yuanqiang Liu	689efc8917	[Torch] fix toBuiltinTensor() (#3415 ) * Let `toBuiltinTensor()` reflects the original dtype of `!torch.vtensor`. * Backend handles dtype conversion themselves.	2024-06-08 09:36:32 +08:00
Yuanqiang Liu	50f7103098	[Stablehlo] support uint8 (#3367 ) Support lowering unsigned integer type to stablehlo as discussed in https://github.com/llvm/torch-mlir/pull/2184. The things I do in this PR: 1. create `setupBackendTypeConversionForStablehlo()`, `createFuncBackendTypeConversionForStablehloPass` and `createFinalizingBackendTypeConversionForStablehloPass`. 2. remove `InferTypeOpInterface` from `torch_c.to_builtin_tensor`, because it's different result type between linalg backend and stablehlo backend: ``` // linalg backend func.func @forward(%arg0: !torch.vtensor<[3],ui8>) -> tensor<3xf32> { %c = torch_c.to_builtin_tensor %arg0 : (!torch.vtensor<[3], ui8> -> tensor<3xi8> %0 = tensor.empty() : tensor<3xf32> %1 = linalg.generic {indexing_maps = [#map, #map], iterator_types = ["parallel"]} ins(%arg0 : tensor<3xi8>) outs(%0 : tensor<3xf32>) { ^bb0(%in: i8, %out: f32): %2 = arith.uitofp %in : i8 to f32 linalg.yield %2 : f32 } -> tensor<3xf32> return %1 : tensor<3xf32> } // stablehlo backend func.func @forward(%arg0: !torch.vtensor<[3],ui8>) -> tensor<3xf32> { %c = torch_c.to_builtin_tensor %arg0 : (!torch.vtensor<[3], ui8> -> tensor<3xui8> %0 = stablehlo.convert %arg0 : (tensor<3xui8> -> tensor<3xf32> return %0 : tensor<3xf32> } ``` 3. fix stablehlo and linalg's conversion	2024-06-04 09:04:59 +08:00
Rob Suderman	afca88a058	[NFC] Change to cast instead of .cast variants (#3405 ) Member casts have been deprecated. Changing over a bunch of the member cast calls to the global templated variants to remove deprecation warnings.	2024-05-30 23:45:13 -07:00
penguin_wwy	1f544c37d0	[NFC] Remove unused header files (#3386 )	2024-05-30 14:30:36 +08:00
penguin_wwy	405f884522	[stablehlo] verify stablehlo backend contract (#3338 )	2024-05-16 11:03:43 +08:00
Stella Laurenzo	5d4b803914	[NFC reformat] Run pre-commit on all files and format misc. This is part 1 of ~3, formatting all miscellaneous text files and CPP files matched by a first run of pre-commit. These tend to be low change-traffic and are likely not disruptive. Subsequent patches will format Python files and remaining CPP files.	2024-04-27 14:08:09 -07:00
penguin_wwy	6679728c56	Fix deprecated uses of cast/dyn_cast/dyn_cast_or_null/isa (#3243 ) Like #3130, gradually replace the deprecated code https://github.com/llvm/mlir-www/blob/main/website/content/deprecation/_index.md#deprecated	2024-04-27 14:00:56 -07:00
Aart Bik	2eac8a992f	[torch-mlir][sparse] sparse tensor dialect is a legal dialect (#3227 )	2024-04-26 02:36:42 +08:00
penguin_wwy	d4a30b7e67	Fix deprecated uses of cast/dyn_cast/dyn_cast_or_null/isa (#3130 ) We should prefer functional style as the method style is deprecated https://github.com/llvm/mlir-www/blob/main/website/content/deprecation/_index.md#deprecated (https://mlir.llvm.org/deprecation/)	2024-04-11 06:47:35 -07:00
Yuanqiang Liu	88533b1968	[Stablehlo] fix aten.arange's lowering to stablehlo (#3138 ) * promote to f64 to do division, avoid division on i64 (floor div) * refactor torch-to-stablehlo-pipeline	2024-04-11 15:55:56 +08:00
Yuanqiang Liu	43d54efd14	[cmake] link TorchMLIRTorchConversionPasses to TorchMLIRConversionPasses (#3113 ) * as that `TorchMLIRTorchConversionPasses` missing dependencies of `TorchMLIRTorchToStablehlo` and `TorchMLIRTorchToTensor`. * use `TorchMLIRConversionPasses` instead of scattered targets.	2024-04-08 14:44:34 +08:00
Yuanqiang Liu	6cbb2f7ae0	[Stablehlo] add stablehlo-canonicalize-dynamism when lowering (#3097 ) so that many stablehlo e2e testcases could pass	2024-04-02 22:47:24 +08:00
Rob Suderman	14b548f968	[torch] Improve shape inference for `torch-to-linalg` path for reshapes (#3055 ) Reshaping tensors depend on directly matching individual dimensions to their corresponding dim in the `torch.view` reshape dimensions. This involves decoupling dynamic dimensions from their static counterparts and support cleanup / canonicalization.	2024-03-26 12:41:40 -07:00
Yuanqiang Liu	8b96727d0d	[Stablehlo] lowering chlo to stablehlo in torch-to-stablehlo pipeline (#3037 ) as that stablehlo is better than chlo as the boundary between frontend compiler and backend compiler.	2024-03-19 21:18:54 +08:00
Rob Suderman	4a7a7d76f8	[onnx] Fix ReduceMean lowering to torch (#2956 ) Torch lowering only supported the most recent version. Refactored the lowering so more easily handle default values and optional operands / attributes.	2024-02-27 22:48:07 -08:00
Rob Suderman	e30a083aff	[torch] Rework lowering to tm_tensor.scatter to stop serialization (#2940 ) We collapsed and broadcasted scatter indices to a single element version. We should instead upport `tm_tensor.scatter`s support for multiple indices and the implicitly broadcasted behavior. This avoids the serialization and materializing a needlessly large indices tensor.	2024-02-27 11:46:57 -08:00
Stella Laurenzo	4446fa00d8	Migrate passes in TorchConversion to use FunctionOpInterface. (#2935 ) This enables better re-use in downstreams which use different func implementations and should have no impact on those that don't except in opt pipelines if using the old form. With interfaces, explicit pipelines via `--pass-pipeline=` must be used.	2024-02-20 08:54:02 -08:00
Scott Todd	d6e1d836ca	Drop torch attributes at the end of backend conversion. (#2876 ) Fixes https://github.com/llvm/torch-mlir/issues/2866 Some backends / downstream projects expect that a "fully converted" program has no remaining ops or attributes from the original dialect(s).	2024-02-13 14:32:02 -08:00
Rob Suderman	0114a570e3	[torch] Support lowering `torch.item` to `tensor.extract` (#2835 ) Extracting scalar values from tensors can be implemented via a lowering to tensor.extract.	2024-01-31 15:09:12 -08:00
Quinn Dawkins	494089d53d	Clang format refresh (#2812 ) After noticing a number of commits with unrelated formatting changes, I think something was changed with clang-format at one point and we're seeing a number of unrelated changes. Doing a refresh can help avoid this. The changes made here came from ``` find lib -iname .h -o -iname .cpp \| xargs clang-format -i --style=llvm find include -iname .h -o -iname .cpp \| xargs clang-format -i --style=llvm find projects -iname .h -o -iname .cpp \| xargs clang-format -i --style=llvm ```	2024-01-29 12:59:33 -05:00
Rob Suderman	f6f890520b	[torch][quant] Quantized `torch.mm` for linalg with end-to-end test (#2750 ) This includes custom op matching for decomposed operations and fusing dequantization into dense operations. As a validation we compare to the dequant+mm torch implementation.	2024-01-24 14:02:50 -08:00
Stella Laurenzo	6961f0a247	Re-organize project structure to separate PyTorch dependencies from core project. (#2542 ) This is a first step towards the structure we discussed here: https://gist.github.com/stellaraccident/931b068aaf7fa56f34069426740ebf20 There are two primary goals: 1. Separate the core project (C++ dialects and conversions) from the hard PyTorch dependencies. We move all such things into projects/pt1 as a starting point since they are presently entangled with PT1-era APIs. Additional work can be done to disentangle components from that (specifically LTC is identified as likely ultimately living in a `projects/ltc`). 2. Create space for native PyTorch2 Dynamo-based infra to be upstreamed without needing to co-exist with the original TorchScript path. Very little changes in this path with respect to build layering or options. These can be updated in a followup without commingling directory structure changes. This also takes steps toward a couple of other layering enhancements: * Removes the llvm-external-projects/torch-mlir-dialects sub-project, collapsing it into the main tree. * Audits and fixes up the core C++ build to account for issues found while moving things. This is just an opportunistic pass through but roughly ~halves the number of build actions for the project from the high 4000's to the low 2000's. It deviates from the discussed plan by having a `projects/` tree instead of `compat/`. As I was thinking about it, this will better accommodate the follow-on code movement. Once things are roughly in place and the CI passing, followups will focus on more in-situ fixes and cleanups.	2023-11-02 19:45:55 -07:00
Stella Laurenzo	078d1e1a1d	Remove mlir-hlo (replace with stablehlo). (#2460 ) We just have to do this: I ran into an issue today where I needed to make a one line patch to stablehlo to work around a compiler issue, and it is completely unapparent how to do so given that the mlir-hlo repo is a read-only export and is at the tail end of a multi-week integration chain from the open-source stablehlo repo. We've discussed this often enough and gotten +1 from everyone that they are ok with taking the e2e testing hit if it becomes necessary: It is necessary as the current situation is unmanageable. Looking at it, I expect it wouldn't actually be very difficult to build a little runner binary out of the stablehlo interpreter and subprocess call that in order to get the testing coverage back. I leave that as an exercise to the users of this part of the stack and recommend following the breadcrumbs from the deleted python/torch_mlir_e2e_test/stablehlo_backends/linalg_on_tensors.py file and the main.py changes. Note that I am pointing us at a stablehlo fork for the moment until it is apparent that we don't need to carry any local patches to it. We can update this in a few days if everything is clear.	2023-09-12 19:10:02 -07:00
Yuanqiang Liu	5895b9f8ca	fix compile warning (#2453 )	2023-09-12 09:31:47 +08:00
jinchen62	1682b540bf	Prototype passes for lowering quantized group matmul (#2402 ) * Support brevitas custom op (#2320) * f16 change for brevitas * Adapt the change of brevitas quant custom op name * Add unit tests * Make brevitas conversions isolated * Address the comments --------- Co-authored-by: dan <danimal197@gmail.com>	2023-08-29 21:25:45 -07:00
Maksim Levental	0caaf8d32a	Bump LLVM (#2176 ) * Bump LLVM --------- Co-authored-by: Matthias Gehre <matthias.gehre@xilinx.com>	2023-06-13 16:17:23 +02:00
Yuanqiang Liu	5223f990df	[Stablehlo] Enable Stablehlo backend with arith dialect (#2139 )	2023-05-26 22:57:57 +08:00
Prashant Kumar	3cd91affbc	Add complex types support with basic complex ops. Add complex types support with basic complex types. Add aten.imag and aten.real op lowering via linalg_backend.	2023-05-11 21:29:07 +05:30
Eric Kunze	6a833e1922	Update to LLVM 3157f03a349cfc852cdd994675eaa9652caa2e3a (#2060 ) New requirement to explicitly cast for interfaces https://reviews.llvm.org/D148493	2023-04-25 08:52:46 -07:00
Alexandre Rames	224ee27610	Fix a few missing dependencies. (#2014 ) `TorchToTMTensor` depends on `TorchMLIRTorchUtils` for `mlir::torch::torch_upstream::get_reduction_enum`. `TorchMLIRTorchConversionPasses` depends on multiple libs for both tblgen'd headers and definitions. Test with `ninja TorchMLIRTorchConversionPasses` from a clean build.	2023-04-11 11:18:49 -07:00
Alexandre Rames	d24fa71368	Minor fixes for `ConvertTorchConversionToMLProgram`. (#1991 ) * Only create the global seed variable if it does not exist already. * Make the pass a module pass. A func pass may not modify its parent op.	2023-04-04 09:09:58 -07:00
Ashay Rane	711646d095	mhlo: migrate conversion to stablehlo (#1840 ) This patch replaces all MHLO operations with their StableHLO counterparts and adds a validation pass to ensure that no MHLO operations remain before translating all Stablehlo operations to the MHLO dialect for further lowering to the Linalg dialect. This patch also updates all lit tests so that they refer to the `convert-torch-to-stablehlo` pass and so that they check for StableHLO operations.	2023-02-02 07:29:47 -06:00
Eric Kunze	95bdfaa9bf	update llvm to d23516e9ad477527a9db4d06b1fa9566680ac67c (#1812 ) Rename BlockAndValueMapping to IRMapping Moved PrimTupleConstructOp type validation to its own verifier as the tablegen version does not work for a combination of variadic input and non-variadic output.	2023-01-23 16:34:22 -08:00
Ashay Rane	f63bb9f86c	build: update llvm tag to 3a020527 (#1717 ) Summary of changes: - Replace `llvm::None` with `std::nullopt`, since the former is deprecated (https://reviews.llvm.org/D139763) - Use setter for symbol visibility instead of passing string attribute when creating FuncOp	2022-12-14 02:06:39 -06:00
Vivek Khandelwal	f416953600	[MLIR][TORCH] Add TorchConversionToMLProgram and MLProgramBufferize pass This commit changes the `InsertRngGlobalsPass` to `TorchConversionToMLProgram` pass. This commit also adds the `MLProgramBufferize` pass for the bufferization of ml_program dialect ops to run on refbackend. Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-12-02 13:20:46 +05:30
Gaurav Shukla	0d209998d1	llvm: update tag to e864ac6945 (#1600 ) Summary of changes: 1. Replace `string` iterator types by `IteratorType` enum. (`e6598b053d`) 2. Update `includes` wrt new directory layout of MLIR HLO codebase. (`9fd8d251a8`) 3. Update tags llvm: e864ac694540342d5e59f59c525c5082f2594fb8 MHLO: eab364ba2a66bd0613efb94f8a738c1c97aaee92 Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com> Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-11-16 14:40:36 -08:00
Ashay Rane	faa9a78e38	build: update llvm tag to 6f46ff37 (#1448 ) Summary of changes: - Updated references to the Arith dialect (https://reviews.llvm.org/D134762) - Switched to prefixed accessors for MemRef dialect (https://reviews.llvm.org/D134995) - Fixed warnings about signed/unsigned comparisons, ignored return values, and unused variables	2022-10-05 08:28:06 -05:00
Gleb Kazantaev	708fa346a6	Fix Base Lazy Backend Type Conversion (#1412 ) * Fix c10::prim::Constant conversion; Added CAPI for passes; Added passes to base lazy backend * Update ivalue_importer to use ImportOptions; Added tests for non-value/value tensor types * Added tests for scalar Constant import; Updated MB::importFunction to use ImportOptions * Test updates * Move back module variable name * Remove RefineTypes from TorchMlirLoweringContext::Build() * Rename pass; Remove passes from base lazy backend * Rename pass to VerifyBackendContractPass * Aligned cmd pass name; Fixed TorchConversion passes registration	2022-10-04 15:53:28 -07:00
Tanyo Kwok	72e422b589	Add relu6 and binary broadcasts (#1408 ) * Add relu6 and binary broadcasts	2022-09-23 20:39:15 +08:00
Sean Silva	851ce0c940	Remove TorchLoweringPipelineOptions from TorchConversion pipelines TorchLoweringPipelineOptions only applies to the frontend lowering pipeline.	2022-09-14 11:20:29 -07:00
Tanyo Kwok	7f63a17a46	[MHLO] add new options to pipeline (#1331 )	2022-09-12 10:27:41 -07:00
Tanyo Kwok	57d8ec151f	[MHLO] add VerifyMhloBackendContract (#1321 ) * [MHLO] add VerifyMhloBackendContract * guard with macro	2022-09-01 17:08:17 +08:00
Sean Silva	0e3ddbac91	Remove VerifyInvariantsBeforeBackendLowering LowerToBackendContract now checks all this consistently.	2022-08-26 10:24:43 -07:00
Sean Silva	57681f7947	Iteratively run the main simplification pipeline. This introduces a new pass LowerToBackendContract (better name very welcome) which performs the bulk of the simplifications that we do, such as - shape refinement - dtype refinement - maximizing value semantics - inlining global slots - decomposing complex ops The key difference from before is that it iterates the set of transformations, which can help to break a number of "catch-22" issues where one simplification depends on another, the latest example being here: https://github.com/llvm/torch-mlir/issues/1131 This also exposed that RefineTypes was sometimes crashing/asserting for certain inputs. This commit hardens it a bit.	2022-08-17 14:54:33 -07:00
Yan Xu	9be8997536	Revert "add native_dropout and related ops pattern (#1211 )" (#1230 ) This reverts commit `c935795086`.	2022-08-17 13:48:10 +08:00
Yan Xu	c935795086	add native_dropout and related ops pattern (#1211 )	2022-08-15 09:28:47 +08:00
Ramana Radhakrishnan	738f4fe96a	Rename TorchToStd pass as TorchToArith (#1163 ) All the converters in this pass appear to create ops from the arith dialect. Hence the full rename. Fix GH Issue #409.	2022-08-10 20:12:51 +01:00

1 2

90 Commits (18139994e807d262f52a13b2c8e1b3edfa45ffa0)