torch-mlir

Commit Graph

Author	SHA1	Message	Date
Ramiro Leal-Cavazos	211cf8fc36	Add `report_fatal_error` to `getTypeForScalarType` (#1722 ) Functions like `getTypeForScalarType` that do a mapping from one set of types to another should not fail, and if they do it should be obvious to the developer that that function has an unhandled case. Instead of silently failing when encountering an unsupported type, this commit adds a `report_fatal_error` at the end, similar to other type translation functions in this file.	2022-12-15 08:33:14 -08:00
Ramiro Leal-Cavazos	60db793feb	Pass op legality info to `verifyBackendContractPass` (#1705 ) In order to verify if a given IR satisfies the backend contract, the verifier needs to know if decompositions took place, and if so, which ops were decomposed and which were not. This commit adds two arguments to `verifyBackendContractPass` to specify if decompositions took place and which ops to consider backend legal, similar to the arguments of `LowerToBackendContractPass`.	2022-12-15 08:32:52 -08:00
Prashant Kumar	564403e3a1	Add float16 support in the refbackend. This will require https://reviews.llvm.org/D139121 patch to go through.	2022-12-15 21:19:52 +05:30
Sean Silva	b60da34f84	[cleanup] Fix a few more llvm::None -> std::nullopt	2022-12-14 05:59:49 -08:00
Ashay Rane	f63bb9f86c	build: update llvm tag to 3a020527 (#1717 ) Summary of changes: - Replace `llvm::None` with `std::nullopt`, since the former is deprecated (https://reviews.llvm.org/D139763) - Use setter for symbol visibility instead of passing string attribute when creating FuncOp	2022-12-14 02:06:39 -06:00
Ahmed S. Taei	b1f6832849	Add aten.slice.Tensor & aten.cat folders (#1691 )	2022-12-13 13:02:47 -08:00
Ramiro Leal-Cavazos	a710237437	[custom op] Generalize shape library logic to work with dtypes (#1594 ) * [custom op] Generalize shape library logic to work with dtypes This commit generalizes the shape library logic, so that dtype rules for ops can also be expressed using the same mechanism. In other words, each op can now have a shape function and a dtype function specified in Python that is imported during lowering to calculate the shapes and dtypes throught a program. For more information about how to specify a dtype function, see the updated `docs/adding_a_shape_and_dtype_function.md`. For those not familiar with how the shape library works, the file `docs/calculations_lib.md` provides an overview.	2022-12-13 08:25:41 -08:00
Chi_Liu	163d19cce6	[TOSA] Add aten.add/sub.Scalar/Tensor si64 type support (#1604 )	2022-12-12 12:13:07 -08:00
Ramiro Leal-Cavazos	73bd32d06c	Make `getTensorRank` safer by changing return to `Optional<unsigned>` (#1707 ) Currently `getTensorRank` returns -1 if it was unable to get the rank of the tensor. However, not every use in the codebase was checking the return value, and in some cases, the return value was casted to unsigned leading to some infinte loops when an unranked tensor reached a decomposition. This commit changes the return of `getTensorRank` to `Optional<unsigned>` to make it clear to the user that the function can fail. This commit also changes a couple of for loops that iterate a vector in reverse order that can potentially become infinite loops into range-based for loops.	2022-12-12 08:56:28 -08:00
Vivek Khandelwal	d4862ec611	[MLIR][TORCH] Add e2e support for aten.var_mean op Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-12-12 15:46:54 +05:30
Vivek Khandelwal	f783e19dcb	Revert "[MLIR][TORCH] Fix mean and mean.dim op for large-sized inputs" This reverts commit `55c7e66aa7`.	2022-12-09 19:30:46 +05:30
Sambhav Jain	f8a2592905	[Bazel] Resolve circular dependency and add targets for conversion to MLProgram dialect (#1694 ) A circular dependency was introduced in `e7edcc62fd`. Specifically, the `makeShapeLLVMCompatible` and `makeShapeTorchCompatible` utilities were being called from `lib/Dialect/Torch/IR/TorchTypes.cpp` and `lib/Dialect/Torch/IR/TorchOps.cpp` defined under the `:TorchMLIRTorchDialect` bazel target, leading it to take a dependency on `:TorchMLIRConversionUtils` which already depends on `:TorchMLIRTorchDialect`, hence creating a circular dependency. This commit resolves the same by moving said utilities from `lib/Conversion/Utils/Utils.cpp` to `lib/Dialect/Torch/Utils/Utils.cpp`. Please LMK if there's a better way to fix this and I will update the code. This commit also adds the required targets to support building the new conversions from Torch to ML Program dialect that was introduced in `f416953600`. Bazel build GHA triggered manually to verify: https://github.com/sjain-stanford/torch-mlir/actions/runs/3645944517	2022-12-08 09:49:54 -08:00
Ramiro Leal-Cavazos	a54b334578	Allow running DecomposeComplexOps more than once (#1671 ) The current implementation of `DecomposeComplexOps` fails if an op expected to be decomposed does not get decomposed in the first iteration of the `createTorchSimplificationPipeline` in `LowerToBackendContractPass`. However, some graphs require multiple iterations of `createTorchSimplificationPipeline` to fully propagate all statically knowable information, such as dtypes and shapes, to the entire graph, sometimes resulting in the need to run `DecomposeComplexOps` more than once. This commit changes `DecomposeComplexOps` to use a greedy algorithm for pattern application and moves the legalization check of ops to the `LowerToBackendContractPass` to allow for the `DecomposeComplexOps` to run more than once.	2022-12-08 09:26:38 -08:00
Ramiro Leal-Cavazos	dd35488da5	build: update llvm tag to 798fa4b4 (#1684 ) - Support for non-prefixed accessors has been removed. See: https://reviews.llvm.org/D136727 - Rename `operands` to `methodOperands` in `prim.CallMethod` since the name `operands` overlaps with a builtin method name. See: https://reviews.llvm.org/D136727 - Add passes in refbackend to lower memref.subview. See: https://reviews.llvm.org/D136377 - Replace `CopyToValueTensorOps` first in `RewriteViewLikeSubgraph` in maximize-value-semantics. The current implementation of the `RewriteViewLikeSubgraph` pass in maximize-value-semantics creates temporarily invalid IR. In particular, given a forward slice starting from a `CopyToNonValueTensorOp` and ending in `CopyToValueTensorOp`s, the pass first replaces all uses of the `CopyToNonValueTensorOp` with its operand, which results in all the `CopyToValueTensorOp` users having their operand have type `!torch.vtensor`, which is invalid. The correct way to do things is to first replace all the `CopyToValueTensorOp`s with their operand, and then replace all uses of the `CopyToNonValueTensorOp` with its operand. This only started failing now because the generated accessor `getOperand` for the `CopyToValueTensorOp` now returns a `TypedValue<NonValueTensorType>`, which has an assert checking that the value returned is of the expected type.	2022-12-07 12:20:41 -08:00
Vivek Khandelwal	3e4bb2bd8e	[MLIR][TORCH] Add E2E support for randn and randn.generator op Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-12-06 22:41:24 +05:30
Vivek Khandelwal	f416953600	[MLIR][TORCH] Add TorchConversionToMLProgram and MLProgramBufferize pass This commit changes the `InsertRngGlobalsPass` to `TorchConversionToMLProgram` pass. This commit also adds the `MLProgramBufferize` pass for the bufferization of ml_program dialect ops to run on refbackend. Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-12-02 13:20:46 +05:30
Eric Kunze	3fc27cf6ca	Update LLVM Tag to 2c1fa734 (#1670 ) Summary of changes: - Change ShapedType::kDynamicSize -> ShapedType::kDynamic - llvm::NoneType has been deprecated, change convertScalarToDtype to use llvm::None	2022-12-01 20:38:28 -08:00
Ramiro Leal-Cavazos	b4b92c990e	Replace LCG algorithm with squares64 algorithm in AtenUniformOp (#1633 ) This commit replaces the LCG algorithm that was being used by the `TorchToLinalg` lowering of `AtenUniformOp` to generate random numbers with the `squares64` algorithm, for the LCG algorithm was producing tensors that were highly correlated with one another. Squares64 algorithm: https://arxiv.org/abs/2004.06278 Closes https://github.com/llvm/torch-mlir/issues/1608	2022-12-01 08:30:10 -08:00
Vivek Khandelwal	e7edcc62fd	build: update llvm tag to 147fe9de Summary of changes: - Replace call to `MemoryEffectOpInterface::hasNoEffect` with `isMemoryEffectFree`. - Make fix for the dynamic dims, since `kDynamicSize` value changed to `std::numeric_limits<int64_t>::min()` from `-1` in llvm - `makeShapeLLVMCompatible` and `makeShapeTorchCompatible` utilities convert shapes in order to remain consistent with the Torch and MLIR semantics. - Update tags llvm: 147fe9de29dc13c14835127b35280c4d95c8e8ba mhlo: 1944b5fa6062ec4c065d726c9c5d64f1487ee8c5 Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-12-01 13:36:50 +05:30
Abhishek Varma	47f67853ac	[RefineTypes] Add Float16Type dtype knowledge support for trivial ops -- This commit adds Float16Type dtype knowledge support for trivial ops. Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>	2022-12-01 10:22:43 +05:30
Ramiro Leal-Cavazos	0983a7f93a	Fix modulus calculation in LCG algorithm of refbackend (#1658 ) The current implementation sets the `nextSeed` value to `temp & 127`, which is wrong. The last step of the LCG algorithm for the multiplier and increment chosen should be `temp % 2^{64} = temp & (1 << 63)`. However, because we are dealing with i64 values, the modulus operation happens automatically, so it is not needed. See Donald Knuth's values for LCG here: https://en.wikipedia.org/wiki/Linear_congruential_generator	2022-11-30 08:46:52 -08:00
Abhishek Varma	c27c1791f1	[MLIR][TORCH] Add e2e support for `aten.amax` op -- This commit adds e2e support for `atend.amax` op. Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>	2022-11-30 17:54:37 +05:30
Abhishek Varma	2c643adcb9	[TORCH][DECOMPOSE] Fix bug in computeReductionType API -- This commit fixes a bug in computeReductionType API. -- The bug pertains to removal of `dim` from the `sizes` array. Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>	2022-11-30 17:54:37 +05:30
Tanyo Kwok	bbcdb38d99	Revert "Decompose torch.slice_scatter (#1622 )" (#1659 ) This reverts commit `f3f2f10030`.	2022-11-30 12:47:13 +08:00
Sean Silva	ecb09c2fc3	[torchdynamo] Fix output size computation for upsample_nearest2d	2022-11-29 01:46:29 -08:00
Abhishek Varma	bb259f918a	[MLIR][TORCH] Add lowering for `aten._softmax` when `half_to_float=True` -- This commit adds decompose logic for `aten._softmax` when `half_to_float` is `True`. -- An e2e test case will be added once support for half to float conversion for `aten._softmax` is added upstream. Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>	2022-11-28 22:32:00 +05:30
Vivek Khandelwal	d9cbf01d1e	Revert "build: update llvm tag to 147fe9de" This reverts commit `e45ad313d4`.	2022-11-25 12:41:56 +05:30
Vivek Khandelwal	e45ad313d4	build: update llvm tag to 147fe9de Summary of changes: - Update call to `hasNoEffect` utility - `KDynamicSize` value changed to `std::numeric_limits<int64_t>::min()` from `-1` - Update tags llvm: 147fe9de29dc13c14835127b35280c4d95c8e8ba mhlo: 1944b5fa6062ec4c065d726c9c5d64f1487ee8c5 Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-11-24 12:44:43 +05:30
Tanyo Kwok	14f1260ac4	Add more mhlo basic converters (#1628 ) * Add more mhlo basic converters * remove unused pinnedMemory constraints * refine naming	2022-11-24 14:28:34 +08:00
Tanyo Kwok	f3f2f10030	Decompose torch.slice_scatter (#1622 ) * Decompose torch.slice_scatter * fix compilation error * update file check * fix ci * fix i64 torch.tensor dtype	2022-11-23 18:14:12 +08:00
Vivek Khandelwal	da8fdc9f96	[MLIR][TORCH] Fix refine types crash This commit fixes https://github.com/llvm/torch-mlir/issues/1599. Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-11-23 15:17:37 +05:30
Tanyo Kwok	4aad5ccf39	fix #1626 return type mismatch (#1634 )	2022-11-23 15:02:41 +08:00
Vivek Khandelwal	68f568b704	[MLIR][TORCH] Add E2E support for prims.convert_element_type op Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-11-22 09:36:36 +05:30
Vivek Khandelwal	55c7e66aa7	[MLIR][TORCH] Fix mean and mean.dim op for large-sized inputs This commit fixes the aten.mean and aten.mean.dim op decomposition for supporting large-sized inputs. This commit also fixes the formatting for the file stats.py Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-11-22 08:38:51 +05:30
Tanyo Kwok	a9fb0c5459	fix mhlo e2e ci crashes (#1620 ) * fix mhlo e2e ci crashes * add passed tests * calc dynamic positive dim	2022-11-21 21:50:35 +08:00
Vivek Khandelwal	4cbd3927d7	[MLIR][TORCH] Add aten.sort.int op Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-11-20 19:00:41 +05:30
Chi_Liu	29c8f47723	[TOSA] Add aten.clamp op tosa support (#1609 ) Co-authored-by: AmosLewis <Amos_Lewsi@foxmail.com>	2022-11-18 13:32:13 -08:00
Abhishek Varma	1d949f3ac2	[MLIR][TORCH] Fix aten.upsample_nearest2d op -- aten.upsample_nearest2d.vec op is not present owing to https://github.com/pytorch/pytorch/pull/85638 -- So this commit adds a lowering on aten.upsample_nearest2d. Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>	2022-11-18 13:41:47 +05:30
Sean Silva	39de4d6265	[cleanup] Make diagnostics better Also remove some unused imports.	2022-11-17 02:09:54 -08:00
Vivek Khandelwal	5f7177da35	[MLIR][TORCH] Add decomposition for aten.var_mean.correction op Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-11-17 13:00:09 +05:30
Gaurav Shukla	0d209998d1	llvm: update tag to e864ac6945 (#1600 ) Summary of changes: 1. Replace `string` iterator types by `IteratorType` enum. (`e6598b053d`) 2. Update `includes` wrt new directory layout of MLIR HLO codebase. (`9fd8d251a8`) 3. Update tags llvm: e864ac694540342d5e59f59c525c5082f2594fb8 MHLO: eab364ba2a66bd0613efb94f8a738c1c97aaee92 Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com> Signed-off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-11-16 14:40:36 -08:00
Ramiro Leal-Cavazos	09ca07bca0	`m_TorchConstant{Int/Bool}List` -> `m_TorchListOfConstant{Int/Bool}s` (#1601 ) This commit renames the patterns used to match on lists of constant values to `m_TorchListOfConstant{valueType}s`. This is needed to avoid ambiguity for when `valueType` has `Optional` in it. In particular, it makes it clear whether the values in the list are optional or the list itself is optional.	2022-11-16 20:33:12 +00:00
Vivek Khandelwal	a1d3afdba9	[MLIR][TORCH] Add E2E support for aten.randint.low op Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-11-16 09:54:18 +05:30
AmosLewis	22a5067242	[TOSA] Add more tosa::cast type support	2022-11-16 09:53:10 +05:30
George Petterson	92f385bd9f	[MLIR][TORCH] Add E2E support aten.convolution_backward op This commit adds the decomposition for the `aten.convolution_backward` and `aten.convolution_backward_overrideable` op.	2022-11-15 07:38:26 +05:30
Chi_Liu	dfe7513a45	[MLIR][TORCH] Fix aten.unsqueeze op (#1578 ) The range of the unsqueeze dim is: [-input.dim() - 1, input.dim() + 1), the bug forgets to add 1.	2022-11-14 09:09:15 -08:00
Vivek Khandelwal	a558034c1a	[MLIR][TORCH] Fix aten.upsample_nearest2d_backward op Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-11-12 00:05:36 +05:30
Yuanqiang Liu	2793a2bd41	fix TorchToMhlo Conversion cmake dependency (#1549 )	2022-11-09 18:34:53 -06:00
Vivek Khandelwal	fedf8c0640	[MLIR][TORCH] Add E2E support for aten.upsample_nearest2d_backward.vec op Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-11-04 22:10:07 +05:30
Ashay Rane	0409595ccc	mlir: add missing dependency on TableGen targets (#1537 ) lib/Dialect/Torch/Utils/Utils.cpp includes TorchOps.h, which, by way of included header files, refers to both TorchOps.h.inc as well as TorchTypes.h.inc. However, the build rules do not specify the dependency of the `TorchMLIRTorchUtils` target on the TableGen generated header files, causing spurious build errors. This patch fixes the problem by adding `MLIRTorchOpsIncGen` and `MLIRTorchTypesIncGen` to the list of dependencies of `TorchMLIRTorchUtils`.	2022-11-01 14:59:11 -05:00
Tanyo Kwok	17bc7c89cc	build: update llvm tag to 74fb770d (#1539 ) * build: update llvm tag to 74fb770d This commit makes the following changes needed to update bump LLVM: + replace usages of `tensor::createPadScalarOp`, see https://reviews.llvm.org/D136493 + Update file checks	2022-11-01 15:27:09 +08:00
xndcn	759057cbdd	[MLIR][TORCH] Fix wrong parameter name "supportFPInputOnly" The parameter "supportFPInputOnly" of function createPoolingOp() is supposed to be "supportNonFPInput", which was added to distinguish between "MaxPool2d" and "AvgPool2d" op in #718	2022-10-30 23:18:08 +08:00
Vivek Khandelwal	c86177730d	[MLIR][TORCH] Add E2E support for aten.fill.Tensor op This commit adds the decomposition for `aten.fill.Tensor` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-10-30 18:40:47 +05:30
Ramiro Leal-Cavazos	b723186983	Remove all but one of valsem ops + move fill.Scalar to elementwise (#1531 ) This commit removes almost all of the valsem ops, since the value semantics version of the ops now exist in PyTorch. The only op missing is `aten.bernoulli_.float`. In addition, this commit also simplifies the implementation of `aten.fill.Scalar` by moving it to the pattern that converts elementwise ops.	2022-10-28 15:06:11 +00:00
Daniel Ellis	3e199aaf11	Add better error message for single-tensor tuple returns.	2022-10-25 12:48:55 -04:00
Vivek Khandelwal	ca87033d2f	[MLIR][TORCH] Add E2E support for aten.mse_loss op This commit adds decomposition for the `aten.mse_loss` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-10-25 21:06:58 +05:30
Chi_Liu	ad6f5848cb	[MLIR][TORCH] Add TorchToTosa lowering for aten.where.self op (#1454 )	2022-10-18 09:39:39 -07:00
Ramiro Leal-Cavazos	82a3860e25	build: update llvm tag to 4546397e (#1502 ) This commit makes the following changes needed to update bump LLVM: - Replace `linalg.init_tensor` with `tensor.empty` (see: https://reviews.llvm.org/D135129) - Replace `NoSideEffect` with `Pure` (see https://reviews.llvm.org/D135505) - Replace `body` region accessor for `ReduceOp` and `ReduceWindowOp` with `getBody` - Fix incorrect use of `tosa::ReduceSumOp` in `AtenNativeLayerNormOp` conversion pattern. The result type of `tosa::ReduceSumOp` must have the same rank as the input type. (see: https://www.mlplatform.org/tosa/tosa_spec.html#_reduce_sum) Co-authored-by: Ashay Rane <ashay@users.noreply.github.com> Co-authored-by: Ashay Rane <ashay@users.noreply.github.com>	2022-10-18 04:22:53 +00:00
Prashant Kumar	3a2cd23380	[LINALG] Add lowering for aten::round op. -- Added the lowering for aten::round op. -- Added the folding for integer cases.	2022-10-13 02:41:26 +05:30
Ramiro Leal-Cavazos	8f76c74be9	Remove unused input tensor from linalg.generic in aten.convolution (#1487 ) This commit removes the `weight` tensor from the inputs of one of the `linalg.generic` ops generated by the `aten.convolution` linalg lowering, since the indexed values are not actually used by the body of the `linalg.generic`. Moreover, in general the `weight` tensor does not have the same shape as the output tensor of the `linalg.generic`, so both tensors being indexed by the same indexing maps is wrong.	2022-10-12 14:01:24 -07:00
Abhishek Varma	61db1b5c4d	[MLIR][TORCH] Add e2e support for `aten.Mish` op (#1470 ) -- This commit adds e2e support for `aten.Mish` op. -- `aten.Mish` op is decomposed as following :- Mish(x) = x * Tanh(Softplus(x)) Signed-off-by: Abhishek Varma <avarma094@gmail.com> Signed-off-by: Abhishek Varma <avarma094@gmail.com>	2022-10-11 14:03:10 -07:00
Gaurav Shukla	da90a25f90	[MLIR][TORCH] Add E2E support for `aten.[div.int\|bitwise_or.Tensor]` ops This commit adds lowering of `aten.div.int` and `aten.bitwise_or.Tensor` ops. Both these ops are required in order to support bloom_560m model. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-10-10 22:28:51 +05:30
Vivek Khandelwal	d3cc3f1aff	[tosa] Add lowering for aten.to.dtype and aten._to_copy op This commit adds the TorchToTosa lowering for `aten.to.dtype` and `aten._to_copy` op. Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-10-06 12:00:25 +05:30
Vivek Khandelwal	56f9a9b5de	[tosa] Add TorchToTosa lowering for torch.prim.NumToTensor.Scalar op Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-10-06 12:00:25 +05:30
Ramiro Leal-Cavazos	8201e7b067	[LINALG] Make `AtenMaxDimOp` use `arith.maxf` to calculate maximum (#1466 ) This commit updates the linalg conversion of `AtenMaxDimOp` to use `arith.maxf` instead of `arith.select` to calculate the maximum. This allows better vectorization further downstream, since the operation can be converted to a simple max reduction when the `indices` result is not used. See: https://github.com/iree-org/iree/issues/10666.	2022-10-05 18:22:59 -07:00
Ashay Rane	faa9a78e38	build: update llvm tag to 6f46ff37 (#1448 ) Summary of changes: - Updated references to the Arith dialect (https://reviews.llvm.org/D134762) - Switched to prefixed accessors for MemRef dialect (https://reviews.llvm.org/D134995) - Fixed warnings about signed/unsigned comparisons, ignored return values, and unused variables	2022-10-05 08:28:06 -05:00
Gleb Kazantaev	708fa346a6	Fix Base Lazy Backend Type Conversion (#1412 ) * Fix c10::prim::Constant conversion; Added CAPI for passes; Added passes to base lazy backend * Update ivalue_importer to use ImportOptions; Added tests for non-value/value tensor types * Added tests for scalar Constant import; Updated MB::importFunction to use ImportOptions * Test updates * Move back module variable name * Remove RefineTypes from TorchMlirLoweringContext::Build() * Rename pass; Remove passes from base lazy backend * Rename pass to VerifyBackendContractPass * Aligned cmd pass name; Fixed TorchConversion passes registration	2022-10-04 15:53:28 -07:00
Daniel Ellis	2ba71af651	Add support for mv decomposition.	2022-10-04 11:34:45 -04:00
Prashant Kumar	6777a9484d	[LINALG] Add lowering for the aten.upsample_nearest2d op.	2022-10-04 17:20:29 +05:30
Ashay Rane	855d267c57	build: update shape library after PyTorch version update (#1449 ) The auto-update of the PyTorch version broke the Torch-MLIR build because it did not update the shape library. Going forward, we should add the shape library update to the PyTorch version update action.	2022-10-02 14:05:53 -05:00
Vivek Khandelwal	9dd5ae8239	[tosa] Add TorchToTosa lowering for aten.arange.start_step op (#1442 )	2022-09-30 07:33:41 -07:00
AmosLewis	940959589b	[MLIR][TORCH] Add Byte and Char Dtype support	2022-09-30 13:19:31 +05:30
Vivek Khandelwal	6db513c51d	[tosa] Add support for some cases of aten.broadcast_to op (#1429 ) This commit adds support for TorchToTosa lowering of `aten.broadcast_to` op for cases: 1.) When the rank of input and output tensor is equal. 2.) When the rank of input tensor is zero. Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-09-29 09:40:56 -07:00
Ramiro Leal-Cavazos	0f15b3a594	Bump shape library (#1427 )	2022-09-29 09:02:28 -07:00
Vivek Khandelwal	bce00c8ed1	[tosa] Fix torch.vtensor.literal lowering Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-09-29 17:03:10 +05:30
JakopinA	8ef0c874c2	Implement Expand/Collapse Functionality for Aten.View (#1353 )	2022-09-27 11:08:14 -07:00
Eric Kunze	cb1b8796a2	Convert torch si literals into signless for TOSA (#1421 )	2022-09-26 16:54:27 -07:00
武家伟	c03aa63325	[MLIR] Add canonicalizer for aten.slice.t op (#1413 ) * [MLIR] Add canonicalizer for aten.slice.t op * Add mlir tests and strength the canonicalizer * rename variable Co-authored-by: Vremold <xremold@gamil.com>	2022-09-26 14:35:50 -07:00
Ashay Rane	a60acf272d	build: update llvm tag to bebc9695 (#1415 ) Summary of changes: - Renamed OptionalArrayRefParameter since the name conflicts with an upstream symbol that has a different meaning (https://reviews.llvm.org/D133819) - Removed extraneous dependency between TorchMLIRTorchToMhlo and ChloOps, since the existing dependency on MhloDialect is sufficient - Fixed code to prevent warnings related to comparisons between signed and unsigned values	2022-09-26 11:44:54 -05:00
武家伟	ab7aa01b1e	[MHLO] Add torch-to-mhlo e2e support for aten.gather op (#1410 ) * Add torch-to-mhlo e2e support for aten.gather op * Add more e2e tests for torch.aten.gather op	2022-09-25 22:07:46 +08:00
Quinn Dawkins	53bf09ceef	Fix iterator types for embedding bag sum mode (#1371 )	2022-09-23 13:13:47 -04:00
Ashay Rane	b0b2b3a199	build: add missing dependency on MLIRTorchTypesIncGen (#1405 )	2022-09-23 08:08:16 -05:00
Tanyo Kwok	16dd7e2e5f	Fix dynamic shapes type verifications (#1409 ) * Fix dynamic shapes type verifications	2022-09-23 20:50:29 +08:00
Tanyo Kwok	72e422b589	Add relu6 and binary broadcasts (#1408 ) * Add relu6 and binary broadcasts	2022-09-23 20:39:15 +08:00
Tanyo Kwok	061a97c3f2	Replace empty_like && empty_memory_format with full/full_like (#1398 ) * Replace empty_like && empty_memory_format with full/full_like * fix broadcast rank0 tensor	2022-09-23 10:24:36 +08:00
Vivek Khandelwal	4ef6e69ed4	[MLIR][TORCH] Add TorchToTosa lowering for aten.clone op (#1388 ) Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com> Co-authored-by: Suraj Sudhir <16977902+sjarus@users.noreply.github.com>	2022-09-20 15:07:46 -07:00
Vivek Khandelwal	1ffd42bbde	[MLIR][TORCH] Add TorchToTosa lowering for aten.broadcast_to op (#1386 ) Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-09-20 10:04:51 -07:00
武家伟	0e2e94d542	Add torch-to-mhlo e2e support for AtenArangeStartStepOp (#1385 ) Co-authored-by: Vremold <xremold@gamil.com>	2022-09-20 22:31:24 +08:00
武家伟	4f3cd236dd	Strength the shape inference for aten.arange-like op (#1367 ) Strength the shape inference for aten.arange-like op by 1. registering aten.sub and aten.ceil.Scalar op and design folders for them. 2. register a new constant-like op: Torch::ConstantNumberOp and design canonicalizer for it.	2022-09-20 12:40:19 +08:00
Sambhav Jain	bb47b36eac	Add a `AllowedInModuleInitializer` trait to denote ops that are permitted in the module initializer (#1379 ) This PR adds an `AllowedInModuleInitializer` trait to keep track of ops that are permitted in the module initializer. We have a handful of such ops that are produced by the IValue importer, and so this change avoids maintaining a list of ops in `TorchOps.cpp` that could lead to spurious merge conflicts, and help us integrate torch-mlir in our downstream compiler better. Please let me know if you'd prefer a better name for the trait itself. Feedback is welcome!	2022-09-19 14:56:35 -07:00
long.chen	797feaf129	[torch-mlir][Tosa] fix during torch.max.dim lower to tosa the reshape's new shape attr mismatch reshape's result type (#1378 )	2022-09-16 21:29:56 -07:00
Vivek Khandelwal	04f3a4ffce	[MLIR][TORCH] Add support for bool element type for aten.sum[.dim_IntList] op This commit adds bool element type support for `aten.sum` and `aten.sum.dim_IntList` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-09-17 09:18:34 +05:30
Ashay Rane	1895b581c4	shape-lib: generate string as multiple lines to work with MSVC (#1370 ) As @oroppas identified, literal strings that are over 16,380 characters cause the MSVC compiler to throw an error (C2026), eventually causing the Windows build of Torch-MLIR to fail because the length of the generated MLIR for the shape library crosses the allowed threshold. This patch fixes the problem by making the Python script generate one literal string per line to satisfy the MSVC compiler. Thanks to @oroppas for the bulk of the effort required to resolve this!	2022-09-16 15:16:01 -05:00
武家伟	b316918947	Add AtenClampOp conversion pattern to MHLO (#1356 ) Add AtenClampOp conversion pattern to MHLO	2022-09-16 15:09:21 +08:00
Sean Silva	851ce0c940	Remove TorchLoweringPipelineOptions from TorchConversion pipelines TorchLoweringPipelineOptions only applies to the frontend lowering pipeline.	2022-09-14 11:20:29 -07:00
Ashay Rane	2bb5f4d8fe	build: update llvm tag to 4d4ca6c9 (#1359 ) Summary of changes: - Updated emitAccessorPrefix since the default value has changed (https://reviews.llvm.org/D133179) - Updated RefineTypes pass since Lattice::isUninitialized() is removed (https://reviews.llvm.org/D132800) - Updated MHLO tag so that it builds with the updated LLVM tag - Disabled two tests that cause segfaults in the TOSA backend (see Issue #1361)	2022-09-13 21:24:43 -05:00
gpetters94	48418b9c22	Fold away type_as (#1358 )	2022-09-12 18:59:12 -04:00
Tanyo Kwok	7f63a17a46	[MHLO] add new options to pipeline (#1331 )	2022-09-12 10:27:41 -07:00
Vivek Khandelwal	71b1f0dd7a	[MLIR][TORCH] Add E2E support for aten.index.Tensor_hacked_twin op This commit adds lowering of `index.Tensor_hacked_twin` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-09-12 21:47:18 +05:30
George Petterson	a12b9c4492	Add lowering for aten::cumsum	2022-09-12 09:28:07 +05:30
Vivek Khandelwal	326f21229e	[MLIR][TORCH] Fix shape calculation for aten::pow.Tensor_Tensor op Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-09-08 21:14:12 +05:30
Vivek Khandelwal	e35741fb1d	[MLIR][TORCH] Add E2E support for aten.bitwise_not op This commit adds lowering of `aten.bitwise_not` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-09-08 17:52:12 +05:30
Vivek Khandelwal	7dfadc2498	[MLIR][TORCH] Add E2E support for aten.lift_fresh_copy op This commit adds lowering of `aten.lift_fresh_copy` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-09-08 12:32:16 +05:30
Vivek Khandelwal	c19fccfca2	[MLIR][TORCH] Add E2E support for aten.pow.Tensor_Tensor op This commit adds lowering of `aten.pow.Tensor_Tensor` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-09-08 10:01:42 +05:30
武家伟	6a1893a517	[MLIR][MHLO] Add AtenFrobeniusNormDimOp and add its conversion pattern to MHLO and linalg (#1306 ) * Add aten.frobenius_norm.dim op and init its conversion pattern to linalg and MHLO, * run symbolic-shape-optimization before hlo-legalize-to-linalg to fit more mhlo e2e tests.	2022-09-08 10:15:36 +08:00
Ashay Rane	93f7c0ceb5	build: update llvm tag to d2613d5b (#1343 ) Summary of changes: - Update the dataflow analysis in RefineTypes.cpp - Add tosa-to-arith pass after tosa-to-linalg pass, since tosa-to-linalg (and canonicalizations) can produce tosa.const() ops - Fixed warning about not making `matchAndRewrite` as override	2022-09-07 14:35:14 -05:00
Gaurav Shukla	99093d0623	[TORCH] Add decomposition of `aten.linear` op This commit adds decomposition of `aten.linear` op. Due to limited support at tosa backend in case of dynamic dimensions, this decomposition is currently disabled for tosa backend. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-09-07 16:58:27 +05:30
Quinn Dawkins	cc86cc0f02	Revert "Implement Non-Expand/Collapse Functionality for Aten.View (#1309 )" (#1347 ) Reverting commit `a6a48ba233` to revise unit tests and address dynamic shape handling based on comments in #1309	2022-09-07 01:38:11 -04:00
JakopinA	a6a48ba233	Implement Non-Expand/Collapse Functionality for Aten.View (#1309 ) Focuses on statically sized cases such as [2, 3] -> [3, 2].	2022-09-06 14:46:04 -04:00
Tanyo Kwok	37f57a9828	Delete ConvertAtenNativeLayerNormOp from TorchToLinalg (#1336 ) The ConvertAtenNativeLayerNormOp is delete because we have decomposition already see https://github.com/llvm/torch-mlir/pull/1332	2022-09-05 10:19:20 +08:00
Tanyo Kwok	512f2d9c23	Add decomposition to aten.native_layer_norm (#1332 ) * Add decomposition to aten.native_layer_norm * fix ci error	2022-09-02 09:29:22 +08:00
Tanyo Kwok	57d8ec151f	[MHLO] add VerifyMhloBackendContract (#1321 ) * [MHLO] add VerifyMhloBackendContract * guard with macro	2022-09-01 17:08:17 +08:00
Tanyo Kwok	29cafdbb61	[MHLO] refactor pass configurations (#1315 ) Related to https://github.com/llvm/torch-mlir/issues/1227 1. Reduce MHLO #ifdefs 2. Dismiss compilation warnings	2022-09-01 10:36:02 +08:00
Ashay Rane	e52e886845	build: update llvm tag to 00d648bd (#1307 ) - Update MHLO commit to build with LLVM commit hash 00d648bd - Update TorchToMhlo code to work with Stablehlo - Re-enabled two failing TOSA tests, thus resolving Github Issue #1231	2022-08-30 14:44:00 -05:00
Sean Silva	51ef1b141c	Add some missing dependencies. Caught in the wild here: https://github.com/llvm/torch-mlir/runs/8046660640?check_suite_focus=true It is common for a missing dependency to only surface as an issue on the CI machines since they have fewer cores which prevents a "race" that happens to cause the dependency to be built before the dependent.	2022-08-30 11:52:30 -07:00
Sean Silva	bcccf41d96	Add CI for generated files. This ensures that they are always up to date. This also updates the shape lib to make the new CI actually pass :)	2022-08-29 12:07:16 -07:00
Sean Silva	26231853ab	Rename an outdated class name We used to not have "value-semantic" tensors but rather "immutable" tensors	2022-08-29 10:08:59 -07:00
Sean Silva	0e3ddbac91	Remove VerifyInvariantsBeforeBackendLowering LowerToBackendContract now checks all this consistently.	2022-08-26 10:24:43 -07:00
Sean Silva	b1fa7a2b9d	Fix a few build warnings	2022-08-26 10:24:22 -07:00
Ashay Rane	1d9d925f6e	mlir: fix replacement of `OpaqueElementsAttr` (#1274 ) An earlier patch (`bb47c166`) incorrectly replaced the now-dropped `OpaqueElementsAttr` with `SparseElementsAttr` in one place and with `DenseElementsAttr` in another. This patch fixes the problem by making both replacements use the dense-equivalent type.	2022-08-24 17:10:40 -05:00
gpetters94	f012279fa2	Add transposed case for at::convolution (#917 ) Also adds a decomposition for aten::conv_transposed2d.input	2022-08-24 12:19:35 -04:00
Tanyo Kwok	3d0e18bbe7	Add decomposition for aten.roll (#1170 ) * Add decomposition for aten.roll * add e2e unittest * refine type of torch.roll * fix aten::cat output type	2022-08-24 08:36:05 +08:00
Tanyo Kwok	2374098d71	[MHLO] Init end to end unit tests (#1223 )	2022-08-23 16:47:21 +08:00
Vivek Khandelwal	8cad02f87e	[MLIR][TORCH] Add torch.Device type to backend contract scalar types Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-08-23 10:50:09 +05:30
Tanyo Kwok	9176b5ed29	Add decomposition for aten.flatten.using_ints (#1161 )	2022-08-23 11:52:54 +08:00
Sean Silva	01290d134a	Add a way for backends to control which ops are legal for them. We were already hitting many cases where backends different in terms of the legal ops that they wanted. This caused unnecessary coupling between the backends. Examples: - https://github.com/llvm/torch-mlir/pull/1161 - https://github.com/llvm/torch-mlir/pull/862 This PR centralizes all compilation to go through `torch_mlir.compile` so that we can keep the logic centralized there. We should move these lists closer to each backend. Especially cases like https://github.com/llvm/torch-mlir/pull/862 where blocking a decomposition is necessary to avoid a crash emphasize that the set of decompositions is tightly coupled to the backend, and should be "controlled by the backend" and not something arbitrarily tweakable. Also: - Fix a small bug in the way we passed through the backendLegalOps option. - Add better error messages in `torch_mlir.compile` for import errors.	2022-08-22 14:16:13 -07:00
Alex Tsao	c38308f3ef	Add lowering for _convolution.deprecated (#1259 ) * Add lowering for _convolution.deprecated	2022-08-22 11:17:36 +08:00
武家伟	99fb4c8637	Add folder for ToF64Op and FromF64Op (#1257 )	2022-08-22 09:49:39 +08:00
Vivek Khandelwal	65d811e267	[MLIR][TORCH] Fix dynamic cases for aten.index.Tensor	2022-08-19 12:13:20 +05:30
武家伟	7bd173a1c4	[MHLO] Eliminate explicit dynamic output shape generating in converting AtenSliceTensorOp (#1245 ) [MHLO] Eliminate explicit dynamic output shape generating in converting AtenSliceTensorOp	2022-08-19 10:14:57 +08:00
Ramiro Leal-Cavazos	9bc606c384	Add support for returning more than one copy of the same tensor (#1228 ) One of the simplifications made by the pass `RefinePublicReturn` currently only happens if the tensor in question only has one user. However, the current method of checking this does not correctly handle the case of a user having multiple uses of the same tensor. This commit makes sure only unique users are considered.	2022-08-18 22:41:45 +00:00
Sean Silva	283e0f141a	Add a concept of "backend legal ops". This is a first step towards formalizing the set of ops in our backend contract. The goal is to eventually formalize `torch` dialect ops into 3 categories: 1. Legal in backend contract 2. Illegal in backend contract 3. Conditionally legal in backend contract The "conditionally legal" set are the ops that we can optionally decompose for backends. This patch adds relevant pass options for this throughout the compiler, in preparation for a new set of traits which will formalize this classification.	2022-08-18 11:46:50 -07:00
Ramiro Leal-Cavazos	f07f7d20f9	Clean up shape functions that use `sum_mean_dim` (#1217 ) I recently fixed the handling of the `dim` argument in `sum_mean_dim` (`59fccab857`). Therefore, the checks that the `dim` input is `None` or `[]` are no longer needed.	2022-08-18 08:23:43 -07:00
Sean Silva	57681f7947	Iteratively run the main simplification pipeline. This introduces a new pass LowerToBackendContract (better name very welcome) which performs the bulk of the simplifications that we do, such as - shape refinement - dtype refinement - maximizing value semantics - inlining global slots - decomposing complex ops The key difference from before is that it iterates the set of transformations, which can help to break a number of "catch-22" issues where one simplification depends on another, the latest example being here: https://github.com/llvm/torch-mlir/issues/1131 This also exposed that RefineTypes was sometimes crashing/asserting for certain inputs. This commit hardens it a bit.	2022-08-17 14:54:33 -07:00
Yan Xu	9be8997536	Revert "add native_dropout and related ops pattern (#1211 )" (#1230 ) This reverts commit `c935795086`.	2022-08-17 13:48:10 +08:00
Quinn Dawkins	85f383ce0b	Bump the shape lib to match the upstream functions currently in PyTorch (#1236 ) Bumps the shape library: - Updates the function signature for aten.arange.start_step - upstream_shape_functions.mean_dim -> upstream_shape_functions.sum_mean_dim	2022-08-17 00:11:04 -04:00
武家伟	11a5b5ac52	[MHLO] Add AtenRSubScalarOp conversion pattern to MHLO (#1233 ) * [MHLO] Add AtenRSubScalarOp conversion pattern Co-authored-by: Bairen Yi <yibairen.byron@bytedance.com> Co-authored-by: Jiawei Wu <xremold@gmail.com> Co-authored-by: Tianyou Guo <tianyou.gty@alibaba-inc.com> Co-authored-by: Xu Yan <yancey.yx@alibaba-inc.com> Co-authored-by: Ziheng Jiang <ziheng.jiang@bytedance.com>	2022-08-17 09:07:36 +08:00
nithinsubbiah	fde390c766	Re-enable custom op support	2022-08-16 22:49:08 +05:30
Ashay Rane	84d345c650	build: update llvm tag to 2dde4ba6 (#1229 ) Summary of changes: - Tensor dialect now sets `emitAccessorPrefix` to prefixed, thus requring updates to methods that retrieve arguments [https://reviews.llvm.org/D131361] - Update MHLO to build with LLVM commit hash 2dde4ba6 - Replace `AbsOp` with `AbsFOp` [https://reviews.llvm.org/D131325] - Replace deprecated `getValue()` with `value()` [https://reviews.llvm.org/D131349] - Remove `AnalysisState::defaultInitialize()` [https://reviews.llvm.org/D131746] - Update MHLO MLIR tests to use the updated assembly format - Disabled two failing TOSA tests (Github Issue link: https://github.com/llvm/torch-mlir/issues/1231)	2022-08-15 23:54:45 -07:00
武家伟	3b3cb99ef8	Generalize canonicalization pattern for more aten.sub/div/mul/add op (#1209 ) Generalize canonicalization pattern for more sub/div/mul/add op, but for AtenDivTensorModeOp in 'trunc' rounding mode, we try to fold it.	2022-08-16 13:24:08 +08:00
Ramiro Leal-Cavazos	9d6ee48661	Fix unused-variables warnings about EmbeddingBag ops (#1220 ) According to the documentation for `torch.embedding_bag` (https://pytorch.org/docs/stable/generated/torch.nn.functional.embedding_bag.html), the default value for `scale_grad_by_freq` is False.	2022-08-15 09:43:55 -07:00
Yan Xu	c935795086	add native_dropout and related ops pattern (#1211 )	2022-08-15 09:28:47 +08:00
Prashant Kumar	b1a506624c	Add decomposition of `aten.masked.tensor` op. `aten.masked.tensor` op has been decomposed to `aten.masked.scalar` op.	2022-08-11 07:48:04 +05:30
Yan Xu	d96ec64be1	remove torch dialect from legal list (#1192 )	2022-08-11 09:22:41 +08:00
Vidush Singhal	dd2da5a038	E2E support for AtenRemainderScalarOp (#1200 )	2022-08-10 20:02:06 -04:00
gpetters94	79b9cf9468	Add lowering for aten.to.device (#1107 )	2022-08-10 19:24:02 -04:00
Ramana Radhakrishnan	738f4fe96a	Rename TorchToStd pass as TorchToArith (#1163 ) All the converters in this pass appear to create ops from the arith dialect. Hence the full rename. Fix GH Issue #409.	2022-08-10 20:12:51 +01:00
武家伟	87562773f8	[MHLO] Add AtenCatOp conversion pattern to MHLO (#1208 ) Co-authored-by: Bairen Yi <yibairen.byron@bytedance.com> Co-authored-by: Jiawei Wu <xremold@gmail.com> Co-authored-by: Tianyou Guo <tianyou.gty@alibaba-inc.com> Co-authored-by: Xu Yan <yancey.yx@alibaba-inc.com> Co-authored-by: Ziheng Jiang <ziheng.jiang@bytedance.com> Co-authored-by: Vremold <xremold@gamil.com>	2022-08-09 22:12:34 -07:00
Marius Brehler	202076c6e3	Add CMake dep to Func dialect (#1196 ) The Torch dialect has an include to `mlir/Dialect/Func/IR/FuncOps.h` and should therefore have a CMake dependency to the MLIRFuncDialect. Otherwise, the build can fail since it may occur that `mlir/Dialect/Func/IR/FuncOps.h.inc` isn't generated yet.	2022-08-09 06:54:30 -07:00
Yan Xu	f83a905856	[MHLO]fix lowering failed on reduction op with i32 shape (#1185 ) fixed lowering failed on torch::max.dim while shape type is i32	2022-08-09 17:02:50 +08:00
powderluv	e55fc4deb5	Revert "E2E support for AtenRemainderScalarOp (#1119 )" (#1190 ) This reverts commit `34e207eeb5`.	2022-08-08 22:59:57 -07:00
Ashay Rane	bb47c166a0	llvm: update tag to 061e0189 (#1180 ) Summary of changes: - Switch to C++17 (similar to https://reviews.llvm.org/D131348) - Update MHLO to build with LLVM commit hash 061e0189 - Replace deprecated `hasValue()` and `getValue()` with `has_value()` and `value()` respectively (https://reviews.llvm.org/D131349) - Use `TypedAttr` (https://reviews.llvm.org/D130092) - Use updated assembly format of `mhlo.compare` op (commit d03ef01e70fbf9afd0fa1976fbb7ed31838929b3 in MHLO repo)	2022-08-08 20:17:35 -07:00
武家伟	351f15424e	[MHLO] Add transposed convolution conversion pattern (#1171 ) Co-authored-by: Bairen Yi <yibairen.byron@bytedance.com> Co-authored-by: Jiawei Wu <xremold@gmail.com> Co-authored-by: Tianyou Guo <tianyou.gty@alibaba-inc.com> Co-authored-by: Xu Yan <yancey.yx@alibaba-inc.com> Co-authored-by: Ziheng Jiang <ziheng.jiang@bytedance.com>	2022-08-09 09:50:07 +08:00
Sean Silva	504de5e701	Rework how global slot initializers work. Rather than a per-global-slot initializer region, we now have one for the whole module. For example, it might look like this: ``` torch.global_slot "private" @tensor : !torch.tensor torch.global_slot "private" @list : !torch.list<tensor> torch.global_slot.module_initializer { %0 = torch.tensor.literal(dense<0.0> : tensor<f32>) : !torch.tensor %1 = torch.prim.ListConstruct %0 : (!torch.tensor) -> !torch.list<tensor> torch.initialize.global_slots [ @tensor(%0 : !torch.tensor) @list(%1 : !torch.list<tensor>) ] } ``` This new structure allows GlobalizeObjectGraph to create the initializer in a much simpler way, avoiding the need to reason about whether different slots alias each other. Reasoning about whether slots alias each other now is the responsibility of InlineGlobalSlots, which has to do a much more complicated analysis, implemented using MLIR's dataflow analysis framework. Recommended review order: - Check out the new IR constructs in the .mlir files of various passes - Op definitions (*.td) - Changes to GlobalizeObjectGraph pass. - InlineGlobalSlots pass (~total rewrite) - Misc changes: - Moving torchMlirAdjustStaticInformation for sharing with C++ code. - EraseModuleInitializer pass To make this a bit nicer, it would be good to have a `torch.module` op with an initializer region attached. That would be more invasive though. This change has highlighted certain aspects of our project layering which are worth calling out. None of our backends can handle global slots, so we enforce that there are no global slots before backend lowering. At an earlier stage in the project, we had aspirations of transparently handling mutable global state and such, but for reasons described below, that is no longer a goal. So really global slots should be seen as a progressive lowering step as part of inlining all the IValue's in the original program (GlobalizeObjectGraph is also one such step). Over time, with insights from work like IREE-JAX, it has become clear that there isn't a reliable programming model we can compile for users where we just transparently handle mutable global state (and some other things, like lists and dictionaries). There is a need for an "outer program" that orchestrates more restricted subroutines of the kind we can handle in our compile flow here. The benefit of that is that it decouples considerations like shapes, dtypes, etc. from the program constructs used in the outer program. As long as the outer program can efficiently invoke (pipelining/async/etc.) high-performance data-parallel numerical subroutines of the kind we compile in our flow here, then there is a complete programming model. This is also consistent with the direction of upstream PyTorch which is becoming more tracing-based (which inherently loses a lot of program structure, which then has to be applied back with an "outer program" orchestrating the traced subroutines).	2022-08-08 18:12:06 -07:00
Vidush Singhal	34e207eeb5	E2E support for AtenRemainderScalarOp (#1119 ) * E2E support for AtenRemainderScalarOp	2022-08-08 20:02:52 -04:00
Vidush Singhal	b70548edff	Add decomposition and E2E support for Aten_EmbeddingBag (#1137 ) * Add decomposition and E2E support for Aten_EmbeddingBag	2022-08-08 18:56:49 -04:00
Tanyo Kwok	290d7755fb	importer: add initial support for loading Float16 tensors (#1169 ) follow up #761: This patch updates the `torch_mlir::convertTensorToMlirElementsAttr()` method to enable the creation of tensors whose base type is Float16. This patch also adds a test to validate the IR generation, and it updates the test for importing tensors of various types.	2022-08-08 12:37:31 +08:00
Tanyo Kwok	1ee865983b	[MHLO] fix tensor mode aten.div op pattern (#1160 ) * [MHLO] fix tensor mode aten.div op pattern See RFC #999 Co-authored-by: Bairen Yi <yibairen.byron@bytedance.com> Co-authored-by: Jiawei Wu <xremold@gmail.com> Co-authored-by: Tianyou Guo <tianyou.gty@alibaba-inc.com> Co-authored-by: Xu Yan <yancey.yx@alibaba-inc.com> Co-authored-by: Ziheng Jiang <ziheng.jiang@bytedance.com>	2022-08-06 23:38:06 +08:00
Vivek Khandelwal	c129a6de93	[MLIR][TORCH] Add support for dim=None to Aten[Var\|Std]DimOp PyTorch recently added support for `dim=None` in the `torch.var` (`5ca9b2b6fa`) and `torch.std`op (`eb0e30e0bc`). This commit adds the corresponding support in torch-mlir. Signed-Off By: Vivek Khandelwal<vivek@nod-labs.com>	2022-08-05 20:28:56 +05:30
武家伟	c94431f71c	[MHLO] Add convolution op pattern (#1152 ) Co-authored-by: Bairen Yi <yibairen.byron@bytedance.com> Co-authored-by: Jiawei Wu <xremold@gmail.com> Co-authored-by: Tianyou Guo <tianyou.gty@alibaba-inc.com> Co-authored-by: Xu Yan <yancey.yx@alibaba-inc.com> Co-authored-by: Ziheng Jiang <ziheng.jiang@bytedance.com>	2022-08-04 00:41:35 -07:00
gpetters94	08fc2d89bb	Add non-unit groups support to aten.convolution (#858 )	2022-08-04 02:18:38 -04:00
武家伟	d030591df9	[MHLO] Init MHLO pooling-like op conversion (#1141 ) * [MHLO] Init MHLO pooling-like op conversion and remove 'op' suffix in filenames Co-authored-by: Bairen Yi <yibairen.byron@bytedance.com> Co-authored-by: Jiawei Wu <xremold@gmail.com> Co-authored-by: Tianyou Guo tianyou.gty@alibaba-inc.com Co-authored-by: Xu Yan <yancey.yx@alibaba-inc.com> Co-authored-by: Ziheng Jiang <ziheng.jiang@bytedance.com> See RFC #999	2022-08-04 12:34:22 +08:00
Tanyo Kwok	f0a24f59f6	[MHLO] Init MHLO linear op patterns (#1132 ) See RFC https://github.com/llvm/torch-mlir/issues/999 Co-authored-by: Bairen Yi yibairen.byron@bytedance.com Co-authored-by: Jiawei Wu xremold@gmail.com Co-authored-by: Tianyou Guo tianyou.gty@alibaba-inc.com Co-authored-by: Xu Yan yancey.yx@alibaba-inc.com Co-authored-by: Ziheng Jiang ziheng.jiang@bytedance.com	2022-08-03 19:10:54 -07:00
武家伟	636f5acb10	[MHLO] Init MHLO reduce-like op conversion (#1133 ) * [MHLO] init reduce-like op conversion from Torch to MHLO Co-authored-by: Bairen Yi <yibairen.byron@bytedance.com> Co-authored-by: Jiawei Wu <xremold@gmail.com> Co-authored-by: Tianyou Guo <tianyou.gty@alibaba-inc.com> Co-authored-by: Xu Yan <yancey.yx@alibaba-inc.com> Co-authored-by: Ziheng Jiang <ziheng.jiang@bytedance.com>	2022-08-03 10:47:52 +08:00
Tanyo Kwok	0b23af27d3	[MHLO] support non-constant torch scalar in BasicOps (#1134 ) See RFC https://github.com/llvm/torch-mlir/issues/999 Co-authored-by: Bairen Yi yibairen.byron@bytedance.com Co-authored-by: Jiawei Wu xremold@gmail.com Co-authored-by: Tianyou Guo tianyou.gty@alibaba-inc.com Co-authored-by: Xu Yan yancey.yx@alibaba-inc.com Co-authored-by: Ziheng Jiang ziheng.jiang@bytedance.com	2022-08-03 08:16:31 +08:00
Ramiro Leal-Cavazos	a7af1fd873	Add support for `dim=None` to `AtenMeanDimOp` (#1129 ) PyTorch recently added support for `dim=None` in the `torch.mean` op (`2bfae07a79`). This commit adds the corresponding support in torch-mlir.	2022-08-02 16:08:06 +00:00
Quinn Dawkins	38d8498b21	add e2e support for aten.atan2 (#1117 ) - Includes math-to-libm pass in refbackend for math::atan2 support	2022-08-02 11:39:41 -04:00
Yan Xu	704efdc259	[MHLO] add aten::gelu op pattern (#1127 ) add aten::gelu op pattern, and moved some unit tests from basic.mlir to elementwise.mlir	2022-08-02 15:01:30 +08:00
武家伟	76c976682c	[MHLO] Support for dynamic shape in basic op conversion by introducing CHLO dialect (#1123 ) * [MHLO] Support for dynamic shape in basic op conversion by introducing CHLO dialect Co-authored-by: Bairen Yi <yibairen.byron@bytedance.com> Co-authored-by: Jiawei Wu <xremold@gmail.com> Co-authored-by: Tianyou Guo <tianyou.gty@alibaba-inc.com> Co-authored-by: Xu Yan <yancey.yx@alibaba-inc.com> Co-authored-by: Ziheng Jiang <ziheng.jiang@bytedance.com> * [MHLO] Support I32 as shape tensor dtype * [NFC] Add a 'TODO' annotation	2022-08-02 12:53:24 +08:00
Tanyo Kwok	3772e0bd91	[NFC][MHLO] move util funcs to MhloLegalizeUtils.h/cpp (#1128 ) See RFC: https://github.com/llvm/torch-mlir/issues/999 Co-authored-by: Bairen Yi yibairen.byron@bytedance.com Co-authored-by: Jiawei Wu xremold@gmail.com Co-authored-by: Tianyou Guo tianyou.gty@alibaba-inc.com Co-authored-by: Xu Yan yancey.yx@alibaba-inc.com Co-authored-by: Ziheng Jiang ziheng.jiang@bytedance.com	2022-08-02 09:21:37 +08:00
Vidush Singhal	ed13ebfd8d	E2E support for AtenEmbeddingBagPaddingIdxOp SUM Mode (#1066 )	2022-08-01 16:44:11 -04:00
Alec	554570f3ab	Implemented a decomposition of aten::narrow	2022-08-01 18:32:14 +05:30
Henry Tu	70395de197	Resolve CI testing failure for Lazy Tensor Core (#1088 ) * Xfail unsupported ops * Register FuncDialect * Include dynamic_ir in build * Code reformat * Enable LTC tests for macOS and Source Build	2022-07-30 09:40:02 -04:00
Henry Tu	0c35e607b3	Add static shape for scalar tensors (#833 ) * Assume zero rank tensors are scalar * Run RefineTypes pass on JIT Graph * Rollback assumption that zero rank tensors are scalar * Set numSizes to -1 for non-ranked tensors * Rename RefineTypes to RefineTupleTypes	2022-07-30 09:40:02 -04:00
PhaneeshB	8b5631d4c5	[MLIR][TORCH] Add decomposition for aten.std.dim Op Signed-Off By: Phaneesh Barwaria <phaneesh@nod-labs.com>	2022-07-29 23:52:54 +05:30
Vivek Khandelwal	c681c3497a	[MLIR][TORCH} Fix empty dim cases for the .dim ops This commit fixes the shape calculation for: 1.) aten.mean.dim 2.) aten.var.dim 3.) aten.sum.dim_IntList op Also, it fixes the lowering of `aten.mean.dim` and `aten.sum.dim_IntList` for handling the cases of empty dim list. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com	2022-07-29 11:08:57 +05:30
Vivek Khandelwal	d386b8f9e5	[MLIR][TORCH] Add decomposition for aten.var.correction op This commit adds the decomposition for `aten.var.correction` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com	2022-07-29 11:08:57 +05:30
Vivek Khandelwal	7247c6a3a7	[MLIR][TORCH] Add E2E support for aten.ge.int op This commit adds lowering of `aten.ge.int` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-07-29 11:08:57 +05:30
Quinn Dawkins	11a8901078	[MLIR][TORCH] Add support for multiple indexing tensors for aten.index.Tensor (#1097 ) - Includes a canonicalizer for `aten.add.t`needed for successfully lowering the shape function - Only offers support for statically sized index tensors when there is more than one - Dynamic shape support remains for single indexing tensors	2022-07-28 19:00:02 -04:00
Quinn Dawkins	3c9addf19c	Add e2e support for aten.expm1	2022-07-27 12:31:35 +05:30
武家伟	052d2f84dc	[MHLO] Init MHLO basic op conversion (#1092 ) * [MHLO] Init MHLO basic Op Conversion Co-authored-by: Bairen Yi <yibairen.byron@bytedance.com> Co-authored-by: Jiawei Wu <xremold@gmail.com> Co-authored-by: Tianyou Guo <tianyou.gty@alibaba-inc.com> Co-authored-by: Xu Yan <yancey.yx@alibaba-inc.com> Co-authored-by: Ziheng Jiang <ziheng.jiang@bytedance.com> * [NFC] Remove 'from @llvm-project' annotation Co-authored-by: wujiawei.jw <wujiawei.jw@bytedance.com>	2022-07-27 13:07:51 +08:00
Kevin Kiningham	e8f327cc00	Add lowering to linalg for softplus and log1p Follows existing conventions for unary operators.	2022-07-25 21:25:57 +05:30
Tanyo Kwok	44ead68772	[MHLO] Init MHLO gather op patterns (#1104 ) See RFC https://github.com/llvm/torch-mlir/issues/999 Co-authored-by: Bairen Yi yibairen.byron@bytedance.com Co-authored-by: Jiawei Wu xremold@gmail.com Co-authored-by: Tianyou Guo tianyou.gty@alibaba-inc.com Co-authored-by: Xu Yan yancey.yx@alibaba-inc.com Co-authored-by: Ziheng Jiang ziheng.jiang@bytedance.com	2022-07-25 23:47:46 +08:00
Tanyo Kwok	f50d7013cd	[MHLO] Add [un]squeeze op patterns (#1099 ) * [MHLO] Add [un]squeeze op patterns * Conform to llvm coding standard * minor update	2022-07-25 23:28:48 +08:00
Tanyo Kwok	b80ce79b9f	[MHLO] Init MHLO view like op patterns (#1090 ) * [MHLO] Init MHLO view like op patterns See RFC: https://github.com/llvm/torch-mlir/issues/999 Co-authored-by: Bairen Yi yibairen.byron@bytedance.com Co-authored-by: Jiawei Wu xremold@gmail.com Co-authored-by: Tianyou Guo tianyou.gty@alibaba-inc.com Co-authored-by: Xu Yan yancey.yx@alibaba-inc.com Co-authored-by: Ziheng Jiang ziheng.jiang@bytedance.com * update filecheck test cases * rebase, remove chlo and clang-format	2022-07-22 15:18:18 +08:00
Tanyo Kwok	a02dbb2d5e	[MHLO] Init MHLO slice like op patterns (#1091 ) See RFC: https://github.com/llvm/torch-mlir/issues/999 Co-authored-by: Bairen Yi yibairen.byron@bytedance.com Co-authored-by: Jiawei Wu xremold@gmail.com Co-authored-by: Tianyou Guo tianyou.gty@alibaba-inc.com Co-authored-by: Xu Yan yancey.yx@alibaba-inc.com Co-authored-by: Ziheng Jiang ziheng.jiang@bytedance.com	2022-07-22 11:32:45 +08:00
Ramiro Leal-Cavazos	f271e6a88c	Add verifiers for ToBuiltinTensorOp and FromBuiltinTensorOp (#1089 ) This commit adds verifiers to the ops `ToBuiltinTensorOp` and `FromBuiltinTensorOp` that make sure that the input and output have the same shape and data type.	2022-07-21 21:41:45 +00:00
Sean Silva	c0ef192865	Improve error message The unknown dtype case can come from RefineTypes.	2022-07-21 13:52:24 -07:00
Ashay Rane	72dd04cdb3	Revert "python: trim registration and loading of dialects and passes" (#1093 ) This reverts commit `ad283c1043`, since it's causing nightly build failures for all platforms.	2022-07-21 09:35:42 -07:00
Ashay Rane	ad283c1043	python: trim registration and loading of dialects and passes (#1084 ) In the interest of merging upstream LLVM quickly, a previous patch (`7f08169`) updated the torch-mlir build to register all dialects and passes through Python bindings. This patch limits the dialects and passes to only those that are used in torch-mlir. Key to this change are the removal of `MLIRPythonExtension.RegisterEverything` and the introduction of a new Python module (`_mlir_libs/_site_initialize_0.py`), where we register the dialects and passes used by torch-mlir.	2022-07-20 18:34:17 -07:00
Ziheng Jiang	c61c99e887	[MHLO] Init MHLO integration. (#1083 ) Co-authored-by: Bairen Yi <yibairen.byron@bytedance.com> Co-authored-by: Jiawei Wu <xremold@gmail.com> Co-authored-by: Tianyou Guo <tianyou.gty@alibaba-inc.com> Co-authored-by: Xu Yan <yancey.yx@alibaba-inc.com> Co-authored-by: Ziheng Jiang <ziheng.jiang@bytedance.com>	2022-07-20 16:18:16 -07:00
Quinn Dawkins	647e75e029	Allow expanding and collapsing in aten::view (#1082 ) - Supports cases where the view op expands and collapses dims simulataneously. This does not handle the case where it is neither expanding nor collapsing (e.g. [2, 3] -> [3, 2]) - Additionally fixes a previous bug with adding 1-sized dims on both sides of a tensor with aten.view	2022-07-20 17:35:51 -04:00
Ashay Rane	e06ee08506	torch: [nfc] use `WalkResult::isInterrupted()` instead of booleans (#1081 ) An upstream MLIR bug (that was recently fixed) caused the result to be ignored for Region- and Block-visitor functions. Now that the bug is fixed, we don't need an auxiliary boolean to track whether the visitor function has succeeded.	2022-07-19 10:17:57 -07:00
Quinn Dawkins	c73a39e40a	Add support for index.Tensor on dimensions other than the first This patch still only supports a single indexing tensor.	2022-07-19 11:36:52 +05:30
Vivek Khandelwal	df0b1e77a4	[MLIR][TORCH] Add negative dim support for aten.cat and aten.slice op This commit adds the support for negative dim cases for `aten.cat`, `aten.slice.Tensor` and `aten.slice_scatter` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-07-18 14:01:33 +05:30
Vivek Khandelwal	4c25878e64	[MLIR][TORCH] Add canonicalization pattern for prim.ListUnpack op This commit adds the canonicalization pattern for the `prim.ListUnpack` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-07-18 13:51:25 +05:30
Jacques Pienaar	247dd64a66	Change to notifyMatchFailure (#1073 ) emitError is intended for error cases and not match failures of patterns. notifyMatchFailure is intended where pattern reports reason for not matching. Op verification should also not happen inside patterns but as part of verify/verification, but left ones that were obviously verification to emitError inside patterns to keep this change small.	2022-07-17 18:39:54 -07:00
Sean Silva	85858d2743	Bump LLVM to 889c6f3996769a991a24da957f597e7500d158e7 The biggest change here is to upgrade RefineTypes to the new sparse dataflow framework. Smaller changes: - minor changes to type parsing - suppress warnings in e2e tests	2022-07-15 13:36:04 -07:00
Ramiro Leal-Cavazos	afdaa60dd4	Fix typo in `inputRank` check of `AtenBatchNormOp` (#1046 ) The original conversion pattern for `AtenBatchNormOp` required that the input rank be greater than 2; however, the only expectation in the conversion pattern and in Pytorch is that the input rank is greater than 1, since the second dimension of the input must match the size of the `weight`, `bias`, `runningMean`, and `runningVar` inputs. This commit fixes the `inputRank` check.	2022-07-15 09:35:59 -07:00
Vivek Khandelwal	3589134d31	[MLIR][TORCH] Add decomposition for aten.var.dim op This commit adds the decomposition for `aten.var.dim` op. This commit also make changes in the decomposition for `aten.var` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-07-15 09:53:42 +05:30
Ashay Rane	29bc48aedb	torch: add pass to catch non-value tensors (#1052 ) This patch adds a new pass `torch-verify-conversion-to-value-semantics`, which looks for non-value semantics tensors to catch such tensors early during compilation. This pass requires `torch-refine-public-return` pass to ensure that return operations are updated to use value tensors, followed by the canonicalize pass to remove any dead ops that may use or produce non-value tensors.	2022-07-13 17:11:15 -07:00
Suraj Sudhir	5e2012c7dd	[tosa] aten.max.dim , aten.slice.tensor ops (#1027 ) Signed-off-by: Suraj Sudhir <suraj.sudhir@arm.com>	2022-07-13 10:10:18 -07:00
Prateek Gupta	3592e0ba7f	[TORCH][MLIR] Fix some comments in slice_scatter/select_scatter lowering. This commit addresses the remaining comments on lowering of slice_scatter and select_scatter. Signed-Off-By: Prateek Gupta <gprateek93@gmail.com>	2022-07-13 09:40:06 +05:30
Ashay Rane	ac4d7d10e0	canonicalizer: propagate type information across copy and cast ops (#1030 ) Prior to this patch, the canonicalizers for `AtenSizeOp` and `AtenSizeIntOp` succeeded only if the tensor operand's type information included the size of the requested dimension(s). We can extend the set of optimizable cases by propagating types across operations whose result type matches the input tensor type. Specifically, this patch enables the canonicalizers for `AtenSizeOp` and `AtenSizeIntOp` to see past `tensor_static_info_cast`, `copy.to_vtensor`, and `copy.to_tensor` ops until it reaches the first op whose result type contains size information for the requested dimensions, with a maximum bound of 6 parent lookups to avoid indefinite compilation times. All other encountered ops cause the canonicalizer to give up.	2022-07-12 12:38:37 -07:00
Sean Silva	e5e11e214b	GlobalizeObjectGraph: Clean up handling of unused slots The way we did it previously still created the slot and copied the initializer even if unused.	2022-07-12 10:47:28 -07:00
Ashay Rane	9017be9e9e	torch: copy uses to prevent iterator invalidation (#1033 ) Prior to this patch, the code in the `torch-simplify-shape-calculations` pass iterated on the uses of an op's result while also modifying the value. This caused the iterator to get invalidated, thus terminating the loop early and producing incorrect IR. This patch makes use of `llvm::make_early_inc_range()` to ensure that the iterator is not invalidated while executing the loop body.	2022-07-11 18:47:04 -07:00
Ramiro Leal-Cavazos	11148e60d6	Undo shape lib changes + update function signature of sum + zero (#1035 ) This commit does three things: 1. Reverts some of the shape lib changes merged in https://github.com/llvm/torch-mlir/pull/844 2. Updates the signature of `aten.sum_dim_IntList` that was recently updated in `23bdb570cf` 3. Replaces `aten.zero.functional` with `aten.zero`, updated in `960758b0b7`	2022-07-11 10:56:12 -07:00
Prateek Gupta	2d75654b2c	[TORCH][MLIR] Add lowering of `aten.slice_scatter` and `aten.select_scatter` op. This commit adds: 1. Lowering of `aten.slice_scatter` op into `tensor.insert_slice` op. 2. Decomposes the `aten.select_scatter` op into `aten.slice_scater` op. Signed-Off-By: Prateek Gupta <gprateek93@gmail.com>	2022-07-11 14:07:21 +05:30
George Petterson	a08ff0d7f2	Add lowering for _convolution	2022-07-11 11:03:03 +05:30
Ashay Rane	340d8af28a	torch: handle `torch.prim.dtype` ops during type refinement (#1013 ) The canonicalizer converts `torch.prim.dtype` ops into integer constants for valid types, but the type may not be known until type refinement is complete. However, type refinement cannot make progress until `torch.prim.dtype` ops have been resolved to their corresponding integer constants, thus creating a circular dependency. This patch creates a tight coupling between type refinement and the lowering of `torch.prim.dtype` ops by handling such ops as they are encountered during type refinement. The unit test in this patch aims to check whether the type refinement pass can now handle chains of operations that alternate between type construction and type refinement.	2022-07-08 16:38:51 -07:00
Ramiro Leal-Cavazos	6a72ab4502	Add basic support for list of optional tensors in reduce-op-variants (#971 ) This commit adds support for lists of type `list<optional<tensor>>` where each element in the list is either a `!torch.tensor` or a `!torch.none`.	2022-07-08 11:12:15 -07:00
Ashay Rane	6491c69539	torch: use ScalarType enum instead of raw constants (#1020 ) This patch replaces the use of raw integers like 6, 4, etc. (that represent PyTorch's scalar types) with named values from the ScalarType enum (e.g. `ScalarType::Float`, `ScalarType::Long`, etc.) in code for folding `prim.dtype` ops into numeric constants. This patch isn't strictly a non-functional change, since its use of `Torch::getScalarTypeForType()` implies that the input type has to be one among the supported types, otherwise compilation will abort, whereas previously, compilation proceeded without folding the unsupported data type into a numeric constant.	2022-07-07 14:21:05 -07:00
Suraj Sudhir	d38f2cae5b	[tosa] aten.transpose.int support (#1017 ) Signed-off-by: Suraj Sudhir <suraj.sudhir@arm.com>	2022-07-07 13:05:33 -07:00
Quinn Dawkins	f0c3b5a7ed	Add E2E support for aten.len.str (#969 )	2022-07-07 10:41:55 -07:00
Ashay Rane	88316b3b4e	torch: fold prim.dtype(bf16) to integer constant 15 (#1012 ) A prior patch (`63538de2`) that added support for bfloat16 type did not add the canonicalization pattern to fold `torch.prim.dtype` operations on bfloat16 tensors into the integer constant 15. This patch fixes the problem.	2022-07-06 18:21:43 -07:00
Andrew Cain	6885f1ed8a	fix: Broaden range of tosa.matmul outputs that don't need to be reshaped (#1015 ) Co-authored-by: Andrew Cain <acain@d-matrix.ai>	2022-07-06 17:24:16 -07:00
Ramiro Leal-Cavazos	bbb648410e	Fix compilation warning Wsign-compare (#1003 )	2022-07-06 09:06:10 -07:00
Tanyo Kwok	d4f1f41435	[MLIR][TORCH] Add decomposition of aten.repeat (#932 ) * [MLIR][TORCH] Add decomposition of aten.repeat * refine & rebase * refine static shapes * add e2e test * Rebase and Refine naming style	2022-07-01 13:02:31 +08:00
Ramiro Leal-Cavazos	f204210266	[LINALG] Fix handling of size-1 dims in `aten.view` again. (#992 ) A previous fix to the handling of size-1 dims in `aten.view` (https://github.com/llvm/torch-mlir/pull/962) resulted in the wrong grouping of dimensions when size-1 dims where between two dims of size greater than 1. This commit fixes that.	2022-06-30 16:39:25 -07:00
Suraj Sudhir	bb576c2cb3	[tosa] aten.embedding op support (#991 ) Enables BERT legalization. Signed-off-by: Suraj Sudhir <suraj.sudhir@arm.com>	2022-06-30 13:13:52 -07:00
Sean Silva	227dea7b2e	Add support for ScalarType::QUInt8 I ran into this while poking around at https://github.com/llvm/torch-mlir/issues/959	2022-06-29 15:33:28 -07:00
JakopinA	5888c4f7dc	Added E2E support for torch::aten.__contains__int_list	2022-06-27 19:30:00 +05:30
Ashay Rane	163fa57cde	torch: allow torch dialect ops after running drop-shape pass (#979 ) In the `pyhpc_turbulent_kinetic_energy` TorchBench benchmark, the shape calculation occurs inside loops, but because `DropShapeCalculationsPass` does not explicitly mark the Torch dialect as legal, the pass execution fails. This patch adds Torch to the list of legal dialects, and adds a test to validate the translation.	2022-06-25 07:27:47 -07:00
Gaurav Shukla	1be604bfd3	[LINALG] Lower `aten.Matmul` to `linalg.BatchMatmul` This commit lowers `aten.matmul` to `linalg.BatchMatmul` under the following conditions: 1. The result of matrix multiplication must have batch dimensions, i.e., rank greater than 2. 2. The resultant matrix must have at most 1 dynamic batch dimension. It also handles broadcasting of batch dimensions when batch dimensions of the matrices are broadcastable. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-06-25 10:58:06 +05:30
Ramiro Leal-Cavazos	400fecc1e5	[LINALG] Fix shape function of index.Tensor + support N-rank inputs (#972 ) This commit fixes the shape function for `index.Tensor`, adding support for multiple index tensors and `None`s in the indices list. This commit also adds support for input tensors of rank greater than 1. The lowering for `index.Tensor` still has the the limitation that only a single index tensor along the first dimension of the input tensor is supported.	2022-06-24 09:45:44 -07:00
Ashay Rane	234fc7fe0c	linalg: lower `aten.triu` op to `linalg.generic` (#965 ) Prior to this patch, the torch dialect included `AtenTriuOp` for computing the upper triangular part of the input matrix, but there was no code for lowering the op to the linalg dialect. This patch adds code to generate a `linalg.generic` operation that compares indices (computed using `linalg.index`) to choose between zero or the original value (using `arith.select`). The lowering fails if the number of dimensions are less than two. This patch also adds a few end-to-end tests.	2022-06-23 22:45:48 -07:00
Tanyo Kwok	143a7bcb76	[MLIR][TORCH] Add folder for torch_c.from_i64 & torch_c.to_i64 (#933 ) * [MLIR][TORCH] Add folder for torch_c.from_i64 & torch_c.to_i64 * add unit tests for each individual fold * fix failure of NumelZeroRankModule & TestMultipleTensorAndPrimitiveTypesReturn	2022-06-24 09:34:39 +08:00
Ramiro Leal-Cavazos	189afa82c5	Update shape library with LLVM bump changes (#973 )	2022-06-23 18:13:03 -07:00
erman-gurses	5cff40c88a	Add canonicalization for aten.add.tensor op	2022-06-23 17:24:59 -04:00
Maksim Levental	829717c96e	Bump LLVM (#958 )	2022-06-22 22:23:46 -05:00
Ramiro Leal-Cavazos	8b94759303	[LINALG] Fix handling of size-1 dims in `aten.view` (#962 ) This commit adds support for several size-1 dims in a row in an expansion using `aten.view`.	2022-06-22 15:58:40 -07:00
Maksim Levental	a34dad2e07	Fix `verifyLinalgCompatibleTypes` which currently doesn't successfully catch `torch.tensor`. (#947 )	2022-06-15 18:21:36 -05:00
Vivek Khandelwal	77ab31641f	[MLIR][TORCH] Add decomposition of aten.numpy_T op This commit adds the decomposition of `aten.numpy_T` op into `aten.t` or `aten.permute` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-06-16 00:01:22 +05:30
Vivek Khandelwal	4605dc9c99	[MLIR][TORCH] Add support for bool type in convertScalarToDtype utility Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-06-16 00:00:47 +05:30
Albert Sandru	708a51ae2e	Add E2E support for aten.is_floating_point	2022-06-15 11:54:00 -05:00
Ramiro Leal-Cavazos	246c2df65a	[LINALG] Fix typo in conversion pattern of `aten.embedding` (#942 )	2022-06-15 09:45:10 -07:00
Vivek Khandelwal	aed5517fda	[MLIR][TORCH] Add integer dtype support for aten.rsub.Scalar op Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-06-15 16:46:28 +05:30
Bob Adolf	b90837ee24	Temporarily revert support for custom op extensions. (#944 ) The MacOS builders are having linking trouble with the extension library. Until it's fixed, all support for op extensions is disabled. It should be easy to restore once the issue is resolved.	2022-06-14 18:24:40 -07:00
Ramiro Leal-Cavazos	93f6d8e776	[LINALG] Add 0-rank case for `aten.permute` (#940 ) The function `AffineMap::inferFromExprList` does not work if the first vector of expressions is empty, because it uses these expressions to obtain the context. This prevented `aten.permute` from working for inputs of 0-rank. This commit adds support for 0-rank inputs.	2022-06-14 12:50:46 -07:00
Vivek Khandelwal	33fa8e7761	[MLIR][TORCH] Add decomposition of aten.floor_divide op This commit adds the decomposition of `aten.floor_divide` op into `aten.div.Tensor_mode` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-06-14 08:56:25 +05:30
Bob Adolf	0a7ba62438	Allow torch-mlir to support PyTorch extensions. (#895 ) PyTorch allows new operators to be registered dynamically in modules. Torch-mlir already makes it fairly straightforward to add support for new operators, and this commit just extends that support to allow new PyTorch ops to come from a external module. This does not allow ops to be dynamically loaded into torch-mlir. Torch-mlir must still be compiled with support built-in. Add a `_torch_mlir_custom_op_example` subpackage to `torch_mlir` which registers an demonstration op. It will not be imported by default when importing torch_mlir. It's strictly for testing and documentation. Adds an end-to-end test for the `torch_mlir_custom_op_example::identity` op. With all these changes, we should now be actively testing PyTorch extension support with all future patches.	2022-06-13 14:51:30 -07:00
Maksim Levental	5c85ac3100	Handle `nn.Linear(..., bias=False)` case for TorchToLinalg (#919 )	2022-06-08 21:13:43 -05:00
Sean Silva	e1b38e74dd	Use upstream shape functions directly. Now that upstream exposes them nicely, we can use them. I noticed that we had added stuff into the upstream_shape_helpers.py file (which was supposed to stay pristine), so some more shape functions need to be upstreamed. Going forward, all shape functions should be upstreamed similar to https://github.com/pytorch/pytorch/pull/76889 instead of added in this file.	2022-06-07 11:15:03 -07:00
Vivek Khandelwal	b95b3d844d	[MLIR][TORCH] Add E2E support for aten.div.Tensor_mode op This commit adds lowering of `aten.div.Tensor_mode` op. This commit also fixes formatting for the test file elementwise.py. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-06-07 22:26:44 +05:30
Vivek Khandelwal	a11ef674a7	[MLIR][TORCH] Add E2E support for aten.baddbmm op This commit decomposes `aten.baddbmm` op into `aten.bmm`, `aten.mul.Scalar`, and `aten.add.Tensor` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-06-07 22:26:28 +05:30
Vivek Khandelwal	2718b4d838	[MLIR][TORCH] Add E2E support for aten.clamp_[min\|max] op This commit decomposes `aten.clamp_min` and `aten.clamp_max` op into `aten.clamp` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-06-06 11:52:29 +05:30
Vidush Singhal	fc419b1e7d	Add E2E support for AtenLogicalOrOp. (#883 )	2022-06-03 16:21:03 -07:00
Henry Tu	abf5c94a1b	Replace valsem.aten.zero with aten.zero.functional (#893 )	2022-06-03 16:27:31 -04:00
Vidush Singhal	0a913bc904	Add E2E support for AtenAllBoolOp (#874 )	2022-06-01 18:20:25 -07:00
Ashay Rane	7fdc1cff02	build: remove manual changes to ShapeLibrary.cpp (#894 ) The patch bumped up the LLVM tag made manual fixes to the code in `ShapeLibrary.cpp`. However, since that file is generated by the `update_shape_lib.sh` script, its contents were reverted each time the script was run. This patch fixes the problem by removing the manual changes to that file.	2022-06-01 14:11:29 -07:00
Vivek Khandelwal	06750815d1	[tosa] Support for AtenAvgPool2d op Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-05-27 07:56:37 +05:30
Vivek Khandelwal	6f548fc3ad	[MLIR][TORCH] Add decomposition of aten.adaptive_avg_pool2d op This commit adds the decomposition of `aten.adaptive_avg_pool2d` op into `aten.avg_pool2d` op. The current decomposition only supports cases where input size is equal to the output size. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-05-27 07:56:37 +05:30
Ashay Rane	029cd54327	build: fix code so that the compiler does not emit warnings (#871 ) When compiling without assertions (i.e. in `NDEBUG` mode), a handful of statements turn to NOPs, which results in warnings such as missing return statement or unused variables and function. This patch replaces such statements with `llvm_unreachable()`, which informs the compiler about program termination regardless of the `NDEBUG` mode. This also enables torch-mlir to be compiled using the flags `-Wall`, `-Wextra`, `-Wpedantic`, and `-Werror`.	2022-05-25 14:04:59 -07:00
Vivek Khandelwal	56e77d4213	[MLIR][TORCH] Add E2E support for aten.Bool.[float\|int] op This commit adds lowering of `aten.Bool.float` and `aten.Bool.int` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-05-24 21:18:34 +05:30
Vivek Khandelwal	014a6d16c7	[MLIR][TORCH] Add E2E support for aten.any.bool op This commit adds lowering of `aten.any.bool` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-05-24 17:24:28 +05:30
Vivek Khandelwal	bc9b2156e3	[MLIR][TORCH] Add E2E support for aten.sqrt.int op This commit adds lowering of `aten.sqrt.int` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-05-24 16:50:39 +05:30
Ashay Rane	f18b2be911	torch,linalg: add support for translating aten.linalg.vector_norm (#839 ) This patch adds support for the torch.linalg.vector_norm op to the torch dialect, including the necessary shape function. It also extends the conversion of reduction operators to support lowering of AtenLinalgVectorNormOp, in addition to adding a handful of end-to-end tests to validate the lowering. There exist several opportunities to make this lowering optimal and robust. For instance, in its current form, the translation does not support ord = 0, +inf, or -inf. For L1 norms, we don't need to raise each element to the power 1.0. Similarly, L2 norms could benefit from strength reduction. Since the canonicalization pass is not able to apply these optimizations, we should consider applying them during the linalg lowering itself.	2022-05-19 15:48:15 -07:00
Sean Silva	3fb54cba4c	torch.prim.TupleIndex: Adjust tensor types when folding. In cases where a refinement/derefinement was needed, we didn't fold. Fixes https://github.com/llvm/torch-mlir/issues/863	2022-05-19 09:36:27 -07:00
Ashay Rane	bb52a460cb	mlir: bump llvm tag to 5380e3 (#856 ) In addition to updating the llvm-project submodule, this patch also: 1. updates shape functions and tests so that `func` and `call` operations refer to the `func` dialect 2. avoid duplicate registration of dialects	2022-05-16 12:54:35 -07:00
Ramiro Leal-Cavazos	96f90efd16	Add shape info to `rand_like` + support for `dtype` flag (#851 ) The op `aten.rand_like` was missing a shape function, unit tests, and the `dtype` argument was being ignored in its decomposition. This commit fixes all three things.	2022-05-12 16:00:59 -07:00
Vivek Khandelwal	f15d257aac	[MLIR][TORCH] Add support for ceil_mode = true for pooling ops This commit adds support for aten.max_pool2d, aten.max_pool2d_with_indices, and aten.avg_pool2d op for the cases where ceil_mode = true. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-05-11 12:52:47 +05:30
Vivek Khandelwal	c69a1e5688	[MLIR][TORCH] Add E2E support for ScalarImplicit, Int.Scalar op This commit adds lowering of `aten.ScalarImplicit` and `aten.Int.Scalar` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-05-10 22:40:49 +05:30
Prashant Kumar	12b3af70d3	[TORCH] Add folding of aten.detach op. `aten.detach` op is folded and returns the first operand since it's an identity function(kind of identity just remove the has_grad attribute).	2022-05-10 21:54:45 +05:30
Prashant Kumar	2b1b0f6e19	[LINALG] Add support for preserve memory format in aten_empty_like op. The preserve memory specifies that `If any of the input tensors is in channels_last format, operator output should be in channels_last format` and hence can be added as is in aten_empty_like op.	2022-05-10 09:37:55 +05:30
Yi Zhang	28be6511d2	Fix type promotion code for scalar only operations Fix the type promotion code for scalar only operation to return TorchType which is the type tracked in ValueKnowledge.scalarType. - Fix `getPromotedResultScalarType` to return Torch type. - Add `getBuiltInTypeForTorchScalar` helper to convert scalar type to builtin type before passing to the next level type promotion helper `updateResultTypeState`. - Add `setScalarType` helper to make setting ValueKnowledge.scalarType easier.	2022-05-07 10:37:21 -04:00
Vivek Khandelwal	96fabc0036	[MLIR][TORCH] E2E support for [ge\|ceil].float, [ge\|ne\|gt].float_int op This commit adds lowering of `aten.ge.float`, `aten.ge.float_int`, `aten.ne.float_int`, `aten.gt.float_int` and `aten.ceil.float` op. This commit also fixes formatting for the file scalar.py and scalar_comparison.py. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-05-05 21:48:35 +05:30
Kristof Denolf	e682b1d0f3	changed name option to decompose-complex-ops	2022-05-05 00:38:51 -07:00
Kristof Denolf	5243638e33	add no decompose option	2022-05-05 00:38:51 -07:00
Yi Zhang	9f7264a7a4	Add support for scalar type propagation The main changes are: - Added `ValueKnowledge.scalarType` to track scalar type information. - Added `ValueKnowledge.kind` to indicate the value kind. - Modified the meet and join helper functions. The ValueKnowledge has slightly more complicated state now so the meet and join function need to look at the `kind` field in addition to just the type field.	2022-05-04 16:57:56 -04:00
Gaurav Shukla	4b911ada40	[LINALG] Add E2E support for `aten.mean.dim` op - This commit adds support for `aten.mean.dim` op. - It also adds a new test script `stats.py` for statistics related ops. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-05-04 20:11:42 +05:30
Sean Silva	32159c4e54	Fix TupleIndex canonicalizer. It would change the result type.	2022-05-03 09:08:49 -07:00
Sean Silva	ab5ad7af09	Add tracing suport to `torch_mlir.compile`. This also has a fix for the adjustment of types of TupleConstruct inputs, which I found when using this new functionality on a model. Some scenarios in tracing create situations where the output of TupleConstruct has a more refined type than the inputs. This introduces a helper `adjustStaticInformationForValues` which subsumes the `derefineValues` helper and the tensor static information adjustment we were doing.	2022-05-03 09:08:40 -07:00
Vivek Khandelwal	c0634bc996	[MLIR][TORCH] Add E2E support for aten.to.dtype_layout op This commit decomposes `aten.to.dtype_layout` op into `aten.to.dtype` op. This commit also fixes the formatting for the file type_conversion.py. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-05-03 12:48:58 +05:30
gpetters94	c4dcdd1e34	Add aten.flip (#817 )	2022-05-02 16:01:15 -04:00
Vivek Khandelwal	8a06419980	[MLIR][TORCH] Add E2E support for aten.masked_fill.Scalar op This commit adds lowering of `aten.masked_fill.Scalar` op. This commit also fixes the formatting of the file constant_alloc.py. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-05-02 22:27:33 +05:30
Vivek Khandelwal	4b11284440	[MLIR][TORCH] Add E2E support for aten.avg_pool2d op This commit adds lowering of `aten.avg_pool2d` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-05-02 12:31:44 +05:30
Prateek Gupta	81ee5bb58c	[TORCH][MLIR] Fix ConstantPad2dStaticModule test. This commit fixes the `ConstantPad2dStaticModule` test case by adding the lowering of `aten.pad` operation. Previously the test case mapped to `aten.constant_pad_nd` operation. The `aten.pad` now decomposes into `aten.constant_pad_nd` operation. Signed-Off-By: Prateek Gupta <prateek@nod-labs.com>	2022-04-29 21:57:01 +05:30
Ashay Rane	809f240f01	importer: add initial support for loading BFloat16 tensors (#761 ) This patch updates the `torch_mlir::convertTensorToMlirElementsAttr()` method to enable the creation of tensors whose base type is BFloat16. This patch also adds a test to validate the IR generation, and it updates the test for importing tensors of various types.	2022-04-29 09:01:49 -07:00
Prateek Gupta	e1db318a3c	[TORCH][MLIR]Add lowering for control flow operations. 1. This commit adds lowering of "while-like" prim loop to scf.while operation. 2. Adds lowering of "for-like" prim loops to scf.for operation. Signed-Off-By: Prateek Gupta <prateek@nod-labs.com>	2022-04-29 16:25:58 +05:30
Sean Silva	44c7b181d3	Revert "[MLIR][TORCH] Add E2E support for aten.ge.float op" This reverts commit `564734b2d7`.	2022-04-28 07:49:58 -07:00
Sean Silva	eff144c0b7	Revert "[MLIR][TORCH] Add E2E support for aten.ge.float_int op" This reverts commit `1f102cc400`.	2022-04-28 07:49:58 -07:00
Sean Silva	7669ee4e4a	Revert "[MLIR][TORCH] Add E2E support for aten.ne.float_int op" This reverts commit `51dd462592`.	2022-04-28 07:49:58 -07:00
Sean Silva	5ef9f501fa	Revert "[MLIR][TORCH] Add E2E support for aten.ceil.float op" This reverts commit `78f5747568`.	2022-04-28 07:49:58 -07:00
Vivek Khandelwal	e57e1968bc	[MLIR][TORCH] Add E2E support for aten.index_put.hacked_twin op This commit decomposes `aten.index_put.hacked_twin` op into `valsem.aten.index_put_impl` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-04-28 13:41:47 +05:30
Vivek Khandelwal	78f5747568	[MLIR][TORCH] Add E2E support for aten.ceil.float op This commit adds lowering of `aten.ceil.float` op. This commit also fixes formatting for the file scalar.py. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-04-28 11:49:35 +05:30
Vivek Khandelwal	51dd462592	[MLIR][TORCH] Add E2E support for aten.ne.float_int op This commit adds lowering of `aten.ne.float_int` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-04-27 21:16:48 +05:30
Vivek Khandelwal	1f102cc400	[MLIR][TORCH] Add E2E support for aten.ge.float_int op This commit adds lowering of `aten.ge.float_int` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-04-27 21:16:48 +05:30
Vivek Khandelwal	564734b2d7	[MLIR][TORCH] Add E2E support for aten.ge.float op This commit adds lowering of `aten.ge.float` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-04-27 21:16:48 +05:30
Vivek Khandelwal	f5b6c4b601	[MLIR][TORCH] Add E2E support for aten.div.float op This commit adds lowering of `aten.div.float` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-04-27 21:16:48 +05:30
Ashay Rane	9208bf0eb6	llvm: bump tag to e1318078 (#781 ) The updated LLVM code includes a patch to create bfloat16 array attributes, thus enabling a different patch to torch-mlir to flesh out support for the bfloat16 type.	2022-04-26 12:27:51 -07:00
Ashay Rane	9ec4712516	types: allow bf16 as result type for various tensor ops (#798 ) Prior to this patch, the result type for several tensor operations could only be float32, float64, or null. This patch adds bf16 to the list of allowed result types.	2022-04-26 11:55:58 -07:00
Prashant Kumar	5cdef0213d	[LINALG] Bug fix i64 vs i32 type comparison. Comparing index type instead of integer types solves the problem.	2022-04-22 08:09:58 +05:30
Prashant Kumar	33c9d256ea	[REFBACKEND] Add support for returning multiple different return types. Added the dynamic registration of return function to the execution engine. This makes sure that different/multiple return types are supported. Also, updated the .style.yapf indentation to 4.	2022-04-21 09:02:30 +05:30
Vivek Khandelwal	769f3a8870	[MLIR][TORCH] Add E2E support for max_pool2d_with_indices op This commit adds lowering of `max_pool2d_with_indices` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-04-18 21:05:19 +05:30
Ashay Rane	a893c7d5cf	Add shape transfer function and lowering to linalg for aten.neg (#759 ) * shape: add shape transfer function for aten.neg Prior to this patch, the list of shape transfer functions did not include `aten.neg`, which resulted in errors like below. ``` error: unsupported by backend lowering: tensor with unknown rank or dtype note: see current operation: %0 = "torch.aten.neg"(%arg0) : (!torch.vtensor<[256,256],f32>) -> !torch.vtensor<,f32> note: this is likely due to a missing shape transfer function in shape_lib_gen.py ``` This patch fixes the problem by adding a shape transfer function to reflect the point-wise nature of this operation. linalg: add translation of aten.neg operation This patch adds a translation rule to lower `aten.neg` operations on tensors to an `arith.negf` operation wrapped inside a `linalg.generic` operation. This patch also adds a rudimentary test.	2022-04-15 11:11:22 -07:00
Vivek Khandelwal	1bccb4fc8a	[MLIR][TORCH] Add E2E support for aten::max_pool2d_with_indices_backward op This commit adds lowering of `aten::max_pool2d_with_indices_backward` op. This commit also fixes formatting issues in basic.py. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-04-14 21:46:47 +05:30
Maksim Levental	24f9de7120	Fixes https://github.com/llvm/torch-mlir/issues/751 where `torch.bool` is parsed as signless `i1`. (#752 )	2022-04-13 12:28:27 -05:00
gpetters94	9ec0683e92	Add 2D case for convolution (#693 )	2022-04-08 00:47:57 -04:00
Sean Silva	e7721fb784	Fix error message. RefineTypes doesn't handle shape refinement anymore.	2022-04-07 14:46:44 -07:00
Prashant Kumar	1d5b5a89e8	[LINALG] Add torch.layout information torch.layout information has been added.	2022-04-07 20:47:49 +05:30
Prashant Kumar	fb8cb0c5f3	[LINALG] Add the lowering of `aten.ne.Scalar` op The lowering of `aten.ne.Scalar` op has been added to the linalg backend.	2022-04-05 21:07:28 +05:30
Ramiro Leal-Cavazos	5620fe030e	Add 1D, weight, and reduction support to nll_loss_backward (#729 ) This commit adds the following support to the op `nll_loss_backward`: - `input` tensor can be rank-1 - `weight` parameter - `reduction` parameter - `target`, `grad_output`, `total_weight` can be rank-0 - Checks that input tensors are of the expected type	2022-04-04 10:57:49 -07:00
Ramiro Leal-Cavazos	51d4d55f8a	Add support for multi-dim input to `index_put_impl` (#722 ) This commit adds support for multi-dimensional tensors as input to the `_index_put_impl_` op. The support was to some degree already there, since `ScatterOp` already supports multi-dimensional tensors. This commit also adds a bit more error checking to `index_put` and refactors the code for creating `ScatterOp`s to mimic the way one would make a `Linalg::GenericOp`.	2022-03-31 09:27:21 -07:00
Anup Gangwar	ccf924d3df	tosa] Support for Aten[Gelu\|GeluBackward] ops (#720 ) Signed-off-by: Anup Gangwar <anup.gangwar@arm.com> Co-authored-by: Anup Gangwar <anup.gangwar@arm.com>	2022-03-30 17:00:55 -07:00
Sean Silva	c17c0a6ba2	Fix for 0-size dim inferred incorrectly. The issue was in the canonicalizer for torch.aten.ge.int -- in cases where the operands were swapped, it would miscompile. This issue is fixed and folding support generalized to `torch.aten.size.int < 0` as well. Fixes #716	2022-03-30 16:36:15 -07:00
Gaurav Shukla	969785d1b6	[LINALG] Add E2E support for `aten.where.[Scalar\|ScalarSelf\|ScalarOther]` ops This commit decomposes different variants of `aten.where.*` op into `aten.where.Self` op. It covers `aten.where.Scalar`, `aten.where.ScalarSelf` and `aten.where.ScalarOther` ops. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-03-30 20:36:48 +05:30
Vivek Khandelwal	2597c481f6	[MLIR][TORCH] Add E2E support for aten.new_empty op This commit decomposes `aten.new_empty` op into `aten.empty.memory_format` op. This commit also made a dtype fix to the constant tensor allocation like ops. Earlier the dtype for the result was inferred from the result type; now, it's being evaluated as per the original definition of the op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-30 13:21:01 +05:30
Sean Silva	140babd952	Add minimal support for Union types. A recent PyTorch commit made ConstantPad2d call a helper function with a `Union[int, float]` type annotated. This commit adds minimal support for representing and dealing with that. https://github.com/pytorch/pytorch/pull/73287 Changes: - Adding support for `!torch.union<T1, T2, T3>`/`Torch::UnionType`, along with the importer and CAPI code. - Add support in isValidSubtype for union types. - Adding a canonicalizer for `torch.derefine` to help simplify some code that derefines to a UnionType (this also fixes #664). There is still more work to do for really supporting UnionType well, such as canonicalizing UnionType's so that they can be compared with pointer equality.	2022-03-29 17:45:48 -07:00
Liam Fitzpatrick	f2269ced80	Improve list index normalization SimplifyShapeCalculations. (#710 ) The reified code to compute the shape of torch.aten.constant_pad_nd uses negative indices when setting list elements. This was not converted to a positive offset in one place in SimplifyShapeCalculations which prevented computation of the static shape.	2022-03-29 22:21:47 +02:00
Maksim Levental	25ba51b2af	This commit decomposes aten._reshape_alias op into aten.view op. (#690 )	2022-03-28 23:54:28 -05:00
Sean Silva	520725cdc5	Fix bad rename from "pseudo" to "valsem".	2022-03-28 20:40:42 +00:00
Sean Silva	776426ea4e	[SimplifyShapeCalculations] Fix AbstractlyInterpretListOpsWithinABlock The logic in the rewriting phase had a bug in case of a read-only op coming before mutation ops. The logic would use the op itself as the "latest literal", but that is not correct, because later on we replace the op itself with the final "latest literal", assuming that all uses of the op have been rewritten -- that was working in general, except for any read-only ops at the beginning. Big thanks to @ljfitz for the tiny reproducer! Fixes #704	2022-03-28 13:18:35 -07:00
Anup Gangwar	5d7a6c2976	[tosa] Support for Aten[Unsqueeze\|Contiguous\|Dropout\|Reshape\|View] ops (#700 )	2022-03-25 14:15:07 -07:00
Vivek Khandelwal	88c216da13	[MLIR][TORCH] Add support for same input and output shapes for view op This commit adds support for the cases of view op where the rank and the shapes of the input and result are equal. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-25 22:26:10 +05:30
Gaurav Shukla	02b6d04eb4	[LINALG] Add E2E support for `aten.zero_` op This commit adds decomposition of `aten.zero_` op. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-03-25 12:46:50 +05:30
Ramiro Leal-Cavazos	e966112c8d	Add final cast to TorchToLinalg conversions missing it (#692 ) In order to make sure that the TorchToLinalg conversions leave the graph in a valid state, the final result of the conversion has to be casted to the result type of the op. This commit adds this cast to ops that did not have it.	2022-03-23 13:52:32 -07:00
Qiang Fu	f7c7bb800c	Add non-default dtype support for a few elementwise math ops. (#687 ) * fix type inference * fix Torch2Linalg conversion * add test cases	2022-03-23 13:35:43 -07:00
Ahmed Taei	f9d34596e8	[NFC] Split BackendTypeConversion -> (BackendTypeConversion, BackendTypeConversionPasses)	2022-03-22 13:56:18 -07:00
Gaurav Shukla	7c3ba25238	[LINALG] Add decomposition of `aten.dropout` op - This commit adds decomposition of `aten.dropout` op. It also covers the training mode of the same op. - It also adds lowering of `aten.sub.float` op. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-03-22 13:14:49 +05:30
Sean Silva	729402c3f4	Reduce compilation time for TorchOps.cpp.inc The `assemblyFormat` stuff (which generates unrolled, per-op C++ code) was taking up a lot of compile time, and all the ops are essentially printed with the same logic. So this PR makes them all call the same helper function. This is done by using `let hasCustomAssemblyFormat = 1` and then implementing `FooOp::parse` and `FooOp::print`. Additionally, the `Generated*Ops.td` files are all collapsed into just `GeneratedTorchOps.td` (there is no reason to have the files separate, since the files are very large anyway so one is always having to search within them -- editors don't care that the file to search is now a bit bigger :) ). This reduces TorchOpsODSGenerated.cpp compile time (which is now GeneratedTorchOps.cpp) from 39 to 31 seconds on my machine. This is actually less than I expected, but this PR is an overall cleanup to the code anyway. The next step will be to introduce (better) functionality upstream for sharding the TorchOps.cpp.inc file, so that we can truly parallelize the O(#ops) costs. This is also necessary, because after this PR, TorchDialect.cpp is now the slowest file to compile, due to the `addOperations<... all the ops ...>` call, which needs to be shareded too.	2022-03-21 14:42:26 -07:00
Vivek Khandelwal	5b9bdfaf3f	[MLIR][TORCH] Add E2E support for aten._to_copy op This commit decomposes `aten._to_copy` op into `valsem.aten.copy` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-21 19:12:37 +05:30
Vivek Khandelwal	13383b03b8	[MLIR][TORCH] Add value tensor variant to aten::copy_ op This commit adds the op `ValsemVariantAtenCopyOp` that represents `AtenCopy_Op` without the underscore. This is needed to make sure that the `ReduceOpVariants` pass turns the in-place op into an op that takes value tensors as inputs, otherwise the `MaximizeValueSemantics` pass will not be able to add value semantics correctly. This commit also adds the lowering of `ValsemVariantAtenCopyOp`. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-21 19:12:37 +05:30
Vivek Khandelwal	4c0cd5c23d	[MLIR][TORCH] Add E2E support for aten.expand_as op This commit decomposes `aten.expand_as` op into `aten.broadcast_to` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-21 12:47:39 +05:30
Vigilans	63fb1e5aad	Bump LLVM at 8361c5da30588d3d4a48eae648f53be1feb5cfad	2022-03-18 13:16:14 -04:00
Ramiro Leal-Cavazos	218b4875d5	Make conditions for type refinement of static cast less strict (#680 ) This commit adds support for type refinement when `torch.tensor_static_info_cast`s are involved, even when there are users of the casted tensor that don't allow type refinements. Originally the canonicalization pattern for `torch.tensor_static_info_cast` would check if all the users of the casted tensor allowed type refinements before making any changes. This means that if at least one of the users did not allow type refinements, the pattern would fail. This becomes an issue when doing shape calculations because the calculations need the shape information of each input tensor to be available before the calculation can be simplified.	2022-03-18 09:10:12 -07:00
Prateek Gupta	7256c9e395	[TORCH][MLIR] Fix the return types of `aten.native_layer_norm`. This commit fixes the 2nd and 3rd return types of the `aten.native_layer_norm`. Previously the mean and rSTD were returned with reduction dims removed. This commit fixes this and keeps the reduction dims of the results. Signed-Off-By: Prateek Gupta <prateek@nord-labs.com>	2022-03-17 12:08:32 +05:30
Sean Silva	3b66b4925a	Make TorchOps.cpp faster to iterate on. The ODS-generated code included via the `TorchOps.cpp.inc` file takes a very long time to compile. This PR isolates it into its own file so that the build system can cache it. This PR creates a new file `TorchOpsODSGenerated.cpp` just to include the `TorchOps.cpp.inc` file. Doing so required moving to the "new" way to define verifiers, since the static `verify` free functions in TorchOps.cpp weren't accessible from the .inc file after it was moved to `TorchOpsODSGenerated.cpp`. On my machine, this drops the build time of TorchOps.cpp (such as when iterating on a canonicalizer) from >40 seconds to <10 seconds. 10 seconds still isn't great though, but at least it isn't "go get a coffee" type of waiting.	2022-03-16 09:33:12 -07:00
Vivek Khandelwal	8da7d90611	[MLIR][TORCH] Add E2E support for aten.index_put op This commit decomposes `aten.index_put` op into `valsem.aten.index_put_impl` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-16 22:02:02 +05:30
Vivek Khandelwal	3d95c3d6c9	[MLIR][TORCH] Add value tensor variant to aten::_index_put_impl_ This commit adds the op `ValsemVariantAtenIndexPutImplOp` that represents `Aten_IndexPutImpl_Op` without the underscore. This is needed to make sure that the `ReduceOpVariants` pass turns the in-place op into an op that takes value tensors as inputs, otherwise the `MaximizeValueSemantics` pass will not be able to add value semantics correctly. This commit also adds the lowering of `ValsemVariantAtenIndexPutImplOp` op. This commit also updates the `torch.bincount` op test cases.	2022-03-16 22:02:02 +05:30
Ramiro Leal-Cavazos	0bcc6d1075	Add maximize-value-semantics support for multiple non-value tensor inputs (#659 ) This commit adds value semantics support for ops such as `aten.view_as` and `aten.expand_as` that take two non-value tensors as input.	2022-03-15 18:13:45 -07:00
Sean Silva	92da4988f0	Improve "pseudo" op terminology. The term "pseudo" is very vague and was getting confusing (I felt I had to explain it in every comment referencing it). Instead, rework the "pseudo" ops to instead be named: - MLIR Syntax: `torch.valsem.` - C++ / ODS: `ValsemVariantOp` This makes it clear what the concept is, and avoids confusion with other things that might be called "pseudo", since these are very specific and should be 100% consistently named w.r.t. the non-valsem-variant ops that they correspond to.	2022-03-15 17:57:52 -07:00
Sean Silva	7ea50a537a	Avoid `using` the `torch_upstream` namespace. This is code that we always want to treat as "foreign" and not get too comfortable using in many functions. One way to accomplish that is to make it a bit clunkier to use. Also, fix Utils.cpp to match the LLVM/MLIR coding conventions (don't define functions inside namespaces -- prefer `using` and explicit qualification).	2022-03-15 17:24:17 -07:00
Sean Silva	84a9693006	Elide `!torch.` prefix in nested dialect types. This leads to much more succinct types in many cases: ``` !torch.list<!torch.int> !torch.list<int> !torch.tuple<!torch.list<!torch.int>, !torch.list<!torch.int>> !torch.tuple<list<int>, list<int>> !torch.optional<!torch.list<!torch.int>> !torch.optional<list<int>> !torch.list<list<list<tensor>>> !torch.list<!torch.list<!torch.list<!torch.tensor>>> ``` I would like to take this further and allow omitting the `!torch.` prefix in all cases, but that's harder -- for example, we currently use `FuncOp` for functions, and so I don't think we can customize the printing there. It seems like it will be a longer road to getting that level of customization.	2022-03-15 17:24:08 -07:00
Sean Silva	a5fe0cf063	Introduce new shape library design. See the documentation in `docs/shape_lib.md` and `docs/adding_a_shape_function.md` for an overview of the system. This completely overhauls how we represent shape functions. In particular, RefineTypes does not infer shapes anymore (only dtypes). Shape functions are now written in (TorchScript'able) Python. Recommended review order: 1. Read `docs/shape_lib.md` and `docs/adding_a_shape_function.md`. 1. Code and tests for ReifyShapeCalculations, DropShapeCalculations. 1. Code and tests for SimplifyShapeCalculations. 1. shape_lib_gen.py 1. Code and tests for new RefineTypes pass. 1. Random folders/canonicalizers in TorchOps.cpp and associated test in `canonicalize.mlir`. 1. New ReadOnly trait inferred from the registry. 1. Any miscellaneous remaining stuff. Example `-print-ir-after-all` for ElementwiseUnaryModule: [IR lowering dump](https://gist.github.com/silvasean/e4dc8cbc8d00aac7819602e3cbd8e212). Example `-print-ir-after-all` for ElementwiseBinaryModule: [IR lowering dump](https://gist.github.com/silvasean/daf6860ecced732af3568af6b1899113).	2022-03-15 12:41:58 -07:00
Sean Silva	5d9222383c	Split up TorchToLinalg.cpp This helps keep things organized and also exposes more parallelism to the build system. It seems though that most of the compile time is actually spent in the headers though, so the wall time doesn't decrease as much as I had hoped (and now that the headers are being included multiple times, the cpu time actually increases a lot, sadly -- will try to dig into this).	2022-03-14 10:19:41 -07:00
Ramiro Leal-Cavazos	51e267aa37	Combine maximize-value-semantics rewrite patterns into one pattern (#642 ) This commit replaces the two rewrite patterns of maximize-value-semantics with a single pattern that captures the behavior of both as well as other edge cases previously not supported. The new pattern works by first performing alias analysis on a subgraph to see if pattern is applicable, then rewriting all non-value tensors to value tensors in a single go.	2022-03-10 09:36:52 -08:00
Prateek Gupta	3d9ba5e525	[MLIR][TORCH] Add E2E support for aten.erf op. Signed-Off-By: Prateek Gupta <prateek@nod-labs.com>	2022-03-09 22:22:03 +05:30
Vivek Khandelwal	1a2a9e066f	[MLIR][TORCH] Add TorchToTMTensor pass This pass is added to lower ops, which can not be lowered via the TorchToLinalg pass, such as `torch.bincount` op. This pass also uses torch-mlir's TMTensor Dialect to lower the complex ops. Also add torch.bincount op lowering with the help of TMTensor dialect Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-08 22:52:34 +05:30
Vivek Khandelwal	b2952b12dd	[MLIR][TORCH] Move common helper functions to Utils.cpp This commit moves the helper function which are common across different torch-mlir conversion passes into a common directory Utils. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-08 22:52:34 +05:30
Vivek Khandelwal	bf463d1f36	[MLIR][TORCH]Add support for integer-type inputs for sum and max op This commit adds support for integer type inputs for `AtenMaxOp`, `AtenSumOp`, `AtenSumDimIntListOp`. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-08 22:52:34 +05:30
Gaurav Shukla	e57d3f9774	[LINALG] Fix `aten.bernoulli` op lowering - This commit adds E2E support for `aten.rand_like` and `aten.bernoulli_.Tensor` ops. - The `aten.bernoulli(x)` was implemented as: `aten.bernoulli(x) = rand_like(x) < 0.5`, assuming 0.5 as default probability, whereas according to the pytorch documentation: https://pytorch.org/docs/stable/generated/torch.bernoulli.html#torch.bernoulli the input x in `aten.bernoulli(x)` is itself a tensor containing probabilities to be used for drawing the binary random number. - So this commit fixes the `aten.bernoulli(x)` implementation as: `aten.bernoulli(x) = rand_like(x) < x`. - It also fixes the case where the input to `aten.bernoulli_.float` is an integer tensor. In this case the input must be casted to float type before passing it as operand to `aten.rand_like` op. `aten.bernoulli_.float(x, p) = rand_like(float(x)) < p`. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-03-05 09:38:22 +05:30
Vivek Khandelwal	af551bd9cd	[MLIR][TORCH] Add E2E support for aten.full_like op This commit decomposes `aten.full_like` op into `aten.empty_like` and `aten.fill` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-04 21:58:23 +05:30
Vivek Khandelwal	d61ae92eee	[MLIR][TORCH] Add E2E support for aten.full op This commit decomposes `aten.full` op into `aten.empty` and `aten.fill` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-04 21:58:23 +05:30
Ramiro Leal-Cavazos	9ce62473f9	Add static type information support to `aten.bmm` (#636 ) This commit adds static type information support to `aten.bmm`. This is needed for the forward pass of Bert training.	2022-03-03 13:01:17 -08:00
Ramiro Leal-Cavazos	5ec70c175d	[LINALG] Add torch-to-linalg lowering for `TensorStaticInfoCastOp` (#634 ) This commit adds a lowering for `TensorStaicInfoCastOp` that simply replaces the op with the `tensor::CastOp`.	2022-03-02 13:35:26 -08:00
Ramiro Leal-Cavazos	298eeb79ca	[LINALG] Add handling of unknown dimension in size list of `view` op (#633 ) The view op allows for the new shape argument to have a -1 value for one of the dimensions, and the op is expected to deduce the size of that dimension by looking at the sizes of the other dimensions and comparing it to the total number of elements in the original tensor. This commit adds this functionality.	2022-03-02 13:35:01 -08:00
Yi Zhang	1d285f0153	Add aten.hardtanh e2e support.	2022-03-02 12:28:06 -05:00
Prashant Kumar	819f29316f	Decompose aten.silu op Decomposition of aten.silu.op is added as silu(x) = x * sigmoid(x).	2022-03-01 23:24:19 +05:30
Vivek Khandelwal	ddd45d6068	[MLIR][TORCH] Add E2E support for aten.new_zeros, aten.new_ones op This commit adds lowering of `aten.new_zeros` and `aten.new_ones` op Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-03-01 22:09:47 +05:30
Ramiro Leal-Cavazos	1dba4fcbd7	[LINALG] Support for contiguous memory format in `clone` and `empty` (#628 ) This commit adds support for the contiguous memory format for the ops `AtenCloneOp` and `AtenEmptyMemoryFormatOp`.	2022-02-28 13:58:04 -08:00
Ramiro Leal-Cavazos	58abec5c0a	Add `reduction` support to `torch.nll_loss_forward` (#624 ) This commit does a couple of things. First, it fixes a bug in the `linalg.generic` body of the `nll_loss_forward` lowering where the `ignoreIndex` was being compared with the loop index rather than the current element of the `target` tensor. This was not being caught by the tests because they were not testing the case where `ingnoreIndex` actually corresponds to a value in `target`. This has been fixed. Second, this commit adds support for the `reduction` argument in `torch.nll_loss_forward` as well as support for 1-D inputs. In order to simplify the lowering code, I've refactored the code that creates the `linalg.generic` ops for elementwise and reduction ops into static functions, to avoid having boilerplate code for indexing maps, etc that can be very error prone. Note: The function `convertScalarToDtype` was moved to before all the conversion patterns, but nothing in it was modified.	2022-02-28 11:01:23 -08:00
Prashant Kumar	7c637eebc3	[LINALG] Decompose aten_hardswish op. `aten.hardswish` op is decomposed into (x/6) * Relu6(x+3).	2022-02-25 21:59:27 +05:30
Gaurav Shukla	056cd2078d	Revert "[LINALG] Decompose `aten.batch_norm` into `aten.native_batch_norm`" This reverts commit `442ff4605c`.	2022-02-25 15:46:55 +05:30
Ramiro Leal-Cavazos	ba29d4f250	Add operand type invariant to `torch.overwrite.tensor.contents` (#606 ) This commit adds the invariant to the op `torch.overwrite.tensor.contents` that both of its operands have the same shape and size. In order to maintain the invariant, special handling of this op is added to the `RefineTypes` pass.	2022-02-22 11:41:46 -08:00
Ramiro Leal-Cavazos	ea371a9bf2	Fix handling of view-like ops in `maximize-value-semantics` (#611 ) This commit adds handling to the `maximize-value-semantics` pass for the case where a view-like op depends on a tensor that has been overwritten by a value tensor. The approach for removing the dependency is to change the input to the view-like op to be a copy of the value tensor that is being used to overwrite. This commit also removes `AtenFill_ScalarOp` and `AtenBernoulli_FloatOp` from the list of view-like ops, since these ops now have a corresponding op with value semantics into which they get converted in the `reduce-op-variants` pass.	2022-02-18 10:19:07 -08:00
Ramiro Leal-Cavazos	2823277f7c	Add static type information support to `aten.mm` (#602 ) This commit adds static type information support to `aten.mm`. This is needed for the forward pass of Bert training.	2022-02-18 09:56:48 -08:00
Prashant Kumar	abbde7d439	[TORCH] The torch definition related to aten.gelu has changed. New str argument approximation is added.	2022-02-18 21:57:46 +05:30
Prashant Kumar	ed9bd556b3	Fix bug for aten_nll_loss op in the refine types pass The check for `self.hasSizes` was missing before performing `.size()` operation.	2022-02-17 19:02:12 +05:30
Nirvedh	f8cb32faf0	LLVM bump Major changes: opTrait changed to Trait, selectOp moved to arith dialect assertOp moved to cf dialect	2022-02-16 15:28:13 -05:00
Gaurav Shukla	442ff4605c	[LINALG] Decompose `aten.batch_norm` into `aten.native_batch_norm` - This commit decomposes the `aten.batch_norm` op into the `aten.native_batch_norm` op, instead of lowering it to the `linalg.generic` op. - It also adds run-time asserts in the `aten.native_batch_norm` lowering to make sure that the shape of the weight, bias, running_mean, and running_var must match the num of features. - Since the `aten.native_batch_norm` op is not supported at TOSA backend, all the modules that are dependent on the `aten.native_batch_norm` op will fail and therefore they should be removed from the TOSA `passing` set. - It also moves `checkNotNone` to utility. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-02-16 23:41:38 +05:30
Anup Gangwar	c60468f141	[tosa] Support for Aten[Zeros\|Ones\|Fill_Scalar] ops (#604 ) Signed-off-by: Anup Gangwar <anup.gangwar@arm.com> Co-authored-by: Anup Gangwar <anup.gangwar@arm.com>	2022-02-16 09:53:51 -08:00
Prashant Kumar	8b79b5f48f	Modify aten._log_softmax op decomposition for numerical stability. `aten.log_softmax` is decomposed to be more numerically stable.	2022-02-16 12:26:17 +05:30
Yi Zhang	869daf3c22	Add TMTensor dialect to torch-mlir This is intended to explore support for non-structured ops that can't be modeled by Linalg dialect. `tm_tensor.scan` and `tm_tensor.scatter` are added as the first such ops. The dialect should aim to be upstreamed in the future.	2022-02-15 16:45:38 -05:00
Gaurav Shukla	cd21dda867	[LINALG] Add E2E support for `aten.Hardsigmoid` op This commit adds lowering of `aten.Hardsigmoid` op. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-02-16 02:35:18 +05:30
Ramiro Leal-Cavazos	00a6e9c1bb	[LINALG] Add value tensor variant to `fill_.Scalar` (#600 ) This commit adds the op `PseudoAtenFillScalarOp` that represents `AtenFill_ScalarOp` without the underscore. The approach is the same as in commit `dd998fa4d4`. Adding this op allows for a simpler and more consistent version of the `empty` and `empty_like` op e2e tests.	2022-02-15 11:58:03 -08:00
Gaurav Shukla	41acde599b	[LINALG] Add E2E support for `aten.[le\|ge].Scalar` ops - This commit adds lowering of `aten.le.Scalar` and `aten.ge.Scalar` ops as a part of `convert-torch-to-linalg` pass. - It also creates a new test script `elementwise_comparison.py` for all element-wise comparison ops. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-02-15 12:21:09 +05:30
Ramiro Leal-Cavazos	413e6000d2	[LINALG] Add value tensor variant to `bernoulli_.float` (#597 ) This commit adds the op `PseudoAtenBernoulliFloatOp` that represents `AtenBernoulli_FloatOp` without the underscore. This is needed to make sure that the `ReduceOpVariants` pass turns the in-place op into an op that takes value tensors as inputs, otherwise the `MaximizeValueSemantics` pass will not be able to add value semantics correctly.	2022-02-14 18:58:48 -08:00
Anup Gangwar	dfc07d11d7	Fix compiler warning introduced in PR575 (#593 )	2022-02-14 12:45:19 -08:00
Gaurav Shukla	78c7844c6c	[LINALG] Add E2E support for `aten.eq.int` op - This commit adds lowering of `aten.eq.int` op as a part of `convert-torch-to-std` pass. - It also refactors the code for binary comparison ops lowering. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-02-15 01:37:35 +05:30
Gaurav Shukla	f00d1686c8	[LINALG] Add E2E support for `aten.[Bool.Tensor\|Float.Tensor]` op - This commit adds lowering of `aten.Bool.Tensor` and `aten.Float.Tensor` op as a part of `convert-torch-to-linalg` pass. - It also adds support for returning bool types. - It also fixes lowering of the `aten.Int.Tensor` op for non-zero rank input tensors. - If a scalar number is converted to a 0-d tensor and passed on to the `aten.Float.Tensor` op, it folds to the scalar number. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-02-14 23:09:20 +05:30
Yi Zhang	9e7b6cab08	Add folder for aten.gt/lt.float	2022-02-14 12:34:01 -05:00
Ramiro Leal-Cavazos	3dc7847348	[LINALG] Fix linalg generic result type argument in TorchToLinalg (#588 ) Some of the lowerings use the result type obtained from the op itself to tell the `linalg::GenericOp` what the type of the result should be rather than using the type of the result tensor given to the `linalg::GenericOp`. This becomes a problem when the result type of the op has static size information and the result tensor used in `linalg::GenericOp` has dynamic dimensions, for `linalg::GenericOp` expects the result type to be equal to the type of the output tensor. This commit replaces the use of the result type from the op itself with the type of the result tensor passed to `linalg::GenericOp`. In order to not create too many dynamic/static versions of the same e2e test, e2e tests have only been added to the ops that currently fail when used with static sizes.	2022-02-11 19:42:18 -08:00
Yi Zhang	ce4d6d1f83	Remove hacky aten.select.int lowering code	2022-02-11 18:14:58 -05:00
Anup Gangwar	756b75fb2d	[tosa] Support for some ops and fix for Issue #532 (#575 ) * [tosa] Support for AtenNe[Tensor\|Scalar]Op, AtenLog2Op, AtenBitwiseAndTensorOp, AtenSquareOp and AtenThresholdOp * Fix for Issue #532 - Mixed input types for few ops and updated few tests to use i32 instead of i64 Signed-off-by: Anup Gangwar <anup.gangwar@arm.com> Co-authored-by: Anup Gangwar <anup.gangwar@arm.com>	2022-02-11 12:30:02 -08:00
Ramiro Leal-Cavazos	c1167853db	Fix error in RefineTypes for constant alloc ops (#579 ) This commit fixes an error in the refine types pass of constant allocation ops. The function used to set the dtype, `fillInDtypeGivenDtypeAndDataType`, takes two torch types as arguments, but a torch type and a standard MLIR type were being passed into it. This commit also fixes the way the dtype was calculated in `visitAtenToDtypeOp`. This op was also passing a standard MLIR type as an argument to the `fillInDtypeGivenDtypeAndDataType` function. Moreover, since the op `aten.to.dtype` has the dtype argument as not optional, all that is needed is to match against the int value to extract the dtype.	2022-02-10 18:02:18 -08:00
Prashant Kumar	258660deb6	Add aten.bernoulli decomposition. aten.bernoulli is decomposed to aten.gtTensor(aten.uniform(x), x).	2022-02-11 00:35:33 +05:30
Prashant Kumar	102c497c4c	Add decomposition of _log_softmax op. Decompose _log_softmax into log(softmax(x)).	2022-02-10 23:17:26 +05:30
Prateek Gupta	318946a650	[TORCH][MLIR] Add E2E support for `aten._unsafe_view` op. This commit adds decomposition of `aten._unsafe_view` op into `aten.view` op. Signed-Off-By: Prateek Gupta<prateek@nod-labs.com>	2022-02-10 22:28:58 +05:30
Ramiro Leal-Cavazos	9b89f8eb3f	[TORCH][MLIR] Add E2E support for aten.clone (#571 ) This commit adds support for the aten.clone op.	2022-02-09 19:31:03 -08:00
Gaurav Shukla	bd177bdfc7	[TORCH][MLIR] Add run-time assert support in Torch-dialect - This commit adds `aten.assert` op in the Torch dialect. - The `aten.assert` op is lowered to `mlir::Assert` op. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-02-09 12:03:01 -05:00
Gaurav Shukla	2fefe68ffd	[TORCH][MLIR] Add E2E support for `aten.native_batch_norm` op - This commit adds support for `aten.native_batch_norm` operation. - The current implementation only supports inference mode of `aten.native_batch_norm` op. Signed-Off-By: Gaurav Shukla <gaurav@nod-labs.com>	2022-02-08 02:54:03 +05:30
Prashant Kumar	d4ea39b616	Convert bool to float or integer type. Conversion of torch.bool tensor type to float and integer type is handled.	2022-02-07 21:22:22 +05:30
Anup Gangwar	f9f97ea184	* [tosa] Support for AtenNativeLayerNormOp * [tosa] Support for AtenPermuteOp Signed-off-by: Anup Gangwar <anup.gangwar@arm.com>	2022-02-04 14:46:31 -05:00
Prashant Kumar	ccf546f14c	Add aten::nll_loss_backward op The lowering of aten::nll_loss_backward op has been added from torch to linalg dialect. The changes has been made as a part of -torch-convert-to-linalg pass. Signed-off-by: Prashant Kumar prashant@nod-labs.com	2022-02-04 21:57:53 +05:30
Prashant Kumar	68acc8696e	Modify softmax decomposition to be more numerically stable. The softmax decomposition is modified according to https://github.com/pytorch/functorch/blob/main/functorch/_src/decompositions.pytorch to account for numerical stability. Also, modified aten.argmax lowering to handle negative dimension.	2022-02-03 21:20:36 +05:30
Gaurav Shukla	0079901039	[TORCH][MLIR] Add E2E support for `aten.reshape` op This commit decomposes `aten.reshape` into `aten.view` op in the case of value tensor type operand. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-02-02 20:41:47 +05:30
Suraj Sudhir	1b505cbac5	RefineTypes fixes for TOSA backend (#557 ) Handles Linear, Adaptive_AvgPool2D and FlattenUsintInts Adds ResNet18 static model for TOSA Signed-off-by: Suraj Sudhir <suraj.sudhir@arm.com>	2022-02-01 14:08:54 -08:00
Yi Zhang	0cb216a1ad	[Torch][Linalg] Add basic support for RNG This PR include the following pieces: - Add torch `Generator` type. `Generator` type is converted to i64 in refbackend type converter. - Add seed managment support for the default global generator. `torch_c.getNextSeed` op is used to get the seed. On refbackend, the `torch_c.getNextSeed` is lowered to load/store from [0] of global variable `default_generator` memref<i64> in `InsertRngGlobals` pass. - Add `aten.uniform_` and testing as an example op for RNG ops. Add `torch.pseudo.aten.uniform` op. It has the same operands and return as the `aten.uniform_` from the op registry except for value semantics.	2022-01-31 18:56:42 -05:00
Suraj Sudhir	0f083e770a	[tosa] Add maxpool2d and adaptive_avgpool2d support (#550 ) Signed-off-by: Suraj Sudhir <suraj.sudhir@arm.com>	2022-01-31 13:34:09 -08:00
Yi Zhang	5d9a15263a	[TORCH] Add aten.std e2e support	2022-01-31 15:17:49 -05:00
Prashant Kumar	e58b66bc3b	Add lowering of `aten.max.dim` op. Lowering of `aten.max.dim` op has been added.	2022-01-31 21:41:22 +05:30
Anup Gangwar	454fa9d123	* [tosa] Support for AtenFlattenUsingIntsOp (#548 )	2022-01-28 21:38:56 -08:00
Liam Fitzpatrick	8bc028af05	Fold __is__ and unchecked_cast of derefine The added e2e maxpool testcase from #545 was not getting a static shape due to an unfolded prim.If when RefineTypes was called. This was because of unfolded torch.iaten.__is__ and torch.prim.unchecked_cast operators with torch.derefine operands.	2022-01-28 17:54:40 -05:00
stephenneuendorffer	52ed3313b4	Bump LLVM to 84fe34a0b7fdd7bbf179981d1583693d5d5ec68b (#544 ) * external/llvm-project 881ff4e4ebe8...84fe34a0b7fd (466): > [MLIR] Workaround for python detection problems.	2022-01-27 17:21:09 -08:00
Anup Gangwar	7a5736facd	* [tosa] Support for AtenReshapeOp (#543 ) * [tosa] Support for AtenBatchNormOp Signed-off-by: Anup Gangwar <anup.gangwar@arm.com> Co-authored-by: Anup Gangwar <anup.gangwar@arm.com>	2022-01-27 14:38:59 -08:00
Suraj Sudhir	eb06d21765	[tosa] Implement conv2d support (#541 ) Signed-off-by: Suraj Sudhir <suraj.sudhir@arm.com>	2022-01-26 19:16:13 -08:00
stephenneuendorffer	3fd9b7789e	Bump LLVM to 881ff4e4ebe8cc0cc045c7c167cffb01f94f27f8 (#539 )	2022-01-25 22:16:30 -08:00
Suraj Sudhir	cadea678e5	[tosa] Implement torch.linear support. (#535 ) Refactor matmul into separate class and derive variants: - matmul - mm, bmm - linear Signed-off-by: Suraj Sudhir <suraj.sudhir@arm.com>	2022-01-25 08:48:58 -08:00
Yi Zhang	ad4b9e0369	Minor fixes	2022-01-24 19:21:15 -05:00
Anup Gangwar	f8080bd1c5	* [tosa] Support for AtenRsubScalarOp for scalar constants (#531 ) * [tosa] Support for AtenCeilOp and AtenReciprocalOp * [tosa] Support for comparator ops, Aten[Gt\|Lt\|Eq][Tensor\|Scalar]Op with scalar constant * [tosa] Support for Scalar variants of Aten[Mul\|Div\|Add\|Sub] Ops with scalar constants Signed-off-by: Anup Gangwar <anup.gangwar@arm.com> Co-authored-by: Anup Gangwar <anup.gangwar@arm.com>	2022-01-20 10:58:30 -08:00
Vivek Khandelwal	6fe70c7794	[MLIR][TORCH] Add E2E support for aten.index.Tensor op This commit adds lowering of `aten.index.Tensor` op Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-01-19 13:37:56 +05:30
Suraj Sudhir	0188ca5498	[tosa] Implement matmul, mm and bmm support (#526 ) - Also handles braodcasting n-D tensors, dynamic shapes Signed-off-by: Suraj Sudhir <suraj.sudhir@arm.com>	2022-01-18 13:37:32 -08:00
dan	3745f54489	Update external/llvm-project - Add `qualified` to ods because of https://reviews.llvm.org/D113873 and https://reviews.llvm.org/D116905 - Needed to revert https://github.com/llvm/torch-mlir/pull/520 as it was based on an old torch version. https://github.com/llvm/torch-mlir/pull/527 will bring this back with a better design. - Change ConvertAtenCatOp to use more accurate tensor shape info and as much static info as possible to pass `tensor.insert_slice` verification code added by https://reviews.llvm.org/D114715 - Other minor fixes	2022-01-18 13:25:42 -05:00
Suraj Sudhir	edf4a0e729	[tosa] Add more common utility functions (#525 ) - Common code as TF repository, being moved to MLIR core. - Will support further legalizations to be published. Signed-off-by: Suraj Sudhir <suraj.sudhir@arm.com>	2022-01-14 13:57:27 -08:00
Anup Gangwar	abd61b4974	* Workaround for Issue 521, remove createTosaToStandard from Passes.cpp and disable ElementwisePowModule_basic * Update nll_loss_forward to align to the change in PyTorch Signed-off-by: Anup Gangwar <anup.gangwar@arm.com>	2022-01-12 14:30:58 -06:00
Anup Gangwar	d69d29b7a6	* [tosa] Support for AtenPowTensorScalarOp with constant Scalar as input Signed-off-by: Anup Gangwar <anup.gangwar@arm.com>	2022-01-11 22:55:54 -05:00
Liam Fitzpatrick	077e55d756	Add support for constant_pad_nd Note that to enable folding of the code coming from an example like the ConstantPad2dStaticModule e2e test, support for other operations had to be added/improved: - aten::neg.int - aten::eq.float - aten::eq.str - prim::Uninitialized	2022-01-11 10:25:25 -05:00
Vivek Khandelwal	35cf8d18f7	Add support for two return values This commit adds support for two return values of type memref f32 and i64. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-01-11 11:07:10 +05:30
Vivek Khandelwal	ca662dc9cc	[MLIR][TORCH] Add E2E support for aten.threshold, aten.threshold_backward op This commit adds lowering of `aten.threshold` op This commit adds lowering of `aten.threshold_backward` op Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2022-01-10 11:56:56 +05:30
Yi Zhang	7cf7b91664	[MLIR][TORCH] Fix tensor literal int elem type to be signless The element type of tensor literal should be signless when converted to builtin tensor types.	2022-01-07 16:34:24 -05:00
Suraj Sudhir	d6b6c0268c	[tosa] Add missing overrride-s to fix compiler warnings (#514 ) Signed-off-by: Suraj Sudhir <suraj.sudhir@arm.com>	2022-01-07 10:57:54 -08:00
Yi Zhang	732a76f45c	Make broadcasting result shape more static This involes the following 2 parts: - Change refine type to propagate more static shape info. - Get as much static shape info as possible when creating the result tensor when converting to linalg.	2022-01-06 18:39:27 -05:00
Suraj Sudhir	b4842d9863	[tosa] Implement squeeze.dim support (#511 ) Templated variants for squeeze and squeeze.dim	2022-01-06 08:31:29 -08:00
Gaurav Shukla	3c40539b34	[TORCH][MLIR] Add E2E support for `aten.[ones_like\|zeros_like]` - This commit adds E2E support for `aten.ones_like` and `aten.zeros_like` ops. - Adds support for non-None `dtype` argument of `aten.empty_like` op. - All the unit test cases related to constant tensor allocation like ops are moved to a different file named `constant_alloc.py`. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2022-01-06 20:24:40 +05:30
Liam Fitzpatrick	ccfdfd1b80	Refine static shapes for conv2d and maxpool2d	2022-01-03 11:09:23 -06:00
Vivek Khandelwal	4486de5ef3	[MLIR][TORCH] Add E2E support for torch.arange op This commit adds lowering of `aten.arange.start_step` op. This commit decomposes `aten.arange` and `aten.arange.start` into `aten.arange.start_step` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2021-12-27 22:45:48 +05:30
Gaurav Shukla	a83004c806	[TORCH][MLIR] Fold trivial cases of `aten.to.dtype` and `aten.view` op - It folds `aten.to.dtype` when the input tensor type and result type are exactly same. - It folds `aten.view` when the rank of both the input tensor type and result type is unity. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2021-12-24 13:32:34 +05:30
Prashant Kumar	9e1ecf2c0b	Add Add and Sub scalar op conversions. `aten.add.Scalar` and `aten.sub.Scalar` op conversions have been added. The changes have been made as a part of `-convert-torch-to-linalg` pass.	2021-12-22 21:41:49 +05:30
Nirvedh	3cb46cecef	Added aten::t() Op	2021-12-22 10:57:10 -05:00
xndcn	5eed562e19	add aten.sub.int/aten.mul.int lowering in TorchToStd	2021-12-17 10:35:15 -08:00
Yi Zhang	d8ba68119e	Lower aten::view with linalg.collapse and linalg.expand We only handle the expanding OR collapsing cases, we do not handle expanding And collapsing happening at the same time or cases where it's neither collapsing nor expanding like view of [2,3] for 3x2 tensor. It's assumed that if a shape list element is got from `aten.size(tensor, dim)` the corresponding dim is not splitted or collapsed. This assumption makes it easier to deal with dynamic shapes.	2021-12-16 17:58:20 -05:00
Gaurav Shukla	bc9abbc1c9	[TORCH][MLIR] Add E2E support for `aten.empty_like` op This commit adds decomposition of `aten.empty_like` into `aten.empty` op. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2021-12-16 20:17:39 +05:30
Gaurav Shukla	eddc09aa55	[TORCH][MLIR] Add E2E support for `aten.eq` and `aten.lt` ops - Added E2E support for `aten.eq.Tensor` and `aten.lt.Tensor` ops. Both the operands are expected to be of the same type, i.e., type promotion is not addressed as a part of this commit. - Added E2E support for `aten.eq.Scalar` and `aten.lt.Scalar` ops. Tensor operand type to Scalar operand type promotion has not been handled in this commit. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2021-12-16 18:47:22 +05:30
Suraj Sudhir	0cd95b5c68	[tosa] Support for Torch.squeeze (#487 )	2021-12-15 21:40:29 -08:00
Daniel Garvey	396ab35c9d	Small fixes for slice edge cases (#476 )	2021-12-15 15:54:41 -06:00
Anup Gangwar	a6c3050dd0	* [tosa] Support for Maximum and Minimum Signed-off-by: Anup Gangwar <anup.gangwar@arm.com>	2021-12-15 11:58:19 -08:00
Suraj Sudhir	829cf8afc3	[tosa] Implement Argmax support (#485 ) Signed-off-by: Suraj Sudhir <suraj.sudhir@arm.com>	2021-12-15 11:01:01 -08:00
Gaurav Shukla	d13bb0e5c1	[TORCH]MLIR] Fix C++17 extension warning The existing implementation of `ConvertConstantTensorAllocOp<>` requires a C++17 feature `if constexpr ()`. This commit removes the use of that feature to support the implementation even for lower C++ versions. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2021-12-15 23:35:06 +05:30
Prashant Kumar	ab81f871e4	Add aten.tensor.int and aten.tensor.float op lowerings. Add the required lowerings and correct test cases. These op produce zero-d tensors and it was incorrectly mentioned in refine types to produce 1d tensor of size 1.	2021-12-15 17:21:34 +05:30
Anup Gangwar	cce490d71d	* [tosa] Support for Rsqrt legalization (#480 ) Signed-off-by: Anup Gangwar <anup.gangwar@arm.com> Co-authored-by: Anup Gangwar <anup.gangwar@arm.com>	2021-12-14 10:03:58 -08:00
Prashant Kumar	6dabf185f5	Add support for int types in gtScalar op. Support for integer types in gtScalar op has been added. The code share same logic with gtTensor op and can be merged which is added as a TODO.	2021-12-14 01:29:52 +05:30
Gaurav Shukla	8d4879feb0	[TORCH][MLIR] Add and templatize lowering of [`aten.zeros\|aten.ones\|aten.empty`] ops - Templatize `aten.zeros` and `aten.ones` ops lowering. - Add E2E support for `aten.empty` op. - Add Integer type support in `aten.mul.Scalar` op lowering. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2021-12-14 00:07:11 +05:30
Prashant Kumar	528354de84	Add `aten.gt.Tensor` op `aten.gt.Tensor` op has been added in torch dialect and the lowering of the op has been done to the linalg dialect. Signed-off-by: Prashant Kumar <prashant@nod-labs.com>	2021-12-13 00:08:52 +05:30
Gaurav Shukla	a778f990e9	[TORCH][MLIR] Add E2E support for `aten.ceil` op This commit adds lowering of `aten.ceil` op as a part of element-wise ops lowering. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2021-12-12 01:15:47 +05:30
Prateek Gupta	cfc8de36f8	[MLIR][TORCH] Add E2E support for `aten.native_layer_norm`. (#470 ) This commit adds support for aten.native_layer_norm operation. Here the previous code for aten.layer_norm is tweaked a little bit to accomodate both mean and variance values alongwith the layer norm value. This commit also adds decomposition of aten.layer_norm into aten.native_layer_norm, which was previously getting lowered directly to linalg. Signed-Off-By: Prateek Gupta<prateek@nod-labs.com>	2021-12-10 19:06:19 +05:30
Gaurav Shukla	5a47f92390	[TORCH][MLIR] Add E2E support for `aten.squeeze.dim` op This commit adds lowering of `aten.squeeze.dim` op into `linalg.TensorCollapseShape` op. Here, the dim(th) dimension of the input tensor is not supposed to be dynamic. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2021-12-10 17:01:20 +05:30
Vivek Khandelwal	8130354c09	[MLIR][TORCH] Add E2E support for aten.index_select op This commit adds lowering of `aten.index_select` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2021-12-09 23:13:36 +05:30
Vivek Khandelwal	0a0a1b4476	[MLIR][Torch] Resolve styling issues related to aten zeros/ones op https://github.com/llvm/torch-mlir/pull/464#discussion_r765065092 Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2021-12-09 17:42:28 +05:30
Gaurav Shukla	f34eb66124	[TORCH][MLIR] Add E2E support for [`aten.gt.Scalar`\|`aten.where.self`] This commit adds lowering of `aten.gt.Scalar` and `aten.where.self` as a part of element-wise ops lowering. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2021-12-09 12:47:10 +05:30
Liam Fitzpatrick	2414bdb1f0	Linalg lowering for aten.conv2d(bias=True) Previously aten.conv2d was only lowered if there was no bias. Here lowering is extended to support bias.	2021-12-08 14:44:36 -08:00
Prashant Kumar	c598e01529	Add support for passing & returning memref of bool types Support for passing memref of bool types as a function argument and return is added in ref-backend. Signed-off-by: Prashant Kumar <prashant@nod-labs.com>	2021-12-09 00:23:38 +05:30
Vivek Khandelwal	9958cf08b6	[MLIR][TORCH] Add E2E support for aten.zeros op This commit adds lowering of `aten.zeros` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2021-12-08 22:42:33 +05:30
Prashant Kumar	977b1b03ea	Add aten::nll_loss_forward op lowering. The op lowering has been added as a part of `torch-lower-to-linalg` pass. This takes care of ignore_index but the weight and reduction operand is still to be accounted for. Signed-off-by: Prashant Kumar <prashant@nod-labs.com>	2021-12-07 17:11:08 +05:30
Prashant Kumar	5c7ce45c4e	Update external llvm to 966b72098363d44adf2882b9c34 The external llvm is updated to point to https://reviews.llvm.org/rG966b72098363d44adf2882b9c34fcdbe344ff913. Some of the changes wrt. NamedAttr has been addressed. Signed-off-by: Prashant Kumar <prashant@nod-labs.com>	2021-12-06 23:33:58 +05:30
Daniel Garvey	b0cb49ca93	Add scalar type promotion for mul and div (#454 )	2021-12-03 13:51:25 -06:00
Suraj Sudhir	c9c9b68d1f	[tosa] Add Torch reduction operators - Supports variants with multiple dims, one dim, all dime - Leverages legalize_common and legalize_utils code from TensorFlow-TOSA work Signed-off-by: Suraj Sudhir <suraj.sudhir@arm.com>	2021-12-03 09:01:48 -08:00
Prashant Kumar	ab6211184f	Bug fixes that pops up when updating generatedAten ops td There is an op name change that requires trivial changes. Also, some of the warning has been fixed. Signed-off-by: Prashant Kumar <prashant@nod-labs.com>	2021-12-03 22:18:18 +05:30
Yi Zhang	24bc06fc8d	Fix compilation warnings.	2021-12-03 11:44:32 -05:00
Daniel Garvey	a52aded0b9	Add lowering for slice and selectInt (#398 )	2021-12-02 22:09:21 -06:00
Vivek Khandelwal	46a2189a41	[MLIR][TORCH] Add E2E support for aten.bitwise_and.tensor op This commit adds lowering of `aten.bitwise_and.tensor` op. Signed-Off By: Vivek Khandelwal vivek@nod-labs.com	2021-12-02 21:06:15 +05:30
Vivek Khandelwal	46a0668b3b	[MLIR][TORCH] Add E2E support for aten.mean and aten.numel op. This commit adds lowering of `aten.mean` and `aten.numel` op. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2021-12-02 11:51:13 +05:30
Suraj Sudhir	1251c186b5	[tosa] Add TosaMakeBroadcastable pass to torch-to-tosa pipeline. Fixes broken e2e test ElementwiseAddModule_basic Signed-off-by: Suraj Sudhir <suraj.sudhir@arm.com>	2021-11-30 13:26:57 -08:00
Ramiro Leal-Cavazos	e6675a50d3	Add support for dtype argument in reduction ops Many reduction ops take as an argument an optional output dtype that can change the type of the input tensor before the reduction is performed. This commit adds support for the optional dtype flag that had been previously ignored. Test: /tools/torchscript_e2e_test.sh -f 'ReduceSumDtype' /tools/torchscript_e2e_test.sh -f 'ReduceSumDImIntListDtype'	2021-11-30 12:53:59 -05:00
Gaurav Shukla	73b27b32dc	[MLIR][TORCH] Add E2E support for `aten.squeeze` op This commit adds lowering of `aten.Squeeze` op into `linalg.TensorCollapseShape` op. The size 1 dynamic dimensions are not handled as a part of this commit. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2021-11-30 23:00:28 +05:30
ds1231h	9ad5954e41	aten.abs and aten.reciprocal to linalg	2021-11-30 11:31:55 -05:00
Yi Zhang	5d28549c2c	Add folder for torch.aten.Int.Tensor This is to fold the common pattern from Bert inference like: ``` %111 = torch.prim.NumToTensor.Scalar %110 : !torch.int -> !torch.vtensor<[],si64> %112 = torch.aten.Int.Tensor %111 : !torch.vtensor<[],si64> -> !torch.int ```	2021-11-30 21:55:48 +05:30
Prashant Kumar	36afa4a4d3	Add aten.fill.Scalar op lowering The lowering of aten.fill.Scalar has been added. The changes have been made as a part of -torch-convert-to-linalg pass. Signed-off-by: Prashant Kumar <prashant@nod-labs.com>	2021-11-30 21:12:15 +05:30
Daniel Garvey	539511c19b	Add dropout op (#436 ) Co-authored-by: dan <dan@nod-labs.com>	2021-11-29 12:30:03 -06:00
dan	03fdf56f21	add aten.add.int lowering in TorchToStd	2021-11-29 13:22:50 -05:00
Liam Fitzpatrick	7616d28ce1	Add leakyrelu support	2021-11-27 23:04:46 +05:30
Prateek Gupta	f461a7ebce	[TORCH][MLIR] Add E2E support for aten._softmax operation. (#431 ) Signed-Off-By: Prateek Gupta <prateek@nod-labs.com>	2021-11-25 11:19:02 +05:30
nodlabs	67ce816fca	lowered addcmul and addcdiv to linalg	2021-11-24 17:26:47 -05:00
Vivek Khandelwal	8d8d2c2fb8	[MLIR][TORCH] Add E2E support for aten.div.Scalar This commit adds lowering of `aten.div.Scalar`. Signed-Off By: Vivek Khandelwal <vivek@nod-labs.com>	2021-11-24 11:17:40 +05:30
Ramiro Leal-Cavazos	56c6e3676b	Fix bug in NumToTensor handling of float values This commit fixes a type promotion bug when NumToTensor was given a float as an argument. In particular, the rules for type promotion of a scalar vary depending on if the scalar is part of a tensor op or not. NumToTensor falls under the second category, but it was being treated as part of the first category.	2021-11-23 11:47:44 -05:00
Prashant Kumar	1dc374014b	Refactor to share code in DecomposeComplexOps pass Share code in `log_softmax_backward` and `softmax_backward` ops.	2021-11-20 00:39:34 +05:30
Prashant Kumar	ea7a30f9b9	Add e2e test for aten.log_softmax_back_data op aten.log_softmax_back_data op lowering and required tests has been added. Some NFC have also been added. Signed-off-by: Prashant Kumar prashant@nod-labs.com	2021-11-19 00:08:28 +05:30
Gaurav Shukla	663fc1ef51	[MLIR][TORCH] Add E2E support for [`aten.mul.Scalar`\|`aten.addmm`] This commit adds lowering of `aten.mul.Scalar` and also adds decomposition of `aten.addmm` to `aten.mul.Scalar`, `aten.add.Tensor` and `aten.mm` ops. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2021-11-18 22:26:41 +05:30
Prashant Kumar	f8ff6d84f4	Support aten::linear with rank 3 inputs Now, aten::linear supports rank 3 inputs. This is a fix for upcoming bert-inference task. The correct way should be to support broadcasting in `aten.matmul` op and decompose `aten.linear` into right ops.	2021-11-18 22:15:04 +05:30
Prateek Gupta	146f109152	[NFC] Cleanup code for aten.gelu_backward operation. This commit adds minor non functional changes to the aten.gelu_backward operation. Signed-Off-By: Prateek Gupta <prateek@nod-labs.com>	2021-11-18 11:24:04 -05:00
Prateek Gupta	ecf78b9849	[TORCH][MLIR] Add E2E support for `aten.gelu_backward` operation. (#418 ) This commit adds new operation `aten.gelu_backward` in the aten dialect and adds lowering of this operation from aten to linalg. Signed-Off-By: Prateek Gupta <prateek@nod-labs.com>	2021-11-17 14:59:38 +05:30
Yi Zhang	0fe70994e5	Add support for multiple return values This change is to unblock the work of some backprop ops returning more than one tensors. We will need to think of a more scalable approach in the future if more flexible return types combinations are needed.	2021-11-16 21:07:45 -05:00
Yi Zhang	53733933a4	Update llvm upstream to 0b17336f793108a7b10c3fa913039144ef1d0f61 Update AsmPrinter/Parser and MatchAndRewrite	2021-11-16 13:04:51 -05:00
Ramiro Leal-Cavazos	a2392a0f19	Fix bug in handling of pin_memory in AtenOnesOp conversion This commit fixes a bug with the way ConvertAtenOnesOp was matching on the pin_memory bool argument, which always resulted in a failed match.	2021-11-12 11:38:25 -05:00
Suraj Sudhir	628a21bb13	[mlir][tosa] Refactor conversions to use templates (#416 ) - Remove use of conversion construction macros - Add mul and div op conversions - Add corresponding tests Signed-off-by: Suraj Sudhir <suraj.sudhir@arm.com>	2021-11-11 16:15:58 -08:00
Suraj Sudhir	1019ddf5a0	[tosa] Add structure for eltwise ops Add a bunch of op legalizations. Signed-off-by: Suraj Sudhir <suraj.sudhir@arm.com>	2021-11-11 11:03:24 -08:00
Prashant Kumar	909f7d7171	Add e2e testing for aten_tanh_backward op. The e2e testing for aten_tanh_backward op has been added. The testing is done for ref_backend.	2021-11-09 11:28:49 -05:00
George Petterson	2764e86f02	Add Rsqrt	2021-11-09 11:08:28 -05:00
Yi Zhang	3bd9d2a4c7	Add e2e support for aten._softmax_backward_data. Decompose aten._softmax_backward_data into aten math ops. Also decompose `aten.size` to facilitate decomposing _softmax_backward_data.	2021-11-09 13:09:30 +05:30
Yi Zhang	05c4dd8e39	Add convertScalarToDtype helper. This is to facilitate scalar type conversion in the TorchToLinalg. As part of adding the helper, this PR also: - Updated `AtenAddTensorOp`, `AtenSubTensorOp` to use the helpers to support more type variants. - Added e2e type promotion testing. - Added i32 memref return/arg type to support e2e testing.	2021-11-08 17:50:52 -05:00
George Petterson	e23cabf3a9	Add log2	2021-11-08 16:19:59 -05:00
George Petterson	f41958037a	Add NumToTensor	2021-11-08 15:56:52 -05:00
Prateek Gupta	18e8806b14	[TORCH][MLIR] Add E2E support for aten::to.dtype. This commit adds end to end support for AtenToDtypeOp from aten to linalg. Signed-Off-By: Prateek Gupta <prateek@nod-labs.com>	2021-11-08 12:56:03 -05:00
Wang Kangyu	4bb9b44775	Add lowering of "aten.pow.Tensor_Scalar" op Add e2e support for torch.pow(Tensor, Float)	2021-11-08 09:19:50 -08:00
Prashant Kumar	fd505db2c6	Adding support for returning elemental types. Support for returning elemental types. Previously, only memref types as returning types was supported. All the hacky ways to write tests which return elemental types should be taken care of. Signed-off-by: Prashant Kumar <prashant@nod-labs.com>	2021-11-08 22:20:48 +05:30
Wang Kangyu	b33543af85	Add lowering of aten.floor op	2021-11-06 17:31:44 -04:00
nodlabs	5ff823ace9	lowerd Sqrt to linalg reused clang-format, as changes got deleted	2021-11-06 11:29:46 -04:00
Gaurav Shukla	2ce47dc8e4	[TORCH][MLIR] Add E2E support for aten.expand This commit adds decomposition of `aten.Expand` to `aten.BroadcastTo` op. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2021-11-03 23:58:59 +05:30
Prashant Kumar	ef897dbb19	Add lowering of `aten.log_softmax` op. The `aten.log_softmax` is decomposed into `aten.softmax` and `aten.log` op.	2021-11-03 22:10:05 +05:30
Prashant Kumar	127c7d8e27	Add lowering of `torch.log` op The lowering of `torch.log` op has been added. Signed-off-by: Prashant Kumar <prashant@nod-labs.com>	2021-11-02 21:18:00 +05:30
George Petterson	6dde5b347e	Add rsub	2021-11-02 09:56:48 -04:00
Prashant Kumar	53b4275ef5	Add lowering of `aten.Int.Tensor` op. The lowering of `aten.Int.Tensor` op has been added. The changes has been made as a part of `convert-torch-to-linalg` pass. Signed-off-by: Prashant Kumar <prashant@nod-labs.com>	2021-11-01 21:58:08 +05:30
Gaurav Shukla	69eaf9a154	[MLIR][TORCH] Add E2E support for `torch.aten.view` - This commit adds lowering of `aten.View` to `linalg.TensorExpandShape`. - This lowering will be successful only when one or more static dimensions are expanded. - It also fixes a typo in `ConvertAtenFlattenUsingIntsOp` conversion pattern. Signed-Off-by: Gaurav Shukla <gaurav@nod-labs.com>	2021-10-29 22:33:10 +05:30
Yi Zhang	752abc8d01	Add type promotion code to refine types. The types have different levels of categories: where complex > floating > integral > boolean (> means left hand side has higher category). The operands have different levels of priorities where: dimensioned tensor > 0-dim tensor > scalar == wrapped 0-dim tensor. This is represented by the `ResultTypeState.dimResult`, `ResultTypeState.zeroResult` and `ResultTypeState..wrappedResult` in the source code. For operands of the same priorities, the result type should be the highest categories with sufficient width to hold all operands. By default, only the highest priority operands participate in the type promotion logic. Lower priority operands participate if they are in a higher category than any higher priority operands. For example, <[],f32> (lower priority) and <[1], si64> tensor would result in <[?],f32> tensor because floating > integeral. Another example <[],f64> (lower priority) and <[1], f32> tensor would result in <[?], f32> tensor because f32 and f64 are the same category. The ScalarType enum definition, type promotion table, ResultTypeState struct definition and some helpers are copied from aten/src/ATen/native/TypeProperties.* Other references: - https://pytorch.org/docs/stable/tensor_attributes.html#type-promotion-doc - https://github.com/pytorch/pytorch/issues/9515 Other minor changes: 1. Fix `visitExpandLikeOp` to consider cases where the given sizes list size is larger than the input rank. 2. Add back the somehow deleted `torch.aten.softmax.int` tests in decompose-complex-ops.mlir.	2021-10-29 11:17:39 -04:00
George Petterson	2ea2ab518b	Add contiguous	2021-10-29 11:11:50 -04:00
Suraj Sudhir	7e4ef74774	[tosa] Add Torch.sigmoid fp32 to TOSA (#386 ) * [tosa] Add Torch.sigmoid fp32 to TOSA Signed-off-by: Suraj Sudhir <suraj.sudhir@arm.com>	2021-10-28 10:09:12 -07:00
Prateek Gupta	c33a2ca952	[TORCH][MLIR] Add E2E support for aten.permute. This commit adds lowering of aten.permute to linalg.generic operation. Signed-Off-By: Prateek Gupta <prateek@nod-labs.com>	2021-10-28 10:25:26 -04:00
stephenneuendorffer	614b889dc6	Enable python extensions when building out of tree (#363 )	2021-10-27 17:04:12 -07:00
Sean Silva	30df2ec71b	Add min/max/clamp support. Part of #380 Also - BoolType is not considered as Scalar - e2e framework fixes for nan handling - `tu.rand(..., low=, high=)` support - delete unused variable (fix warning) - Add IouOfModule from #380 to e2e test suite (this is a common calculation in vision models) Your branch is ahead of 'origin/main' by 1 commit.	2021-10-27 13:29:21 -07:00
Prashant Kumar	5009cbf55c	Add lowering of aten.matmul op. Lowering of `aten.matmul` op is added from torch to linalg dialect. The different cases correspond to https://pytorch.org/docs/stable/generated/torch.matmul.html. TODO: Broadcasting in case of batch-matmul is yet to be taken care of. Signed-off-by: Prashant Kumar <prashant@nod-labs.com>	2021-10-26 12:45:09 -04:00
Boian Petkantchin	e276dbbaa6	Add aten::gelu lowering (#374 ) * Print more exception info on error during test execution * Fix formatting * Add aten::gelu lowering Co-authored-by: Boian Petkantchin <boian@nod-labs.com>	2021-10-25 16:16:01 -07:00
Stella Laurenzo	47209539a8	Bump llvm-project to f1b922188ead5ca492c8d8edd47921b013a22ae0. Includes a fix to use `add_mlir_public_c_api_library` for Torch-MLIR's CAPI library, which is now required (note: upstream sample has it the right way). Disabled a TOSA test per discussion: https://github.com/llvm/torch-mlir/issues/379	2021-10-25 13:22:07 -07:00
Ramiro Leal-Cavazos	8bfb819d35	Fix bug with transpose of negative dims Summary: This commit fixes an off-by-one error in how negative dimensiosn were being handled in the lowering of transpose. This commit also adds tests to transpose and unsqueeze to test negative dimensions.	2021-10-25 15:50:55 -04:00
George Petterson	22aeb967c5	Add ones	2021-10-21 14:46:59 -04:00
Yi Zhang	abfaf8c577	Add aten.ne.bool to make CI pass	2021-10-21 14:45:41 -04:00
George Petterson	7c47b9a0c8	Formatting fix	2021-10-19 13:33:31 -04:00
George Petterson	8853dfbc74	Add broadcast	2021-10-19 13:33:31 -04:00
Yi Zhang	a459e09ab7	E2e support for aten.softmax.int and aten.embedding - Added a DecomposeComplexOps pass to decompose complex torchOps. - Refactored `visitAtenArgmaxOp` and `visitAtenAnyDimOp` to `visitReductionAlongDimIntOp`. - Moved some helper functions into torch-mlir/Dialect/Torch/Utils/Utils.h to be shared by multiple files. - Added support for f64 tensor as argument and return types.	2021-10-18 17:57:45 -04:00
Yi Zhang	0902438882	Update llvm-project to a54f4eae0e1d0ef5adccdcf9f6c2b518dc1101aa This brings in https://reviews.llvm.org/D110797. PRs that are in progress will need to use scripts provided by https://llvm.discourse.group/t/psa-removed-arithmetic-ops-from-standard/4455.	2021-10-18 13:36:42 -04:00
dan	7750d2173a	add argmax lowering Add argmax lowering from torch to linalg	2021-10-13 14:31:16 -04:00
Sean Silva	0c5c84d63d	Add a basic TOSA E2E backend. We lower through linalg-on-tensors and use RefBackend to run it. This adds enough support for a "tanh" op. Adding more ops should be fairly mechanical now that things are wired up. Run with: ``` ./tools/torchscript_e2e_test.sh -c tosa ``` The backend structure is very similar to linalg-on-tensors based E2E backends and is a nice parallel (see `tosa_backend.py`). Actually, this forced a nice refactoring to the layering here. We removed `torchscript-module-to-linalg-on-tensors-backend-pipeline` and instead require separately running ``` torchscript-function-to-torch-backend-pipeline,torch-backend-to-linalg-on-tensors-backend-pipeline ``` This highlights the step that lowers to the "torch backend contract" of cleaned up `torch` dialect ops is a critical step in the lowering. Going forward, that is the key load-bearing contract of the torch-mlir project, not the linalg-on-tensors backend contract. Recommended review order: - `TorchToTosa.cpp` / `TorchToTosa/basic.mlir` - `python/torch_mlir_e2e_test/torchscript/configs/tosa_backend.py` and the new `utils.py` file there. - `python/torch_mlir_e2e_test/tosa_backends/linalg_on_tensors.py` and `abc.py` in that directory for the TOSA backend e2e interface. - other misc mechanical changes	2021-10-08 09:59:45 -07:00
dan	2e1498ad11	add i64 support to refbackend	2021-10-05 15:12:44 -04:00
Yi Zhang	98ba255288	E2e support for layernorm.	2021-10-04 14:15:13 -04:00
Sean Silva	5b6902e31c	Dual license the torch-mlir project. This commit (with approval from all contributors) dual licenses the torch-mlir project under both the standard LLVM license and the standard PyTorch license. This will facilitate moving code between torch-mlir and the two upstream projects. The standard file comment is now: ``` // This file is licensed under the Apache License v2.0 with LLVM Exceptions. // See https://llvm.org/LICENSE.txt for license information. // SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception // Also available under a BSD-style license. See LICENSE. ``` See `LICENSE` in the project root for the terms of both licenses.	2021-10-01 10:46:08 -07:00
Yi Zhang	89225b0cd8	Add BertSequenceClassification model to e2e Use torch tracing to get the module because the original model is not TorchScriptable out of box.	2021-09-30 13:30:29 -04:00
Sean Silva	8b2c099914	Update llvm-project to 204d301bb1921431a853c0bfba32007c018df1d5 This brings in the fix for the obscure RefBackend bug we were hitting.	2021-09-28 17:38:10 -07:00
Ramiro Leal-Cavazos	b59f2cb673	Implement the lazytensor package (#331 ) Implement the `lazytensor` python package for converting lazy computations captured by the Lazy Tensor Core into MLIR. This PR also fixes a few things with `torchfx` and its example	2021-09-28 17:25:06 -07:00
Sean Silva	4fad753073	Move external/torch-mlir to the root of the repo.	2021-09-27 17:11:08 -07:00
Sean Silva	d8f603a4e5	Remove old stuff in prep for move-to-root.	2021-09-27 17:11:08 -07:00
Sean Silva	a99cbeeb7e	Move TorchConversion dialect and TorchTo* into torch-mlir	2021-09-23 21:39:31 -07:00
Sean Silva	2213584c4f	VerifyBackendContract -> VerifyLinalgOnTensorsBackendContract This moves it into TorchConversion since it is only needed there. This removes the Backend/ directory.	2021-09-23 21:39:31 -07:00
Yi Zhang	603e068e45	E2e implementation for `aten.cat`,`aten.gather`, `aten.bmm` Also contains the following changes: - Remove derefineOp canonicalizer because it's not safe. - Support for optional tensor and list tensors in reduceOpVariant. This only works for some special detected and easy to handle cases. For list, it covers the case list is got from a `ListConstruct`. For optional, it covers the case optional is constructed from a `DerefineOp`. - Remove the `inferReturnTypes` for `FromBuiltinTensorOp` because it's not safe to deduce types from the input. For example, a built-in tensor of i8 could be converted to si8 or ui8. It's better to let the user specify the return type explicitly.	2021-09-22 19:15:01 -04:00
Sean Silva	1a0b953ea7	Eliminate almost all mentions of IREE. A few remain in examples/docs that will be naturally be updated in due time. This regresses the list support and the general direction of more widely supported control flow, lists/dicts/globals that we were going for with the TorchScript path. The idea is that we are deferring that work to make torch-mlir a very clean standalone thing. We will reboot it, probably using some of the tools of iree_pydm to make it simpler, and in a more natural place (such as an iree-torch repo that depends on IREE and torch-mlir to build a working PyTorch frontend solution for IREE -- it was really weird that npcomp depended on IREE).	2021-09-22 16:06:38 -07:00
Sean Silva	a25163fbfa	Remove old RefBackend It is superceded by the new one.	2021-09-22 15:33:28 -07:00
Sean Silva	f9c48d0b89	Bring up new RefBackend. `tools/torchscript_e2e_test.sh` is all green. This needs a few passes I put into torch-mlir/lib/RefBackend (not to be confused with `npcomp/lib/RefBackend`, which will soon be deleted). For the sake of review, since this brings together a lot of things, I split this into its own commit. I temporarily commented out some "list" stuff that we are going to remove as part of the torch-mlir refocus.	2021-09-22 14:20:22 -07:00
George Petterson	ecc334123c	Added transpose lowering	2021-09-19 20:28:27 -04:00
Sean Silva	68fefe7e1f	Remove NPCOMP_ENABLE_IREE CMake flag. Our new dependency management solution relies: - on the C++ side with the public iree-dialects project, which we include and are using as representative of some missing upstream ops (so we treat them "as if" they were upstream, with the hope of upstreaming them after some codevelopment has happened) - on the Python side, with simple PYTHONPATH manipulation or installed Python packages. No CMake stuff required.	2021-09-17 09:27:49 -07:00
Sean Silva	b6be96d722	[torch-mlir earthmoving (2/N)] Python code movement. This moves the bulk of the Python code (including the Torch interop) from `frontends/pytorch` into `torch-mlir/TorchPlugin`. This also required reconciling a bunch of other Python-related stuff, like the `torch` dialects. As I did this, it was simpler to just remove all the old numpy/basicpy stuff because we were going to delete it anyway and it was faster than debugging an intermediate state that would only last O(days) anyway. torch-mlir has two top-level python packages (built into the `python_packages` directory): - `torch_mlir_dialects`: `torch` dialect Python bindings (does not depend on PyTorch). This also involves building the aggregate CAPI for `torch-mlir`. - `torch_mlir`: bindings to the part of the code that links against PyTorch (or C++ code that transitively does). Additionally, there remain two more Python packages in npcomp (but outside `torch-mlir`): - `npcomp_torch`: Contains the e2e test framework and testing configs that plug into RefBackend and IREE. - `npcomp_core`: Contains the low-level interfaces to RefBackend and IREE that `npcomp_torch` uses, along with its own `MLIR_PYTHON_PACKAGE_PREFIX=npcomp.` aggregation of the core MLIR python bindings. (all other functionality has been stripped out) After all the basicpy/numpy deletions, the `npcomp` C++ code is now very tiny. It basically just contains RefBackend and the `TorchConversion` dialect/passes (e.g. `TorchToLinalg.cpp`). Correspondingly, there are now 4 main testing targets paralleling the Python layering (which is reflective of the deeper underlying dependency structure) - `check-torch-mlir`: checks the `torch-mlir` pure MLIR C++ code. - `check-torch-mlir-plugin`: checks the code in `TorchPlugin` (e.g. TorchScript import) - `check-frontends-pytorch`: Checks the little code we have in `frontends/pytorch` -- mainly things related to the e2e framework itself. - `check-npcomp`: Checks the pure MLIR C++ code inside npcomp. There is a target `check-npcomp-all` that runs all of them. The `torch-mlir/build_standalone.sh` script does a standalone build of `torch-mlir`. The e2e tests (`tools/torchscript_e2e_test.sh`) are working too. The update_torch_ods script now lives in `torch-mlir/build_tools/update_torch_ods.sh` and expects a standalone build. This change also required a fix upstream related to cross-shlib Python dependencies, so we also update llvm-project to 8dca953dd39c0cd8c80decbeb38753f58a4de580 to get https://reviews.llvm.org/D109776 (no other fixes were needed for the integrate, thankfully). This completes most of the large source code changes. Next will be bringing the CI/packaging/examples back to life.	2021-09-15 13:40:30 -07:00
Sean Silva	28a7738189	[torch-mlir earthmoving (1/N)] C/C++ code movement. This creates the `external/torch-mlir` directory as an LLVM_EXTERNAL_PROJECTS-compatible project (analogous to `iree-dialects`) and completes movement/rename of all pure MLIR C/C++ compiler code into there. The next step will be to move all the Python code / code that links/includes PyTorch C++ code (which currently lives in `frontends/pytorch`) into a subdirectory here. I call this "earthmoving" because it is mostly mechanical changes and renames. As a quick summary (we can change this down the road easily) - C++ `mlir::NPCOMP::Torch -> mlir::torch::Torch` - CAPI `npcompTorchListTypeGet -> torchMlirTorchListTypeGet` - preprocessor `#ifndef NPCOMP_ -> #ifndef TORCHMLIR_` - CMake `NPCOMPFoo -> TorchMLIRFoo` The goal of this is to create a standalone project creating a center of mass for entry into the MLIR ecosystem from PyTorch, suitable in scope for eventual inclusion/ownership in PyTorch. The idea is that `external/torch-mlir` will some day be pulled out into its own repository, and then npcomp will simply pull it in as a submodule. Layering-wise, what lives in `torch-mlir` lowers code from PyTorch (currently TorchScript, but TorchFX or pytorch/xla-style tracing are possible extensions) down to what we have been calling the "Torch backend contract" which is cleaned up IR (inlining, simplifcation, conversion to value tensors, ...) entirely in the `torch` dialect. This is the branching off point for further lowering, of which npcomp takes one opinion (outside `torch-mlir` of course!), namely the `TorchConversion` dialect/transforms which lower to IR suitable for IREE and other linalg-on-tensors based lower-level compilers. Summary of changes: - move `{include,lib,test}/Dialect/Torch` into `torch-mlir` - move relevant parts of CAPI into `torch-mlir`. - leave a few things related to the `torch-mlir` Python build commented out, which should be resolved in a subsequent change.	2021-09-10 21:44:37 -07:00
Sean Silva	a7252f9a06	Add basic support for lists. This plumbs through a vertical slice of support for lists. The main chunk of new code here is AnnotateABIPass which captures the program signature at the Torch backend contract layer, right before we start `TorchConversion`. The `TorchConversion` lowering process is lossy w.r.t. types, so it's necessary to do this for all targets in general. Like using `!iree.list` directly, we use IREE's ABI annotation representation for this, although there is nothing very IREE-specific about it (see https://github.com/google/iree/blob/main/docs/developers/design_docs/function_abi.md) We change `ListLiteralModule_basic` to use `!torch.int` because IREE doesn't support f64 yet (and we don't yet have a way for users to say that they want `!torch.float` to lower as f32). Recommended review order: - AnnotateABIPass and tests - Arg marshaling in npcomp_backend.py and `iree.py` - Updates to `list_programs.py` / `xfail_sets.py` - Moving DeleteDeadIREEListsPass to Backend/Common, so that backends that don't support lists can use it. RefBackend uses that pass, for example.	2021-09-09 20:48:55 -07:00
Yi Zhang	73d553e168	MT model compilation minor changes This contains the following changes: - Fix optional knowledge propagation. The initial knowledge should always be NotNone for the operations we implemented. - Add Folder for `prim.dtype`	2021-09-09 19:02:48 -04:00
Sean Silva	5f3eb637c4	Fix lowering of reduce ops We were not filling the `outs` with the neutral element of the reduction, which resulted in reading uninitialized values (we were getting lucky that sometimes the uninitialized buffers were all zero's). Also, - Slight tweak to error messages in the e2e framework.	2021-09-08 15:30:15 -07:00
Ramiro Leal-Cavazos	6724de7692	Added sum lowering Added lowering to torch.sum into linalg	2021-09-03 17:37:06 -07:00
Sean Silva	ed2afe43e7	Fix TorchToIREE lowering. We needed to resize the list, not just reserve capacity.	2021-09-03 23:57:54 +00:00
Sean Silva	1dec561cfd	Update llvm-project to 830c0b9023cd0cf91955900e0d96283e7a8c3711 - builder.getSymbolRefAttr is gone. - OpAsmOpInterface's getAsmResultNames method needs explicit override - a bunch of churn for builtin.func needing to be made explicit (and sometimes implicit?) - operation printers no longer need to print the operation name themselves. - snuck in beneficial trivial addition to TmpDeleteDeadIREEListsPass to test a particular upstream change e2e with my local patchset.	2021-09-03 14:16:38 -07:00
Yi Zhang	3b0e5910a8	Refine types continue. This should cover all the ops that are left in MT.	2021-09-02 14:39:28 -04:00
dan	d9df4bfc95	Add sigmoid lowering Follows existing conventions for activation functions	2021-08-30 17:32:23 -04:00
Sean Silva	29e1b2fe89	Delete RestrictedCanonicalizer It doesn't work properly with the new dialect registration framework. This was latent and only was exposed when running through npcomp-opt. Not worth investing the brainpower to fix now.	2021-08-27 19:09:29 +00:00
Yi Zhang	d6b9709fa5	Changes to refine types - Add `!torch.optional` knowledge tracking - Changes to improve type propagation for branches and terminators. See examples in `refine-types-branch.mlir` - Refator to separate handling of different ops from `visitOperation` - Add refine types for a few new ops	2021-08-27 11:42:00 -04:00
Yi Zhang	bc5eae41ca	Add more folders to fold away branches Added folders to a few binary computing ops, `TupleUnpack`, `__contains__.str` and `__getitem__.Dict_str`.	2021-08-26 17:37:49 -04:00
Stella Laurenzo	32f56c67f4	Integrate llvm-project at a8de667af092c9b4b3b4a95827a521602ebf14ed. * Requires patch https://reviews.llvm.org/D108527	2021-08-22 18:59:59 -07:00
Stella Laurenzo	80ff744c56	Add a few missing deps exposed by stricter linking with BFD.	2021-08-22 11:56:48 -07:00
Sean Silva	cab8d922ec	Add TorchToIREE and factor out TorchConversion dialect. This converts a basic list op (torch.prim.ListConstruct) to the IREE dialect. ``` def forward(self, x: float): return [x, x] ``` turns into: ``` builtin.func @forward(%arg0: !torch.float) -> !torch.list<!torch.float> { %0 = torch.prim.ListConstruct %arg0, %arg0 : (!torch.float, !torch.float) -> !torch.list<!torch.float> return %0 : !torch.list<!torch.float> } ``` which turns into: ``` builtin.func @forward(%arg0: f64) -> !iree.list<f64> { %c1 = constant 1 : index %c0 = constant 0 : index %c2 = constant 2 : index %0 = iree.list.create %c2 : !iree.list<f64> iree.list.set %0[%c0], %arg0 : !iree.list<f64>, f64 iree.list.set %0[%c1], %arg0 : !iree.list<f64>, f64 return %0 : !iree.list<f64> } ``` As part of doing this, I realized that it was time to formalize the IR form that we reach right before running TorchTo{Linalg,Std,...}. We now call it the "Torch backend contract". We then lower the "Torch backend contract" to the "npcomp backend contract", which involves the new TorchConversion (`torch_c`) dialect, which holds ops that need to operate on both the npcomp backend types (e.g. builtin tensors, i1, IREE list, etc.) and the `!torch` types. This made more sense, as I realized that if I didn't factor out `torch_c` then the Torch dialect would have a dependency on IREE dialect (we previously didn't notice this was an issue because we only depended on `builtin` types), which seemed wrong to me. Recommended review order: - TorchToIREE.cpp / `TorchToIREE/basic.mlir` - Look at the new structure of createTorchScriptToNpcompBackendPipeline. It now lives in TorchConversion/Transforms/Passes.cpp and cleanly calls into `Torch::createTorchScriptToTorchBackendPipeline` for the frontend lowering to the Torch backend contract. - Mechanical change extracting `torch_c.{to,from}_{i1,i64,f64,builtin_tensor,iree_list}` into a new TorchConversion dialect, and a few passes specific to the lowering from the Torch backend contract to the npcomp backend contract. - Minor fixes to TorchToLinalg.cpp to use unconverted operands (now that we convert lists as part of operand materialization, we need to use the original operands). Also added test for AtenMaxPool2dOp and fixed m_TorchConstantIntList. - TmpDeleteDeadIREELists pass. Temporary pass for deleting dead IREE lists that are created as part of operand materialization for conv/max pool/avg pool ops in TorchToLinalg.	2021-08-16 15:01:58 -07:00
Yi Zhang	85ff8b692b	Fix compilation errors from MT model With the following changes the compilation can continue until RefineTypes pass: - Add operators without ODS into `torch_ods_gen.py` - Add some new optional and list types in `TorchTypes.td` - Add some folders for aten int type comparator ops - Modify GlobalizeObjectGraph.cpp. For global slots that's not used, dont check if an aliased value is stored in more than one of global slots. This can work around a failure where the same tensor is stored in multiple "version" slots which are not used.	2021-08-16 16:37:23 -04:00
Yi Zhang	bfc3ee35c6	Import Machine Translation model to MLIR. This includes the following changes to import MT model into MLIR. There are still a lot of work to for actual compilation. - Add `torch.dict<>`, `torch.any`, `torch.number` types - Add `torch.prim.DictConstruct` op - Fix `torch.prim.TupleConstruct` op assembly format to include resulting types	2021-08-10 15:22:06 -04:00
Sean Silva	a3bfd115ee	Remove npcomp-iree-backend-lower-linkage pass. This is no longer needed by IREE.	2021-08-09 15:28:02 -07:00
Sean Silva	902c2e579b	Add resnet inference jupyter notebook. This takes the example from torchscript_resnet18_e2e.py and puts it into a slightly cleaned up notebook form. It's still a little rough around the edges. Areas for improvement: - Installation / setup. - API usability. Also, - Add `npcomp-backend-to-iree-frontend-pipeline` since we will be adding more stuff there. - Slight cleanups.	2021-08-09 14:34:43 -07:00
Yi Zhang	0342b73bf1	Add torch.aten.flatten.using_ints and aten.MaxPool2d linalg lowering - torch.aten.flatten.using_ints to linalg lowering - torch.aten.max_pool2d to linalg lowering - Support torch.aten.conv2d for more flexible dilation and strides values	2021-08-04 12:00:43 -04:00
Sean Silva	f168cacd6d	Remove TCF and TCP. These were legacy concepts that are now superceded by direct Torch to linalg-on-tensors lowering. These were based on some very early thinking related to the layering of frontends vs codegen, which is now obsolete because: - We expected a lot more centralization at the frontend (TCF) level. It turns out that frontend needs really vary a lot, and there is no grand unifying TCF dialect plausible. The additional layer isn't worth it. - Linalg-on-tensors obsoletes the primary need for TCP. There are still a few things not representable with linalg-on-tensors, but the support is growing and the whole "not included in linalg-on-tensors" direction needs to be rethought. Our TCP dialect didn't cover any of the actually important things in this space (such as sort, FFT, top-k, etc.). See historical [slides](https://drive.google.com/file/d/1iljcpTQ5NPaMfGpoPDFml1XkYxjK_6A4/view) / [recording](https://drive.google.com/file/d/1jSPa8TwPKUt0WuLquGc8OgSUVYJHMvWZ/view) for more details on the origin story here. Their presence was confusing users too [bug](https://github.com/llvm/mlir-npcomp/issues/248). Also, - Trim down npcomp-run-mlir testing. It was testing TCF to TCP lowering for the most part. The essential stuff is retained and rephrased with linalg-on-tensors. (we should probably rename it "refback-run" or something, as it is just a way to invoke RefBackend) - test/Python/Backend/RefJIT/simple_invoke_numpy.py is XFAIL'ed. Our "anti-framework" direction seems to be the likely future path.	2021-08-02 12:08:39 -07:00
Stella Laurenzo	ec611c1e6f	Misc fixes for MacOS. (#255 ) * Change aligned_alloc -> malloc. It can fail (and does on MacOS) and is a bit over-aggressive optimization for a reference backend. * Fixed a fragile test that prints -0.0 on MacOS. * Fail the test (not the framework) on failure to trace (Torch on MacOS is missing features). * Fix .so -> .dylib for compiler runtime.	2021-07-27 17:48:47 -07:00
Stella Laurenzo	2dbab50444	Rework the python build to a static assembly of MLIR+NPCOMP (#251 ) * Adapt to python build system updates. * Bump llvm to 310c9496d80961188e8d8f8ad306cdf44bd7541f (includes python build updates) * Adds refback C-API. * Re-layers all python builds. * Rework CI.	2021-07-27 16:10:10 -07:00
Stella Laurenzo	2ecbcbf8c7	Bump llvm-project to a085c23aa3c8f91866d7f4588d4f683407dc775d. (#250 ) * Added additional ToLLVM conversion patterns (they were disaggregated from standard). Misc renames. * Spelling change on ConvNCHW op, and it now expects strides and dilations attributes.	2021-07-23 14:13:19 -07:00
Yi Zhang	89d4931324	Linalg lowering for aten.conv2d and aten.AdaptiveAvgPool2d 1. Add m_TorchConstantIntList 2. Lowering for aten.conv2d 3. Lowering aten.AdaptiveAvgPool2d	2021-07-09 15:04:29 -07:00
Sean Silva	83b5b5456d	Bump llvm-project to da289a174fc6617c7be37be2947480510fd4f02a - Build adjustments for `.cpp.inc` dialect files. - Renaming of `memref.dim` to `tensor.dim` for tensor case. Minor changes: - Renaming of `mlir::linalg::ReassociationIndices` to `mlir::ReassociationIndices`. - Adjust command line option parsing in npcomp-run-mlir.	2021-07-07 13:57:29 -07:00
Sean Silva	79928cd2dd	Generalize support for elementwise ops. We plumb through e2e a fair number of interesting cases: - unary, binary, ternary elementwise ops - ops like `torch.aten.add.Tensor` that also take a scalar parameter - static size-1 broadcasting We allow the static size-1 broadcasting case, but emit a runtime error in the case of dynamic size-1 broadcasting. This seems like a sweet spot subset of things that can be lowered directly to linalg, while not being overly constraining to users. This is consistent with what IREE is doing for CHLO->Linalg lowering as well ([code](`50bf7a87e4/iree/compiler/InputConversion/MHLO/BroadcastingToLinalgPatterns.cpp (L1)`)). To test the static size-1 case, we added support for the `torch.aten.unsqueeze` op and lowering for it through `linalg.tensor_expand_shape`. This involved a generalization of `MaximizeValueSemantics` able to handle it (the solution there also works for `torch.aten.flatten.using_ints` which we need for ResNet anyway) Also, a few minor additional changes: - Add `VerifyInvariantsBeforeBackendLowering` pass, which catches a large class of errors before we get to backend lowering (now that we are doing dialect conversion, the errors are way nicer if we just emit them up front rather than in the guts of a random pattern). - Minor change to RefBackend to allow `linalg.tensor_expand_shape`. Recommended review order: - e2e tests in elementwise.py - `ConvertElementwiseOp` in TorchToLinalg.cpp + elementwise.mlir test - `ConvertAtenUnsqueezeOp` in TorchToLinalg.cpp + unsqueeze.mlir test - RefineTypes.cpp + tests - MaximizeValueSemantics changes + test - VerifyInvariantsBeforeBackendLowering pass + test	2021-06-28 13:28:38 -07:00
Sean Silva	145d4ae23c	Bump llvm-project to a37cf17834d39411ed1d669098b428f8374c5b45 Changes: - Change to operand ordering of `linalg.fill`.	2021-06-23 10:03:29 -07:00
Sean Silva	90c6c64fd6	Make torch.constant.float print a little nicer. This printing is chosen to be similar to how MLIR prints the values by default.	2021-06-23 08:07:45 -07:00
Sean Silva	60a947b4a7	Add CastOpInterface to torch.prim.unchecked_cast. This allows it to fold away in trivial cases.	2021-06-23 08:07:45 -07:00
Yi Zhang	45f2edfc7a	Add TorchToSCF pass. 1. Add TorchToSCF pass. 2. Convert prim.If and prim.If.yield.	2021-06-23 08:06:43 -07:00
Yi Zhang	5ad144c4fe	More folding for aten.gt.int, aten.ne.int and Aten__Getitem__TOp. - Fold more for aten.gt.int, aten.ne.int and Aten__Getitem__TOp - Some format cleaning up	2021-06-23 08:06:37 -07:00
Sean Silva	79aade33da	Make MaximizeValueSemantics a bit smarter. This adds a pattern to MaximizeValueSemantics which does a simple abstract interpretation within a block, which handles simple cases of `torch.overwrite_tensor`, enough to remove all the unnecessary uses of non-value tensors in ResNet right now. Before/after IR: [gist](https://gist.github.com/silvasean/a3e1ef625b19dfc63579f73cd3b543b6) Also, - Split `torch.copy.tensor` into `torch.copy.to_tensor` and `torch.copy.to_vtensor` which convert between value and non-value semantic tensors. This is a much cleaner factorization as they have very separate use cases and properties (e.g. different side effects) - Remove the various canonicalization patterns they had, which were confusing because they resulted in limited forms of maximizing value semantics throughout the pipeline. We should structure our compilation pipeline such that only MaximizeValueSemantics should be maximizing value semantics. - Adjust pass pipeline to only run MaximizeValueSemantics once. - Make OverwriteTensorOp `$value` always be a value tensor and `$overwritten` be a non-value tensor.	2021-06-22 16:48:57 -07:00
Yi Zhang	6dddb4d4fe	Add torch.aten.batch_norm Linalg lowering support 1. Added a simplified version of torch.aten.batch_norm which only handles inference and assumes the weight, bias, running_mean, running_var are not None. 2. Removed the primitive types check in verifyLinalgCompatibleTypes check since now we have proper type converter to handle torch types conversion. The checks for RankedTensorType is kept because the type converter doesn't guarantee the converted builtin tensor type is ranked. A separate verification pass to verify the invariant expected by later passes will need to be added before those can be removed as well.	2021-06-22 16:45:21 -07:00
Yi Zhang	e6adecac83	Convert Torch constant ops to std.constant	2021-06-18 12:22:47 -07:00
Sean Silva	78d2cc0818	Make `torch.copy.tensor` canonicalization a bit smarter. This removes most of the trivial cases that MaximizeValueSemantics needs to handle, making it easier to see the nontrivial cases.	2021-06-17 18:11:58 -07:00
Sean Silva	40369c54dc	Adjust pass pipeline for changes to `dim` canonicalization. This results in cleaner IR. In particular, Mlp2LayerModule e2e test has a dim op that is eliminated by this change: https://gist.github.com/silvasean/734f11a291ae6236c955f65cffae285f	2021-06-17 16:59:55 -07:00
Sean Silva	333e07a74e	Add `torch.vtensor.literal` op. This op is much better behaved than the `torch.tensor.literal` op (which is the new name of the `torch.tensor` op). In particular `torch.tensor.literal`: - always has a maximally refined type. - always has value semantics. - can be constant folded / CSE'd. ReduceOpVariants is changed to perform the transformation from `torch.tensor.literal` to `torch.vtensor.literal` (which in general involves static information casts and copies. This new op also allowed tightening up `torch.tensor.literal` to only accept NonValueTensorType (instead of any tensor type). This new ".literal" name is more descriptive. It was getting too confusing seeing an op called just `torch.tensor` (we originally called it that because that's the name of the similar function in the Torch Python API, but it just doesn't fit here).	2021-06-17 14:37:04 -07:00
Sean Silva	4a0eb44d17	Add a !torch.float type. This removes the dependence of the `torch` dialect on the low-level builtin types. Now the `torch` dialect is a standalone layer, suitable for targeting from higher-level Python abstractions without any premature lowering to primitive types.	2021-06-17 09:24:18 -07:00
Sean Silva	f49ebf1690	Add `!torch.int` type. This replaces the ad-hoc use of `i64` throughout the Torch layer, and helps to keep it crystal clear the distinction between `!torch.int` (which is modeling the Python `int` type) and the various types that serve as dtypes of tensors, which are a totally different type universe. Changes: - `!torch.int` type and C bindings. - Change `torch.constant.int` parser to not need the `: i64` at the end. - `m_TorchConstantInt` matcher to aid with matching constants. - BackendTypeConversion changes for `!torch.int` -> `i64` type conversion. - Refactor finalizing patterns in FinalizingBackendTypeConversionPass (they were getting very repetitive). - Mechanical rewriting of `!torch.int` to `i64` in all the tests, and `AnyTorchIntType` to `Torch_IntType` in the `.td` files.	2021-06-17 07:28:23 -07:00
Sean Silva	224afb186e	Add folders for torch.aten.gt.int / torch.aten.ne.int This fixes a "regression" on ResNet where we weren't folding away all the control flow. For now, our policy is to "optimize hard enough" to make that control flow go away, because we don't yet have a way to lower to the backend the stuff guarded by the control flow (RaiseException, string operations, etc.). It remains to be seen how much optimization we decide to do at this level in the fullness of time -- the torch op set is not particularly well-designed (at least not idiomatically for MLIR) for general optimization. Ideally, with really good backend support for various features, all the heavy optimization will happen at that layer on `std` ops and `scf` control flow. But I have a suspicion we might end up needing more optimization earlier in the pipeline.	2021-06-16 14:04:31 -07:00
Sean Silva	8860b5c55d	Add `torch.prim.If` This removes the use of `scf.if`, which required laundering back and forth between `i1` and `!torch.bool` in the frontend. We will eventually lower this op to `scf.if`, but this results in a cleaner IR and layering at the frontend.	2021-06-16 14:04:31 -07:00
Sean Silva	784156a998	Add `!torch.bool` type. This finishes removing the dependence on the basicpy dialect! Changes: - Add `!torch.bool` type and replace use of `!basicpy.BoolType` in Torch-related code. - Rename BuiltinTensorize to BackendTypeConversion since now it handles bool conversions (and, when we add !torch.int and !torch.float, it will handle those as well), and generalize the related utilities (I also moved them to Torch/Transforms since they aren't really part of Torch/IR). - Add `torch.to_i1` and `torch.from_i1` ops for materializations - [cleanup] Reorganize `torch.constant.*` ops in TorchOps.td - Remove dependency of `torch` dialect on `basicpy` dialect and also `std` dialect. For `std`, we use some call related ops, but the `torch` dialect itself never produces them (we have passes that do though). This is fairly mechanical. Recommended review order: - New stuff in Torch/IR - New BuiltinTypeConversion files. - Mechnical fixups elsewhere.	2021-06-16 13:22:00 -07:00
Yi Zhang	7b7c9c5d3d	Add aten.relu Linalg lowering support	2021-06-16 08:18:14 -07:00
Sean Silva	3ccf6002af	Add `torch.constant.int` and `torch.constant.float`. - This removes reliance on basicpy.numeric_constant. - Also, add OpAsmOpInterface to the `torch.constant.none` and `torch.constant.str` ops.	2021-06-15 15:29:42 -07:00
Sean Silva	2e850ecb72	Add !torch.str type. - Remove dependence on `!basicpy.BytesType`. - Add `torch.constant.str "s"` analogous to `torch.constant.none`.	2021-06-15 10:10:59 -07:00
Sean Silva	92ee0fa98f	Add `!torch.tuple<T1, T2>` type. This further eliminates the need for the `basicpy` dependency. This required adding `torch.prim.TupleConstruct` to replace `basicpy.build_tuple`.	2021-06-15 08:15:22 -07:00
Sean Silva	ea1dd1cd90	Remove a few more comments I missed in the last commit.	2021-06-14 18:18:43 -07:00
Sean Silva	6b2424512b	Make C API files more consistent - Make consistent with MLIR Core - Use `//` or `///` comments. - Use `bool` type for booleans - No duplicated comments in .cpp files - Split types into separate files `{Basicpy,Numpy,Torch}Types.h` - Add dialect prefix consistently to C API symbols. We have lots of similarly named types (e.g. "list" type in basicpy and torch).	2021-06-14 15:34:43 -07:00
Sean Silva	db282fd1b4	Introduce native `!torch.none` type. - Add `torch.constant.none` op to construct it (naming is chosen to be analogous to Torch's representation of a prim::Constant with NoneType, rather than using the "singleton" terminology of Basicpy).	2021-06-14 13:30:58 -07:00
Sean Silva	81bcd7fb12	Move Torch type implementation code into TorchTypes.cpp	2021-06-10 16:46:47 -07:00
Yi Zhang	e0ff5248fb	Add TorchList type and prim::ListConstruct #218	2021-06-10 14:31:35 -07:00
Sean Silva	370e3270ab	Introduce `!torch.tensor` / `!torch.vtensor` types. This removes our reliance on the numpy dialect and avoids our off-label use of the builtin tnesor type for modeling unknown dtypes. The `!torch.vtensor` (`ValueTensorType`) type is a value-semantic tensor. The `!torch.tensor` (`NonValueTensorType`) type is a non-value-semantic tensor. The new types look as follows syntactically: ``` // Least-static-information, non-value-semantic tensor. !torch.tensor // Explicit form of least-static-information variant. !torch.tensor<,unk> // Least-static-information, value-semantic tensor. !torch.vtensor // Explicit form of least-static-information variant. !torch.vtensor<,unk> // Fixed-set of allowable element types, with first-class support for // Torch's frontend signedness semantics. !torch.tensor<*,si32> // First-class support for unknown dtypes. !torch.tensor<[?,?,?],unk> // Standard MLIR representation of `?` for unknown dimensions. !torch.tensor<[?,2,?,4],unk> // Statically shaped / dtyped example. !torch.vtensor<[1,2,3,4],f32> ``` This required fairly significant changes throughout the compiler, but overall it is a big cleanup. We now have a much clearer layering of "the Torch frontend lowering" vs "lowering to std + linalg + etc.". At the C++ level, there is `ValueTensorType`, `NonValueTensorType`. We also have a helper `BaseTensorType` (kind of like ShapedType) which interoperates with those two. Included changes: - New `torch.tensor(dense<0.0> : tensor<5xf32>) : !torch.tensor` op for creating torch tensor literals in the frontend. - Consistently use signedness for the types (except i1 which I didn't touch -- we need to sort out the situation with !basicpy.BoolType there anyway so will be attending to that soon) - Frontend can annotate whether an argument to the function has value semantics. We currently require this, as our backend contract does not currently allow us to even model the non-value-semantic case. Before, the value-semantic assumption was randomly injected in the middle of the pass pipeline. - Move ArrayToTensor (now called MaximizeValueSemantics) and RefinePublicReturn passes to torch dialect. - The TorchToStd and TorchToLinalg passes are now type conversions from `!torch.vtensor` to `tensor` and use the dialect conversion infra. The overall conversion pipeline is set up following the best practices of the "Type Conversions the Not-So-Hard Way" talk. This required introducing `torch-func-builtin-tensorize` and `torch-finalizing-builtin-tensorize` passes analogous to the upstream bufferization passes with the corresponding names (mostly just copypasta from there). - Misc Torch-level canonicalizations -- we now cleanly layer the lowering to std later in the pipeline, so we are gradually lessening our reliance on random std constant folding before we get to that point. Recommended review order: - New types in TorchTypes.td/TorchTypes.h/TorchDialect.cpp - New ops in TorchOps.td / TorchOps.cpp - Less important / more mechanical stuff - Frontend changes. - Pass changes/additions in `Torch/Transforms` and `Conversion/`	2021-06-10 10:56:48 -07:00
Sean Silva	b7b7fd4959	Rewrite error reporting of e2e tests. This now gives [much nicer output](https://gist.github.com/silvasean/f048e0f37b04542dae6469b86802bb3e). Embarrassingly, we previously couldn't even report failures for two different tests, and weren't able to report on compilation failures (besides just crashing).	2021-05-20 11:28:20 -07:00
Sean Silva	d66e8fe1f8	Get simple quantized model importing. This is enough to import the program and get it through the compilation pipeline. It of course fails at the VerifyBackendContract pass since there is a lot missing, but the final IR for a simple quantized MLP is looking pretty decent already: [IR](https://gist.github.com/silvasean/f76bccd76e9b193d396cfb2f9a11f54d) Main changes: - Add support for importing torch quantized tensors, including `torch.per_tensor_affine.create` op and `!torch.qint8` element type. - Add support for importing `LinearPackedParamsBase` (basically a weight + optional bias, but requires `torch.linear_params.create` op + `!torch.LinearParams` type to model it). This was less painful than I expected, as it has the necessary methods to opaquely unpack itself. I factored things so it should be easy to extend to other custom classes like `ConvPackedParamsBase`. - Add minimal boilerplate for importing `quantized::*` ops, with `quantized::linear` being a motivating example. - Add e2e test with simple quantized MLP (courtesy of @phoenix-meadowlark). This is somewhat of an abuse of `!numpy.ndarray` / `tensor`, as really the proper semantics of `!torch.qint8` dtype on a Torch tensor is "check the quantizer object of the tensor for side data (scale/offset, possibly per-channel) that defines the full semantics of the tensor". We don't have any such notion of "side data" for `!numpy.ndarray` / `tensor`, let alone anything that would have the associated behavior of keying off the dtype to determine if the side data is present. This will be fixed by a proper `!torch.tensor` type.	2021-05-20 11:28:20 -07:00
Sean Silva	2efda323ff	Significantly restructure torch/aten import design. This is a really major and invasive restructuring of the way we get torch operators (`torch::jit::Operator` / `c10::OperatorHandle`) into MLIR. Please forgive the challenging review, but due to the sheer invasiveness, it wasn't really practical do do it in sane smaller pieces. This fully replaces everything that was already working on the TorchScript path (actually, more -- we added tanh support to TorchToLinalg in order to delete the older code paths). Additionally, I've kept the lights on for the acap path too, including what little e2e stuff was working before (for expediency I made a few tiny compromises along the way that will be easy to undo when we give that path proper attention). Overview of the new design: - The torch operator `somens::someunqualname.someoverloadname` is imported as `torch.somens.someunqualname.someoverloadname` (skip the last dotted part if the overload name is empty), OR, if we don't have such an op registered, it is imported as `torch.operator "somens.someunqualname.someoverloadname" (...) : ...`. - The addition of the "overload name" is a critical element here, as the `(ns,unqual,overload)` triple is unique, which solves a lot of problems we were having. - This involves having separate MLIR ops for the `trailing_` and `.out` variants and all the different overloads. This seemed necessary, because the set of overloads is so wild and varied and unstructured. The previous design was leaning into some underlying structure that just isn't there -- the default situation is the "random overload that we want to manage on the MLIR side", rather than that being an exception. E.g. `aten::ne` (not-equal) has 21 overloads, only 4 of which are c10 dispatcher ops see [gist](https://gist.github.com/silvasean/190ba918c550c956260e21254e1b8aa1), and the "out" variant is really called `.Tensor_out` instead of `.out` as it frequently is for other ops. - Rationale for all being in `torch` namespace: the set of operators are so varied and unstructured that "dialect per namespace" doesn't result in anything resembling the typical MLIR dialect boundary expectations. We could maybe draw the boundary at dispatcher ops vs non-dispatcher ops, but that doesn't seem to really result in very much useful structure at this point in time. - Note: within the torch operator registry, we effectively have a mini-basicpy subdialect (already type-resolved), which is reasonably structured. - The existing Torch op interfaces are also removed -- now that we track the overload name, we can losslessly find the original operator. - Instead of `ATenRecognizeKernelsPass`, we now have a `ReduceOpVariantsPass` that keys off certain traits (and perhaps eventually interfaces) to reduce variants of ops to a smaller set, ideally operating on immutable tensors and using surrounding ops to model the mutability/aliasing aspects. - Note: `torch.ns.unqual.overload` ops allow both immutable and mutable tensors (unlike the previous hard distinction in the common case). This is a premonition for a future change that will introduce a bona fide `!torch.tensor` type that will clean up a bunch of stuff. - `TorchToLinalg` / `TorchToStd` supercede the existing "ATen->TCF->TCP->Linalg" path. - The new `torch_ods_gen.py` supercedes `torch_signature_ods_gen.py`. It should look somewhat familiar, but the benefit of hindsight has allowed a lot of simplifications. The overall trend seems to be to make the `torch` dialect a nice layer independent of anything else. It feels like as a natural result of various future changes we will be removing the reliance on basicpy+numpy dialects and have a nice self-contained type system too that properly models the TorchScript type system (including proper subtyping, mutable/immutable tensors, optional dtype, etc.). Recommended review order: - Start at some of the new import IR, e.g. in `frontends/pytorch/test/node_import/prim.py`, `frontends/pytorch/test/acap_export/test_export_add3.py`, and other tests. - `frontends/pytorch/python/torch_mlir_utils/codegen/torch_ods_gen.py` and associated generated files: - `include/npcomp/Dialect/Torch/IR/GeneratedAtenOps.td` - `include/npcomp/Dialect/Torch/IR/GeneratedPrimOps.td` - Inspect `ReduceOpVariants.cpp` / `reduce-op-variants.mlir` and the new traits in `include/npcomp/Dialect/Torch/IR/TorchTraits.h` - Various code changes in the import path in `frontends/pytorch/csrc/builder`. Probably most interesting is the new code in `torch_to_mlir_utils.cpp` that has the logic to create the `torch.operator` ops or `torch.ns.unqual.overload` ops. This is the [new ResNet IR](https://gist.github.com/silvasean/5407aafb710d07612b7b5b92eabecebe), just to be able to look at a substantial sample of IR in the new style.	2021-05-19 13:37:39 -07:00
Sean Silva	133bdf4b31	[cleanup] Add materializer for basicpy.singleton This allows the canonicalizer to coalesce it like other constants.	2021-05-03 09:54:44 -07:00
Sean Silva	3d08c83580	Add flatten op recognition + shape refinement. This op has complex aliasing semantics, so it is kept mutable for now. With this, we reduce ResNet18 to a single BB with all aten operators having rank + dtype: https://gist.github.com/silvasean/2fcb1c6e4d4ae27461204a43ae9c5031	2021-05-03 09:54:44 -07:00
Sean Silva	122cae2ee3	Add aten::len.t, aten::size, and aten::gt.int primitive ops Also add some canonicalizations that finally reduce ResNet down to a single block.	2021-04-30 10:57:02 -07:00
Sean Silva	ec6d06aa86	Add some more ResNet ops. - aten::relu_, aten::max_pool2d, aten::adaptive_avg_pool2d, aten::batch_norm, aten::conv2d No aten-to-linalg conversion for the latter ones, as they are fairly substantial. At this point, I'm trying to get shape inference and stuff working for them and the IR cleaned up.	2021-04-30 10:57:02 -07:00
Sean Silva	9257457d8a	Add AllowsTypeRefinement trait and use it to improve RefineTypes This trait lets us model the semantics of various aten/torch/numpy ops that are insensitive to type refinements. This replaces hardcoded/inconsistent checks for this property. To show usage of this new trait, we fix up some old uses, and improve RefineTypes to be smarter about rewriting with this trait.	2021-04-30 10:57:02 -07:00
Sean Silva	1c832604d2	Remove old aten-to-std / ATenLowering pass. It was confusing now that we have `convert-aten-to-std`.	2021-04-30 10:57:02 -07:00
Sean Silva	55c3cc6624	Add recognition/folder/lowering for aten::__is__, aten::ne.int, and aten::dim Interestingly, TorchScript has its own op (`torch::jit::Operator`) registry separate from the dispatcher (it is a superset of the dispatcher). This is where the "prim" ops and some "aten" ops (that should probably be renamed to "prim") live. In particular, `aten::__is__` is in that latter category of "aten but really prim". This registry is also the source of truth for what the TorchScript interpreter calls into when it executes. The bulk of the "not part of the dispatcher" ops live in `09feb5f579/torch/csrc/jit/runtime/register_prim_ops.cpp (L82)` And the registry itself lives in: `09feb5f579/torch/csrc/jit/runtime/operator.cpp (L196)` This fold further reduces the IR of ResNet by folding away some more not-taken branches. These not-taken branches in ResNet require first-class handling of the list type which we don't yet have on any backend.	2021-04-30 10:57:02 -07:00
Sean Silva	7eb36b4ae7	Constant fold through basicpy.bool_cast. This is the start of a push to getting ResNet running. This involves throwing in the towel on an O0 pipelinie for now. See note in the code. We keep an options struct with `optimize` flag, but it default to true for now.	2021-04-30 10:57:02 -07:00
Sean Silva	fb5f149e04	Reformat Passes.cpp and remove torch-globalize-pipeline. The pipeline is subsumed by our lowering pipelines.	2021-04-30 10:57:02 -07:00
River Riddle	4678a7fedd	Refactor RefineTypes to use the upstream ForwardDataFlowAnalysis engine This removes the need for defining all of the custom propagation logic, and also adds support for propagating value knowledge across branches, through regions, and across calls.	2021-04-27 13:17:56 -07:00
Sean Silva	642482429c	Bump llvm-project to 12011b5217929ef8a56c2099c6f3233934ea4fbc - Rename FrozenRewritePatternList -> FrozenRewritePatternSet	2021-04-27 13:12:33 -07:00
Sean Silva	179105ca3e	Add basic MLP's to the e2e curriculum. These tests pass on the reference backend. - Add aten.linear op + shape xfer function + ATen->Linalg lowering. - Note: this needs to be more automated, and needs to cover more cases. - Current not implemented caveats: - size-1 broadcasting for bias vector (either static-size-1 or ? case) - higher-rank aten.linear ops (not produced by torch.nn.Linear though) - type promotion (still don't even know the exact rules here) - Add folder for torch.derefine op. Now the inliner can clean it up as it inlines. (call boundaries are a main place we need to insert torch.derefine) This is brittle -- the other important case is control flow which will need to be handled via an extension to RefineTypes.cpp (as will more robust call handling). River has an in-flight patch to update it to the new dataflow framework so I didn't want to do anything intrusive here. - Also adjust torch.derefine syntax to use the keyword `to` instead of `->`, as most type-only, cast-like ops do.	2021-04-27 12:18:54 -07:00
Sean Silva	9ba77c6e13	Add InlineGlobalSlots pass. This inlines global slots if possible. This allows them to participate in folding, canonicalization, shape inference, etc. Example use cases: - inlining weights and biases that are readonly during inference - inlining the "training" bool to allow stuff to fold away For training use cases (especially internal training loop), we will need something smarter to get good performance. That would look like an "SSA formation" which promotes the global slots to tensors in the program, flushing them back to the slots at the minimal number of necessary places. We might want to let backends do that transformation though. This also interacts with shape inference (type bounds on the slots to even lower them to backends in the first place).	2021-04-27 12:18:54 -07:00
Sean Silva	3a890aa26c	Miscellaneous changes while trying to work on ResNet18 - Move frontend lowering pipelines to c++ (this helps with reproducing failures in npcomp-opt) - Add debugging printouts when compilation fails on RefBackendTestConfig The experience now when a test fails during MLIR lowering is now like this: ``` NPCOMP TorchScript Object Graph IR -> NPCOMP Backend IR lowering failed with the following diagnostics: failed to legalize operation 'torch.global_slot' Module does not conform to npcomp's backend contract. See dialect conversion legality information above. Error can be reproduced with: $ npcomp-opt -torchscript-to-npcomp-backend-pipeline /tmp/ResNet18Module.mlir ``` And when TorchScript->MLIR import fails it looks like this: ``` PyTorch TorchScript module -> NPCOMP Object Graph IR import failed with the following diagnostics: unhandled prim operation: %18 : int = prim::min(%17) # /usr/local/google/home/silvasean/.local/lib/python3.9/site-packages/torch/nn/functional.py:4532:4 ``` Also, - Add `--filter=<regex>` to e2e test harness to filter tests. - Add a few prim ops that were needed to import ResNet18 - Fix torch.prim.Loop.condition assemblyFormat (it previously would not round-trip in the case of no loop-carried variables)	2021-04-27 11:51:11 -07:00
Sean Silva	544cb4ef54	Bump llvm-project to 484b6648fdd4b104eaf7a2504dd07b60af2c9f8d - add_mlir_doc arg order - fix some dependent dialects on passes that were now causing errors - "encoding" attribute on mlirRankedTensorTypeGetChecked	2021-04-22 18:12:55 -07:00
Sean Silva	fef1733e12	Fix issue with unused functions in torch::jit::CompilationUnit As described in the code comment: ``` When we import TorchScript IR, we import their entire "compilation unit", which can contain numerous functions unrelated to the current program, which breaks torch-globalization-pipeline; for example, there can be random functions referencing types that haven't been imported as part of the root `torch.nn.Module` we imported. Those will be unreferenced private functions which symbol-dce will clean up nicely. ``` This situation is really easy to hit in jupyter notebooks, where the same cell is evaluated multiple times. That results in the same class name (at the Python level, e.g. class `Foo` in the top-level main module). Internally to PyTorch, it handles this situation by mangling in a unique number to the names of ClassType's and such. When we import the new ClassType's, we see not just the new torch::jit::Function's in the CompilationUnit, but, also all the old ones, which reference ClassType's that are not reachable from the `torch.nn.Module` that we imported. Note: there is no way to avoid importing the whole CompilationUnit (including these old remnants) without doing a fairly complicated call graph reachability analysis of which functions are reachable from the methods of the ClassType's we imported. It turns out that once we are inside MLIR, we model visibility correctly so that `symbol-dce` "Just Works" for this use case. That is to say, this is not a quick hack, but rather seems like a totally palatable long-term solution.	2021-04-20 12:00:35 -07:00
Sean Silva	c4123d4d4d	Add npcomp-verify-backend-contract pass. This pass verifies that a given module satisfies the contract that we have for backends. This is phrased as an "allowlist", because we want to keep this interface tight. Also, this gives much better diagnostics than a backend randomly crashing or failing to compile would (though they could still be improved). This was especially painful because if we had `tensor<?x!numpy.any_dtype>` slip through, at some point RefBackend would convert it to a memref type and trip the "verify type invariants" assertion which gives no location or anything and crashed the process, which was very unpleasant. We implement this with the dialect conversion framework, which works reasonably well and was quick to put together and familiar, but is still very "op oriented". We probably want to make this hand-rolled eventually, especially the error reporting (the most useful kind of error for a dialect conversion user is not necessarily the best for this use case). Also, in production, these error will go to users, and need to be surfaced carefully such as "the compiler needs a type annotation on this function parameter" which in general requires some special analysis, wordsmithing, and overall awareness of the e2e use case (such as how much we can lean into certain source locations) to provide a meaningful user-level diagnostic. Also, add `inline` to the current frontend lowering pass pipeline to allow slightly more complicated programs that otherwise would fail on shape inference.	2021-04-20 12:00:35 -07:00
Sean Silva	f5dfa02523	Add `aten.mm` to linalg lowering. This is our first op with error semantics, and stresses the system. There are a few design notes of special interest: - RefineTypes.cpp's note about shape inference in the presence of code that dynamically produces and error, and it is provable statically. - ATenToLinalg.cpp's notes about future automation of the ATen->linalg path. - The notes in Passes.td about using low-tech `std.assert` ops instead of `shape.assuming`. Note: Doesn't work on IREE yet due to the `std.assert` op (needs to be lowered to `vm.fail` on the IREE side).	2021-04-16 12:03:31 -07:00
Sean Silva	28a0f02746	Add support for compiling through IREE. Recommended review order: - Changes in frontends/pytorch/examples/ - Changes in python/npcomp/compiler/pytorch/backend/ - Boilerplate for the `npcomp-iree-backend-lower-linkage` pass. This change separates out a `npcomp.compiler.pytorch.backend.frontend_lowering` module that does the common lowering for all backends. The individual compiler backends `npcomp.compiler.pytorch.backend.{refjit,iree}` now accept a loosely defined "TCP + scalar code" IR mix that will be formalized in the future as the interface to codegen backends. This also required adding a small pass `npcomp-iree-backend-lower-linkage` which adds `iree.module.export` onto functions, and layering that into the frontend flow. The pass doesn't require a C++-level dependency on IREE, which is nice for now. TBD how we are going to handle lists (we hope we can get away with sneakerneting some td files and relying on loose IR compatibility). Running through IREE requires the ability to import `iree.compiler` and `iree.runtime`, which can be obtained as follows: ``` python3 -m pip install iree-compiler-snapshot iree-runtime-snapshot -f https://github.com/google/iree/releases/tag/snapshot-20210406.200 PYTHONPATH="${PYTHONPATH}:${MY_IREE_BUILD}/bindings/python/" ``` This patch makes it painfully clear that we don't have any e2e testing harness to really plug into, and also don't have a usable Python API to our compiler stack (something usable in a jupyter notebook). That will be addressed in subsequent commits. We've been flying by the seat of our pants with this `examples` directory that isn't subject to any kind of testing or real usability concerns.	2021-04-09 13:15:07 -07:00
Aaron J Arthurs	f9d9518f6e	Declare TCP dialect dependency in TCFToTCP conversion	2021-04-07 14:23:56 -07:00
Sean Silva	927546b3c5	Add RefinePublicReturn pass. This pass allows shape information to be propagated to return types, which is nontrivial and cannot be cleanly put anywhere else as it changes the public ABI, which is a concern that we want to keep concentrated in one place.	2021-04-07 11:06:34 -07:00
Sean Silva	1e357ae680	Add simple type refinement pass. Currently implemented as a simple intraprocedural dataflow analysis over a standard ShapedType lattice (hasRank, sizes, and elementType). It currently hardcodes a few key pieces of information: - shape transfer functions - whether it is legal to update the operand type of an op This needs to be made pluggable obviously and the core propagation logic moved somewhere agnostic.	2021-04-07 11:06:34 -07:00
Sean Silva	6431b0f11f	Add primitive ArrayToTensor (numpy-array-to-tensor) pass. The current implementation is just sufficient to do a unary aten.tanh from the e2e spike, and just applies some local rewrite patterns. I've sketched out the more full explanation of where this pass eventually need to go in the pass docs. Adding this required adding `numpy.tensor_static_info_cast`, which is the tensor analog of `numpy.static_info_cast`. This op encapsulates the same numpy-specific "no runtime code" casting semantics, in particular the interpretation of `!numpy.any_dtype`. The `numpy.tensor_static_info_cast` I see in practice now are "information erasing" and will be removed by a later pass that exploits the fact that aten ops are agnostic to the static info in the operand types (so substituting a type with more static info is fine). Side note: we need to do dtype and rank inference before aten->tcf (which will eventually mostly be aten->linalg+guards), because each aten op is idiosyncratically overloaded based on dtype and rank. Without copying that idiosyncratic overloading into lower layers (layering violation), we cannot really lower it to anything until we do that.	2021-04-05 17:56:35 -07:00
Sean Silva	30356c41c8	Add torch-adjust-calling-conventions pass. This pass incorporates torch.type_bound info and also removes NoneType returns (eventually it will rewrite tuple types too, but can't yet because !basicpy.TupleType doesn't track element types). Recommend looking at adjust-calling-conventions.mlir first to see what it is doing, and holding your nose for the implementation of the pass. I decided to implement this with the conversion framework, because it gives us some goodies for type conversion -- mainly avoiding large amounts of tricky RAUW dances. Unfortunately, the conversion framework isn't a perfect fit for a couple reasons: - the incorporation of torch.type_bound is a context-sensitive rewrite (requires looking at the arg attr, not just the type). - NoneType conversion is 1->0, which requires some special handling - (not implemented yet) 1->N tuple type conversions require special handling. It's a little bit scary, but on balance doing it the other way would have its own downsides.	2021-04-05 17:56:35 -07:00
Sean Silva	464feacba9	Bump llvm-project to 223dcdcfbe23affdf17ada7f023ee1872fd76160 - ModuleOp no longer has a terminator.	2021-04-05 17:56:35 -07:00
Sean Silva	e749074bae	Basic infra for annotate shapes and dtypes on arguments. These allow users to annotate a known "type bound" on the argument, which can seed shape/dtype inference. We don't rewrite the function types as part of the import process (it will happen in a yet-to-be-written pass) because: 1. We would need to interprocedurally rewrite all calls to keep the IR consistent. Currently, we have a place after GlobalizeObjectGraph but before we convert to tensors where this is convenient to do. Ideally, we would do this on the object graph representation. 1. We don't necessarily know that adjusting the function type is a legal calling convention change. The pass will have blessed knowledge (by the pass pipeline author) that adjusting the argument type based on the type bound is safe (which it frequently is). 2. Note that in principle, a type bound could be a fairly general thing (such as maximum sizes of dimensions, unions of multiple concrete types, etc.). The pass will in principle have logic to interpret the type bounds and to determine a suitable "best" (and legal) argument type.	2021-04-01 18:40:03 -07:00
Sean Silva	7a4043b7c4	Add ability to compile from object graph ir.	2021-03-31 09:25:13 -07:00
Sean Silva	c6d56fed8a	Add unary tanh lowering.	2021-03-30 16:39:49 -07:00
Sean Silva	641098be54	Clean up some compiler warnings on my machine.	2021-03-23 14:29:05 -07:00
Sean Silva	99178a167d	Bump llvm-project to 0524a09cc7e1a0797982feacf505825231efbee7 - renames of OwningRewritePatternList -> RewritePatternSet - also `insert` to `add` - RewritePatternSet holds a context now - memref dialect split from std	2021-03-23 14:29:05 -07:00
Bryce Arden	4591884d06	[refbackrt] Scalar arg support * Adds f32 scalar argument support across the ABI boundary. * Adds support for passing input type / shape information across the ABI boundary * Adds support for parsing / creating input FloatAttr's in `npcomp-run-mlir`	2021-03-23 13:16:44 -07:00
Sean Silva	703428eff4	Add support for "trailing_" and "out" variants of various ops. We already had the `promoteTrailingOutTensor` flag, but weren't using it. A inplaceVariantKernelName flag needed to be added. This change is a little dissatisfying, as the conversions done by the RecognizeKernelsPass are currently non-orthogonal. In particular, `kDropResultAndAliasArg0` probably won't work as intended if mixed with these (we probably need to promote kDropResultAndAliasArg0 to not be an arg-level thing anyway, as we have done with promoteTrailingOutTensor). This involved adding a new op `numpy.overwrite_array`. ``` numpy.overwrite_array %arg2 overwrites %arg0 : tensor<2x3xf32>, !numpy.ndarray<[2,3]:f32> ``` This models the destructive update behavior. Note that in the above op, we cannot simply RAUW %arg0 with a suitably conveted %arg2 (for example, %arg0 might have uses that are not dominated by %arg2, or might have an alias relation with some other array in the program). In general, we need a pass analogous to "SSA-formation" which knows how to see through these to uncover an underlying tensor program. Also, add tanh_out_e2e.py/div_inplace_e2e.py and fix some bitrot in refjit.py which is my running example I'm trying to get working.	2021-03-19 10:34:50 -07:00
Aaron Arthurs	4fd9b4afb5	Import ATen conv2d conversion and test (#180 ) * Import ATen conv2d conversion and test This is a first attempt at expanding ATen-to-TCF conversion for the conv2d operator. Eventually, this will come in use when lowering a high-level conv-based model.	2021-03-12 17:21:16 -08:00
Sean Silva	58c7030104	Support multiple instances of a class in GlobalizeObjectGraph. This happens in practice with e.g. ResNet from torchvision (multiple instances of the same BatchNorm class). The key observation is that for this program, and the expected set of programs, we can convert the program to the same globalized form with a bit more static analysis and effort to suitably monomorphize the program. Though what we are doing here is fairly annoying to implement, it saves any nontrivial later pass from having to do similar analyses (or worse). E.g. shape inference would need to be object-graph aware, mutation/lifetime analyses would have to be aware, etc. Additionally, it would make us front-load what it means to have a !torch.nn.Module type on an ABI boundary, which we are just not ready to handle. I'm really, really hoping that in practice we can get away with this, otherwise it's going to be really rough designing a representation (and implementing everything to back it) that is convenient to transform and gracefully scales from full object graph (in the most dynamic case) down to a fixed set of global slots like we have here (in the most static case, which we presume a lot of practical programs fall into). This also involved introducing a `torch-prepare-for-globalize-object-graph` pass that does a minimal set of lowerings to simplify the IR into a more orthogonal and analyzable form, and a `torch-globalize-pipeline` helper. Recommended review order: - updated documentation in Passes.td - new tests in `globalize-object-graph-multiple-instances*.mlir` - implementation of GlobalizeObjectGraph.cpp - PrepareForGlobalizeObjectGraph.cpp + prepare-for-globalize-object-graph.mlir - misc stuff like torch-globalize-pipeline pipeline definition. With this, we can import, globalize, and inline resnet18 from torchvision: https://gist.github.com/silvasean/821586afc19b67d9fb72030b2e0adeb8	2021-03-11 19:21:07 -08:00
Sean Silva	2750d2084c	Add prim::device and handle derefining for prim::CallMethod	2021-03-11 14:10:09 -08:00
Bairen Yi	5fed296904	Address missing default label in switch statement Signed-off-by: Bairen Yi <yibairen.byron@bytedance.com>	2021-03-11 11:55:59 -08:00
Bairen Yi	5315598947	Update .getAttrs to ->getAttrs as it is deprecated. Signed-off-by: Bairen Yi <yibairen.byron@bytedance.com>	2021-03-11 11:55:59 -08:00
Bryce Arden	e7a8fd76e2	[refbackrt] Update Invoke API to support more than just Tensor's (#181 )	2021-03-10 15:39:26 -08:00
Bairen Yi	53b01cb9ba	Bump llvm-project to e31c77b1827fa4dd3511f21af11cfab18ecf6d38 Signed-off-by: Bairen Yi <yibairen.byron@bytedance.com>	2021-03-10 11:01:16 -08:00
Sean Silva	43dba03afd	Properly model "derefinement". In terms of IR structure, TorchScript allows types to vary in many circumstances where MLIR requires pointer-identical types. In particular, it is valid to pass any subtype in place of a type. For example, if an `Optional[int]` is required somewhere in the IR, it is legal to pass a value of just `int` (but not the other way around; see `torch.prim.unchecked_cast`). In effect, every use can have a different type. We introduce a new op `torch.derefine` that models that impedance mismatch. This op allows casting a value from one type to a type that it is a subtype of to model this behavior. Recommended review order: - TorchOps.td for new torch.derefine (and updated docs for `torch.prim.unchecked_cast`) - new test code in if.py, loop.py, function-derefine.py - new code in node_importer.cpp for handling derefinement insertion - function_importer.cpp and utils changes in torch_to_mlir_utils.cpp Properly handling derefinement on function boundaries required relayering the code so that graph_importer.cpp/.h is now function_importer.cpp/.h because only the `torch::jit::Function` (actually the `c10::FunctionSchema` it holds) knows the derefined types that are actually needed at the boundary (see `function-derefine.py` for a test). Annoyingly, this churns all the functions which are now prefixed with `__torch__.` but that is more correct anyway (that is their linkage name in the `torch::jit::CompilationUnit`; the previous `mb.import_function` was actually buggy in the case of functions calling each other as it would reference their unqualified name). With this change, we can import `resnet18` from `torchvision` :) IR: https://gist.github.com/silvasean/6426a5272d8a6c7caae533fce05ab704	2021-03-03 15:09:44 -08:00
Sean Silva	939d36906f	Add support for prim::Loop op. This is a funny one. It combines a `for` and `while` loop in one op. We will need to write some conversions to `scf`.	2021-03-02 16:01:34 -08:00
Sean Silva	c837dbb077	Properly import the entire torch::jit::CompilationUnit This primarily unlocks proper handling of free functions (that is, functions that are not methods of any torch.nn.Module). Recommended review order: - `ivalue_importer.cpp` + `ivalue_import/functions*.py` - `GlobalizeObjectGraph.cpp` + test case - misc other stuff The `torch::jit::CompilationUnit` is basically a backing store or "context" holding all the possible functions in the program. The previous code was not explicitly accessing this data structure, since it just imported the `torch::jit::Function`'s that it saw attached to methods. Subtly, any time a TorchScript module called into a free function, the free function gets incorporated into the torch::jit::CompilationUnit, but doesn't show up anywhere when dumping the module, except in the curious pattern: ``` %5 : Function = prim::Constant[name="adaptive_avg_pool2d"]() %6 : Tensor = prim::CallFunction(%5, %input.1, %4) ``` That is, calls are indirect calls, and are accessed via `prim::Constant` materializing a function object. Even stranger, the `name` attribute here doesn't really even tell the full story -- it doesn't correspond to anything. It turns out that the c10::FunctionType itself actually holds a pointer to the `torch::jit::Function` in the compilation unit directly (so there is actually no indirection in prim::CallMethod, because any two values of the same FunctionType call the same function!). E.g. when converting the IR to bytecode, the "name" is ignored [code link](`1d6bd15790/torch/csrc/jit/runtime/interpreter.cpp (L937)`). We do import `prim::CallFunction` as a `std.call_indirect` though because it's more braindead to do it that way (it gets canonicalized to a direct call easily).	2021-03-01 12:08:01 -08:00
Sean Silva	79a3f639bf	Give torch.global_slot an initializer region. This is a much simpler representation than the ad-hoc initializer function we had before. It is also less general, but given the rationale in Passes.td it seems like the right tradeoff right now. We can probably carry this representation for quite a while, and when we can't, it likely means that TorchScript has fixed their object identity bug and we probably need to just upgrade to a more general object graph modeling (more general than GlobalizeObjectGraph). In particular, we don't want to deal with defining and carrying around this initializer function concept until we need it. For example, if we want to constant-fold the global slots into uses, this is a much better representation, and it plays better with symbol-dce (the initializer function counts as a "use" of the symbol). (the alternative would have been to write a pass that converts the initializer function to this form when possible, but I realized that lots of information had been lost which made that fairly annoying -- it was all self-inflicted anyway, so best to just go to the source (GlobalizeObjectGraph) before the information is lost) Now symbol-dce works nicely (no more "training" bools) ``` pt_util ~/tmp/classifier.pt --import --exported-name forward \ \| npcomp-opt -torch-globalize-object-graph -inline -symbol-dce ``` IR: https://gist.github.com/silvasean/8abe63d70d24e29d6db9170ccc8d512b	2021-02-26 16:24:19 -08:00
Sean Silva	a375ccf9da	Add ability to annotate TorchScript classes. The first use case is to annotate certain program constructs as either exported or private. In this commit we plumb it down to GlobalizeObjectGraph which makes use of this information. Recommended review order: 1. class_annotator.h/.cpp + `test/module_import/annotations/*` - New abstractions to communicate with Python code and annotate. 2. IR changes in TorchOps.td - Adding "private" attribute to various things. 3. ivalue_import.cpp changes - Module + ClassAnnotator = annotated IR 4. GlobalizeObjectGraph.cpp + tests - use new "private" attributes to create "private" IR. - also, tweak some of the op deleting mechanics, which was triggering some memory errors / assertions With this, we can run the classifier through and inline it as follows: ``` frontends/pytorch/utils/pt_util.py --import --exported-name forward ~/tmp/classifier.pt \ \| npcomp-opt -torch-globalize-object-graph -inline ``` IR: https://gist.github.com/silvasean/32dcad9f6270557f412094a77cecdd69	2021-02-25 11:28:34 -08:00
Sean Silva	c424c24ed8	Bump llvm-project to c68d2895a1f4019b387c69d1e5eec31b0eb5e7b0 - dialect registration - StringAttr::get: order of context arg - math dialect - LogicalResult nodiscard - error message for invalid broadcast	2021-02-22 12:23:24 -08:00
Sean Silva	8486968925	Add trivial inliner interfaces. With this + manually setting private visibility on everything, a simple classifier can be reduced to this IR, which is looking pretty lean and mean: https://gist.github.com/silvasean/19e7e2e21a61ff197aeac0dd864d188f Also, include a utility script for importing `.pt` models. ``` pt_util.py --import classifier.pt \| npcomp-opt -torch-globalize-object-graph ```	2021-02-22 10:40:38 -08:00
Sean Silva	1b769f7841	Extend GlobalizeObjectGraph to handle torch.prim.GetAttr returning NnModuleType This happens in practice. With this, we can globalize slots for the non-trivial classifier layer obtained from https://github.com/NVIDIA/NeMo/blob/main/tutorials/nlp/Joint_Intent_and_Slot_Classification.ipynb This also adds support for tuple return types, which were needed by that model.	2021-02-19 10:23:25 -08:00
Sean Silva	158c5c484d	Implement GlobalizeObjectGraph transformation. This required restructuring of how we model TorchScript on import. The main difference is that now we split out a `torch.class_type` that holds methods and declarations of the types of each slot. This is more consistent with TorchScript (our previous representation was "denormalized"). Recommended reading order: 1. check out the description of `torch.class_type` in `TorchOps.td` and look at `test/Dialect/Torch/ops.mlir` and `frontends/pytorch/test/module_import/` to familiarize with the new representation. - Just look at the new IR. The diff between the old names and new names is confusing. 2. check out `test/Dialect/Torch/globalize-object-graph*.mlir` and read along with the pass description in `include/npcomp/Dialect/Torch/Transforms/Passes.td` 3. Read the code in `GlobalizeObjectGraph.cpp` and miscellaneous changes in `ivalue_importer.cpp`, `TorchOps.cpp`, etc.	2021-02-18 18:18:47 -08:00
Sean Silva	7f7bf39551	Add prim::Print and fix prim::CallMethod For now, we are treating strings as bytes.	2021-02-10 15:15:56 -08:00
Aaron J Arthurs	484fe0d9bd	Reformat code	2021-01-28 12:01:35 -08:00
Aaron J Arthurs	c0e14da888	Fix TensorFromElementsOp reference	2021-01-28 12:01:35 -08:00
Aaron J Arthurs	fc650c9447	Import TCP pad	2021-01-28 12:01:35 -08:00
Sean Silva	689b40c7a6	Add initial TorchScript module importer It turns out that this was easiest to structure as a general IValue importer, since torch module are just one of the possible IValue's. We import the IValue object graph in a braindead fashion into basicpy ops and a new `torch.nn_module` op that is used to model the attributes/methods of a torch::jit::Module IValue. See `Torch/ops.mlir` for an example, and also check out the .py import tests in `frontends/pytorch/test/module_import`. As part of this change, a few housekeeping tasks: - extract some helpers from graph_importer.cpp - more helpers around the C API - misc touchups	2021-01-28 11:55:17 -08:00
Sean Silva	1965ac4d67	NFC: mark some methods as `override` This silences some warnings I was seeing locally.	2021-01-21 11:48:41 -08:00
Sean Silva	3f4161635c	Bump llvm-project to be7352c00d51f4358db3a23ed6a077f7cb48eafd - TensorFromElementsOp -> tensor::FromElementsOp - `cmpi "eq", ...` -> `cmpi eq, ...`. Same for `cmpf` - syntax change for private func ops - some changes to the python bindings	2021-01-21 11:16:55 -08:00
Sean Silva	6351474382	Bump llvm-project to bc556e5685c0f97e79fb7b3c6f15cc5062db8e36 - `let typeDesription` -> `let description` - LLVMIntegerType -> IntegerType	2021-01-08 14:18:09 -08:00
Stella Laurenzo	3f706473fd	NFC: Delete npcomp python API and switch to upstream. * Most updates are mechanical except: * python/npcomp/__init__.py and python/NpcompModule.cpp: New init/registration bits to replace some automatic things being done in the old bindings. Also an annoying linkage hack that I'll need to triage next. * NpcompModule.cpp: New python helpers for custom types and other hard to reach items (for the new bindings). * PybindUtils.h: Extended type casting so that the local extension can directly exchange Mlir* C types. * python/npcomp/dialects/: Build support and ODS bindings for local dialects. mlir_utils.py: Defines an ImportContext to replace the old/bad "Helper" class that tracked locations, and insertion points. This has a number of methods on it that would be good candidates to think about better ways to do them upstream. * Also hoisted a few stand-alone samples to dedicated unit tests as they covered important things. * More cleanup can be done, but keeping this patch as mechanical as possible to stay in NFC land (this is big enough).	2021-01-08 10:46:24 -08:00
Sean Silva	97d6d04d41	Bump llvm-project to 16c6e9c58e9ae50a775945e6b407f1891f353d2f Changes: - linalg init tensor change (outs+init -> just outs) - IntegerType::get and other builtin types now take the context as the first arg - LLVMType::* is gone. Now LLVM Types are just regular Type's.	2021-01-05 16:12:11 -08:00
powderluv	4237172bbf	Fix OSX builds. (#143 ) --version_script doesn't work on OSX. Shared libs are .dylibs on OSX. TEST=Build on iMac Pro. M1 has other issues will be fixed later Change-Id: I2bda46349a878b8265e273c05d8db6b46c0df633	2020-12-28 01:30:45 -08:00
Aaron Arthurs	85898aaf10	Add TCF convolutional op with bias addition (#137 )	2020-12-15 12:53:12 -08:00
Sean Silva	d818043986	Bump llvm-project to d50d7c37a159802c89454a6c53c0ec2e7949d84a Fixes: - use `op->(method on Operation)` - update for MlirIdentifier in signature of mlirNamedAttributeGet	2020-12-14 14:30:51 -08:00
Sean Silva	b2077738ca	Bump llvm-project to 444822d77a7fea28aa49edf24533c987efa1b2ee Fixes: - renames StandardTypes -> BuiltinTypes - std.extract_element -> tensor.extract	2020-12-11 14:43:38 -08:00
Sean Silva	251aa6e435	Bump llvm-project to 774f1d3ffd458d6cb82d5039758ef1cf6370957f Date: Mon Nov 30 15:20:30 2020 -0800 Changes: - finalizing-bufferize is stricter now, and we need to pull in a DimOp bufferization which was previously working by happenstance. The offending DimOp's are actually created by the linalg bufferization (which creates dim ops on the original tensor values, not the converted memrefs), so the fix is moving std-bufferize after linalg-bufferize.	2020-11-30 18:40:13 -08:00
Sean Silva	f9b32a99fc	Bump llvm-project to 164410324d8bf3b5a99e39f7dfe3c6d6972dab30 Date: Mon Nov 30 12:44:35 2020 -0800 Fixes: - func-bufferize is no longer finalizing, so we need to add finalizing-bufferize.	2020-11-30 13:58:13 -08:00
Sean Silva	955fd3eeda	Add some much-needed comments around refbackrt::invoke. This code is really tricky, and was not commented.	2020-11-25 15:39:41 -08:00
Sean Silva	46aa6d0a24	[RefBackend] Fix leaks related to ABI boundaries. Best as I can tell (e.g. from LeakSanitizer), this fixes all the leaks except for those due to buffers created internally to the codegenned code itself (up next I'll add the buffer deallocation pass to fix those). The main change is that instead of attempting to pass `refbackrt::Tensor` to the codegenned function directly, we make all the ABI types be UnrankedMemRef which gets passed awkwardly (but workably) as a `{size_t rank, void ptrToDescriptor}` on the ABI. The reason why refbackrt::Tensor wasn't workable is that is that MLIR doesn't really have a way to deal with the lifetime of unranked memref descriptors that happen inside the function, which is inevitably what would happen in the old code that would emit runtime calls to `refbackrt.to_memref/refbackrt.from_memref` to convert back and forth to `refbackrt::Tensor` inside the codegenned code. So, instead of the `refbackrt.to_memref/refbackrt.from_memref` with no real sound basis for valid lifetime management, we now have a lovely piece of code in `refbackrt::invoke` in `Runtime.cpp` that just barely seems to be sound. We rely on the codegenned code having these properties, which it seems to have: - it won't free memref descriptors or their backing buffer for arguments of UnrankedMemRef type. - it will allocate a separate memref descriptor for each result UnrankedMemRef (which is ensured by having a separate memref_cast for each) - we can sniff the `allocatedPtr`'s (i.e. the backing buffer pointers) to avoid double-freeing in the case of aliasing of the backing buffer (including backing buffers for arguments feeding into results) - to catch the case of statically allocated data (which we need to avoid passing to `free`) , check if the `allocatedPtr` is (no joke) equal to `0xDEADBEEF`, because there is otherwise no way to distinguish statically allocated from malloc'ed data... (std.global_memref lowering to LLVM by happenstance sets the allocatedPtr equal to `0xDEADBEEF`, presumably mainly as a debugging thing) Even with all this, we still* need to (internally to refbackrt::invoke) make copies of all inputs/outputs! And the details of how the LLVM-level ABI gets laid out for e.g. function arguments/returns is still super tricky. This really highlights how deficient memref is as the general runtime type for our use case. It's stewing in my mind how best to improve the situation. My general gut feeling is that IREE's abstractions for this are "right", but I need to think more how to distill those aspects of IREE's design in a "reference" way for RefBackend. Some implementation notes: - In terms of how this is implemented, this did catch a bug in our ABI wrapper functions in LowerToLLVM.cpp, which I had to fix (it happened to work before through some combination of npcomprt::Tensor being passed as a single pointer + probably me infinite-monkey-ing it until it worked) - This actually removes 2 out of the 3 compiler runtime functions (the only one left is "abort_if". (most of the memref descriptor code moved from CopmilerRuntime.cpp to Runtime.cpp) - this also means deleting `refbackrt.from_memref` and `refbackrt.to_memref`	2020-11-25 13:09:58 -08:00
Stella Laurenzo	3937dd14cb	Add basicpy.numeric_constant op. * Going through TODOs on the PyTorch side, this is a big cause of them (not being able to have constants for signed/unsigned). * Added complex while in here since we're at the phase where it is better to just have things complete than partially done.	2020-11-24 16:44:40 -08:00
Stella Laurenzo	bea0af419d	NFC: Prefactor some basicpy ops in advance of more type work. * Organizes the BasicPyOps.td file by function. * Renamed `to_boolean` -> `as_predicate_value` (trying to consistently use "predicate" to refer to i1/low-level types and Bool/Boolean to refer to Python bool types).	2020-11-24 15:49:37 -08:00
Sean Silva	0b7c443256	[RefBackend] Properly initialize refbackrt::Tensor refcount. Although `refCount` is initialized as `std::atomic<int> refCount{0};` in the definition of Tensor, our tail-allocating malloc would ignore it, resulting in bogus values that led to leaks. Caught with LeakSanitizer, but I added an assertion that the refcount is non-negative to begin with, which should catch this bug in the future fairly consistently (assuming the garbage refcount is negative half the time).	2020-11-24 12:01:35 -08:00
Stella Laurenzo	78a3c90758	Add TorchScript graph importer. * Does not handle all features yet but should conservatively fail on unsupported things. * Location tracking is still somewhat mismatched between what TorchScript and MLIR do. Likely need a better heuristic for tracking locations from defs for nodes that do not carry location. * Sets the ground-work for a specialized/generic split but only implements the generic side. * Had some evidence that this requires a recent bump of PT nightly (within the last month) to pick up pybind11 2.6, which includes some cross-module symbol fixes (vs the previously sync'd version). No source changes, but older versions fail to cast function types at runtime.	2020-11-23 14:20:09 -08:00
Stella Laurenzo	f03225b1f1	Bump llvm-project to f4f8a67aaf13bc66a2b7d55561b14a3724a5e0de. * Incorporates source fixes. * Uses upstream pybind11 detection logic. * Patches CI. * This may break the CI, which will need to be fixed manually in a followup.	2020-11-22 13:14:44 -08:00
Sean Silva	1dfcfa9cd1	Add aten.mm op and "test" it e2e. Note that unlike aten.matmul which has dynamic behavior depending on the argument ranks (can do matrix-matrix, matrix-vector, batch matmul, etc.), aten.mm is just a vanilla matrix multiply, which can be lowered precisely to tcf.matmul. The "test" is really just an example that I stared at while getting my feet wet with this. We probably want something that actually tests this as part of `ninja check-npcomp`.	2020-11-20 17:21:24 -08:00
Sean Silva	32b2dc6ce7	Revert "Bump llvm-project to 369c51a74b5327464e27e0749ca7ac59ac1349ce" This reverts commit `c60d7b4aae`. It seems to have tickled some sort of pybind version issue: https://github.com/llvm/mlir-npcomp/runs/1433414550?check_suite_focus=true	2020-11-20 15:09:18 -08:00
Sean Silva	c60d7b4aae	Bump llvm-project to 369c51a74b5327464e27e0749ca7ac59ac1349ce	2020-11-20 13:03:24 -08:00
Sean Silva	64a7e83184	[RefBackend] Add refback-tcf-to-tcp-pipeline This allows invoking TCF to TCP-level conversion more easily, and starts us towards a path of factoring it out of the RefBackend.	2020-11-17 12:33:37 -08:00
Sean Silva	358159a6eb	[RefBackend] Open-code shape.get_extent as extract_element It was annoying that we were creating shape.get_extent in the middle of the bufferization pipeline, as it required running convert-shape-to-std at an awkward place. To make that cleaner, just open-code the extract_element ops that shape.get_extent expands into. This is a little gross, but it helps with the macroscopic pipeline ordering issues. Anyway, the train is long-gone of trying to treat shapes as some special data type that should only be operated on with shape ops. Also, - reorder tensor constant bufferize (which is a module pass) to bracket all the bufferization function passes, to make the parallelism opportunities there clearer. Now we have a very clean little bufferization segment of our pipeline construction.	2020-11-17 11:00:38 -08:00
Stella Laurenzo	a7ff87a922	Sever C++ level depend on IREE and rebase on exe and python interface. * IREE doesn't have proper install support, so there is some temporary hoaky hacking in our CMakeLists.txt to shuttle some symlinks around. * Reworked the original numpy e2e with IREE test to pipe through iree-translate. * Removed all of the C++-level dependencies. * Will generalize and apply to the PyTorch backend in a followup.	2020-11-16 21:32:56 -08:00
Sean Silva	5227d52c26	[RefBackend] Use std.global_memref instead of homegrown thing This vastly simplifies our code, allowing deleting multiple ops, simplifying multiple passes, and removing a whole pass. Now `refback` dialect is down to one op (refback.alloc_memref, which simplifies allocations to just take a shape instead of individual extents).	2020-11-13 18:43:50 -08:00
Sean Silva	32388d938b	Make some passes run on FuncOp so they can run in parallel.	2020-11-13 16:12:18 -08:00
Stella Laurenzo	b4c7ae1e0c	Repurpose numpy-compiler compiler/runtime flow for PyTorch. * A bit gross because I took the chance to upgrade all of the backend bits to the new MLIR Python bindings and we still co-mingle the old and new for now. * Since the Python created PassManagers are configured for explicit nesting, I had to upgrade some of the pass pipelines to be explicit. * The demo in mul_maximum_e2e.py now compiles, runs through PyTorch and through the JIT, prints and asserts the same results. * I am not claiming that this is the prettiest API in this patch: consider that this is just directly using low-level APIs and there should be an intervening high level API.	2020-11-11 10:38:13 -08:00
Sean Silva	1c7c362e29	[TCP] Replace tcp.matmul with linalg.matmul. This involved adding a `tcp.splatted` op to splat a dynamically sized init tensor. See rationale in TCPOps.td docs. One interesting observation is that when lowering tcf.matmul to linalg.matmul, we need to both 1) create the error checks and 2) calculate a shape transfer function to create the init tensors. Previously, 2) was deferred to bufferizing tcp.matmul later. I'm not sure if this is a conflation of concerns or not. For now, it's not a big burden.	2020-11-10 18:58:28 -08:00
Sean Silva	0427aacb0b	[TCP] Replace elementwise ops with std elementwise ops.	2020-11-10 18:58:28 -08:00
Stella Laurenzo	e60dc2470e	Add aten.maximum op and conversions from aten->tcf. * Conversions are very simple, suporting mul, maximum and add (alpha=1 only). * Example added with pass pipeline needed to run. * Much missing off of the golden path but sufficient for such simple cases.	2020-11-04 17:20:54 -08:00
Stella Laurenzo	6c702b149f	Add a number of kernels and new patterns. * convolution, convolution_backward, _log_softmax, _log_softmax_backward_data, nll_loss_forward, nll_loss_backward, nll_loss2d_forward, nll_loss2d_backward, copy_ * Extends the recognition logic and metadata for handling inplace transformations, optional tensors, ints, lists and dropped args. * The kernel_calls generated by test_conv_nllloss_grads.py now convert to ATen. * The result almost comes out as a pure tensor program with the exception of the copy_ op, which I will do some followup work to deal with. * More progress on #97	2020-11-04 14:36:59 -08:00
Sean Silva	57e58b9272	[RefBackend] Use upstream func-bufferize pass. Now, the only bufferization we have left is lowering tensor constants to memref, which will hopefully proceed soon after Rahul's new std.global_memref lands + the lowering to LLVM IR. Then I'll port LowerConstantTensorsToMemref to upstream and we'll be 100% upstream bufferization, except for our local TCP dialect (which will probably go away and be replaced by std elementwise + linalg named ops on tensors :) ).	2020-11-02 17:38:33 -08:00
Sean Silva	1874bf5eb1	NFC: Clean up some minor nits - Remove GreedyPatternRewriteDriver.h from files that don't need it - fix typo shouldBeCloned -> wouldBeCloned	2020-10-30 18:48:25 -07:00
Sean Silva	f9c2f8eb0d	[RefBackend] Use upstream SCF bufferization pass.	2020-10-30 18:12:41 -07:00
Stella Laurenzo	a3f4db9fe8	Bump llvm-project to c8c07b76b2cf2ada8e7ec132f7f57b97d76743cf. * Several NFC changes to signatures/includes.	2020-10-29 15:25:55 -07:00
Stella Laurenzo	c08935a418	Rewrite ATen ODS code generator to be based on new op registry and new signature recognition system. * Deletes prior code generator from previous attempt (moved some of it into this one). * Renames old generated tablegen source to "Legacy". * Generates ODS and import rules for most binary and unary arithmetic ops. * Removes old generated ops and integration tests that were testing details of the prior setup.	2020-10-28 10:37:37 -07:00
Aaron J Arthurs	94ea6f7c92	[RefBackend] Support element-wise multiply op Register the following for the multiply op: - tcf.mul - tcp.mul - TCP->TCP lowering - Shape transfer, broadcasted multiplicands - Lower to standard `MulFOp` op	2020-10-27 19:41:23 -07:00
Stella Laurenzo	510f226df2	Expose signature metadata to ops and implement ATenRecognizeKernelsPass pass. * Two op interfaces, one for querying instance metadata and one for getting static data needed to construct an op from a generic form. * For torch.generic_kernel ops, metadata is splatted in during capture from Torch (it comes from the op registry, which will work for either device capture or graph import). * Moved the 'add' out of the generated set so I can experiment on it. It implements the TorchBuildableKernelOpInterface interface which provides its metadata. * The ATenRecognizeKernelsPass pass generically lowers from a torch.generic_kernel to recognized ops that implement the TorchBuildableKernelOpInterface, handling the various types of transformations that we allow at this stage.	2020-10-26 20:31:45 -07:00
Mehdi Amini	f3c75d957b	Add missing dependency on NPCOMPCAPI from NPCOMPPythonCommon Fix linker error: lib/Python/libNPCOMPPythonCommon.a(MlirInit.cpp.o): in function `mlir::npcomp::python::npcompMlirInitialize()': mlir-npcomp/build/../lib/Python/MlirInit.cpp:46: undefined reference to `npcompInitializeLLVMCodegen'	2020-10-22 22:44:18 -07:00
Stella Laurenzo	91fc83d2e7	NFC: Transition ATen passes to tablegen registration.	2020-10-22 17:12:44 -07:00
Stella Laurenzo	9618c2dbf7	NFC: Re-organize ATen directory structure and fix warnings. * Still some more work to do on the Transforms tree to bring it in line with the others (will do that as I add things).	2020-10-22 14:13:26 -07:00
Sean Silva	14470f9ff6	[RefBackend] Use upstream std bufferization. It now subsumes the one we had.	2020-10-21 16:46:56 -07:00
Sean Silva	b6ae53b312	[RefBackend] Use new upstream SCF type conversions.	2020-10-21 16:46:56 -07:00
Stella Laurenzo	fe5ceed18d	NFC: Format a file that had not been.	2020-10-21 12:47:12 -07:00
Stella Laurenzo	029815152e	Add remaining pieces to capture full example models. * Adds Basicpy List, Tuple, Dict types and plumbs through C API. * Started debugging the issues around aten::conv2d capture, but a PyTorch bug is suspected. * Was able to manually verify that the basic conv2d forward test captures correctly with a workaround. * Need to resolve some printing issues upstream and move these tests to an integration test target (they take ~seconds to run).	2020-10-19 22:16:59 -07:00
Stella Laurenzo	9e52f6235b	More progress on PyTorch acap device capture. * Now gets far enough to capture batch_norm. * Has some issues still with in-place ops. * Can materialize constants. * Includes an upgrade to PyTorch nightly, which has important bug fixes for fallback and boxed kernel dispatch. * Fixes #78, #79, #80. * Will do more testing in a follow-up once further bugs are fixed that facilitate getting at the other features.	2020-10-15 21:43:21 -07:00
Sean Silva	06a8ba6900	[RefBackend] Use more idiomatic bufferize pattern for TCP. The time has come for BypassShapes/LowerShapedResultsToMemref to go away :( For the reference backend, being consistent with upstream conventions is the name of the game now. This is a step down in a number of ways, e.g. test clarity and separation of concerns. But it is fewer files and fewer tests, and does address the "TODO: This is really fragile". It also eliminates two more ops from the refback dialect (sadly, they are the shaped_results/yield that we were getting kind of fond of, but alas).	2020-10-15 20:15:53 -07:00
Sean Silva	b6bdc8cc4f	[RefBackend] Use upstream BufferizeTypeConverter Now that it has grown source/target materialization capabilities (spelled with ops tensor_load/tensor_to_memref), we can use it. We can also now delete refback.memref_to_tensor/refback.tensor_to_memref. This is also a first step to reducing the downstream functionality needed in the refback dialect.	2020-10-15 15:58:51 -07:00
Sean Silva	f2d5c26c97	Bump llvm-project to 820e65f9e2369d2990fde4b3e7cfceb64f0df9c8 Date: Mon Oct 12 11:26:50 2020 -0700	2020-10-12 13:30:22 -07:00
Sean Silva	93fc21dad0	[RefBackend] Split out TCF->TCP conversion. Now the reference backend is cleanly accepts "TCP"+scalar ops. We introduce tcf-refback-lowering-pipeline which also does TCF->TCP conversion for convenience until we have a "target interface".	2020-10-12 11:56:39 -07:00
Stella Laurenzo	af4edb63ae	Start reworking towards a shared library build. * Need to have a dag of shared library deps in order to interop across python extensions (as presented in ODM). * Introduced add_npcomp_library and friends to mirror the MLIR setup. * Adds a libNPCOMP.so shared library. * Redirects tools and extensions to link against libNPCOMP.so (instead of static libs). * Moves all libraries to lib/, all binaries to bin/ and all python extensions to python/. The invariant is that the rpaths are setup to have a one level directory structure. * Reworks the _torch_mlir extension to build like the others (still need to come up with a consolidated rule to do this instead of open coded). * Includes an upstream version bump to pick up needed changes. Sizes with dynamic linking (stripped, release, asserts enabled): libNPCOMP.so: 43M (includes much of the underlying LLVM codegen deps) libMLIR.so: 31M _npcomp.so: 1.6M (python extension) _torch_mlir.so: 670K (python extension) npcomp-capi-ir-test: 6.3K npcomp-opt: 351K npcomp-run-mlir: 461K mnist-playground: 530K Still more can be done to normalize and optimize but this gets us structurally to the starting point.	2020-10-09 16:02:58 -07:00
Sean Silva	631c8070df	[RefBackend] Put JITModule in refback namsepace.	2020-10-08 09:07:00 -07:00
Sean Silva	7edb5f3641	[RefBackend] Rename RefBackend dialect to Refback I now realize that VerboseCamelCase is not the best choice for dialect directory/file names and C++ identifiers (take e.g. "Linalg", "Basicpy", etc. as prior art here; not LinearAlgebra or BasicPython). If I had to name the convention it seems to be "Shortword" (or of course just acronym dialects like LLVM, SCF, etc.). This rename also has the side benefit of differentiating RefBackend directories, which now refer to the actual backend itself, from Refback/Refbackrt, which are the dialects which happen to be used by that backend.	2020-10-08 09:07:00 -07:00
Sean Silva	bf99a82832	[RefBackend] Rename Npcomprt dialect to Refbackrt.	2020-10-08 09:07:00 -07:00
Sean Silva	83ad70ef54	[RefBackend] Move runtime related code under npcomp/RefBackend/ Other than the dialect definitions (which will live in standard Dialect/ subdirectory), the goal here is to keep RefBackend-related code nested in {include/npcomp,lib,test}/RefBackend.	2020-10-08 09:07:00 -07:00
Sean Silva	21255d5f8e	[RefBackend] Rename "E2E" to RefBackend.	2020-10-07 10:29:48 -07:00
Sean Silva	03846ed8e7	Rename a couple CMake targets. NPCOMPFoo to NPCOMPFooDialect for consistency with others.	2020-10-07 10:29:48 -07:00
Sean Silva	5017430dc7	[RefBackend] Split out RefBackend (refback) dialect from TCP. This is the first in a patch series that is refactoring the constellation of things variously called or associated with "E2E", "RefE2E", "npcomprt", and "TCP" into a more cleanly layered result. Concretely, this first patch fixes the fact that TCP was basically acting like a dumping ground needed by the reference backend. This splits it out, which is fairly mechanical, but touches a lot of lines of code (basically replacing `tcp` with `refback` and `TCP` with `RefBackend). Now, the RefBackend dialect is that dumping ground, which is slighly better, as it starts allowing TCP to become a nice clean middle layer that is not related per se to the reference backend. The previous name RefE2E or "reference e2e flow" was super confusing. Now that we are seeing more clearly where the "backend" distinction lies, the [RefBackend] commit tag is born :)	2020-10-07 10:29:48 -07:00
Stella Laurenzo	58b6033537	Bump llvm to ed46e84c7aaffd847656ac559acb06089096ec33. * Minor change of MLIRStandardOps -> MLIRStandard	2020-10-06 22:02:57 -07:00
Sean Silva	8022dfaf1a	[RefE2E] Initialize the linalg matmul accumulator buffer. I was seeing some miscompiles due to the uninitialized data read here before. Interestingly, this was masked in some of our previous test cases, since the uninitialized data "always" was so small that it would present as a rounding error for the 1.0-10.0 sized values that the matmul was computing on.	2020-10-02 16:24:52 -07:00
Stella Laurenzo	e5433e314f	Add capture function arguments. * Adds at::Tensor -> MlirValue tracking. * Adds conversions for tensor and scalar types to MLIR types. * Adds npcomp C APIs for constructing custom types. * Reworks pybind include so as to get Torch pybind helpers (needed to pass at::Tensor type from Python->C++).	2020-10-01 18:59:58 -07:00
Stella Laurenzo	3d74337be0	Add a torch.kernel_call op and associated predicates.	2020-09-29 15:10:38 -07:00
Stella Laurenzo	ba03ecc652	Add public API for constructing a module/function to capture PyTorch ops. * Uses the MLIR-C API since that will save us a lot of grief down the road (i.e. will give PyTorch and libMLIR/libNPCOMP the ability to skew version-wise). * Quite a few TODOs and not yet populating the function in any way.	2020-09-29 14:23:22 -07:00
Stella Laurenzo	2c9ca79c89	Add boilerplate for Torch dialect.	2020-09-28 15:26:17 -07:00

... 12 13 14 15 16 ...

1470 Commits (ec6d7aa5d28f110aa5b893e16e502e6198988801)