torch-mlir

Commit Graph

Author	SHA1	Message	Date
Stella Laurenzo	ec611c1e6f	Misc fixes for MacOS. (#255 ) * Change aligned_alloc -> malloc. It can fail (and does on MacOS) and is a bit over-aggressive optimization for a reference backend. * Fixed a fragile test that prints -0.0 on MacOS. * Fail the test (not the framework) on failure to trace (Torch on MacOS is missing features). * Fix .so -> .dylib for compiler runtime.	2021-07-27 17:48:47 -07:00
Stella Laurenzo	2dbab50444	Rework the python build to a static assembly of MLIR+NPCOMP (#251 ) * Adapt to python build system updates. * Bump llvm to 310c9496d80961188e8d8f8ad306cdf44bd7541f (includes python build updates) * Adds refback C-API. * Re-layers all python builds. * Rework CI.	2021-07-27 16:10:10 -07:00
Stella Laurenzo	2ecbcbf8c7	Bump llvm-project to a085c23aa3c8f91866d7f4588d4f683407dc775d. (#250 ) * Added additional ToLLVM conversion patterns (they were disaggregated from standard). Misc renames. * Spelling change on ConvNCHW op, and it now expects strides and dilations attributes.	2021-07-23 14:13:19 -07:00
Yi Zhang	89d4931324	Linalg lowering for aten.conv2d and aten.AdaptiveAvgPool2d 1. Add m_TorchConstantIntList 2. Lowering for aten.conv2d 3. Lowering aten.AdaptiveAvgPool2d	2021-07-09 15:04:29 -07:00
Yi Zhang	5f1b2ba323	Bump LLVM version to 7c35aae35b2c386b59af58c56ed36908f3d68371	2021-07-09 10:44:44 -07:00
Sean Silva	83b5b5456d	Bump llvm-project to da289a174fc6617c7be37be2947480510fd4f02a - Build adjustments for `.cpp.inc` dialect files. - Renaming of `memref.dim` to `tensor.dim` for tensor case. Minor changes: - Renaming of `mlir::linalg::ReassociationIndices` to `mlir::ReassociationIndices`. - Adjust command line option parsing in npcomp-run-mlir.	2021-07-07 13:57:29 -07:00
Sean Silva	ef118eb1e1	Add E2E tests to CI This includes IREE and RefBackend. This includes a fixup to torchscript_e2e_test.sh for handling the situation where PYTHONPATH was not already exported.	2021-07-02 13:46:38 -07:00
Sean Silva	30400d5492	Pin PyTorch version in the CI. I'm seeing the following error: ``` CMake Error in frontends/pytorch/csrc/CMakeLists.txt: Imported target "torch" includes non-existent path "/usr/local/include/breakpad" in its INTERFACE_INCLUDE_DIRECTORIES. ``` Reported upstream in: https://github.com/pytorch/pytorch/issues/60485	2021-07-02 11:15:27 -07:00
Sean Silva	c289d83407	Fix README.md Use the new `tools/torchscript_e2e_test.sh`. Also, fix a few whitespace/comment issues.	2021-07-02 10:57:21 -07:00
Sean Silva	d5108b9dc1	Add IREE support in TorchScript e2e tests. - Add support for "expected failures" in test reporting. The new error reports look like [this](https://gist.github.com/silvasean/6ffd95e1d55302b699673da201da210d). - We will now be able to put these tests into CI, since the harness understand which tests are expected to pass and fail. - Refactor RefBackendTestConfig to NpcompBackendTestConfig which supports both RefBackend and IREE. - Add instructions for installing IREE dependencies (both from packages and for local builds of IREE) - Add `tools/torchscript_e2e_test.sh` for invoking the e2e test harness (this makes invoking a bit easier, as it doesn't rely on a loose Python invocation).	2021-06-30 16:19:25 -07:00
Sean Silva	79928cd2dd	Generalize support for elementwise ops. We plumb through e2e a fair number of interesting cases: - unary, binary, ternary elementwise ops - ops like `torch.aten.add.Tensor` that also take a scalar parameter - static size-1 broadcasting We allow the static size-1 broadcasting case, but emit a runtime error in the case of dynamic size-1 broadcasting. This seems like a sweet spot subset of things that can be lowered directly to linalg, while not being overly constraining to users. This is consistent with what IREE is doing for CHLO->Linalg lowering as well ([code](`50bf7a87e4/iree/compiler/InputConversion/MHLO/BroadcastingToLinalgPatterns.cpp (L1)`)). To test the static size-1 case, we added support for the `torch.aten.unsqueeze` op and lowering for it through `linalg.tensor_expand_shape`. This involved a generalization of `MaximizeValueSemantics` able to handle it (the solution there also works for `torch.aten.flatten.using_ints` which we need for ResNet anyway) Also, a few minor additional changes: - Add `VerifyInvariantsBeforeBackendLowering` pass, which catches a large class of errors before we get to backend lowering (now that we are doing dialect conversion, the errors are way nicer if we just emit them up front rather than in the guts of a random pattern). - Minor change to RefBackend to allow `linalg.tensor_expand_shape`. Recommended review order: - e2e tests in elementwise.py - `ConvertElementwiseOp` in TorchToLinalg.cpp + elementwise.mlir test - `ConvertAtenUnsqueezeOp` in TorchToLinalg.cpp + unsqueeze.mlir test - RefineTypes.cpp + tests - MaximizeValueSemantics changes + test - VerifyInvariantsBeforeBackendLowering pass + test	2021-06-28 13:28:38 -07:00
Sean Silva	577bf1600a	Undo CI pinning. The underlying issue seems to be resolved now: https://github.com/pytorch/pytorch/issues/60485	2021-06-28 11:01:36 -07:00
Sean Silva	49b5b7272b	Handle rank-0 annotations properly.	2021-06-23 12:24:51 -07:00
Sean Silva	145d4ae23c	Bump llvm-project to a37cf17834d39411ed1d669098b428f8374c5b45 Changes: - Change to operand ordering of `linalg.fill`.	2021-06-23 10:03:29 -07:00
Sean Silva	90c6c64fd6	Make torch.constant.float print a little nicer. This printing is chosen to be similar to how MLIR prints the values by default.	2021-06-23 08:07:45 -07:00
Sean Silva	60a947b4a7	Add CastOpInterface to torch.prim.unchecked_cast. This allows it to fold away in trivial cases.	2021-06-23 08:07:45 -07:00
Yi Zhang	45f2edfc7a	Add TorchToSCF pass. 1. Add TorchToSCF pass. 2. Convert prim.If and prim.If.yield.	2021-06-23 08:06:43 -07:00
Yi Zhang	5ad144c4fe	More folding for aten.gt.int, aten.ne.int and Aten__Getitem__TOp. - Fold more for aten.gt.int, aten.ne.int and Aten__Getitem__TOp - Some format cleaning up	2021-06-23 08:06:37 -07:00
Sean Silva	79aade33da	Make MaximizeValueSemantics a bit smarter. This adds a pattern to MaximizeValueSemantics which does a simple abstract interpretation within a block, which handles simple cases of `torch.overwrite_tensor`, enough to remove all the unnecessary uses of non-value tensors in ResNet right now. Before/after IR: [gist](https://gist.github.com/silvasean/a3e1ef625b19dfc63579f73cd3b543b6) Also, - Split `torch.copy.tensor` into `torch.copy.to_tensor` and `torch.copy.to_vtensor` which convert between value and non-value semantic tensors. This is a much cleaner factorization as they have very separate use cases and properties (e.g. different side effects) - Remove the various canonicalization patterns they had, which were confusing because they resulted in limited forms of maximizing value semantics throughout the pipeline. We should structure our compilation pipeline such that only MaximizeValueSemantics should be maximizing value semantics. - Adjust pass pipeline to only run MaximizeValueSemantics once. - Make OverwriteTensorOp `$value` always be a value tensor and `$overwritten` be a non-value tensor.	2021-06-22 16:48:57 -07:00
Yi Zhang	6dddb4d4fe	Add torch.aten.batch_norm Linalg lowering support 1. Added a simplified version of torch.aten.batch_norm which only handles inference and assumes the weight, bias, running_mean, running_var are not None. 2. Removed the primitive types check in verifyLinalgCompatibleTypes check since now we have proper type converter to handle torch types conversion. The checks for RankedTensorType is kept because the type converter doesn't guarantee the converted builtin tensor type is ranked. A separate verification pass to verify the invariant expected by later passes will need to be added before those can be removed as well.	2021-06-22 16:45:21 -07:00
Sean Silva	bbd749620e	Try again to pin the CI to a working PyTorch version. For some reason, pytorch_nightly was being installed for the LLVM build, and so the wrong line got updated in the previous attempt.	2021-06-22 15:04:49 -07:00
Sean Silva	f7ebd870f6	Pin torch to a specific version in the CI. This temporarily works around the CMake error: ``` CMake Error in frontends/pytorch/csrc/CMakeLists.txt: Imported target "torch" includes non-existent path "/pytorch/torch/lib" in its INTERFACE_INCLUDE_DIRECTORIES. ```	2021-06-22 13:11:48 -07:00
Yi Zhang	e6adecac83	Convert Torch constant ops to std.constant	2021-06-18 12:22:47 -07:00
Sean Silva	78d2cc0818	Make `torch.copy.tensor` canonicalization a bit smarter. This removes most of the trivial cases that MaximizeValueSemantics needs to handle, making it easier to see the nontrivial cases.	2021-06-17 18:11:58 -07:00
Sean Silva	40369c54dc	Adjust pass pipeline for changes to `dim` canonicalization. This results in cleaner IR. In particular, Mlp2LayerModule e2e test has a dim op that is eliminated by this change: https://gist.github.com/silvasean/734f11a291ae6236c955f65cffae285f	2021-06-17 16:59:55 -07:00
Sean Silva	1bc889130d	Bump llvm-project to 116841c623747972d0ae80239d3ea7b8409b868b This brings in a change to canonicalization of `dim` ops, which we need to adjust our pass pipeline for.	2021-06-17 16:59:55 -07:00
Sean Silva	333e07a74e	Add `torch.vtensor.literal` op. This op is much better behaved than the `torch.tensor.literal` op (which is the new name of the `torch.tensor` op). In particular `torch.tensor.literal`: - always has a maximally refined type. - always has value semantics. - can be constant folded / CSE'd. ReduceOpVariants is changed to perform the transformation from `torch.tensor.literal` to `torch.vtensor.literal` (which in general involves static information casts and copies. This new op also allowed tightening up `torch.tensor.literal` to only accept NonValueTensorType (instead of any tensor type). This new ".literal" name is more descriptive. It was getting too confusing seeing an op called just `torch.tensor` (we originally called it that because that's the name of the similar function in the Torch Python API, but it just doesn't fit here).	2021-06-17 14:37:04 -07:00
Sean Silva	4a0eb44d17	Add a !torch.float type. This removes the dependence of the `torch` dialect on the low-level builtin types. Now the `torch` dialect is a standalone layer, suitable for targeting from higher-level Python abstractions without any premature lowering to primitive types.	2021-06-17 09:24:18 -07:00
Sean Silva	f49ebf1690	Add `!torch.int` type. This replaces the ad-hoc use of `i64` throughout the Torch layer, and helps to keep it crystal clear the distinction between `!torch.int` (which is modeling the Python `int` type) and the various types that serve as dtypes of tensors, which are a totally different type universe. Changes: - `!torch.int` type and C bindings. - Change `torch.constant.int` parser to not need the `: i64` at the end. - `m_TorchConstantInt` matcher to aid with matching constants. - BackendTypeConversion changes for `!torch.int` -> `i64` type conversion. - Refactor finalizing patterns in FinalizingBackendTypeConversionPass (they were getting very repetitive). - Mechanical rewriting of `!torch.int` to `i64` in all the tests, and `AnyTorchIntType` to `Torch_IntType` in the `.td` files.	2021-06-17 07:28:23 -07:00
Sean Silva	224afb186e	Add folders for torch.aten.gt.int / torch.aten.ne.int This fixes a "regression" on ResNet where we weren't folding away all the control flow. For now, our policy is to "optimize hard enough" to make that control flow go away, because we don't yet have a way to lower to the backend the stuff guarded by the control flow (RaiseException, string operations, etc.). It remains to be seen how much optimization we decide to do at this level in the fullness of time -- the torch op set is not particularly well-designed (at least not idiomatically for MLIR) for general optimization. Ideally, with really good backend support for various features, all the heavy optimization will happen at that layer on `std` ops and `scf` control flow. But I have a suspicion we might end up needing more optimization earlier in the pipeline.	2021-06-16 14:04:31 -07:00
Sean Silva	8860b5c55d	Add `torch.prim.If` This removes the use of `scf.if`, which required laundering back and forth between `i1` and `!torch.bool` in the frontend. We will eventually lower this op to `scf.if`, but this results in a cleaner IR and layering at the frontend.	2021-06-16 14:04:31 -07:00
Sean Silva	784156a998	Add `!torch.bool` type. This finishes removing the dependence on the basicpy dialect! Changes: - Add `!torch.bool` type and replace use of `!basicpy.BoolType` in Torch-related code. - Rename BuiltinTensorize to BackendTypeConversion since now it handles bool conversions (and, when we add !torch.int and !torch.float, it will handle those as well), and generalize the related utilities (I also moved them to Torch/Transforms since they aren't really part of Torch/IR). - Add `torch.to_i1` and `torch.from_i1` ops for materializations - [cleanup] Reorganize `torch.constant.*` ops in TorchOps.td - Remove dependency of `torch` dialect on `basicpy` dialect and also `std` dialect. For `std`, we use some call related ops, but the `torch` dialect itself never produces them (we have passes that do though). This is fairly mechanical. Recommended review order: - New stuff in Torch/IR - New BuiltinTypeConversion files. - Mechnical fixups elsewhere.	2021-06-16 13:22:00 -07:00
Yi Zhang	7b7c9c5d3d	Add aten.relu Linalg lowering support	2021-06-16 08:18:14 -07:00
Sean Silva	3ccf6002af	Add `torch.constant.int` and `torch.constant.float`. - This removes reliance on basicpy.numeric_constant. - Also, add OpAsmOpInterface to the `torch.constant.none` and `torch.constant.str` ops.	2021-06-15 15:29:42 -07:00
Sean Silva	2e850ecb72	Add !torch.str type. - Remove dependence on `!basicpy.BytesType`. - Add `torch.constant.str "s"` analogous to `torch.constant.none`.	2021-06-15 10:10:59 -07:00
Sean Silva	31c15cab2b	Add 2021Q3 roadmap. This also restructures the docs to a "roadmap" directory, to preserve previous roadmaps / allow retrospective "grading" of how we did.	2021-06-15 10:05:25 -07:00
Sean Silva	92ee0fa98f	Add `!torch.tuple<T1, T2>` type. This further eliminates the need for the `basicpy` dependency. This required adding `torch.prim.TupleConstruct` to replace `basicpy.build_tuple`.	2021-06-15 08:15:22 -07:00
Sean Silva	ea1dd1cd90	Remove a few more comments I missed in the last commit.	2021-06-14 18:18:43 -07:00
Sean Silva	6b2424512b	Make C API files more consistent - Make consistent with MLIR Core - Use `//` or `///` comments. - Use `bool` type for booleans - No duplicated comments in .cpp files - Split types into separate files `{Basicpy,Numpy,Torch}Types.h` - Add dialect prefix consistently to C API symbols. We have lots of similarly named types (e.g. "list" type in basicpy and torch).	2021-06-14 15:34:43 -07:00
Sean Silva	db282fd1b4	Introduce native `!torch.none` type. - Add `torch.constant.none` op to construct it (naming is chosen to be analogous to Torch's representation of a prim::Constant with NoneType, rather than using the "singleton" terminology of Basicpy).	2021-06-14 13:30:58 -07:00
Sean Silva	6b293b695d	Use new "MLIR_ENABLE_BINDINGS_PYTHON" in the CI.	2021-06-10 18:06:46 -07:00
Sean Silva	81bcd7fb12	Move Torch type implementation code into TorchTypes.cpp	2021-06-10 16:46:47 -07:00
Sean Silva	0b6516c7cc	Bump llvm-project to cbd0054b9eb17ec48f0702e3828209646c8f5ebd Changes: - MLIR_BINDINGS_PYTHON_ENABLED -> MLIR_ENABLE_BINDINGS_PYTHON - canonicalizer constant insertion order - EDSC is gone now	2021-06-10 16:26:45 -07:00
Yi Zhang	e0ff5248fb	Add TorchList type and prim::ListConstruct #218	2021-06-10 14:31:35 -07:00
Sean Silva	370e3270ab	Introduce `!torch.tensor` / `!torch.vtensor` types. This removes our reliance on the numpy dialect and avoids our off-label use of the builtin tnesor type for modeling unknown dtypes. The `!torch.vtensor` (`ValueTensorType`) type is a value-semantic tensor. The `!torch.tensor` (`NonValueTensorType`) type is a non-value-semantic tensor. The new types look as follows syntactically: ``` // Least-static-information, non-value-semantic tensor. !torch.tensor // Explicit form of least-static-information variant. !torch.tensor<,unk> // Least-static-information, value-semantic tensor. !torch.vtensor // Explicit form of least-static-information variant. !torch.vtensor<,unk> // Fixed-set of allowable element types, with first-class support for // Torch's frontend signedness semantics. !torch.tensor<*,si32> // First-class support for unknown dtypes. !torch.tensor<[?,?,?],unk> // Standard MLIR representation of `?` for unknown dimensions. !torch.tensor<[?,2,?,4],unk> // Statically shaped / dtyped example. !torch.vtensor<[1,2,3,4],f32> ``` This required fairly significant changes throughout the compiler, but overall it is a big cleanup. We now have a much clearer layering of "the Torch frontend lowering" vs "lowering to std + linalg + etc.". At the C++ level, there is `ValueTensorType`, `NonValueTensorType`. We also have a helper `BaseTensorType` (kind of like ShapedType) which interoperates with those two. Included changes: - New `torch.tensor(dense<0.0> : tensor<5xf32>) : !torch.tensor` op for creating torch tensor literals in the frontend. - Consistently use signedness for the types (except i1 which I didn't touch -- we need to sort out the situation with !basicpy.BoolType there anyway so will be attending to that soon) - Frontend can annotate whether an argument to the function has value semantics. We currently require this, as our backend contract does not currently allow us to even model the non-value-semantic case. Before, the value-semantic assumption was randomly injected in the middle of the pass pipeline. - Move ArrayToTensor (now called MaximizeValueSemantics) and RefinePublicReturn passes to torch dialect. - The TorchToStd and TorchToLinalg passes are now type conversions from `!torch.vtensor` to `tensor` and use the dialect conversion infra. The overall conversion pipeline is set up following the best practices of the "Type Conversions the Not-So-Hard Way" talk. This required introducing `torch-func-builtin-tensorize` and `torch-finalizing-builtin-tensorize` passes analogous to the upstream bufferization passes with the corresponding names (mostly just copypasta from there). - Misc Torch-level canonicalizations -- we now cleanly layer the lowering to std later in the pipeline, so we are gradually lessening our reliance on random std constant folding before we get to that point. Recommended review order: - New types in TorchTypes.td/TorchTypes.h/TorchDialect.cpp - New ops in TorchOps.td / TorchOps.cpp - Less important / more mechanical stuff - Frontend changes. - Pass changes/additions in `Torch/Transforms` and `Conversion/`	2021-06-10 10:56:48 -07:00
Sean Silva	b7b7fd4959	Rewrite error reporting of e2e tests. This now gives [much nicer output](https://gist.github.com/silvasean/f048e0f37b04542dae6469b86802bb3e). Embarrassingly, we previously couldn't even report failures for two different tests, and weren't able to report on compilation failures (besides just crashing).	2021-05-20 11:28:20 -07:00
Sean Silva	d66e8fe1f8	Get simple quantized model importing. This is enough to import the program and get it through the compilation pipeline. It of course fails at the VerifyBackendContract pass since there is a lot missing, but the final IR for a simple quantized MLP is looking pretty decent already: [IR](https://gist.github.com/silvasean/f76bccd76e9b193d396cfb2f9a11f54d) Main changes: - Add support for importing torch quantized tensors, including `torch.per_tensor_affine.create` op and `!torch.qint8` element type. - Add support for importing `LinearPackedParamsBase` (basically a weight + optional bias, but requires `torch.linear_params.create` op + `!torch.LinearParams` type to model it). This was less painful than I expected, as it has the necessary methods to opaquely unpack itself. I factored things so it should be easy to extend to other custom classes like `ConvPackedParamsBase`. - Add minimal boilerplate for importing `quantized::*` ops, with `quantized::linear` being a motivating example. - Add e2e test with simple quantized MLP (courtesy of @phoenix-meadowlark). This is somewhat of an abuse of `!numpy.ndarray` / `tensor`, as really the proper semantics of `!torch.qint8` dtype on a Torch tensor is "check the quantizer object of the tensor for side data (scale/offset, possibly per-channel) that defines the full semantics of the tensor". We don't have any such notion of "side data" for `!numpy.ndarray` / `tensor`, let alone anything that would have the associated behavior of keying off the dtype to determine if the side data is present. This will be fixed by a proper `!torch.tensor` type.	2021-05-20 11:28:20 -07:00
Sean Silva	0c89296075	Shore up error reporting for TorchScript import. This code was not exception safe -- it would leave an operation unattached to anything, which breaks MLIR's C++ data structure invariants (e.g. it cannot safely erase ops). Also, print out both the exception and any diagnostics, since they can both contain useful information.	2021-05-20 11:28:20 -07:00
Sean Silva	d50ea8d31e	Improve diagnostic handler It wasn't printing notes or putting the "error:" in front.	2021-05-20 11:28:20 -07:00
Sean Silva	2453805f7f	Bump llvm-project to 35454268cf93f5561439980d6baeb27a874a380c	2021-05-19 14:00:38 -07:00

1 2 3 4 5 ...

539 Commits (ec611c1e6f44eb5b49c658fd98740000935a1058) All Branches Search

539 Commits (ec611c1e6f44eb5b49c658fd98740000935a1058)

All Branches