torch-mlir

Commit Graph

Author	SHA1	Message	Date
Sean Silva	c4123d4d4d	Add npcomp-verify-backend-contract pass. This pass verifies that a given module satisfies the contract that we have for backends. This is phrased as an "allowlist", because we want to keep this interface tight. Also, this gives much better diagnostics than a backend randomly crashing or failing to compile would (though they could still be improved). This was especially painful because if we had `tensor<?x!numpy.any_dtype>` slip through, at some point RefBackend would convert it to a memref type and trip the "verify type invariants" assertion which gives no location or anything and crashed the process, which was very unpleasant. We implement this with the dialect conversion framework, which works reasonably well and was quick to put together and familiar, but is still very "op oriented". We probably want to make this hand-rolled eventually, especially the error reporting (the most useful kind of error for a dialect conversion user is not necessarily the best for this use case). Also, in production, these error will go to users, and need to be surfaced carefully such as "the compiler needs a type annotation on this function parameter" which in general requires some special analysis, wordsmithing, and overall awareness of the e2e use case (such as how much we can lean into certain source locations) to provide a meaningful user-level diagnostic. Also, add `inline` to the current frontend lowering pass pipeline to allow slightly more complicated programs that otherwise would fail on shape inference.	2021-04-20 12:00:35 -07:00
Sean Silva	f5dfa02523	Add `aten.mm` to linalg lowering. This is our first op with error semantics, and stresses the system. There are a few design notes of special interest: - RefineTypes.cpp's note about shape inference in the presence of code that dynamically produces and error, and it is provable statically. - ATenToLinalg.cpp's notes about future automation of the ATen->linalg path. - The notes in Passes.td about using low-tech `std.assert` ops instead of `shape.assuming`. Note: Doesn't work on IREE yet due to the `std.assert` op (needs to be lowered to `vm.fail` on the IREE side).	2021-04-16 12:03:31 -07:00
Sean Silva	28a0f02746	Add support for compiling through IREE. Recommended review order: - Changes in frontends/pytorch/examples/ - Changes in python/npcomp/compiler/pytorch/backend/ - Boilerplate for the `npcomp-iree-backend-lower-linkage` pass. This change separates out a `npcomp.compiler.pytorch.backend.frontend_lowering` module that does the common lowering for all backends. The individual compiler backends `npcomp.compiler.pytorch.backend.{refjit,iree}` now accept a loosely defined "TCP + scalar code" IR mix that will be formalized in the future as the interface to codegen backends. This also required adding a small pass `npcomp-iree-backend-lower-linkage` which adds `iree.module.export` onto functions, and layering that into the frontend flow. The pass doesn't require a C++-level dependency on IREE, which is nice for now. TBD how we are going to handle lists (we hope we can get away with sneakerneting some td files and relying on loose IR compatibility). Running through IREE requires the ability to import `iree.compiler` and `iree.runtime`, which can be obtained as follows: ``` python3 -m pip install iree-compiler-snapshot iree-runtime-snapshot -f https://github.com/google/iree/releases/tag/snapshot-20210406.200 PYTHONPATH="${PYTHONPATH}:${MY_IREE_BUILD}/bindings/python/" ``` This patch makes it painfully clear that we don't have any e2e testing harness to really plug into, and also don't have a usable Python API to our compiler stack (something usable in a jupyter notebook). That will be addressed in subsequent commits. We've been flying by the seat of our pants with this `examples` directory that isn't subject to any kind of testing or real usability concerns.	2021-04-09 13:15:07 -07:00
Aaron J Arthurs	f9d9518f6e	Declare TCP dialect dependency in TCFToTCP conversion	2021-04-07 14:23:56 -07:00
Sean Silva	2ab62aec12	MILESTONE: TorchScript unary tanh runs on RefBackend This revamps the TORCH_TO_TCF_PASSES to reflect the new layering that we are doing in the compiler. See comments there for the layering. Also adds `frontends/pytorch/examples/torchscript_tanh_e2e.py` as an "example". E2E testing story TBD (want to get IREE working first).	2021-04-07 11:06:34 -07:00
Sean Silva	927546b3c5	Add RefinePublicReturn pass. This pass allows shape information to be propagated to return types, which is nontrivial and cannot be cleanly put anywhere else as it changes the public ABI, which is a concern that we want to keep concentrated in one place.	2021-04-07 11:06:34 -07:00
Sean Silva	1e357ae680	Add simple type refinement pass. Currently implemented as a simple intraprocedural dataflow analysis over a standard ShapedType lattice (hasRank, sizes, and elementType). It currently hardcodes a few key pieces of information: - shape transfer functions - whether it is legal to update the operand type of an op This needs to be made pluggable obviously and the core propagation logic moved somewhere agnostic.	2021-04-07 11:06:34 -07:00
Sean Silva	6431b0f11f	Add primitive ArrayToTensor (numpy-array-to-tensor) pass. The current implementation is just sufficient to do a unary aten.tanh from the e2e spike, and just applies some local rewrite patterns. I've sketched out the more full explanation of where this pass eventually need to go in the pass docs. Adding this required adding `numpy.tensor_static_info_cast`, which is the tensor analog of `numpy.static_info_cast`. This op encapsulates the same numpy-specific "no runtime code" casting semantics, in particular the interpretation of `!numpy.any_dtype`. The `numpy.tensor_static_info_cast` I see in practice now are "information erasing" and will be removed by a later pass that exploits the fact that aten ops are agnostic to the static info in the operand types (so substituting a type with more static info is fine). Side note: we need to do dtype and rank inference before aten->tcf (which will eventually mostly be aten->linalg+guards), because each aten op is idiosyncratically overloaded based on dtype and rank. Without copying that idiosyncratic overloading into lower layers (layering violation), we cannot really lower it to anything until we do that.	2021-04-05 17:56:35 -07:00
Sean Silva	30356c41c8	Add torch-adjust-calling-conventions pass. This pass incorporates torch.type_bound info and also removes NoneType returns (eventually it will rewrite tuple types too, but can't yet because !basicpy.TupleType doesn't track element types). Recommend looking at adjust-calling-conventions.mlir first to see what it is doing, and holding your nose for the implementation of the pass. I decided to implement this with the conversion framework, because it gives us some goodies for type conversion -- mainly avoiding large amounts of tricky RAUW dances. Unfortunately, the conversion framework isn't a perfect fit for a couple reasons: - the incorporation of torch.type_bound is a context-sensitive rewrite (requires looking at the arg attr, not just the type). - NoneType conversion is 1->0, which requires some special handling - (not implemented yet) 1->N tuple type conversions require special handling. It's a little bit scary, but on balance doing it the other way would have its own downsides.	2021-04-05 17:56:35 -07:00
Sean Silva	464feacba9	Bump llvm-project to 223dcdcfbe23affdf17ada7f023ee1872fd76160 - ModuleOp no longer has a terminator.	2021-04-05 17:56:35 -07:00
Sean Silva	3f9760dc33	Add communication channels to README	2021-04-02 13:00:47 -07:00
Sean Silva	c3f1f8ebf4	[cleanup] Put the root class type for exportPath first. This is more consistent and intuitive -- usually the object being "indexed" or used as a "context" for a later parameter goes first.	2021-04-01 18:40:03 -07:00
Sean Silva	e749074bae	Basic infra for annotate shapes and dtypes on arguments. These allow users to annotate a known "type bound" on the argument, which can seed shape/dtype inference. We don't rewrite the function types as part of the import process (it will happen in a yet-to-be-written pass) because: 1. We would need to interprocedurally rewrite all calls to keep the IR consistent. Currently, we have a place after GlobalizeObjectGraph but before we convert to tensors where this is convenient to do. Ideally, we would do this on the object graph representation. 1. We don't necessarily know that adjusting the function type is a legal calling convention change. The pass will have blessed knowledge (by the pass pipeline author) that adjusting the argument type based on the type bound is safe (which it frequently is). 2. Note that in principle, a type bound could be a fairly general thing (such as maximum sizes of dimensions, unions of multiple concrete types, etc.). The pass will in principle have logic to interpret the type bounds and to determine a suitable "best" (and legal) argument type.	2021-04-01 18:40:03 -07:00
Sean Silva	7a4043b7c4	Add ability to compile from object graph ir.	2021-03-31 09:25:13 -07:00
Sean Silva	c6d56fed8a	Add unary tanh lowering.	2021-03-30 16:39:49 -07:00
Sean Silva	b0ac04001d	Update README.	2021-03-30 11:33:33 -07:00
Sean Silva	641098be54	Clean up some compiler warnings on my machine.	2021-03-23 14:29:05 -07:00
Sean Silva	99178a167d	Bump llvm-project to 0524a09cc7e1a0797982feacf505825231efbee7 - renames of OwningRewritePatternList -> RewritePatternSet - also `insert` to `add` - RewritePatternSet holds a context now - memref dialect split from std	2021-03-23 14:29:05 -07:00
Bryce Arden	4591884d06	[refbackrt] Scalar arg support * Adds f32 scalar argument support across the ABI boundary. * Adds support for passing input type / shape information across the ABI boundary * Adds support for parsing / creating input FloatAttr's in `npcomp-run-mlir`	2021-03-23 13:16:44 -07:00
Sean Silva	703428eff4	Add support for "trailing_" and "out" variants of various ops. We already had the `promoteTrailingOutTensor` flag, but weren't using it. A inplaceVariantKernelName flag needed to be added. This change is a little dissatisfying, as the conversions done by the RecognizeKernelsPass are currently non-orthogonal. In particular, `kDropResultAndAliasArg0` probably won't work as intended if mixed with these (we probably need to promote kDropResultAndAliasArg0 to not be an arg-level thing anyway, as we have done with promoteTrailingOutTensor). This involved adding a new op `numpy.overwrite_array`. ``` numpy.overwrite_array %arg2 overwrites %arg0 : tensor<2x3xf32>, !numpy.ndarray<[2,3]:f32> ``` This models the destructive update behavior. Note that in the above op, we cannot simply RAUW %arg0 with a suitably conveted %arg2 (for example, %arg0 might have uses that are not dominated by %arg2, or might have an alias relation with some other array in the program). In general, we need a pass analogous to "SSA-formation" which knows how to see through these to uncover an underlying tensor program. Also, add tanh_out_e2e.py/div_inplace_e2e.py and fix some bitrot in refjit.py which is my running example I'm trying to get working.	2021-03-19 10:34:50 -07:00
Sean Silva	a53ed850bd	Fix signature of unboxed aten::arange for torch HEAD	2021-03-18 17:53:52 -07:00
Bairen Yi	19b9398aee	Revert "Skip torchvision 0.9.0 as it is incompatible with torch nightly" This reverts commit `e7b96ebefc`.	2021-03-16 19:37:45 -07:00
Bairen Yi	fead0312f1	Revert "Also fallback autograd dispatch keys for torchvision::nms" This reverts commit `30a42dea32`.	2021-03-16 19:37:45 -07:00
Sean Silva	ba482cbb72	Generate Conv2d definition. We should generally be using torch_signature_ods_gen.py for generating these. Somehow this one slipped through manually. There is no `aten::conv2d_overridable` in the op registry AFAICT so I removed that alias.	2021-03-16 12:39:28 -07:00
Sean Silva	c607efa205	Make ATenOpRegistrations.txt dump more readable. Also add `is_write` field.	2021-03-16 12:39:28 -07:00
Bairen Yi	30a42dea32	Also fallback autograd dispatch keys for torchvision::nms Signed-off-by: Bairen Yi <yibairen.byron@bytedance.com>	2021-03-15 17:58:08 -07:00
Bairen Yi	e7b96ebefc	Skip torchvision 0.9.0 as it is incompatible with torch nightly torchvision nightly has not bump to 0.10.0 alpha, so pip installs torchvision==0.9.0 even with the --pre flag. Signed-off-by: Bairen Yi <yibairen.byron@bytedance.com>	2021-03-15 17:58:08 -07:00
Sean Silva	4cf8aef5d6	Add roadmap doc.	2021-03-15 14:43:51 -07:00
Aaron Arthurs	4fd9b4afb5	Import ATen conv2d conversion and test (#180 ) * Import ATen conv2d conversion and test This is a first attempt at expanding ATen-to-TCF conversion for the conv2d operator. Eventually, this will come in use when lowering a high-level conv-based model.	2021-03-12 17:21:16 -08:00
Sean Silva	58c7030104	Support multiple instances of a class in GlobalizeObjectGraph. This happens in practice with e.g. ResNet from torchvision (multiple instances of the same BatchNorm class). The key observation is that for this program, and the expected set of programs, we can convert the program to the same globalized form with a bit more static analysis and effort to suitably monomorphize the program. Though what we are doing here is fairly annoying to implement, it saves any nontrivial later pass from having to do similar analyses (or worse). E.g. shape inference would need to be object-graph aware, mutation/lifetime analyses would have to be aware, etc. Additionally, it would make us front-load what it means to have a !torch.nn.Module type on an ABI boundary, which we are just not ready to handle. I'm really, really hoping that in practice we can get away with this, otherwise it's going to be really rough designing a representation (and implementing everything to back it) that is convenient to transform and gracefully scales from full object graph (in the most dynamic case) down to a fixed set of global slots like we have here (in the most static case, which we presume a lot of practical programs fall into). This also involved introducing a `torch-prepare-for-globalize-object-graph` pass that does a minimal set of lowerings to simplify the IR into a more orthogonal and analyzable form, and a `torch-globalize-pipeline` helper. Recommended review order: - updated documentation in Passes.td - new tests in `globalize-object-graph-multiple-instances*.mlir` - implementation of GlobalizeObjectGraph.cpp - PrepareForGlobalizeObjectGraph.cpp + prepare-for-globalize-object-graph.mlir - misc stuff like torch-globalize-pipeline pipeline definition. With this, we can import, globalize, and inline resnet18 from torchvision: https://gist.github.com/silvasean/821586afc19b67d9fb72030b2e0adeb8	2021-03-11 19:21:07 -08:00
Sean Silva	2750d2084c	Add prim::device and handle derefining for prim::CallMethod	2021-03-11 14:10:09 -08:00
Sean Silva	572d198b68	Refactor prim node imports.	2021-03-11 14:10:09 -08:00
Sean Silva	01b8a01e1b	prim::dtype op	2021-03-11 14:10:09 -08:00
Bairen Yi	5fed296904	Address missing default label in switch statement Signed-off-by: Bairen Yi <yibairen.byron@bytedance.com>	2021-03-11 11:55:59 -08:00
Bairen Yi	5315598947	Update .getAttrs to ->getAttrs as it is deprecated. Signed-off-by: Bairen Yi <yibairen.byron@bytedance.com>	2021-03-11 11:55:59 -08:00
Bryce Arden	e7a8fd76e2	[refbackrt] Update Invoke API to support more than just Tensor's (#181 )	2021-03-10 15:39:26 -08:00
Bairen Yi	8f9d4f917d	Add LLVM_LINK_LLVM_DYLIB=ON and remove LLVM_ENABLE_LLD=ON when building LLVM in GitHub CI So CI build options are closer to those in `build_tools/install_mlir.sh`. Also append hash of CI spec file to LLVM commit hash when caching builds. Signed-off-by: Bairen Yi <yibairen.byron@bytedance.com>	2021-03-10 11:01:16 -08:00
Bairen Yi	53b01cb9ba	Bump llvm-project to e31c77b1827fa4dd3511f21af11cfab18ecf6d38 Signed-off-by: Bairen Yi <yibairen.byron@bytedance.com>	2021-03-10 11:01:16 -08:00
stephenneuendorffer	06373dcbbb	Add install options for npcomp libraries and executables (#183 )	2021-03-10 07:18:54 -08:00
Bryce Arden	b94a859e03	[torch] Add import support for IValue string Type(s) (#179 ) * [torch] Add import support for IValue string Type(s) * [test] Add test for Strings import	2021-03-04 13:08:50 -08:00
Sean Silva	a36113e586	Fix recent break due to PyTorch changes. Tracing seems now now capture a 4-operand version of aten::add instead of 3-operand. I fixed the tests that made sense. One test was XFAIL'ed, as I don't have in cache the exact way to fix it yet (requires touching aten-recogniz-kernels stuff). I'll be context switching to work on the kernel recognition stuff soon, and will fix it then.	2021-03-03 18:35:23 -08:00
Sean Silva	43dba03afd	Properly model "derefinement". In terms of IR structure, TorchScript allows types to vary in many circumstances where MLIR requires pointer-identical types. In particular, it is valid to pass any subtype in place of a type. For example, if an `Optional[int]` is required somewhere in the IR, it is legal to pass a value of just `int` (but not the other way around; see `torch.prim.unchecked_cast`). In effect, every use can have a different type. We introduce a new op `torch.derefine` that models that impedance mismatch. This op allows casting a value from one type to a type that it is a subtype of to model this behavior. Recommended review order: - TorchOps.td for new torch.derefine (and updated docs for `torch.prim.unchecked_cast`) - new test code in if.py, loop.py, function-derefine.py - new code in node_importer.cpp for handling derefinement insertion - function_importer.cpp and utils changes in torch_to_mlir_utils.cpp Properly handling derefinement on function boundaries required relayering the code so that graph_importer.cpp/.h is now function_importer.cpp/.h because only the `torch::jit::Function` (actually the `c10::FunctionSchema` it holds) knows the derefined types that are actually needed at the boundary (see `function-derefine.py` for a test). Annoyingly, this churns all the functions which are now prefixed with `__torch__.` but that is more correct anyway (that is their linkage name in the `torch::jit::CompilationUnit`; the previous `mb.import_function` was actually buggy in the case of functions calling each other as it would reference their unqualified name). With this change, we can import `resnet18` from `torchvision` :) IR: https://gist.github.com/silvasean/6426a5272d8a6c7caae533fce05ab704	2021-03-03 15:09:44 -08:00
Bryce Arden	1736ff0253	[prim] Add TupleIndex support I could not find a corresponding ListIndex in prim, which seems to translate to a __get_attr__ under the hood. I think the reason a tuple Index op can exist is because Tuple's are supposed to be frozen, where List operands can be mutable.	2021-03-02 17:28:32 -08:00
Bryce Arden	68338eafb7	[chore] Make variable names in prim.py more clear	2021-03-02 17:28:32 -08:00
Bryce Arden	ca3a02da28	[prim] Add support for List\|TupleUnpack	2021-03-02 17:28:32 -08:00
Sean Silva	df4c5764da	Add support for `prim::unchecked_cast`. This arises when casting optionals, which happens a lot especially around handling of default arguments (python `if arg is None` idiom). In this case, the offending code for the model is in max_pool2d: [code link](`b3bf08e67f/torch/nn/functional.py (L657)`)	2021-03-02 16:01:34 -08:00
Sean Silva	939d36906f	Add support for prim::Loop op. This is a funny one. It combines a `for` and `while` loop in one op. We will need to write some conversions to `scf`.	2021-03-02 16:01:34 -08:00
Sean Silva	7dfd6f697e	Add support for prim::RaiseException. Used by resnet18. It seems to originate from a helper `_verify_batch_size`: [code link](`b3bf08e67f/torch/nn/functional.py (L2099)`). I couldn't find a way to test `prim::RaiseException` without also having `prim::Uninitialized`.	2021-03-02 16:01:34 -08:00
Yi Zhang	7bb3b2eb6e	Fix the import path in python samples	2021-03-02 13:40:08 -08:00
Sean Silva	c837dbb077	Properly import the entire torch::jit::CompilationUnit This primarily unlocks proper handling of free functions (that is, functions that are not methods of any torch.nn.Module). Recommended review order: - `ivalue_importer.cpp` + `ivalue_import/functions*.py` - `GlobalizeObjectGraph.cpp` + test case - misc other stuff The `torch::jit::CompilationUnit` is basically a backing store or "context" holding all the possible functions in the program. The previous code was not explicitly accessing this data structure, since it just imported the `torch::jit::Function`'s that it saw attached to methods. Subtly, any time a TorchScript module called into a free function, the free function gets incorporated into the torch::jit::CompilationUnit, but doesn't show up anywhere when dumping the module, except in the curious pattern: ``` %5 : Function = prim::Constant[name="adaptive_avg_pool2d"]() %6 : Tensor = prim::CallFunction(%5, %input.1, %4) ``` That is, calls are indirect calls, and are accessed via `prim::Constant` materializing a function object. Even stranger, the `name` attribute here doesn't really even tell the full story -- it doesn't correspond to anything. It turns out that the c10::FunctionType itself actually holds a pointer to the `torch::jit::Function` in the compilation unit directly (so there is actually no indirection in prim::CallMethod, because any two values of the same FunctionType call the same function!). E.g. when converting the IR to bytecode, the "name" is ignored [code link](`1d6bd15790/torch/csrc/jit/runtime/interpreter.cpp (L937)`). We do import `prim::CallFunction` as a `std.call_indirect` though because it's more braindead to do it that way (it gets canonicalized to a direct call easily).	2021-03-01 12:08:01 -08:00

... 36 37 38 39 40 ...

2314 Commits (d50d3aa5e77117fbb7078c25831ea2913a1c5566) All Branches Search

2314 Commits (d50d3aa5e77117fbb7078c25831ea2913a1c5566)

All Branches