torch-mlir

Commit Graph

Author	SHA1	Message	Date
Stella Laurenzo	6c702b149f	Add a number of kernels and new patterns. * convolution, convolution_backward, _log_softmax, _log_softmax_backward_data, nll_loss_forward, nll_loss_backward, nll_loss2d_forward, nll_loss2d_backward, copy_ * Extends the recognition logic and metadata for handling inplace transformations, optional tensors, ints, lists and dropped args. * The kernel_calls generated by test_conv_nllloss_grads.py now convert to ATen. * The result almost comes out as a pure tensor program with the exception of the copy_ op, which I will do some followup work to deal with. * More progress on #97	2020-11-04 14:36:59 -08:00
Sean Silva	3dab9056f0	Bump llvm-project to eb8d386d513bf4243d0adb814d862af25b8c4e2f Two changes: - no more "verifyPasses" constructor arg for PassManager - OpPassManager defaults to requiring explicit "nest" calls when created via the C++ API. The behavior upstream for mlir-opt still obeys the "implicit" mode, so I just slapped that onto all our pass managers. I pinged https://reviews.llvm.org/D90671 to get a signal for whether we are expected to migrate to explicit mode. If so, I'll do that too later.	2020-11-04 14:14:46 -08:00
Stella Laurenzo	59b7c559f4	Tweak build flags for efficiency and document building without a container. * Enables -gsplit-dwarf for both LLVM and NPCOMP, reducing the occurrence of the ~GB scale binaries. * CMake shared linking seems incompatible with this, so shared objects are still "too big" but there are few of them. * Reduces disk thrash on clean/install of everything.	2020-11-03 13:46:46 -08:00
Sean Silva	57e58b9272	[RefBackend] Use upstream func-bufferize pass. Now, the only bufferization we have left is lowering tensor constants to memref, which will hopefully proceed soon after Rahul's new std.global_memref lands + the lowering to LLVM IR. Then I'll port LowerConstantTensorsToMemref to upstream and we'll be 100% upstream bufferization, except for our local TCP dialect (which will probably go away and be replaced by std elementwise + linalg named ops on tensors :) ).	2020-11-02 17:38:33 -08:00
Sean Silva	94bee9ec23	Bump llvm-project to 773ad135a30dbe0f969086e3ed518ab17502e9f5	2020-11-02 17:38:33 -08:00
Harsh Menon	c2d3820e48	Fix insertion point bug #102 The current code was inserting all build_list ops after the last constant op since it was assuming that all elements being passed in were constants. This patch replaces that patch with a new function that inserts the build_list ops before the terminator. Also modifies test_export_conv2d_fwd.py since its output no longer matches. TEST: Added test_export_cat.py which is the code in #102	2020-11-02 16:41:26 -08:00
Stella Laurenzo	0c73c535d6	Capture backward conv and copy_ kernels. * This is sufficient to capture the forward and backward pass and gradients of a convolutional model with an nllloss. * As with the forward conv, the backward conv is a special case wrapped in an enigma on the PyTorch side. There aren't many like it, so special casing is just what we do. * When I traced this, I found that the copy_ op is not yet boxing compatible so I had to map it manually. If there are many more like this, I'll probably do something a bit more clever to reduce duplication. * This exposes new signature patterns that will need to be handled by the ATen lowering. Will take care of that next: It will be nice to have an e2e of a non-trivial case with full gradients. * Fixes #97.	2020-10-30 22:59:26 -07:00
Sean Silva	1874bf5eb1	NFC: Clean up some minor nits - Remove GreedyPatternRewriteDriver.h from files that don't need it - fix typo shouldBeCloned -> wouldBeCloned	2020-10-30 18:48:25 -07:00
Sean Silva	f9c2f8eb0d	[RefBackend] Use upstream SCF bufferization pass.	2020-10-30 18:12:41 -07:00
Sean Silva	0761df9f58	Bump llvm-project to 72ddd559b8aafef402091f8e192e025022e4ebef - Fixup to OpBuilderDAG - Update for affine map naming	2020-10-30 18:12:41 -07:00
Aaron J Arthurs	29c715b6b1	Add TCP mul test	2020-10-30 15:11:52 -07:00
Stella Laurenzo	8d98dd4551	Support optional args/returns and other odds and ends. * None's out Device? args. * Emits bool tensors if needed. * Adds some stderr tracing to better see what is going on. * Test case that exercises NLLLoss. * This test case emits something for backward calculations but there are some issues still to be worked out, so that part is left out of the test case. * Progress on #97	2020-10-30 14:50:28 -07:00
Stella Laurenzo	a3f4db9fe8	Bump llvm-project to c8c07b76b2cf2ada8e7ec132f7f57b97d76743cf. * Several NFC changes to signatures/includes.	2020-10-29 15:25:55 -07:00
Marius Brehler	30adf9e6b0	Fix TCP_MulOp tablegen definition	2020-10-28 19:28:15 +01:00
Stella Laurenzo	c08935a418	Rewrite ATen ODS code generator to be based on new op registry and new signature recognition system. * Deletes prior code generator from previous attempt (moved some of it into this one). * Renames old generated tablegen source to "Legacy". * Generates ODS and import rules for most binary and unary arithmetic ops. * Removes old generated ops and integration tests that were testing details of the prior setup.	2020-10-28 10:37:37 -07:00
Aaron J Arthurs	94ea6f7c92	[RefBackend] Support element-wise multiply op Register the following for the multiply op: - tcf.mul - tcp.mul - TCP->TCP lowering - Shape transfer, broadcasted multiplicands - Lower to standard `MulFOp` op	2020-10-27 19:41:23 -07:00
Stella Laurenzo	510f226df2	Expose signature metadata to ops and implement ATenRecognizeKernelsPass pass. * Two op interfaces, one for querying instance metadata and one for getting static data needed to construct an op from a generic form. * For torch.generic_kernel ops, metadata is splatted in during capture from Torch (it comes from the op registry, which will work for either device capture or graph import). * Moved the 'add' out of the generated set so I can experiment on it. It implements the TorchBuildableKernelOpInterface interface which provides its metadata. * The ATenRecognizeKernelsPass pass generically lowers from a torch.generic_kernel to recognized ops that implement the TorchBuildableKernelOpInterface, handling the various types of transformations that we allow at this stage.	2020-10-26 20:31:45 -07:00
Mehdi Amini	f3c75d957b	Add missing dependency on NPCOMPCAPI from NPCOMPPythonCommon Fix linker error: lib/Python/libNPCOMPPythonCommon.a(MlirInit.cpp.o): in function `mlir::npcomp::python::npcompMlirInitialize()': mlir-npcomp/build/../lib/Python/MlirInit.cpp:46: undefined reference to `npcompInitializeLLVMCodegen'	2020-10-22 22:44:18 -07:00
Stella Laurenzo	91fc83d2e7	NFC: Transition ATen passes to tablegen registration.	2020-10-22 17:12:44 -07:00
Stella Laurenzo	9618c2dbf7	NFC: Re-organize ATen directory structure and fix warnings. * Still some more work to do on the Transforms tree to bring it in line with the others (will do that as I add things).	2020-10-22 14:13:26 -07:00
Stella Laurenzo	d09300886a	NFC: Use new print with large_elements_limit in tests. * For tests with large constants, decreases issues with lit pipelines. * Bumps llvm-project to pick up the update.	2020-10-22 13:04:24 -07:00
Sean Silva	14470f9ff6	[RefBackend] Use upstream std bufferization. It now subsumes the one we had.	2020-10-21 16:46:56 -07:00
Sean Silva	b6ae53b312	[RefBackend] Use new upstream SCF type conversions.	2020-10-21 16:46:56 -07:00
Sean Silva	4f2aa12d1a	Bump llvm-project to 57b338c08a4942bda6e58c77870c657c53b6fb5b Date: Mon Oct 19 15:59:03 2020 -0700	2020-10-21 16:46:56 -07:00
Stella Laurenzo	fe5ceed18d	NFC: Format a file that had not been.	2020-10-21 12:47:12 -07:00
Stella Laurenzo	58adb6bd8e	Work around various PyTorch issues in support of convolution. * Enables the conv2d fwd test and ResA (which are both small). * Deletes resnet18 and vgg, which both run but generate output that crashes FileCheck and lit (or at least makes them take an eternity).	2020-10-21 12:44:31 -07:00
Stella Laurenzo	029815152e	Add remaining pieces to capture full example models. * Adds Basicpy List, Tuple, Dict types and plumbs through C API. * Started debugging the issues around aten::conv2d capture, but a PyTorch bug is suspected. * Was able to manually verify that the basic conv2d forward test captures correctly with a workaround. * Need to resolve some printing issues upstream and move these tests to an integration test target (they take ~seconds to run).	2020-10-19 22:16:59 -07:00
Sean Silva	81119aa0a1	Only build mnist-playground if Torch was found.	2020-10-19 15:43:01 -07:00
Marius Brehler	54c0a3a1e9	Replace deprecated add-path in GitHub Action The add-path command is deprecated and should be replaced as stated in https://github.blog/changelog/2020-10-01-github-actions-deprecating-set-env-and-add-path-commands/	2020-10-16 09:40:37 -07:00
Stella Laurenzo	9e52f6235b	More progress on PyTorch acap device capture. * Now gets far enough to capture batch_norm. * Has some issues still with in-place ops. * Can materialize constants. * Includes an upgrade to PyTorch nightly, which has important bug fixes for fallback and boxed kernel dispatch. * Fixes #78, #79, #80. * Will do more testing in a follow-up once further bugs are fixed that facilitate getting at the other features.	2020-10-15 21:43:21 -07:00
Sean Silva	06a8ba6900	[RefBackend] Use more idiomatic bufferize pattern for TCP. The time has come for BypassShapes/LowerShapedResultsToMemref to go away :( For the reference backend, being consistent with upstream conventions is the name of the game now. This is a step down in a number of ways, e.g. test clarity and separation of concerns. But it is fewer files and fewer tests, and does address the "TODO: This is really fragile". It also eliminates two more ops from the refback dialect (sadly, they are the shaped_results/yield that we were getting kind of fond of, but alas).	2020-10-15 20:15:53 -07:00
Sean Silva	ebb3c90355	run_lit.sh: build all the things npcomp-capi-ir-test was missing, causing the tests to fail if it wasn't built. Just build everything instead.	2020-10-15 20:15:53 -07:00
Sean Silva	87b6bc305a	[build_tools] Fix install_mlir.sh to look for Python3 Otherwise, on my machine MLIR somehow gets configured with Python 2, which is not supported. The `probe_python` stuff is copied from cmake_configure.sh	2020-10-15 20:15:53 -07:00
Sean Silva	b6bdc8cc4f	[RefBackend] Use upstream BufferizeTypeConverter Now that it has grown source/target materialization capabilities (spelled with ops tensor_load/tensor_to_memref), we can use it. We can also now delete refback.memref_to_tensor/refback.tensor_to_memref. This is also a first step to reducing the downstream functionality needed in the refback dialect.	2020-10-15 15:58:51 -07:00
Sean Silva	1d118b0845	Update llvm-project to ee491ac91e123b90eeec3cce7e494936ea8cb85d Date: Wed Oct 14 11:26:22 2020 -0700	2020-10-15 15:58:51 -07:00
Sean Silva	a7a1d1be2a	Fix up bash_helpers.sh for new build directory layout. Also, remove the PYTHONPATH stuff for now, I think it is out of date and I haven't been able to verify the new setup.	2020-10-15 15:58:51 -07:00
Stella Laurenzo	abb6fe8aa2	Port prior acap export tests to new dispatcher based versions. * Sadly, non-trivial ones fail. * Bugs filed and marked XFAIL.	2020-10-13 16:37:46 -07:00
Stella Laurenzo	30cfc6499f	Create public API for torch_mlir python code. * Adds a trampoline/loader 'torch_mlir' module. * Plumbs through the MLIR python Context and Module creation, interoping with the MLIR Python API (resolves TODO on creating with own context and accessing the module being built). * Inter-module Python API interop is still a bit rough but workable via the capsule mechanism. Can be evolved later. * Exports the frontends/pytorch python sources to the project python/ build directory. * Requires D89294 to land.	2020-10-13 16:36:49 -07:00
Stella Laurenzo	86df4cabeb	Enable MLIR python bindings in CI.	2020-10-12 21:49:39 -07:00
Stella Laurenzo	5c5b8db70f	Update test configuration to import mlir from LLVM install location. * Also adds two lit tests to verify that all of our extensions load without fireworks, which is a good indication that the shared library deps are sane. * Bumps llvm-project version to use D89167.	2020-10-12 15:25:07 -07:00
Sean Silva	f2d5c26c97	Bump llvm-project to 820e65f9e2369d2990fde4b3e7cfceb64f0df9c8 Date: Mon Oct 12 11:26:50 2020 -0700	2020-10-12 13:30:22 -07:00
Sean Silva	93fc21dad0	[RefBackend] Split out TCF->TCP conversion. Now the reference backend is cleanly accepts "TCP"+scalar ops. We introduce tcf-refback-lowering-pipeline which also does TCF->TCP conversion for convenience until we have a "target interface".	2020-10-12 11:56:39 -07:00
Stella Laurenzo	83342e8715	Add another config var to the CI for shared lib builds. Missed this one in the previous change.	2020-10-09 16:57:00 -07:00
Stella Laurenzo	af4edb63ae	Start reworking towards a shared library build. * Need to have a dag of shared library deps in order to interop across python extensions (as presented in ODM). * Introduced add_npcomp_library and friends to mirror the MLIR setup. * Adds a libNPCOMP.so shared library. * Redirects tools and extensions to link against libNPCOMP.so (instead of static libs). * Moves all libraries to lib/, all binaries to bin/ and all python extensions to python/. The invariant is that the rpaths are setup to have a one level directory structure. * Reworks the _torch_mlir extension to build like the others (still need to come up with a consolidated rule to do this instead of open coded). * Includes an upstream version bump to pick up needed changes. Sizes with dynamic linking (stripped, release, asserts enabled): libNPCOMP.so: 43M (includes much of the underlying LLVM codegen deps) libMLIR.so: 31M _npcomp.so: 1.6M (python extension) _torch_mlir.so: 670K (python extension) npcomp-capi-ir-test: 6.3K npcomp-opt: 351K npcomp-run-mlir: 461K mnist-playground: 530K Still more can be done to normalize and optimize but this gets us structurally to the starting point.	2020-10-09 16:02:58 -07:00
Stella Laurenzo	d9dc16a9be	Enable libMLIR.so in CI.	2020-10-09 13:57:01 -07:00
Sean Silva	d6b05c507a	Fix up the docker script / instructions after attempting to use it.	2020-10-09 10:27:06 -07:00
Sean Silva	631c8070df	[RefBackend] Put JITModule in refback namsepace.	2020-10-08 09:07:00 -07:00
Sean Silva	7edb5f3641	[RefBackend] Rename RefBackend dialect to Refback I now realize that VerboseCamelCase is not the best choice for dialect directory/file names and C++ identifiers (take e.g. "Linalg", "Basicpy", etc. as prior art here; not LinearAlgebra or BasicPython). If I had to name the convention it seems to be "Shortword" (or of course just acronym dialects like LLVM, SCF, etc.). This rename also has the side benefit of differentiating RefBackend directories, which now refer to the actual backend itself, from Refback/Refbackrt, which are the dialects which happen to be used by that backend.	2020-10-08 09:07:00 -07:00
Sean Silva	bf99a82832	[RefBackend] Rename Npcomprt dialect to Refbackrt.	2020-10-08 09:07:00 -07:00
Sean Silva	83ad70ef54	[RefBackend] Move runtime related code under npcomp/RefBackend/ Other than the dialect definitions (which will live in standard Dialect/ subdirectory), the goal here is to keep RefBackend-related code nested in {include/npcomp,lib,test}/RefBackend.	2020-10-08 09:07:00 -07:00

... 18 19 20 21 22 ...

1276 Commits (738f4fe96a5416191858d105039977f1d91100ff) All Branches Search

1276 Commits (738f4fe96a5416191858d105039977f1d91100ff)

All Branches