torch-mlir/examples/ltc_backend_mnist.py

# Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
# See https://llvm.org/LICENSE.txt for license information.
# SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
# Also available under a BSD-style license. See LICENSE.
"""
Example use of the example Torch MLIR LTC backend.
"""
import argparse
import sys

import torch
import torch._lazy
import torch.nn.functional as F


def main(device='lazy'):
    """
    Load model to specified device. Ensure that any backends have been initialized by this point.

    :param device: name of device to load tensors to
    """
    torch.manual_seed(0)

    inputs = torch.tensor([[1, 2, 3, 4, 5]], dtype=torch.float32, device=device)
    assert inputs.device.type == device

    targets = torch.tensor([3], dtype=torch.int64, device=device)
    assert targets.device.type == device

    print("Initialized data")

    class Model(torch.nn.Module):
        def __init__(self):
            super().__init__()
            self.fc1 = torch.nn.Linear(5, 10)

        def forward(self, x):
            out = self.fc1(x)
            out = F.relu(out)
            return out

    model = Model().to(device)
    model.train()
    assert all(p.device.type == device for p in model.parameters())

    print("Initialized model")

    criterion = torch.nn.CrossEntropyLoss()
    optimizer = torch.optim.SGD(model.parameters(), lr=0.01)

    num_epochs = 3
    losses = []
    for _ in range(num_epochs):
        optimizer.zero_grad()

        outputs = model(inputs)
        loss = criterion(outputs, targets)
        loss.backward()
        losses.append(loss)

        optimizer.step()

        if device == "lazy":
            print("Calling Mark Step")
            torch._lazy.mark_step()

    # Get debug information from LTC
    if 'torch_mlir._mlir_libs._REFERENCE_LAZY_BACKEND' in sys.modules:
        computation = lazy_backend.get_latest_computation()
        if computation:
            print(computation.debug_string())

    print(losses)

    return model, losses


if __name__ == "__main__":
    parser = argparse.ArgumentParser()
    parser.add_argument(
        "-d",
        "--device",
        type=str.upper,
        choices=["CPU", "TS", "MLIR_EXAMPLE"],
        default="MLIR_EXAMPLE",
        help="The device type",
    )
    args = parser.parse_args()

    if args.device in ("TS", "MLIR_EXAMPLE"):
        if args.device == "TS":
            import torch._lazy.ts_backend
            torch._lazy.ts_backend.init()

        elif args.device == "MLIR_EXAMPLE":
            import torch_mlir._mlir_libs._REFERENCE_LAZY_BACKEND as lazy_backend

            lazy_backend._initialize()

        device = "lazy"
        print("Initialized backend")
    else:
        device = args.device.lower()

    main(device)
Add example Torch MLIR LTC Backend (#725) 2022-04-15 00:53:00 +08:00			`# Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.`
			`# See https://llvm.org/LICENSE.txt for license information.`
			`# SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception`
			`# Also available under a BSD-style license. See LICENSE.`
			`"""`
			`Example use of the example Torch MLIR LTC backend.`
			`"""`
			`import argparse`
Added e2e LTC tests (#916) * Added e2e LTC Torch MLIR tests * Fix seed for reproducability * Check if computation is None before getting debug string * Updated unit tests, and added numeric tests * Print name of the model layer that fails numeric validation * Run LTC e2e test with CI/CD * Set seed in main function, instead of beginning of execution * Add comment to specify number of digits of precision * Fixed typo * Remove tests for LTC example models * Added LTC option to torchscript e2e * Implement compile and run for LTC e2e test * xfail all tests that use ops that aren't currently supported 2022-06-10 03:56:01 +08:00			`import sys`
Add example Torch MLIR LTC Backend (#725) 2022-04-15 00:53:00 +08:00
Added e2e LTC tests (#916) * Added e2e LTC Torch MLIR tests * Fix seed for reproducability * Check if computation is None before getting debug string * Updated unit tests, and added numeric tests * Print name of the model layer that fails numeric validation * Run LTC e2e test with CI/CD * Set seed in main function, instead of beginning of execution * Add comment to specify number of digits of precision * Fixed typo * Remove tests for LTC example models * Added LTC option to torchscript e2e * Implement compile and run for LTC e2e test * xfail all tests that use ops that aren't currently supported 2022-06-10 03:56:01 +08:00			`import torch`
			`import torch._lazy`
Add example Torch MLIR LTC Backend (#725) 2022-04-15 00:53:00 +08:00			`import torch.nn.functional as F`


Added e2e LTC tests (#916) * Added e2e LTC Torch MLIR tests * Fix seed for reproducability * Check if computation is None before getting debug string * Updated unit tests, and added numeric tests * Print name of the model layer that fails numeric validation * Run LTC e2e test with CI/CD * Set seed in main function, instead of beginning of execution * Add comment to specify number of digits of precision * Fixed typo * Remove tests for LTC example models * Added LTC option to torchscript e2e * Implement compile and run for LTC e2e test * xfail all tests that use ops that aren't currently supported 2022-06-10 03:56:01 +08:00			`def main(device='lazy'):`
			`"""`
			`Load model to specified device. Ensure that any backends have been initialized by this point.`
Add example Torch MLIR LTC Backend (#725) 2022-04-15 00:53:00 +08:00
Added e2e LTC tests (#916) * Added e2e LTC Torch MLIR tests * Fix seed for reproducability * Check if computation is None before getting debug string * Updated unit tests, and added numeric tests * Print name of the model layer that fails numeric validation * Run LTC e2e test with CI/CD * Set seed in main function, instead of beginning of execution * Add comment to specify number of digits of precision * Fixed typo * Remove tests for LTC example models * Added LTC option to torchscript e2e * Implement compile and run for LTC e2e test * xfail all tests that use ops that aren't currently supported 2022-06-10 03:56:01 +08:00			`:param device: name of device to load tensors to`
			`"""`
			`torch.manual_seed(0)`
Add example Torch MLIR LTC Backend (#725) 2022-04-15 00:53:00 +08:00
			`inputs = torch.tensor([[1, 2, 3, 4, 5]], dtype=torch.float32, device=device)`
			`assert inputs.device.type == device`

			`targets = torch.tensor([3], dtype=torch.int64, device=device)`
			`assert targets.device.type == device`

			`print("Initialized data")`

			`class Model(torch.nn.Module):`
			`def __init__(self):`
			`super().__init__()`
Generate MLIR with shape information via LTC frontend (#742) 2022-05-27 03:53:15 +08:00			`self.fc1 = torch.nn.Linear(5, 10)`
Add example Torch MLIR LTC Backend (#725) 2022-04-15 00:53:00 +08:00
			`def forward(self, x):`
			`out = self.fc1(x)`
			`out = F.relu(out)`
			`return out`

			`model = Model().to(device)`
			`model.train()`
			`assert all(p.device.type == device for p in model.parameters())`

			`print("Initialized model")`

			`criterion = torch.nn.CrossEntropyLoss()`
			`optimizer = torch.optim.SGD(model.parameters(), lr=0.01)`

Added e2e LTC tests (#916) * Added e2e LTC Torch MLIR tests * Fix seed for reproducability * Check if computation is None before getting debug string * Updated unit tests, and added numeric tests * Print name of the model layer that fails numeric validation * Run LTC e2e test with CI/CD * Set seed in main function, instead of beginning of execution * Add comment to specify number of digits of precision * Fixed typo * Remove tests for LTC example models * Added LTC option to torchscript e2e * Implement compile and run for LTC e2e test * xfail all tests that use ops that aren't currently supported 2022-06-10 03:56:01 +08:00			`num_epochs = 3`
			`losses = []`
			`for _ in range(num_epochs):`
			`optimizer.zero_grad()`
Add example Torch MLIR LTC Backend (#725) 2022-04-15 00:53:00 +08:00
Added e2e LTC tests (#916) * Added e2e LTC Torch MLIR tests * Fix seed for reproducability * Check if computation is None before getting debug string * Updated unit tests, and added numeric tests * Print name of the model layer that fails numeric validation * Run LTC e2e test with CI/CD * Set seed in main function, instead of beginning of execution * Add comment to specify number of digits of precision * Fixed typo * Remove tests for LTC example models * Added LTC option to torchscript e2e * Implement compile and run for LTC e2e test * xfail all tests that use ops that aren't currently supported 2022-06-10 03:56:01 +08:00			`outputs = model(inputs)`
			`loss = criterion(outputs, targets)`
			`loss.backward()`
			`losses.append(loss)`
Add example Torch MLIR LTC Backend (#725) 2022-04-15 00:53:00 +08:00
Added e2e LTC tests (#916) * Added e2e LTC Torch MLIR tests * Fix seed for reproducability * Check if computation is None before getting debug string * Updated unit tests, and added numeric tests * Print name of the model layer that fails numeric validation * Run LTC e2e test with CI/CD * Set seed in main function, instead of beginning of execution * Add comment to specify number of digits of precision * Fixed typo * Remove tests for LTC example models * Added LTC option to torchscript e2e * Implement compile and run for LTC e2e test * xfail all tests that use ops that aren't currently supported 2022-06-10 03:56:01 +08:00			`optimizer.step()`
Add example Torch MLIR LTC Backend (#725) 2022-04-15 00:53:00 +08:00
Added e2e LTC tests (#916) * Added e2e LTC Torch MLIR tests * Fix seed for reproducability * Check if computation is None before getting debug string * Updated unit tests, and added numeric tests * Print name of the model layer that fails numeric validation * Run LTC e2e test with CI/CD * Set seed in main function, instead of beginning of execution * Add comment to specify number of digits of precision * Fixed typo * Remove tests for LTC example models * Added LTC option to torchscript e2e * Implement compile and run for LTC e2e test * xfail all tests that use ops that aren't currently supported 2022-06-10 03:56:01 +08:00			`if device == "lazy":`
			`print("Calling Mark Step")`
			`torch._lazy.mark_step()`
Add example Torch MLIR LTC Backend (#725) 2022-04-15 00:53:00 +08:00
Added e2e LTC tests (#916) * Added e2e LTC Torch MLIR tests * Fix seed for reproducability * Check if computation is None before getting debug string * Updated unit tests, and added numeric tests * Print name of the model layer that fails numeric validation * Run LTC e2e test with CI/CD * Set seed in main function, instead of beginning of execution * Add comment to specify number of digits of precision * Fixed typo * Remove tests for LTC example models * Added LTC option to torchscript e2e * Implement compile and run for LTC e2e test * xfail all tests that use ops that aren't currently supported 2022-06-10 03:56:01 +08:00			`# Get debug information from LTC`
Fix LTC lib_torch_mlir_ltc.so import error (#1283) * Build LTC to _mlir_libs directory * Update CMakeLists.txt 2022-08-26 06:25:01 +08:00			`if 'torch_mlir._mlir_libs._REFERENCE_LAZY_BACKEND' in sys.modules:`
Reference Lazy Backend (#1045) * Changed Example MLIR backend to Reference MLIR backend * Moved reference_ltc_backend into csrc * Merged sys_utils.h * Renamed reference_ltc_backend to reference_lazy_backend * Addressed review comments * Update docs with new library name * Removed _REFERENCE_LAZY_BACKEND from .gitignore * Added reference_lazy_backend to the TorchMLIRPythonModules dependency list Fixed typo in `ltc_examples.md` Missed instance where `ltc_backend` was used instead of `lazy_backend`. 2022-07-13 03:56:52 +08:00			`computation = lazy_backend.get_latest_computation()`
Added e2e LTC tests (#916) * Added e2e LTC Torch MLIR tests * Fix seed for reproducability * Check if computation is None before getting debug string * Updated unit tests, and added numeric tests * Print name of the model layer that fails numeric validation * Run LTC e2e test with CI/CD * Set seed in main function, instead of beginning of execution * Add comment to specify number of digits of precision * Fixed typo * Remove tests for LTC example models * Added LTC option to torchscript e2e * Implement compile and run for LTC e2e test * xfail all tests that use ops that aren't currently supported 2022-06-10 03:56:01 +08:00			`if computation:`
			`print(computation.debug_string())`
E2E HuggingFace Bert using LTC Backend (#912) * Update native function definitions * Add ops to support bert lowering - Add empty_strided and as_strided - Restore zeros_like to op blacklist (Without this, tensors will be unintentionally created with a CPU device rather than lazy) - Check for composite implicit ops and add device data IR - Also fix codegen for functionalization * Add autogen to CMakeList * Remove PyTorch submodule * Reduced BERT model size * Print Mark Step status in Torch MLIR LTC debug string * Apply fixes to work with latest upstream/main - Pass importOptions into getMlirTypeFromTorchType during NodeImporter::importNode Without this, the tensor type created may have a mismatched type as ImportOptions may cause vtensor to be used instead of tensor * Update shape inference functions - Fixed compute_shape_native_batch_norm when mean and var are uninitialized Previously, the number of shapes returned would be <3 if either mean or val was didn't exist. Instead, we now initialize them with a vector matching the number of channels. - Implemented compute_shape_mul - Fixed bug in reshape shape inference error message * Get MLIR backend more consistent with TS backend - Remove LazyNativeFunctions::_unsafe_view from autogen - Blacklist ops to make JIT graph more like output of TS backend - Print graph when SSA value has mismatch of types and results - Remove normalize_index from LazyShapeInference - Fix seeds for LTC example models * Update and clean up shape inference functions - Prune shape inference functions - Add shape inference function for GenerateSlice - Add shape inference function for GenerateCopy Co-authored-by: Henry Tu <henry.tu@cerebras.net> 2022-06-08 02:38:50 +08:00
Added e2e LTC tests (#916) * Added e2e LTC Torch MLIR tests * Fix seed for reproducability * Check if computation is None before getting debug string * Updated unit tests, and added numeric tests * Print name of the model layer that fails numeric validation * Run LTC e2e test with CI/CD * Set seed in main function, instead of beginning of execution * Add comment to specify number of digits of precision * Fixed typo * Remove tests for LTC example models * Added LTC option to torchscript e2e * Implement compile and run for LTC e2e test * xfail all tests that use ops that aren't currently supported 2022-06-10 03:56:01 +08:00			`print(losses)`

			`return model, losses`


			`if __name__ == "__main__":`
Add example Torch MLIR LTC Backend (#725) 2022-04-15 00:53:00 +08:00			`parser = argparse.ArgumentParser()`
			`parser.add_argument(`
			`"-d",`
			`"--device",`
			`type=str.upper,`
			`choices=["CPU", "TS", "MLIR_EXAMPLE"],`
			`default="MLIR_EXAMPLE",`
			`help="The device type",`
			`)`
			`args = parser.parse_args()`
Added e2e LTC tests (#916) * Added e2e LTC Torch MLIR tests * Fix seed for reproducability * Check if computation is None before getting debug string * Updated unit tests, and added numeric tests * Print name of the model layer that fails numeric validation * Run LTC e2e test with CI/CD * Set seed in main function, instead of beginning of execution * Add comment to specify number of digits of precision * Fixed typo * Remove tests for LTC example models * Added LTC option to torchscript e2e * Implement compile and run for LTC e2e test * xfail all tests that use ops that aren't currently supported 2022-06-10 03:56:01 +08:00
			`if args.device in ("TS", "MLIR_EXAMPLE"):`
			`if args.device == "TS":`
Only import the LTC backend that's used (#939) 2022-06-15 00:09:55 +08:00			`import torch._lazy.ts_backend`
Added e2e LTC tests (#916) * Added e2e LTC Torch MLIR tests * Fix seed for reproducability * Check if computation is None before getting debug string * Updated unit tests, and added numeric tests * Print name of the model layer that fails numeric validation * Run LTC e2e test with CI/CD * Set seed in main function, instead of beginning of execution * Add comment to specify number of digits of precision * Fixed typo * Remove tests for LTC example models * Added LTC option to torchscript e2e * Implement compile and run for LTC e2e test * xfail all tests that use ops that aren't currently supported 2022-06-10 03:56:01 +08:00			`torch._lazy.ts_backend.init()`

			`elif args.device == "MLIR_EXAMPLE":`
Fix LTC lib_torch_mlir_ltc.so import error (#1283) * Build LTC to _mlir_libs directory * Update CMakeLists.txt 2022-08-26 06:25:01 +08:00			`import torch_mlir._mlir_libs._REFERENCE_LAZY_BACKEND as lazy_backend`
Integrate Functionalization Pass (#998) * Fix autogen build dir issue * Got functionalization pass to compile * Add slice/diagonal backwards functionalization * Fix codegen invocation in CMakeLists.txt * Add functionalization view ops * Fix logsumexp out functionalization * Fix ComputationPtr * Blacklist new_empty op * Add op comparison * Remove unnecessary ops Co-authored-by: Henry Tu <henry.tu@cerebras.net> 2022-07-01 03:19:05 +08:00
Reference Lazy Backend (#1045) * Changed Example MLIR backend to Reference MLIR backend * Moved reference_ltc_backend into csrc * Merged sys_utils.h * Renamed reference_ltc_backend to reference_lazy_backend * Addressed review comments * Update docs with new library name * Removed _REFERENCE_LAZY_BACKEND from .gitignore * Added reference_lazy_backend to the TorchMLIRPythonModules dependency list Fixed typo in `ltc_examples.md` Missed instance where `ltc_backend` was used instead of `lazy_backend`. 2022-07-13 03:56:52 +08:00			`lazy_backend._initialize()`
Added e2e LTC tests (#916) * Added e2e LTC Torch MLIR tests * Fix seed for reproducability * Check if computation is None before getting debug string * Updated unit tests, and added numeric tests * Print name of the model layer that fails numeric validation * Run LTC e2e test with CI/CD * Set seed in main function, instead of beginning of execution * Add comment to specify number of digits of precision * Fixed typo * Remove tests for LTC example models * Added LTC option to torchscript e2e * Implement compile and run for LTC e2e test * xfail all tests that use ops that aren't currently supported 2022-06-10 03:56:01 +08:00
			`device = "lazy"`
			`print("Initialized backend")`
			`else:`
			`device = args.device.lower()`

			`main(device)`