torch-mlir/examples/torchscript_resnet18_e2e.py

#  Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
#  See https://llvm.org/LICENSE.txt for license information.
#  SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

from PIL import Image
import requests
import torch
import torchvision.models as models
from torchvision import transforms
import typing

import torch_mlir

import npcomp
from npcomp.passmanager import PassManager
from npcomp.compiler.pytorch.backend import refbackend, iree
from npcomp.compiler.utils import logging

mb = torch_mlir.ModuleBuilder()


def load_and_preprocess_image(url: str):
    headers = {
        'User-Agent':
        'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_11_5) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/50.0.2661.102 Safari/537.36'
    }
    img = Image.open(requests.get(url, headers=headers,
                                  stream=True).raw).convert("RGB")
    # preprocessing pipeline
    preprocess = transforms.Compose([
        transforms.Resize(256),
        transforms.CenterCrop(224),
        transforms.ToTensor(),
        transforms.Normalize(mean=[0.485, 0.456, 0.406],
                             std=[0.229, 0.224, 0.225]),
    ])
    img_preprocessed = preprocess(img)
    return torch.unsqueeze(img_preprocessed, 0)


def load_labels():
    classes_text = requests.get(
        "https://raw.githubusercontent.com/cathyzhyi/ml-data/main/imagenet-classes.txt",
        stream=True,
    ).text
    labels = [line.strip() for line in classes_text.splitlines()]
    return labels


def top3_possibilities(res):
    _, indexes = torch.sort(res, descending=True)
    percentage = torch.nn.functional.softmax(res, dim=1)[0] * 100
    top3 = [(labels[idx], percentage[idx].item()) for idx in indexes[0][:3]]
    return top3


def predictions(torch_func, jit_func, img, labels):
    golden_prediction = top3_possibilities(torch_func(img))
    print("PyTorch prediction")
    print(golden_prediction)
    prediction = top3_possibilities(torch.from_numpy(jit_func(img)))
    print("NPCOMP prediction")
    print(prediction)


class ResNet18Module(torch.nn.Module):
    def __init__(self):
        super().__init__()
        self.resnet = models.resnet18(pretrained=True)
        self.train(False)

    def forward(self, img):
        return self.resnet.forward(img)


class TestModule(torch.nn.Module):
    def __init__(self):
        super().__init__()
        self.s = ResNet18Module()

    def forward(self, x):
        return self.s.forward(x)


image_url = (
    "https://upload.wikimedia.org/wikipedia/commons/2/26/YellowLabradorLooking_new.jpg"
)
import sys

print("load image from " + image_url, file=sys.stderr)
img = load_and_preprocess_image(image_url)
labels = load_labels()

test_module = TestModule()
class_annotator = torch_mlir.ClassAnnotator()
recursivescriptmodule = torch.jit.script(test_module)
torch.jit.save(recursivescriptmodule, "/tmp/foo.pt")

class_annotator.exportNone(recursivescriptmodule._c._type())
class_annotator.exportPath(recursivescriptmodule._c._type(), ["forward"])
class_annotator.annotateArgs(
    recursivescriptmodule._c._type(),
    ["forward"],
    [
        None,
        ([-1, -1, -1, -1], torch.float32, True),
    ],
)
# TODO: Automatically handle unpacking Python class RecursiveScriptModule into the underlying ScriptModule.
mb.import_module(recursivescriptmodule._c, class_annotator)

backend = refbackend.RefBackendNpcompBackend()
PassManager.parse("torchscript-to-npcomp-backend-pipeline").run(mb.module)
compiled = backend.compile(mb.module)
jit_module = backend.load(compiled)

predictions(test_module.forward, jit_module.forward, img, labels)
Remove frontends/pytorch directory. It just contained the e2e testing framework. We now fold it into the main project to reduce complexity. - `frontends/pytorch/python/` -> `python/torch_support` - `frontends/pytorch/e2e_testing -> e2e_testing` - `frontends/pytorch/examples -> examples` - `frontends/pytorch/test` -> `python/test` - `torch_mlir_torchscript` python module -> `npcomp_torchscript` - `torch_mlir_torchscript_e2e_test_configs` python module -> `npcomp_torchscript_e2e_test_configs` This also changes the license of a handful of files from the "pytorch-style" license to the regular LLVM/npcomp license. The only people who committed to those files were myself and Yi. 2021-09-17 06:18:17 +08:00			`# Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.`
			`# See https://llvm.org/LICENSE.txt for license information.`
			`# SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception`
Add an e2e test example for Resnet18 Show an example of classifying image from https://commons.wikimedia.org/wiki/File:YellowLabradorLooking_new.jpg with Resnet18 2021-07-29 23:06:02 +08:00
			`from PIL import Image`
			`import requests`
			`import torch`
			`import torchvision.models as models`
			`from torchvision import transforms`
			`import typing`

			`import torch_mlir`

			`import npcomp`
[torch-mlir earthmoving (2/N)] Python code movement. This moves the bulk of the Python code (including the Torch interop) from `frontends/pytorch` into `torch-mlir/TorchPlugin`. This also required reconciling a bunch of other Python-related stuff, like the `torch` dialects. As I did this, it was simpler to just remove all the old numpy/basicpy stuff because we were going to delete it anyway and it was faster than debugging an intermediate state that would only last O(days) anyway. torch-mlir has two top-level python packages (built into the `python_packages` directory): - `torch_mlir_dialects`: `torch` dialect Python bindings (does not depend on PyTorch). This also involves building the aggregate CAPI for `torch-mlir`. - `torch_mlir`: bindings to the part of the code that links against PyTorch (or C++ code that transitively does). Additionally, there remain two more Python packages in npcomp (but outside `torch-mlir`): - `npcomp_torch`: Contains the e2e test framework and testing configs that plug into RefBackend and IREE. - `npcomp_core`: Contains the low-level interfaces to RefBackend and IREE that `npcomp_torch` uses, along with its own `MLIR_PYTHON_PACKAGE_PREFIX=npcomp.` aggregation of the core MLIR python bindings. (all other functionality has been stripped out) After all the basicpy/numpy deletions, the `npcomp` C++ code is now very tiny. It basically just contains RefBackend and the `TorchConversion` dialect/passes (e.g. `TorchToLinalg.cpp`). Correspondingly, there are now 4 main testing targets paralleling the Python layering (which is reflective of the deeper underlying dependency structure) - `check-torch-mlir`: checks the `torch-mlir` pure MLIR C++ code. - `check-torch-mlir-plugin`: checks the code in `TorchPlugin` (e.g. TorchScript import) - `check-frontends-pytorch`: Checks the little code we have in `frontends/pytorch` -- mainly things related to the e2e framework itself. - `check-npcomp`: Checks the pure MLIR C++ code inside npcomp. There is a target `check-npcomp-all` that runs all of them. The `torch-mlir/build_standalone.sh` script does a standalone build of `torch-mlir`. The e2e tests (`tools/torchscript_e2e_test.sh`) are working too. The update_torch_ods script now lives in `torch-mlir/build_tools/update_torch_ods.sh` and expects a standalone build. This change also required a fix upstream related to cross-shlib Python dependencies, so we also update llvm-project to 8dca953dd39c0cd8c80decbeb38753f58a4de580 to get https://reviews.llvm.org/D109776 (no other fixes were needed for the integrate, thankfully). This completes most of the large source code changes. Next will be bringing the CI/packaging/examples back to life. 2021-09-11 02:44:38 +08:00			`from npcomp.passmanager import PassManager`
Remove "refjit" terminology. We now use RefBackend/refbackend consistently. 2021-09-23 06:33:08 +08:00			`from npcomp.compiler.pytorch.backend import refbackend, iree`
Add an e2e test example for Resnet18 Show an example of classifying image from https://commons.wikimedia.org/wiki/File:YellowLabradorLooking_new.jpg with Resnet18 2021-07-29 23:06:02 +08:00			`from npcomp.compiler.utils import logging`

			`mb = torch_mlir.ModuleBuilder()`

Add resnet inference jupyter notebook. This takes the example from torchscript_resnet18_e2e.py and puts it into a slightly cleaned up notebook form. It's still a little rough around the edges. Areas for improvement: - Installation / setup. - API usability. Also, - Add `npcomp-backend-to-iree-frontend-pipeline` since we will be adding more stuff there. - Slight cleanups. 2021-08-10 04:55:20 +08:00
Add an e2e test example for Resnet18 Show an example of classifying image from https://commons.wikimedia.org/wiki/File:YellowLabradorLooking_new.jpg with Resnet18 2021-07-29 23:06:02 +08:00			`def load_and_preprocess_image(url: str):`
Add resnet inference jupyter notebook. This takes the example from torchscript_resnet18_e2e.py and puts it into a slightly cleaned up notebook form. It's still a little rough around the edges. Areas for improvement: - Installation / setup. - API usability. Also, - Add `npcomp-backend-to-iree-frontend-pipeline` since we will be adding more stuff there. - Slight cleanups. 2021-08-10 04:55:20 +08:00			`headers = {`
			`'User-Agent':`
			`'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_11_5) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/50.0.2661.102 Safari/537.36'`
			`}`
			`img = Image.open(requests.get(url, headers=headers,`
			`stream=True).raw).convert("RGB")`
Add an e2e test example for Resnet18 Show an example of classifying image from https://commons.wikimedia.org/wiki/File:YellowLabradorLooking_new.jpg with Resnet18 2021-07-29 23:06:02 +08:00			`# preprocessing pipeline`
Add resnet inference jupyter notebook. This takes the example from torchscript_resnet18_e2e.py and puts it into a slightly cleaned up notebook form. It's still a little rough around the edges. Areas for improvement: - Installation / setup. - API usability. Also, - Add `npcomp-backend-to-iree-frontend-pipeline` since we will be adding more stuff there. - Slight cleanups. 2021-08-10 04:55:20 +08:00			`preprocess = transforms.Compose([`
			`transforms.Resize(256),`
			`transforms.CenterCrop(224),`
			`transforms.ToTensor(),`
			`transforms.Normalize(mean=[0.485, 0.456, 0.406],`
			`std=[0.229, 0.224, 0.225]),`
			`])`
Add an e2e test example for Resnet18 Show an example of classifying image from https://commons.wikimedia.org/wiki/File:YellowLabradorLooking_new.jpg with Resnet18 2021-07-29 23:06:02 +08:00			`img_preprocessed = preprocess(img)`
			`return torch.unsqueeze(img_preprocessed, 0)`


			`def load_labels():`
			`classes_text = requests.get(`
			`"https://raw.githubusercontent.com/cathyzhyi/ml-data/main/imagenet-classes.txt",`
			`stream=True,`
			`).text`
			`labels = [line.strip() for line in classes_text.splitlines()]`
			`return labels`


			`def top3_possibilities(res):`
			`_, indexes = torch.sort(res, descending=True)`
			`percentage = torch.nn.functional.softmax(res, dim=1)[0] * 100`
			`top3 = [(labels[idx], percentage[idx].item()) for idx in indexes[0][:3]]`
			`return top3`


			`def predictions(torch_func, jit_func, img, labels):`
			`golden_prediction = top3_possibilities(torch_func(img))`
			`print("PyTorch prediction")`
			`print(golden_prediction)`
			`prediction = top3_possibilities(torch.from_numpy(jit_func(img)))`
			`print("NPCOMP prediction")`
			`print(prediction)`


			`class ResNet18Module(torch.nn.Module):`
			`def __init__(self):`
			`super().__init__()`
			`self.resnet = models.resnet18(pretrained=True)`
			`self.train(False)`

			`def forward(self, img):`
			`return self.resnet.forward(img)`


			`class TestModule(torch.nn.Module):`
			`def __init__(self):`
			`super().__init__()`
			`self.s = ResNet18Module()`

			`def forward(self, x):`
			`return self.s.forward(x)`


Add resnet inference jupyter notebook. This takes the example from torchscript_resnet18_e2e.py and puts it into a slightly cleaned up notebook form. It's still a little rough around the edges. Areas for improvement: - Installation / setup. - API usability. Also, - Add `npcomp-backend-to-iree-frontend-pipeline` since we will be adding more stuff there. - Slight cleanups. 2021-08-10 04:55:20 +08:00			`image_url = (`
			`"https://upload.wikimedia.org/wikipedia/commons/2/26/YellowLabradorLooking_new.jpg"`
			`)`
			`import sys`

			`print("load image from " + image_url, file=sys.stderr)`
			`img = load_and_preprocess_image(image_url)`
			`labels = load_labels()`

Add an e2e test example for Resnet18 Show an example of classifying image from https://commons.wikimedia.org/wiki/File:YellowLabradorLooking_new.jpg with Resnet18 2021-07-29 23:06:02 +08:00			`test_module = TestModule()`
			`class_annotator = torch_mlir.ClassAnnotator()`
			`recursivescriptmodule = torch.jit.script(test_module)`
			`torch.jit.save(recursivescriptmodule, "/tmp/foo.pt")`

			`class_annotator.exportNone(recursivescriptmodule._c._type())`
			`class_annotator.exportPath(recursivescriptmodule._c._type(), ["forward"])`
			`class_annotator.annotateArgs(`
			`recursivescriptmodule._c._type(),`
			`["forward"],`
Add resnet inference jupyter notebook. This takes the example from torchscript_resnet18_e2e.py and puts it into a slightly cleaned up notebook form. It's still a little rough around the edges. Areas for improvement: - Installation / setup. - API usability. Also, - Add `npcomp-backend-to-iree-frontend-pipeline` since we will be adding more stuff there. - Slight cleanups. 2021-08-10 04:55:20 +08:00			`[`
			`None,`
			`([-1, -1, -1, -1], torch.float32, True),`
			`],`
Add an e2e test example for Resnet18 Show an example of classifying image from https://commons.wikimedia.org/wiki/File:YellowLabradorLooking_new.jpg with Resnet18 2021-07-29 23:06:02 +08:00			`)`
			`# TODO: Automatically handle unpacking Python class RecursiveScriptModule into the underlying ScriptModule.`
			`mb.import_module(recursivescriptmodule._c, class_annotator)`

Remove "refjit" terminology. We now use RefBackend/refbackend consistently. 2021-09-23 06:33:08 +08:00			`backend = refbackend.RefBackendNpcompBackend()`
[torch-mlir earthmoving (2/N)] Python code movement. This moves the bulk of the Python code (including the Torch interop) from `frontends/pytorch` into `torch-mlir/TorchPlugin`. This also required reconciling a bunch of other Python-related stuff, like the `torch` dialects. As I did this, it was simpler to just remove all the old numpy/basicpy stuff because we were going to delete it anyway and it was faster than debugging an intermediate state that would only last O(days) anyway. torch-mlir has two top-level python packages (built into the `python_packages` directory): - `torch_mlir_dialects`: `torch` dialect Python bindings (does not depend on PyTorch). This also involves building the aggregate CAPI for `torch-mlir`. - `torch_mlir`: bindings to the part of the code that links against PyTorch (or C++ code that transitively does). Additionally, there remain two more Python packages in npcomp (but outside `torch-mlir`): - `npcomp_torch`: Contains the e2e test framework and testing configs that plug into RefBackend and IREE. - `npcomp_core`: Contains the low-level interfaces to RefBackend and IREE that `npcomp_torch` uses, along with its own `MLIR_PYTHON_PACKAGE_PREFIX=npcomp.` aggregation of the core MLIR python bindings. (all other functionality has been stripped out) After all the basicpy/numpy deletions, the `npcomp` C++ code is now very tiny. It basically just contains RefBackend and the `TorchConversion` dialect/passes (e.g. `TorchToLinalg.cpp`). Correspondingly, there are now 4 main testing targets paralleling the Python layering (which is reflective of the deeper underlying dependency structure) - `check-torch-mlir`: checks the `torch-mlir` pure MLIR C++ code. - `check-torch-mlir-plugin`: checks the code in `TorchPlugin` (e.g. TorchScript import) - `check-frontends-pytorch`: Checks the little code we have in `frontends/pytorch` -- mainly things related to the e2e framework itself. - `check-npcomp`: Checks the pure MLIR C++ code inside npcomp. There is a target `check-npcomp-all` that runs all of them. The `torch-mlir/build_standalone.sh` script does a standalone build of `torch-mlir`. The e2e tests (`tools/torchscript_e2e_test.sh`) are working too. The update_torch_ods script now lives in `torch-mlir/build_tools/update_torch_ods.sh` and expects a standalone build. This change also required a fix upstream related to cross-shlib Python dependencies, so we also update llvm-project to 8dca953dd39c0cd8c80decbeb38753f58a4de580 to get https://reviews.llvm.org/D109776 (no other fixes were needed for the integrate, thankfully). This completes most of the large source code changes. Next will be bringing the CI/packaging/examples back to life. 2021-09-11 02:44:38 +08:00			`PassManager.parse("torchscript-to-npcomp-backend-pipeline").run(mb.module)`
			`compiled = backend.compile(mb.module)`
Add an e2e test example for Resnet18 Show an example of classifying image from https://commons.wikimedia.org/wiki/File:YellowLabradorLooking_new.jpg with Resnet18 2021-07-29 23:06:02 +08:00			`jit_module = backend.load(compiled)`

			`predictions(test_module.forward, jit_module.forward, img, labels)`