torch-mlir/README.md

# The Torch-MLIR Project

The Torch-MLIR project aims to provide first class compiler support from the [PyTorch](https://pytorch.org) ecosystem to the MLIR ecosystem.

> This project is participating in the LLVM Incubator process: as such, it is
not part of any official LLVM release.  While incubation status is not
necessarily a reflection of the completeness or stability of the code, it
does indicate that the project is not yet endorsed as a component of LLVM.

[PyTorch](https://pytorch.org)
PyTorch is an open source machine learning framework that facilitates the seamless transition from research and prototyping to production-level deployment.

[MLIR](https://mlir.llvm.org)
The MLIR project offers a novel approach for building extensible and reusable compiler architectures, which address the issue of software fragmentation, reduce the cost of developing domain-specific compilers, improve compilation for heterogeneous hardware, and promote compatibility between existing compilers.

[Torch-MLIR](https://github.com/llvm/torch-mlir)
Several vendors have adopted MLIR as the middle layer in their systems, enabling them to map frameworks such as PyTorch, JAX, and TensorFlow into MLIR and subsequently lower them to their target hardware. We have observed half a dozen custom lowerings from PyTorch to MLIR, making it easier for hardware vendors to focus on their unique value, rather than needing to implement yet another PyTorch frontend for MLIR. The ultimate aim is to be similar to the current hardware vendors adding LLVM target support, rather than each one implementing Clang or a C++ frontend.

[![pre-commit](https://img.shields.io/badge/pre--commit-enabled-brightgreen?logo=pre-commit)](https://github.com/pre-commit/pre-commit)

## All the roads from PyTorch to Torch MLIR Dialect

We have few paths to lower down to the Torch MLIR Dialect.
 - ONNX as the entry points.
 - Fx as the entry points

## Project Communication

- `#torch-mlir` channel on the LLVM [Discord](https://discord.gg/xS7Z362) - this is the most active communication channel
- Github issues [here](https://github.com/llvm/torch-mlir/issues)
- [`torch-mlir` section](https://llvm.discourse.group/c/projects-that-want-to-become-official-llvm-projects/torch-mlir/41) of LLVM Discourse

## Install torch-mlir snapshot

At the time of writing, we release [pre-built snapshots of torch-mlir](https://github.com/llvm/torch-mlir-release) for Python 3.11 and Python 3.10.

If you have supported Python version, the following commands initialize a virtual environment.
```shell
python3.11 -m venv mlir_venv
source mlir_venv/bin/activate
```

Or, if you want to switch over multiple versions of Python using conda, you can create a conda environment with Python 3.11.
```shell
conda create -n torch-mlir python=3.11
conda activate torch-mlir
python -m pip install --upgrade pip
```

Then, we can install torch-mlir with the corresponding torch and torchvision nightlies.
```
pip install --pre torch-mlir torchvision \
  --extra-index-url https://download.pytorch.org/whl/nightly/cpu \
  -f https://github.com/llvm/torch-mlir-release/releases/expanded_assets/dev-wheels
```

## Using torch-mlir

Torch-MLIR is primarily a project that is integrated into compilers to bridge them to PyTorch and ONNX. If contemplating a new integration, it may be helpful to refer to existing downstreams:

* [IREE](https://github.com/iree-org/iree.git)
* [Blade](https://github.com/alibaba/BladeDISC)

While most of the project is exercised via testing paths, there are some ways that an end user can directly use the APIs without further integration:

### FxImporter ResNet18
```shell
# Get the latest example if you haven't checked out the code
wget https://raw.githubusercontent.com/llvm/torch-mlir/main/projects/pt1/examples/fximporter_resnet18.py

# Run ResNet18 as a standalone script.
python projects/pt1/examples/fximporter_resnet18.py

# Output
load image from https://upload.wikimedia.org/wikipedia/commons/2/26/YellowLabradorLooking_new.jpg
...
PyTorch prediction
[('Labrador retriever', 70.65674591064453), ('golden retriever', 4.988346099853516), ('Saluki, gazelle hound', 4.477451324462891)]
torch-mlir prediction
[('Labrador retriever', 70.6567153930664), ('golden retriever', 4.988325119018555), ('Saluki, gazelle hound', 4.477458477020264)]
```

## Repository Layout

The project follows the conventions of typical MLIR-based projects:

* `include/torch-mlir`, `lib` structure for C++ MLIR compiler dialects/passes.
* `test` for holding test code.
* `tools` for `torch-mlir-opt` and such.
* `python` top level directory for Python code

## Developers
If you would like to develop and build torch-mlir from source please look at [Development Notes](docs/development.md)
Switch to pre-commit for lint checks. (#3200) Users can run via `pre-commit run` or set up a hook as described in the instructions: https://pre-commit.com/ The CI is set to only run pre-commit on files changed in the patch. We will run with `--all-files` in a separate patch. 2024-04-28 04:29:51 +08:00			`# The Torch-MLIR Project`
Update README.md 2021-09-24 02:27:03 +08:00
Wordsmith readme 2021-09-29 04:53:33 +08:00			`The Torch-MLIR project aims to provide first class compiler support from the [PyTorch](https://pytorch.org) ecosystem to the MLIR ecosystem.`
License and readme changes to align with inclusion in LLVM. (#1) * Updates the LICENSE to the same verbiage as used in the circt project. * Adds the incubator disclaimer to the README. * Reworks the introduction of the README to more accurately reflect the eventual scope. * There is a fair amount of further rework of the repo that needs to take place. This is just the minimal cosmetic changes now that it is part of LLVM. 2020-08-01 11:53:09 +08:00
			`> This project is participating in the LLVM Incubator process: as such, it is`
			`not part of any official LLVM release. While incubation status is not`
			`necessarily a reflection of the completeness or stability of the code, it`
			`does indicate that the project is not yet endorsed as a component of LLVM.`

Wordsmith readme 2021-09-29 04:53:33 +08:00			`[PyTorch](https://pytorch.org)`
Switch to pre-commit for lint checks. (#3200) Users can run via `pre-commit run` or set up a hook as described in the instructions: https://pre-commit.com/ The CI is set to only run pre-commit on files changed in the patch. We will run with `--all-files` in a separate patch. 2024-04-28 04:29:51 +08:00			`PyTorch is an open source machine learning framework that facilitates the seamless transition from research and prototyping to production-level deployment.`
Update README.md 2021-09-24 02:27:03 +08:00
			`[MLIR](https://mlir.llvm.org)`
Update README.md (#1880) * Update README.md refined formal definition * Update README.md toned down "revolutionary" to novel. 2023-02-25 04:46:32 +08:00			`The MLIR project offers a novel approach for building extensible and reusable compiler architectures, which address the issue of software fragmentation, reduce the cost of developing domain-specific compilers, improve compilation for heterogeneous hardware, and promote compatibility between existing compilers.`
[README] fix line break (#1969) 2023-03-25 12:42:15 +08:00
Update diagram and Readme.md 2021-09-24 03:29:35 +08:00			`[Torch-MLIR](https://github.com/llvm/torch-mlir)`
Update README.md (#1880) * Update README.md refined formal definition * Update README.md toned down "revolutionary" to novel. 2023-02-25 04:46:32 +08:00			Several vendors have adopted MLIR as the middle layer in their systems, enabling them to map frameworks such as PyTorch, JAX, and TensorFlow into MLIR and subsequently lower them to their target hardware. We have observed half a dozen custom lowerings from PyTorch to MLIR, making it easier for hardware vendors to focus on their unique value, rather than needing to implement yet another PyTorch frontend for MLIR. The ultimate aim is to be similar to the current hardware vendors adding LLVM target support, rather than each one implementing Clang or a C++ frontend.
Update README.md 2021-09-24 02:27:03 +08:00
Switch to pre-commit for lint checks. (#3200) Users can run via `pre-commit run` or set up a hook as described in the instructions: https://pre-commit.com/ The CI is set to only run pre-commit on files changed in the patch. We will run with `--all-files` in a separate patch. 2024-04-28 04:29:51 +08:00			`[![pre-commit](https://img.shields.io/badge/pre--commit-enabled-brightgreen?logo=pre-commit)](https://github.com/pre-commit/pre-commit)`
Update README.md 2022-06-11 11:00:43 +08:00
Update README.md 2021-09-24 02:27:03 +08:00			`## All the roads from PyTorch to Torch MLIR Dialect`

			`We have few paths to lower down to the Torch MLIR Dialect.`
Disable TORCH_MLIR_ENABLE_JIT_IR_IMPORTER and TORCH_MLIR_ENABLE_PYTORCH_EXTENSIONS by default (#3693) Only enable it in CI and debug for update_abstract_interp_lib.sh and update_torch_ods.sh usage. 2024-09-10 13:58:27 +08:00			`- ONNX as the entry points.`
			`- Fx as the entry points`
[README] Small touch-ups, and mention PT2 2022-12-13 22:28:23 +08:00
README: Move Project Communication section higher People might want to lurk on discord, and not even check out the code yet. 2021-11-09 03:18:00 +08:00			`## Project Communication`

			- `#torch-mlir` channel on the LLVM [Discord](https://discord.gg/xS7Z362) - this is the most active communication channel
			`- Github issues [here](https://github.com/llvm/torch-mlir/issues)`
			- [`torch-mlir` section](https://llvm.discourse.group/c/projects-that-want-to-become-official-llvm-projects/torch-mlir/41) of LLVM Discourse
Update README to include new meeting schedule (#2503) 2023-10-11 00:54:54 +08:00
Move development notes to development.md (#800) * Update README.md * Create development.md Add a separate development.md file 2022-04-27 02:28:04 +08:00			`## Install torch-mlir snapshot`
Update README.md (#785) 2022-04-26 22:19:48 +08:00
[README] update links to snapshot packages (#3073) Source: https://github.com/llvm/torch-mlir/issues/3068#issuecomment-2024109412 Verified commands locally on Ubuntu 22.04 with pyenv virtualenv created for python 3.11. 2024-04-02 05:16:02 +08:00			`At the time of writing, we release [pre-built snapshots of torch-mlir](https://github.com/llvm/torch-mlir-release) for Python 3.11 and Python 3.10.`
Update README.md (#785) 2022-04-26 22:19:48 +08:00
[README] update links to snapshot packages (#3073) Source: https://github.com/llvm/torch-mlir/issues/3068#issuecomment-2024109412 Verified commands locally on Ubuntu 22.04 with pyenv virtualenv created for python 3.11. 2024-04-02 05:16:02 +08:00			`If you have supported Python version, the following commands initialize a virtual environment.`
Update README.md (#785) 2022-04-26 22:19:48 +08:00			```shell
Change Python version from 3.10 to 3.11 in installation instructions (#2370) 2023-08-03 05:35:40 +08:00			`python3.11 -m venv mlir_venv`
Update README.md (#785) 2022-04-26 22:19:48 +08:00			`source mlir_venv/bin/activate`
Update README.md and put Python 3.10 a prerequisite (#1821) * Update README.md and put Python 3.10 a prerequisite * Update README.md * Follow up with comments * Update README.md 2023-01-26 03:32:50 +08:00			```

Change Python version from 3.10 to 3.11 in installation instructions (#2370) 2023-08-03 05:35:40 +08:00			`Or, if you want to switch over multiple versions of Python using conda, you can create a conda environment with Python 3.11.`
Update README.md and put Python 3.10 a prerequisite (#1821) * Update README.md and put Python 3.10 a prerequisite * Update README.md * Follow up with comments * Update README.md 2023-01-26 03:32:50 +08:00			```shell
Change Python version from 3.10 to 3.11 in installation instructions (#2370) 2023-08-03 05:35:40 +08:00			`conda create -n torch-mlir python=3.11`
Update README.md and put Python 3.10 a prerequisite (#1821) * Update README.md and put Python 3.10 a prerequisite * Update README.md * Follow up with comments * Update README.md 2023-01-26 03:32:50 +08:00			`conda activate torch-mlir`
Update README.md (#785) 2022-04-26 22:19:48 +08:00			`python -m pip install --upgrade pip`
Update README.md and put Python 3.10 a prerequisite (#1821) * Update README.md and put Python 3.10 a prerequisite * Update README.md * Follow up with comments * Update README.md 2023-01-26 03:32:50 +08:00			```

			`Then, we can install torch-mlir with the corresponding torch and torchvision nightlies.`
			```
			`pip install --pre torch-mlir torchvision \`
Fixed installation command in README.md (#3466) Current pip installation command raises error ``` ERROR: Could not find a version that satisfies the requirement torch-mlir (from versions: none) ERROR: No matching distribution found for torch-mlir ``` (checked on Ubuntu 22.04.2 LTS with `venv` and with `conda`) Because it is trying to install torch-mlir from pytorch repository. The installation command was wrongly split into 2 in #3073. I just merged them back to 1 installation command with both pytorch and llvm/torch-mlir channels. 2024-08-17 00:07:35 +08:00			`--extra-index-url https://download.pytorch.org/whl/nightly/cpu \`
			`-f https://github.com/llvm/torch-mlir-release/releases/expanded_assets/dev-wheels`
Update README.md (#785) 2022-04-26 22:19:48 +08:00			```

Disable TORCH_MLIR_ENABLE_JIT_IR_IMPORTER and TORCH_MLIR_ENABLE_PYTORCH_EXTENSIONS by default (#3693) Only enable it in CI and debug for update_abstract_interp_lib.sh and update_torch_ods.sh usage. 2024-09-10 13:58:27 +08:00			`## Using torch-mlir`

			`Torch-MLIR is primarily a project that is integrated into compilers to bridge them to PyTorch and ONNX. If contemplating a new integration, it may be helpful to refer to existing downstreams:`

			`* [IREE](https://github.com/iree-org/iree.git)`
			`* [Blade](https://github.com/alibaba/BladeDISC)`

			`While most of the project is exercised via testing paths, there are some ways that an end user can directly use the APIs without further integration:`
Update README.md (#785) 2022-04-26 22:19:48 +08:00
[FxImporter] Add an e2e test example for FxImporter (#3331) 2024-05-14 00:45:19 +08:00			`### FxImporter ResNet18`
			```shell
			`# Get the latest example if you haven't checked out the code`
			`wget https://raw.githubusercontent.com/llvm/torch-mlir/main/projects/pt1/examples/fximporter_resnet18.py`

			`# Run ResNet18 as a standalone script.`
			`python projects/pt1/examples/fximporter_resnet18.py`

			`# Output`
			`load image from https://upload.wikimedia.org/wikipedia/commons/2/26/YellowLabradorLooking_new.jpg`
			`...`
			`PyTorch prediction`
			`[('Labrador retriever', 70.65674591064453), ('golden retriever', 4.988346099853516), ('Saluki, gazelle hound', 4.477451324462891)]`
			`torch-mlir prediction`
			`[('Labrador retriever', 70.6567153930664), ('golden retriever', 4.988325119018555), ('Saluki, gazelle hound', 4.477458477020264)]`
			```

Updates to Readme.md (#334) 2021-09-29 04:50:25 +08:00			`## Repository Layout`
Add E2E support for tests with heavy dependencies (heavydep tests). The tests use the same (pure-Python) test framework as the normal torchscript_e2e_test.sh, but the tests are added in `build_tools/torchscript_e2e_heavydep_tests` instead of `frontends/pytorch/e2e_testing/torchscript`. Any needed dependencies can easily be configured in generate_serialized_tests.sh. We add an initial machine translation model with a complex set of dependencies to seed the curriculum there. I verified that this model gets to the point of MLIR import (it fails there with a segfault due to not being able to import the "Any" type). This required moving a few files from the `torch_mlir` Python module into multiple modules to isolate the code that depends on our C++ extensions (which now live in `torch_mlir` and `torch_mlir_torchscript_e2e_test_configs`) from the pure Python code (which now lives in `torch_mlir_torchscript`). This is an entirely mechanical change, and lots of imports needed to be updated. The dependency graph is: ``` torch_mlir_torchscript_e2e_test_configs / \| / \| / \| V V torch_mlir_torchscript torch_mlir ``` The `torch_mlir_torchscript_e2e_test_configs` are then dependency-injected into the `torch_mlir_torchscript` modules to successfully assemble a working test harness (the code was already structured this way, but this new file organization allows the isolation from C++ code to actually happen). This isolation is critical to allowing the serialized programs to be transported across PyTorch versions and for the test harness to be used seamlessly to generate the heavydep tests. Also: - Extend `_Tracer` class to support nested property (submodule) accesses. Recommended review order: - "user-level" docs in README.md - code in `build_tools/torchscript_e2e_heavydep_tests`. - changes in `torch_mlir_torchscript/e2e_test/framework.py` - misc mechanical changes. 2021-07-10 03:22:45 +08:00
Updates to Readme.md (#334) 2021-09-29 04:50:25 +08:00			`The project follows the conventions of typical MLIR-based projects:`
Add E2E support for tests with heavy dependencies (heavydep tests). The tests use the same (pure-Python) test framework as the normal torchscript_e2e_test.sh, but the tests are added in `build_tools/torchscript_e2e_heavydep_tests` instead of `frontends/pytorch/e2e_testing/torchscript`. Any needed dependencies can easily be configured in generate_serialized_tests.sh. We add an initial machine translation model with a complex set of dependencies to seed the curriculum there. I verified that this model gets to the point of MLIR import (it fails there with a segfault due to not being able to import the "Any" type). This required moving a few files from the `torch_mlir` Python module into multiple modules to isolate the code that depends on our C++ extensions (which now live in `torch_mlir` and `torch_mlir_torchscript_e2e_test_configs`) from the pure Python code (which now lives in `torch_mlir_torchscript`). This is an entirely mechanical change, and lots of imports needed to be updated. The dependency graph is: ``` torch_mlir_torchscript_e2e_test_configs / \| / \| / \| V V torch_mlir_torchscript torch_mlir ``` The `torch_mlir_torchscript_e2e_test_configs` are then dependency-injected into the `torch_mlir_torchscript` modules to successfully assemble a working test harness (the code was already structured this way, but this new file organization allows the isolation from C++ code to actually happen). This isolation is critical to allowing the serialized programs to be transported across PyTorch versions and for the test harness to be used seamlessly to generate the heavydep tests. Also: - Extend `_Tracer` class to support nested property (submodule) accesses. Recommended review order: - "user-level" docs in README.md - code in `build_tools/torchscript_e2e_heavydep_tests`. - changes in `torch_mlir_torchscript/e2e_test/framework.py` - misc mechanical changes. 2021-07-10 03:22:45 +08:00
Updates to Readme.md (#334) 2021-09-29 04:50:25 +08:00			* `include/torch-mlir`, `lib` structure for C++ MLIR compiler dialects/passes.
			* `test` for holding test code.
			* `tools` for `torch-mlir-opt` and such.
			* `python` top level directory for Python code
Add E2E support for tests with heavy dependencies (heavydep tests). The tests use the same (pure-Python) test framework as the normal torchscript_e2e_test.sh, but the tests are added in `build_tools/torchscript_e2e_heavydep_tests` instead of `frontends/pytorch/e2e_testing/torchscript`. Any needed dependencies can easily be configured in generate_serialized_tests.sh. We add an initial machine translation model with a complex set of dependencies to seed the curriculum there. I verified that this model gets to the point of MLIR import (it fails there with a segfault due to not being able to import the "Any" type). This required moving a few files from the `torch_mlir` Python module into multiple modules to isolate the code that depends on our C++ extensions (which now live in `torch_mlir` and `torch_mlir_torchscript_e2e_test_configs`) from the pure Python code (which now lives in `torch_mlir_torchscript`). This is an entirely mechanical change, and lots of imports needed to be updated. The dependency graph is: ``` torch_mlir_torchscript_e2e_test_configs / \| / \| / \| V V torch_mlir_torchscript torch_mlir ``` The `torch_mlir_torchscript_e2e_test_configs` are then dependency-injected into the `torch_mlir_torchscript` modules to successfully assemble a working test harness (the code was already structured this way, but this new file organization allows the isolation from C++ code to actually happen). This isolation is critical to allowing the serialized programs to be transported across PyTorch versions and for the test harness to be used seamlessly to generate the heavydep tests. Also: - Extend `_Tracer` class to support nested property (submodule) accesses. Recommended review order: - "user-level" docs in README.md - code in `build_tools/torchscript_e2e_heavydep_tests`. - changes in `torch_mlir_torchscript/e2e_test/framework.py` - misc mechanical changes. 2021-07-10 03:22:45 +08:00
Move development notes to development.md (#800) * Update README.md * Create development.md Add a separate development.md file 2022-04-27 02:28:04 +08:00			`## Developers`
Move development.md to docs/ for consistency 2022-08-27 08:42:48 +08:00			`If you would like to develop and build torch-mlir from source please look at [Development Notes](docs/development.md)`