Automating PyTorch ARM Builds with Docker BuildX for Nvidia CUDA and Python >...

My Workflow

One of the more urgent needs in the community was to automate the building of ARM wheels of PyTorch to utilize PyTorch on Nvidia Devices (e.g. Nvidia Jetson Nano) with CUDA enabled. Nvidia currently offers this but only with Python 3.6 (while we are at Python 3.11 now and many packages require Python > 3.6)

Therefore, a way had to be found to start automating the building of these wheels. However, it's not straightforward on how we can get this to build seeing that we require CUDA (for GPU acceleration on edge devices) which is required for AI models to run smoothly.

The entire process above took me around ~11 full days, starting of with figuring out how to build the Dockerfile and finally automating the CI process.

Full Write Up & Source Code

To be able to keep this post within reading limits, I decided to publish the entire build on my personal blog: https://xaviergeerinck.com/post/iot/nvidia-building-pytorch.

The source code can be found on GitHub with a build of the resulting wheel and including GitHub Action Workflow.

Contributions

In any kind of project of this size, there are specific contributions that were made. In my project I believe to have made the following:

Install CUDA on non-GPU devices
Compile PyTorch with CUDA enabled on non-GPU devices
Compile PyTorch for Python > 3.6
Build for ARM with CI through Docker Buildx

Utilized Actions

Since I did not want to reinvent the wheel I reused some actions:

docker/setup-buildx-action
- I cross-compile for ARM on AMD64 machines in the pipeline
docker/setup-qemu-action
- Configure QEMU to be able to compile for ARM and install the QEMU static binaries
actions/checkout
- Check out a repo
actions/cache
- Allow us to cache the docker layers
actions/upload-artifact
- Upload the output of a directory to GitHub artifacts

Workflow Outline

When a release is created trigger the action (or pushed to master at the moment - will change seeing the LONG compilation time)
Clone the repository
Setup Docker with Buildx
Run our container
Copy over the Built Wheel to an artifact on GitHub

Submission Category:

DIY Deployments, Interesting IoT

Yaml File or Link to Code

https://github.com/XavierGeerinck/Jetson-Linux-PyTorch

Note: I had to put the link as the recommended liquid syntax gave an error ({% https://github.com/XavierGeerinck/Jetson-Linux-PyTorch %})

My Workflow

Full Write Up & Source Code

Contributions

Utilized Actions

Workflow Outline

Submission Category:

Yaml File or Link to Code

Additional Resources / Info

Recommend

Emmet Abbreviation for JSX in VSCode is unstable

I need help

如何正确提出数据需求

掌握Java的内存模型，你就是解决并发问题最靓的仔

☕【并发技术系列】「多线程并发编程」技术体系和并发模型的基础探究（夯实基础）

TiDB 数据一致性校验实现：Sync-diff-inspector 优化方案

Android 系统的 WebView 错误更新，致使大量应用崩溃，有没有不需要用户卸载更新syste...

PG中有哪些方案方法用到了GPU进行加速？

手把手教你学Dapr - 8. 绑定

Apache Kyuubi：灵活运用引擎隔离共享，加速即席查询，支持大规模 ETL

About Joyk