Deploy and Integration

This page contains guidelines on how to deploy TVM to various platforms as well as how to integrate it with your project.

Unlike traditional deep learning frameworks. TVM stack is divided into two major components:

  • TVM compiler, which does all the compilation and optimizations
  • TVM runtime, which runs on the target devices.

In order to integrate the compiled module, we do not need to build entire TVM on the target device. You only need to build the TVM compiler stack on your desktop and use that to cross-compile modules that are deployed on the target device. We only need to use a light-weight runtime API that can be integrated into various platforms.

For example, you can run the following commands to build the runtime API on a Linux based embedded system such as Raspberry Pi:

git clone --recursive
cd tvm
mkdir build
cp cmake/config.cmake build
cd build
cmake ..
make runtime

Note that we type make runtime to only build the runtime library. If you want to include additional runtime such as OpenCL, you can modify config.cmake to enable these options. After you get the TVM runtime library, you can link the compiled library

The easiest and recommended way to test, tune and benchmark TVM kernels on embedded devices is through TVM’s RPC API. Here are the links to the related tutorials.

After you finished tuning and benchmarking, you might need to deploy the model on the target device without relying on RPC. see the following resources on how to do so.