Build·May 23, 2026

Fixing Camera Latency: Migrating from ONNX to TensorRT

Tackling the camera lag that dropped us from a target of 720p @ 60 fps down to 0.5 fps, and why we moved our model runtime from ONNX to TensorRT on the Jetson Orin Nano.

Camera feed and inference latency comparison before and after TensorRT migration

Camera + inference latency, ONNX vs. TensorRT (placeholder)

Overview

Today, we worked on fixing the high latency issue of the camera and implementing localization. The camera is supposed to run at 720p @ 60 fps, but in practice, it was running much slower, at around 0.5 fps.

Diagnosing the Lag

To diagnose the problem causing the lag, we went through two main components:

The runtime of the model
The model itself

Switching from ONNX to TensorRT

The previous runtime for the model was ONNX, which was chosen because it provided a standardized file format we could run anywhere. We had used ONNX in previous years because of its ease of use, accessibility, and its compatibility with our software stack.

However, due to the limited compute power the Jetson Orin Nano has, our main processing unit, we decided to switch over to TensorRT. TensorRT is a runtime, like ONNX, but optimized for inference speed and throughput, which is ideal for our situation.

← Finalizing the Behavior Tree and Testing the Camera CLAW: Initial Brainstorming→