From d18f2282c86998ab41431f49b3e3438b68d5bb9a Mon Sep 17 00:00:00 2001 From: Nicolas Mowen Date: Thu, 31 Jul 2025 07:21:41 -0600 Subject: [PATCH] Update tensorrt inference time docs (#19338) * Update tensorrt inference times * Update hardware.md --- docs/docs/frigate/hardware.md | 14 +++++--------- 1 file changed, 5 insertions(+), 9 deletions(-) diff --git a/docs/docs/frigate/hardware.md b/docs/docs/frigate/hardware.md index ce097a0a0..650151a35 100644 --- a/docs/docs/frigate/hardware.md +++ b/docs/docs/frigate/hardware.md @@ -166,16 +166,12 @@ There are improved capabilities in newer GPU architectures that TensorRT can ben Inference speeds will vary greatly depending on the GPU and the model used. `tiny` variants are faster than the equivalent non-tiny model, some known examples are below: -| Name | YOLOv7 Inference Time | YOLO-NAS Inference Time | RF-DETR Inference Time | +| Name | YOLOv9 Inference Time | YOLO-NAS Inference Time | RF-DETR Inference Time | | --------------- | --------------------- | ------------------------- | ------------------------- | -| GTX 1060 6GB | ~ 7 ms | | | -| GTX 1070 | ~ 6 ms | | | -| GTX 1660 SUPER | ~ 4 ms | | | -| RTX 3050 | 5 - 7 ms | 320: ~ 10 ms 640: ~ 16 ms | 336: ~ 16 ms 560: ~ 40 ms | -| RTX 3070 Mobile | ~ 5 ms | | | -| RTX 3070 | 4 - 6 ms | 320: ~ 6 ms 640: ~ 12 ms | 336: ~ 14 ms 560: ~ 36 ms | -| Quadro P400 2GB | 20 - 25 ms | | | -| Quadro P2000 | ~ 12 ms | | | +| RTX 3050 | 320: 15 ms | 320: ~ 10 ms 640: ~ 16 ms | 336: ~ 16 ms 560: ~ 40 ms | +| RTX 3070 | 320: 11 ms | 320: ~ 8 ms 640: ~ 14 ms | 336: ~ 14 ms 560: ~ 36 ms | +| RTX A4000 | | 320: ~ 15 ms | | +| Tesla P40 | | 320: ~ 105 ms | | ### ROCm - AMD GPU