GStreamer Qt Overlay for Embedded Systems/Performance/jetson: Difference between revisions
Line 298: | Line 298: | ||
For average behavior: | For average behavior: | ||
<pre> | <pre> | ||
gst-launch-1.0 videotestsrc ! "video/x-raw,width=${W},height=${H},framerate=30/1" ! nvvidconv ! qtoverlay | gst-launch-1.0 videotestsrc ! "video/x-raw,width=${W},height=${H},framerate=30/1" ! nvvidconv ! qtoverlay qml=gst-libs/gst/qt/main.qml ! perf print-cpu-load=1 ! fakesink | ||
</pre> | </pre> | ||
For limit behavior: | For limit behavior: | ||
<pre> | <pre> | ||
gst-launch-1.0 videotestsrc ! "video/x-raw,width=${W},height=${H},framerate=30/1" ! nvvidconv ! qtoverlay | gst-launch-1.0 videotestsrc ! "video/x-raw,width=${W},height=${H},framerate=30/1" ! imagefreeze ! nvvidconv ! qtoverlay qml=gst-libs/gst/qt/main.qml ! perf print-cpu-load=1 ! fakesink | ||
</pre> | </pre> | ||
Line 317: | Line 317: | ||
|- | |- | ||
| Max Framerate (fps) | | Max Framerate (fps) | ||
| | | 54.279 | ||
| | | 37.62 | ||
| 11.83 | | 11.83 | ||
|- | |- | ||
| CPU(%) | | CPU(%) | ||
| | | 7 | ||
| | | 8 | ||
| | | 8 | ||
|- | |- | ||
| RAM(MiB) | | RAM(MiB) | ||
| | | 98 | ||
| | | 105 | ||
| | | 126 | ||
|} | |} | ||
</center> | </center> | ||
Line 344: | Line 344: | ||
|- | |- | ||
| Max Framerate (fps) | | Max Framerate (fps) | ||
| | | 254 | ||
| | | 202 | ||
| | | 84.5 | ||
|- | |- | ||
| CPU(%) | | CPU(%) | ||
| | | 7 | ||
| | | 14.1 | ||
| | | 15.6 | ||
|- | |- | ||
| RAM(MiB) | | RAM(MiB) | ||
| | | 89.141 | ||
| | | 89.141 | ||
| | | 110.16 | ||
|} | |} | ||
</center> | </center> | ||
Line 362: | Line 362: | ||
'''NVMM''': | '''NVMM''': | ||
For average behavior: | |||
<pre> | <pre> | ||
gst-launch-1.0 videotestsrc ! "video/x-raw, width=${W}, height=${H}, framerate=30/1" ! queue ! nvvidconv ! queue ! 'video/x-raw(memory:NVMM)' ! qtoverlay qml=gst-libs/gst/qt/main.qml ! perf ! queue ! fakesink sync=false | gst-launch-1.0 videotestsrc ! "video/x-raw, width=${W}, height=${H}, framerate=30/1" ! queue ! nvvidconv ! queue ! 'video/x-raw(memory:NVMM)' ! qtoverlay qml=gst-libs/gst/qt/main.qml ! perf ! queue ! fakesink sync=false | ||
</pre> | </pre> | ||
'''Results''' | For limit behavior: | ||
<pre> | |||
gst-launch-1.0 videotestsrc ! "video/x-raw, width=${W}, height=${H}, framerate=30/1" ! queue ! nvvidconv ! queue ! 'video/x-raw(memory:NVMM)' ! qtoverlay qml=gst-libs/gst/qt/main.qml ! perf ! queue ! fakesink sync=false | |||
</pre> | |||
'''Results for average behavior''' | |||
<center> | <center> | ||
Line 377: | Line 383: | ||
|- | |- | ||
| Max Framerate (fps) | | Max Framerate (fps) | ||
| | | 195.837 | ||
| | | 111.943 | ||
| | | 28.113 | ||
|- | |- | ||
| CPU(%) | | CPU(%) | ||
| 27 | |||
| 28 | | 28 | ||
| | | 29 | ||
| | |- | ||
| RAM(MiB) | |||
| 112 | |||
| 114 | |||
| 126 | |||
|} | |||
</center> | |||
'''Results for limit behavior''' | |||
<center> | |||
{| class="wikitable" | |||
|- style="font-weight:bold;" | |||
! | |||
! 720p | |||
! 1080p<br /> | |||
! 4K | |||
|- | |||
| Max Framerate (fps) | |||
| 390 | |||
| 228 | |||
| 73.3 | |||
|- | |||
| CPU(%) | |||
| 6 | |||
| 11 | |||
| 13 | |||
|- | |- | ||
| RAM(MiB) | | RAM(MiB) | ||
| | | 117 | ||
| | | 151 | ||
| | | 157 | ||
|} | |} | ||
</center> | </center> |
Revision as of 23:29, 5 October 2023
GStreamer Qt Overlay |
---|
Overview |
Getting Started |
Examples |
Performance |
Similar Solutions |
Troubleshooting |
FAQ |
Contact Us |
Xavier NX Platform
For testing purposes, take into account the following points:
- Maximum performance mode enabled: all cores, and Jetson clocks enabled
- Jetpack 4.4 (4.2.1 or earlier is not recommended)
- Base installation
- The GstQtOverlay is surrounded by queues to measure the actual capability of GstQtOverlay
For measuring and contrasting the performances with and without NVMM support, we used the following pipelines:
NVMM:
gst-launch-1.0 videotestsrc pattern=black ! 'video/x-raw, width=1920, height=1080' ! queue ! nvvidconv ! queue ! 'video/x-raw(memory:NVMM)' ! qtoverlay qml=gst-libs/gst/qt/main.qml ! perf ! queue ! nvvidconv ! 'video/x-raw' ! fakesink sync=false
No NVMM
gst-launch-1.0 videotestsrc pattern=black ! 'video/x-raw, width=1920, height=1080' ! queue ! qtoverlay qml=gst-libs/gst/qt/main.qml ! perf ! queue ! fakesink sync=false
The results obtained:
Measurement | Jetson NX |
---|---|
No NVMM | 190 fps |
NVMM |
265 fps |
With the addition of native UYVY and NV12 support for NVMM memory, we measured the performance for each format between using nvvidconv or the native support. The following pipelines were used:
NVMM using nvvidconv:
gst-launch-1.0 videotestsrc pattern=black ! 'video/x-raw, width=1920, height=1080, format=RGBA' ! queue ! nvvidconv ! queue ! 'video/x-raw(memory:NVMM), format=RGBA' ! qtoverlay qml=gst-libs/gst/qt/main.qml ! perf ! queue ! nvvidconv ! 'video/x-raw' ! fakesink sync=false
NVMM with native UYVY and NV12 support:
gst-launch-1.0 videotestsrc pattern=black ! 'video/x-raw, width=1920, height=1080, format=NV12' ! queue ! nvvidconv ! queue ! 'video/x-raw(memory:NVMM), format=NV12' ! qtoverlay qml=gst-libs/gst/qt/main.qml ! perf ! queue ! nvvidconv ! 'video/x-raw' ! fakesink sync=false
The results obtained:
Measurement | RGBA | UYVY | NV12 |
---|---|---|---|
NVMM with nvvidconv | 227 fps |
200 fps |
177 fps |
NVMM with native formats |
227 fps |
172 fps |
177 fps |
CPU usage
Take the following pipelines as reference:
No NVMM
gst-launch-1.0 videotestsrc is-live=true ! 'video/x-raw, width=1920, height=1080' ! qtoverlay qml=gst-libs/gst/qt/main.qml ! 'video/x-raw, width=1920, height=1080' ! fakesink
NVMM:
gst-launch-1.0 videotestsrc is-live=true ! 'video/x-raw, width=1920, height=1080' ! nvvidconv ! 'video/x-raw(memory:NVMM), width=1920, height=1080' ! qtoverlay qml=gst-libs/gst/qt/main.qml ! nvvidconv ! 'video/x-raw, width=1920, height=1080' ! fakesink sync=false
Results
Measurement | No NVMM |
NVMM |
---|---|---|
GstQtOverlay | 16.5% | 7.3% |
Rest of pipeline | 14% | 27.8% |
Total | 30.5% | 35.1% |
The total consumption of the pipeline is higher in the NVMM case since there are more elements. The GstQtOverlay consumes less CPU in NVMM mode.
Nano Platform
For testing purposes, take into account the following points:
- Maximum performance mode enabled: all cores, and Jetson clocks enabled
- Jetpack 4.5 (4.2.1 or earlier is not recommended)
- Base installation
- The GstQtOverlay is surrounded by queues to measure the actual capability of GstQtOverlay
For measuring and contrasting the performances with and without NVMM support for every format, we used the following pipelines:
NVMM using nvvidconv:
gst-launch-1.0 videotestsrc pattern=black ! 'video/x-raw, width=1920, height=1080, format=NV12' ! queue ! nvvidconv ! queue ! 'video/x-raw(memory:NVMM), format=RGBA' ! qtoverlay qml=gst-libs/gst/qt/main.qml ! perf ! queue ! nvvidconv ! 'video/x-raw' ! fakesink sync=false
NVMM with native UYVY and NV12 support:
gst-launch-1.0 videotestsrc pattern=black ! 'video/x-raw, width=1920, height=1080, format=NV12' ! queue ! nvvidconv ! queue ! 'video/x-raw(memory:NVMM), format=NV12' ! qtoverlay qml=gst-libs/gst/qt/main.qml ! perf ! queue ! nvvidconv ! 'video/x-raw' ! fakesink sync=false
No NVMM
gst-launch-1.0 videotestsrc pattern=black ! 'video/x-raw, width=1920, height=1080' ! queue ! qtoverlay qml=gst-libs/gst/qt/main.qml ! perf ! queue ! fakesink sync=false
The results obtained:
Measurement | RGBA | UYVY | NV12 |
---|---|---|---|
No NVMM | 57 fps |
N/A | N/A |
NVMM with nvvidconv |
121 fps |
67 fps | 61 fps |
NVMM with native formats |
123 fps |
67 fps | 61 fps |
While using the native formats may not provide big performance gains here as it still uses nvvidconv to upload to NVMM memory, it allows connecting directly to some cameras that output NV12 in NVMM memory like with the nvarguscamerasrc element.
CPU usage
Taking the following pipelines as reference:
No NVMM
gst-launch-1.0 videotestsrc is-live=true ! 'video/x-raw, width=1920, height=1080' ! qtoverlay qml=gst-libs/gst/qt/main.qml ! perf print-cpu-load=true ! 'video/x-raw, width=1920, height=1080' ! fakesink
NVMM:
gst-launch-1.0 videotestsrc is-live=true ! 'video/x-raw, width=1920, height=1080' ! nvvidconv ! 'video/x-raw(memory:NVMM), width=1920, height=1080' ! qtoverlay qml=gst-libs/gst/qt/main.qml ! perf print-cpu-load=true ! nvvidconv ! 'video/x-raw, width=1920, height=1080' ! fakesink sync=false
Results
Measurement | No NVMM |
NVMM |
---|---|---|
GstQtOverlay | 2% | 2% |
Rest of pipeline | 17% | 13% |
Total | 19% | 15% |
Orin Nano Platform
CPU usage
Taking the following pipelines as reference:
No NVMM
gst-launch-1.0 videotestsrc ! "video/x-raw,width=${W},height=${H},framerate=30/1" ! nvvidconv ! qtoverlay name=main.qml ! perf print-cpu-load=1 ! fakesink
Results
720p | 1080p |
4K | |
---|---|---|---|
Max Framerate (fps) | 73.45 | 38.842 | 11.83 |
CPU(%) | 9 | 9 | 9 |
RAM(MiB) | 110.04 | 111.06 | 111.06 |
NVMM:
gst-launch-1.0 videotestsrc ! "video/x-raw, width=${W}, height=${H}, framerate=30/1" ! queue ! nvvidconv ! queue ! 'video/x-raw(memory:NVMM)' ! qtoverlay qml=gst-libs/gst/qt/main.qml ! perf ! queue ! fakesink sync=false
Results
720p | 1080p |
4K | |
---|---|---|---|
Max Framerate (fps) | 223.302 | 100.615 | 25.409 |
CPU(%) | 28 | 25 | 22 |
RAM(MiB) | 144 | 144 | 160 |
Xavier NX Platform
CPU usage
Taking the following pipelines as reference:
No NVMM
For average behavior:
gst-launch-1.0 videotestsrc ! "video/x-raw,width=${W},height=${H},framerate=30/1" ! nvvidconv ! qtoverlay qml=gst-libs/gst/qt/main.qml ! perf print-cpu-load=1 ! fakesink
For limit behavior:
gst-launch-1.0 videotestsrc ! "video/x-raw,width=${W},height=${H},framerate=30/1" ! imagefreeze ! nvvidconv ! qtoverlay qml=gst-libs/gst/qt/main.qml ! perf print-cpu-load=1 ! fakesink
Results for average behavior
720p | 1080p |
4K | |
---|---|---|---|
Max Framerate (fps) | 54.279 | 37.62 | 11.83 |
CPU(%) | 7 | 8 | 8 |
RAM(MiB) | 98 | 105 | 126 |
Results for limit behavior
720p | 1080p |
4K | |
---|---|---|---|
Max Framerate (fps) | 254 | 202 | 84.5 |
CPU(%) | 7 | 14.1 | 15.6 |
RAM(MiB) | 89.141 | 89.141 | 110.16 |
NVMM:
For average behavior:
gst-launch-1.0 videotestsrc ! "video/x-raw, width=${W}, height=${H}, framerate=30/1" ! queue ! nvvidconv ! queue ! 'video/x-raw(memory:NVMM)' ! qtoverlay qml=gst-libs/gst/qt/main.qml ! perf ! queue ! fakesink sync=false
For limit behavior:
gst-launch-1.0 videotestsrc ! "video/x-raw, width=${W}, height=${H}, framerate=30/1" ! queue ! nvvidconv ! queue ! 'video/x-raw(memory:NVMM)' ! qtoverlay qml=gst-libs/gst/qt/main.qml ! perf ! queue ! fakesink sync=false
Results for average behavior
720p | 1080p |
4K | |
---|---|---|---|
Max Framerate (fps) | 195.837 | 111.943 | 28.113 |
CPU(%) | 27 | 28 | 29 |
RAM(MiB) | 112 | 114 | 126 |
Results for limit behavior
720p | 1080p |
4K | |
---|---|---|---|
Max Framerate (fps) | 390 | 228 | 73.3 |
CPU(%) | 6 | 11 | 13 |
RAM(MiB) | 117 | 151 | 157 |