GstCUDA - Performance Profiling: Difference between revisions

GstCUDA - Performance Profiling (view source)

Revision as of 00:39, 16 July 2019

31 bytes added , 16 July 2019

no edit summary

Dgarbanzo

1,433

edits

@@ Line 18: / Line 18: @@
 === Jetpack 3.3 - IMX274 camera 4K@60fps glass to glass latency ===
-==Simple Capture to Display pipeline (without GstCUDA)==
+====Simple Capture to Display pipeline (without GstCUDA)====
 This measurement should be used as a reference to compare the glass to glass latency of the below pipelines with GstCUDA.
 * '''''Glass to Glass latency = 112.2042693 ms'''''
@@ Line 26: / Line 26: @@
 </pre>
+==== Cudafilter ====
-=== Cudafilter ===
+===== NVMM Direct Handling =====
-==== NVMM Direct Handling ====
+====== In-place:True ======
-===== In-place:True =====
 * '''''Glass to Glass latency = 178.9331237 ms'''''
 Test pipeline:
@@ Line 35: / Line 34: @@
 gst-launch-1.0 nvcamerasrc queue-size=10 sensor-id=1 fpsRange='60 60' ! "video/x-raw(memory:NVMM),width=3840,height=2160,format=NV12,framerate=60/1" ! nvvidconv ! "video/x-raw(memory:NVMM),width=3840,height=2160,format=I420,framerate=60/1" ! cudafilter in-place=true location=/home/nvidia/gst-cuda/tests/examples/cudafilter_algorithms/gray-scale-filter/gray-scale-filter.so ! perf print-arm-load=true ! nvoverlaysink enable-last-sample=false
 </pre>
-===== In-place:False =====
+====== In-place:False ======
 * '''''Glass to Glass latency = 230.3850304 ms'''''
 Test pipeline:
@@ Line 41: / Line 40: @@
 gst-launch-1.0 nvcamerasrc queue-size=10 sensor-id=1 fpsRange='60 60' ! "video/x-raw(memory:NVMM),width=3840,height=2160,format=NV12,framerate=60/1" ! nvvidconv ! "video/x-raw(memory:NVMM),width=3840,height=2160,format=I420,framerate=60/1" ! cudafilter in-place=false location=/home/nvidia/gst-cuda/tests/examples/cudafilter_algorithms/gray-scale-filter/gray-scale-filter.so ! perf print-arm-load=true ! nvoverlaysink enable-last-sample=false
 </pre>
-==== Unified Memory Allocator ====
+===== Unified Memory Allocator =====
-===== In-place:True =====
+====== In-place:True ======
 * '''''Glass to Glass latency = 188.1192285 ms'''''
 Test pipeline:
@@ Line 48: / Line 47: @@
 gst-launch-1.0 nvcamerasrc queue-size=10 sensor-id=1 fpsRange='60 60' ! "video/x-raw(memory:NVMM),width=3840,height=2160,format=I420,framerate=60/1" ! nvvidconv ! "video/x-raw,width=3840,height=2160,format=I420,framerate=60/1" ! cudafilter in-place=true location=/home/nvidia/gst-cuda/tests/examples/cudafilter_algorithms/gray-scale-filter/gray-scale-filter.so ! perf print-arm-load=true ! nvoverlaysink enable-last-sample=false
 </pre>
-===== In-place:False =====
+====== In-place:False ======
 * '''''Glass to Glass latency = 306.2578894 ms'''''
 Test pipeline:
@@ Line 56: / Line 55: @@
-=== Cudamux ===
+==== Cudamux ====
-==== NVMM Direct Handling ====
+===== NVMM Direct Handling =====
-===== In-place:True =====
+====== In-place:True ======
 * '''''Glass to Glass latency = 145.5713375 ms'''''
 Test pipeline:
@@ Line 64: / Line 63: @@
 gst-launch-1.0 -v cudamux name=cuda in-place=true location=/home/nvidia/gst-cuda/tests/examples/cudamux_algorithms/mixer/mixer.so nvcamerasrc queue-size=10 sensor-id=1 fpsRange='60 60' ! "video/x-raw(memory:NVMM),width=3840,height=2160,format=NV12,framerate=60/1" ! nvvidconv ! "video/x-raw(memory:NVMM),width=3840,height=2160,format=I420,framerate=60/1" ! queue max-size-buffers=3 leaky=2 ! cuda.sink_0 nvcamerasrc queue-size=10 sensor-id=2 fpsRange='60 60' ! "video/x-raw(memory:NVMM),width=3840,height=2160,format=NV12,framerate=60/1" ! nvvidconv ! "video/x-raw(memory:NVMM),width=3840,height=2160,format=I420,framerate=60/1" ! queue max-size-buffers=3 leaky=2 ! cuda.sink_1 cuda. ! perf print-arm-load=true ! nvoverlaysink enable-last-sample=false
 </pre>
-===== In-place:False =====
+====== In-place:False ======
 * '''''Glass to Glass latency = 332.9231919 ms'''''
 Test pipeline:
@@ Line 70: / Line 69: @@
 gst-launch-1.0 -v cudamux name=cuda in-place=false location=/home/nvidia/gst-cuda/tests/examples/cudamux_algorithms/mixer/mixer.so nvcamerasrc queue-size=10 sensor-id=1 fpsRange='60 60' ! "video/x-raw(memory:NVMM),width=3840,height=2160,format=NV12,framerate=60/1" ! nvvidconv ! "video/x-raw(memory:NVMM),width=3840,height=2160,format=I420,framerate=60/1" ! queue max-size-buffers=3 leaky=2 ! cuda.sink_0 nvcamerasrc queue-size=10 sensor-id=2 fpsRange='60 60' ! "video/x-raw(memory:NVMM),width=3840,height=2160,format=NV12,framerate=60/1" ! nvvidconv ! "video/x-raw(memory:NVMM),width=3840,height=2160,format=I420,framerate=60/1" ! queue max-size-buffers=3 leaky=2 ! cuda.sink_1 cuda. ! perf print-arm-load=true ! nvoverlaysink enable-last-sample=false
 </pre>
-==== Unified Memory Allocator ====
+===== Unified Memory Allocator =====
-===== In-place:True =====
+====== In-place:True ======
 * '''''Glass to Glass latency = 136.4211149 ms'''''
 Test pipeline:
@@ Line 77: / Line 76: @@
 gst-launch-1.0 -v cudamux name=cuda in-place=true location=/home/nvidia/gst-cuda/tests/examples/cudamux_algorithms/mixer/mixer.so nvcamerasrc queue-size=10 sensor-id=1 fpsRange='60 60' ! "video/x-raw(memory:NVMM),width=3840,height=2160,format=I420,framerate=60/1" ! nvvidconv ! "video/x-raw,width=3840,height=2160,format=I420,framerate=60/1" ! queue max-size-buffers=3 leaky=2 ! cuda.sink_0 nvcamerasrc queue-size=10 sensor-id=2 fpsRange='60 60' ! "video/x-raw(memory:NVMM),width=3840,height=2160,format=I420,framerate=60/1" ! nvvidconv ! "video/x-raw,width=3840,height=2160,format=I420,framerate=60/1" ! queue max-size-buffers=3 leaky=2 ! cuda.sink_1 cuda. ! perf print-arm-load=true ! nvoverlaysink enable-last-sample=false
 </pre>
-===== In-place:False =====
+====== In-place:False ======
 * '''''Glass to Glass latency = 197.1957698 ms'''''
 Test pipeline: