Difference between revisions of "CUDA ISP for NVIDIA Jetson/Performance"

From RidgeRun Developer Connection
Jump to: navigation, search
(GStreamer elements performance)
(Blanked the page)
(Tag: Blanking)
 
Line 1: Line 1:
<noinclude>
 
{{CUDA ISP for NVIDIA Jetson/Head|previous=|next=|metakeywords=|metadescription=}}
 
</noinclude>
 
  
{{DISPLAYTITLE:CUDA ISP for NVIDIA Jetson: Performance|noerror}}
 
 
= Library API performance =
 
 
To measure the CUDA ISP API performance, we built a simple example that iterates over the apply methods and records performance metrics for each iteration. We recorded the duration of each apply method, the CPU and GPU usage during the application of the code, and the CPU RAM and GPU RAM usage. We recorded the performance on a Jetson Nano, Jetson Xavier NX, Jetson Xavier AGX, and Jetson Orin. We recorded the performance statistics over 3 buffer sizes:
 
* A minimum 2x2 case, to test the maximum speeds that the apply methods could achieve
 
* A medium 1920x1080 case, to illustrate the changes in performance as the buffer size increases
 
* A maximum 3840x2160 case, to test performance on large buffers
 
 
== Jetson Nano ==
 
 
=== Procesing Time ===
 
<center>
 
{| class="wikitable"
 
|-
 
! Procesing time (In microseconds, averaged over 100 iterations) !! 2x2 Buffers !! 1080p Buffers !! 4K Buffers
 
|-
 
| cudashift || 136 || 135 || 147
 
|-
 
| cudadebayer || 68 || 53 || 55
 
|-
 
| cudawhitebalancer || 317 || 5071 || 18903
 
|-
 
| cudacolorspaceconverter || 55 || 55 || 57
 
|-
 
|}
 
</center>
 
 
=== CPU and CPU RAM usage ===
 
<center>
 
{| class="wikitable"
 
|-
 
! Measurement (Averaged over 100 iterations) !! 2x2 Buffers !! 1080p Buffers !! 4K Buffers
 
|-
 
| CPU usage (%)|| 0.797500 || 0.836478 || 0.819940
 
|-
 
| CPU RAM usage (kB) || 147071 || 146295 || 147580
 
|-
 
|}
 
</center>
 
 
=== GPU and GPU RAM usage ===
 
<center>
 
{| class="wikitable"
 
|-
 
! Measurement (Averaged over 100 iterations) !! 2x2 Buffers !! 1080p Buffers !! 4K Buffers
 
|-
 
| GPU usage (%)|| 0.0 || 25.12 || 94.6
 
|-
 
| GPU RAM usage (kB) || 91967 || 91733 || 116833
 
|-
 
|}
 
</center>
 
 
== Jetson Xavier NX ==
 
 
=== Procesing Time ===
 
<center>
 
{| class="wikitable"
 
|-
 
! Procesing time (In microseconds, averaged over 100 iterations) !! 2x2 Buffers !! 1080p Buffers !! 4K Buffers
 
|-
 
| cudashift || 93 || 93 || 93
 
|-
 
| cudadebayer || 39 || 39 || 31
 
|-
 
| cudawhitebalancer || 375 || 1360 || 4249
 
|-
 
| cudacolorspaceconverter || 33 || 35 || 34
 
|-
 
|}
 
</center>
 
 
=== CPU and CPU RAM usage ===
 
<center>
 
{| class="wikitable"
 
|-
 
! Measurement (Averaged over 100 iterations) !! 2x2 Buffers !! 1080p Buffers !! 4K Buffers
 
|-
 
| CPU usage (%)|| 0.482488 || 0.523657 || 0.477216
 
|-
 
| CPU RAM usage (kB) || 171679 || 173539 || 171987
 
|-
 
|}
 
</center>
 
 
=== GPU and GPU RAM usage ===
 
<center>
 
{| class="wikitable"
 
|-
 
! Measurement (Averaged over 100 iterations) !! 2x2 Buffers !! 1080p Buffers !! 4K Buffers
 
|-
 
| GPU usage (%)|| 0.85 || 5.48 || 17.91
 
|-
 
| GPU RAM usage (kB) || 98719 || 100387 || 106288
 
|-
 
|}
 
</center>
 
 
== Jetson Xavier AGX ==
 
 
=== Procesing Time ===
 
<center>
 
{| class="wikitable"
 
|-
 
! Procesing time (In microseconds, averaged over 100 iterations) !! 2x2 Buffers !! 1080p Buffers !! 4K Buffers
 
|-
 
| cudashift || 129 || 135 || 131
 
|-
 
| cudadebayer || 54 || 48 || 39
 
|-
 
| cudawhitebalancer || 667 || 4844 || 8091
 
|-
 
| cudacolorspaceconverter || 38 || 45 || 52
 
|-
 
|}
 
</center>
 
 
=== CPU and CPU RAM usage ===
 
<center>
 
{| class="wikitable"
 
|-
 
! Measurement (Averaged over 100 iterations) !! 2x2 Buffers !! 1080p Buffers !! 4K Buffers
 
|-
 
| CPU usage (%)|| 0.409836 || 0.491435 || 0.458062
 
|-
 
| CPU RAM usage (kB) || 172066 || 173613 || 173477
 
|-
 
|}
 
</center>
 
 
=== GPU and GPU RAM usage ===
 
<center>
 
{| class="wikitable"
 
|-
 
! Measurement (Averaged over 100 iterations) !! 2x2 Buffers !! 1080p Buffers !! 4K Buffers
 
|-
 
| GPU usage (%)|| || ||
 
|-
 
| GPU RAM usage (kB) || 101984 || 105247 || 107641
 
|-
 
|}
 
</center>
 
 
== Jetson Orin ==
 
 
=== Procesing Time ===
 
<center>
 
{| class="wikitable"
 
|-
 
! Procesing time (In microseconds, averaged over 100 iterations) !! 2x2 Buffers !! 1080p Buffers !! 4K Buffers
 
|-
 
| cudashift || || ||
 
|-
 
| cudadebayer || || ||
 
|-
 
| cudawhitebalancer || || ||
 
|-
 
| cudacolorspaceconverter || || ||
 
|-
 
|}
 
</center>
 
 
=== CPU and CPU RAM usage ===
 
<center>
 
{| class="wikitable"
 
|-
 
! Measurement (Averaged over 100 iterations) !! 2x2 Buffers !! 1080p Buffers !! 4K Buffers
 
|-
 
| CPU usage (%)|| || ||
 
|-
 
| CPU RAM usage (kB) || || ||
 
|-
 
|}
 
</center>
 
 
=== GPU and GPU RAM usage ===
 
<center>
 
{| class="wikitable"
 
|-
 
! Measurement (Averaged over 100 iterations) !! 2x2 Buffers !! 1080p Buffers !! 4K Buffers
 
|-
 
| GPU usage (%)|| || ||
 
|-
 
| GPU RAM usage (kB) || || ||
 
|-
 
|}
 
</center>
 
 
 
<noinclude>
 
{{CUDA ISP for NVIDIA Jetson/Foot||}}
 
</noinclude>
 

Latest revision as of 08:17, 28 March 2023