Difference between revisions of "CUDA ISP for NVIDIA Jetson/Performance/Library"

From RidgeRun Developer Connection
Jump to: navigation, search
Line 7: Line 7:
 
= Library API performance =
 
= Library API performance =
  
To measure the CUDA ISP API performance, we built a simple example that iterates over the apply methods and records performance metrics for each iteration. We recorded the duration of each apply method, the CPU and GPU usage during the application of the code, and the CPU RAM and GPU RAM usage. We recorded the performance on a Jetson Nano, Jetson Xavier NX, Jetson Xavier AGX, and Jetson Orin. We recorded the performance statistics over 3 buffer sizes:
+
To measure the CUDA ISP API performance, we built a simple example (not included in the production code) that iterates over the apply methods and records performance metrics for each iteration. We recorded the duration of each apply method separately. We then recorded the CPU and GPU usage, as well as the CPU RAM and GPU RAM usage for the complete processing pipeline running at 30 fps. We recorded the performance statistics for 1080p and 4K buffers. We recorded the performance on a Jetson Nano, Jetson Xavier NX, Jetson Xavier AGX, and Jetson Orin.
* A minimum 2x2 case, to test the maximum speeds that the apply methods could achieve
 
* A medium 1920x1080 case, to illustrate the changes in performance as the buffer size increases
 
* A maximum 3840x2160 case, to test performance on large buffers
 
  
 
<center>
 
<center>

Revision as of 12:39, 28 March 2023


  Index  






Library API performance

To measure the CUDA ISP API performance, we built a simple example (not included in the production code) that iterates over the apply methods and records performance metrics for each iteration. We recorded the duration of each apply method separately. We then recorded the CPU and GPU usage, as well as the CPU RAM and GPU RAM usage for the complete processing pipeline running at 30 fps. We recorded the performance statistics for 1080p and 4K buffers. We recorded the performance on a Jetson Nano, Jetson Xavier NX, Jetson Xavier AGX, and Jetson Orin.

Platform Jetson Orin Jetson Xavier AGX Jetson Xavier NX Jetson Nano
Buffer size 1080p 4K 1080p 4K 1080p 4K 1080p 4K
Processing time by algorithm (microseconds)
CudaShift 135 131 93 93 135 147
CudaDebayer 48 39 39 31 53 55
CudaWhiteBalancer 4844 8091 1360 4249 5071 18903
CudaColorSpaceConverter 45 52 35 34 55 57
Resource consumption profile
CPU usage (%) 0.491435 0.458062 0.523657 0.477216 0.836478 0.819940
CPU RAM (kB) 173613 173477 173539 171987 146295 147580
GPU usage (%) 5.48 17.91 25.12 94.6
GPU RAM (kB) 105247 107641 100387 106288 91733 116833



  Index