NVIDIA Xavier - JetPack 5.0.2 - Components - Cuda
CUDA is a parallel computing platform and programming model developed by NVIDIA for general computing on graphical processing units (GPUs). With CUDA developers are able to dramatically speed up computing applications by harnessing the power of GPUs.
Device Capabilities
Capability | Value |
---|---|
CUDA Driver Version / Runtime Version | 10.0 / 10.0 |
CUDA Capability Major/Minor version number | 7.2 |
Total amount of global memory | 15819 MBytes (16587702272 bytes) |
( 8) Multiprocessors, ( 64) CUDA Cores/MP | 512 CUDA Cores |
GPU Max Clock rate | 1500 MHz (1.50 GHz) |
Memory Clock rate | 1500 Mhz |
Memory Bus Width | 256-bit |
L2 Cache Size | 524288 bytes |
Maximum Texture Dimension Size (x,y,z) | 1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384) |
Maximum Layered 1D Texture Size, (num) layers | 1D=(32768), 2048 layers |
Maximum Layered 2D Texture Size, (num) layers | 2D=(32768, 32768), 2048 layers |
Total amount of constant memory | 65536 bytes |
Total amount of shared memory per block | 49152 bytes |
Total number of registers available per block | 65536 |
Warp size | 32 |
Maximum number of threads per multiprocessor | 2048 |
Maximum number of threads per block | 1024 |
Max dimension size of a thread block (x,y,z) | (1024, 1024, 64) |
Max dimension size of a grid size (x,y,z) | (2147483647, 65535, 65535) |
Max dimension size of a grid size (x,y,z) | (2147483647, 65535, 65535) |
Maximum memory pitch | 2147483647 bytes |
Texture alignment | 512 bytes |
Concurrent copy and kernel execution | Yes with 1 copy engine(s) |
Run time limit on kernels | No |
Integrated GPU sharing Host Memory | Yes |
Support host page-locked memory mapping | Yes |
Alignment requirement for Surfaces | Yes |
Device has ECC support | Disabled |
Device supports Unified Addressing (UVA) | Yes |
Device supports Compute Preemption | Yes |
Supports Cooperative Kernel Launch | Yes |
Supports MultiDevice Co-op Kernel Launch | Yes |
Device PCI Domain ID / Bus ID / location ID | 0 / 0 / 0 |
Compute Mode | Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) |
CUDA Driver | CUDART |
Build Device Query Tool
We obtained the properties of the CUDA devices present in the system using the deviceQuery example. In order to run it follow these steps:
1. Go to the sample's directory
cd /usr/local/cuda/samples/1_Utilities/deviceQuery
2. Construct the sample using the makefile
sudo make
3. Run the Binary
./deviceQuery
Samples
For a complete summary of the samples go to: https://developer.ridgerun.com/wiki/index.php?title=Xavier/GPU/Software_Support/CUDA