NVIDIA Jetson Xavier - Volta GPU
< Xavier | Processors
The Volta GPU is an architecture that powers NVIDIA high-performance computing. In this section, you will learn more about this architecture and its features. You will learn about software available to work with the Volta GPU.
- Description section is a simple description of the Volta GPU.
- CUDA section explain the samples of CUDA.
- OPENGL section explains a HelloWorld with OpenGL.
Device Capabilities
Capability | Value |
---|---|
CUDA Driver Version / Runtime Version | 10.0 / 10.0 |
CUDA Capability Major/Minor version number | 7.2 |
Total amount of global memory | 15819 MBytes (16587702272 bytes) |
( 8) Multiprocessors, ( 64) CUDA Cores/MP | 512 CUDA Cores |
GPU Max Clock rate | 1500 MHz (1.50 GHz) |
Memory Clock rate | 1500 Mhz |
Memory Bus Width | 256-bit |
L2 Cache Size | 524288 bytes |
Maximum Texture Dimension Size (x,y,z) | 1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384) |
Maximum Layered 1D Texture Size, (num) layers | 1D=(32768), 2048 layers |
Maximum Layered 2D Texture Size, (num) layers | 2D=(32768, 32768), 2048 layers |
Total amount of constant memory | 65536 bytes |
Total amount of shared memory per block | 49152 bytes |
Total number of registers available per block | 65536 |
Warp size | 32 |
Maximum number of threads per multiprocessor | 2048 |
Maximum number of threads per block | 1024 |
Max dimension size of a thread block (x,y,z) | (1024, 1024, 64) |
Max dimension size of a grid size (x,y,z) | (2147483647, 65535, 65535) |
Max dimension size of a grid size (x,y,z) | (2147483647, 65535, 65535) |
Maximum memory pitch | 2147483647 bytes |
Texture alignment | 512 bytes |
Concurrent copy and kernel execution | Yes with 1 copy engine(s) |
Run time limit on kernels | No |
Integrated GPU sharing Host Memory | Yes |
Support host page-locked memory mapping | Yes |
Alignment requirement for Surfaces | Yes |
Device has ECC support | Disabled |
Device supports Unified Addressing (UVA) | Yes |
Device supports Compute Preemption | Yes |
Supports Cooperative Kernel Launch | Yes |
Supports MultiDevice Co-op Kernel Launch | Yes |
Device PCI Domain ID / Bus ID / location ID | 0 / 0 / 0 |
Compute Mode | Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) |
CUDA Driver | CUDART |
Build Device Query Tool
We obtained the properties of the CUDA devices present in the system using the deviceQuery example. In order to run it follow these steps:
1. Go to the sample's directory
cd /usr/local/cuda/samples/1_Utilities/deviceQuery
2. Construct the sample using the makefile
sudo make
3. Run the Binary
./deviceQuery