Open main menu
Home
Random
Log in
Settings
About RidgeRun Developer Wiki
Disclaimers
RidgeRun Developer Wiki
Search
Template
:
RidgeRun CUDA Optimisation Guide/TOC
Language
Watch
Edit
Revision as of 14:18, 5 August 2022 by
Spalli
(
talk
|
contribs
)
(
diff
)
← Older revision
|
Latest revision
(
diff
) |
Newer revision →
(
diff
)
RidgeRun CUDA Optimisation Guide
GPU Architecture
Execution process
Memory hierarchy
Communication & Concurrency
Optimisation Workflow
Tools
CUDA Memcheck
CUDA Profiler
Computational Budget Tool
Optimisation Recipes
Types of optimisations
Coarse optimisations
Workload offloading
Problem size
Communication overlapping
Correct memory access patterns
Inter-thread communication
Fine optimisations
Increase arithmetic intensity
Function approximation
Condition and loops replacement
Inlining
Common pitfalls when optimising
Examples
Contact Us