Arm DDT with GPUs

Posted Sep 8, 2022 Updated Sep 15, 2022

By Robert Caddy

1 min read

Arm DDT with GPUs

Debugging GPU Codes with Arm DDT

See the Arm Forge User Guide for all the info, this is just a place for some notes and tips

Use -g -O0 to compile host code and -g -G -cudart shared to compile device code. The -g -G enable debugging symbols and -cudart shared enable device memory debugging.
A device pointer’s contents can be accessed by preappending @global to its type. A permanant version of this can be done with expressions.
- ((@global TYPE *)(VARIABLE_NAME) to get the proper pointer
- ((@global TYPE *)(VARIABLE_NAME)[IDX] to get the value at IDX
- ((@global TYPE *)(VARIABLE_NAME)[IDX]@N to get N values starting at IDX
The expressions panel: can also be used for any other debugging or math expression you want
Array Viewer: Any expression can go in the brackets and be displayed in 2D. The correct indexing scheme for Cholla is xid + yid*nx + zid*nx*ny + field*n_cells.

perf-report mpirun -n 4 EXECUTABLE ARGS or perf-report map-file.map. Generates a nice HTML and text summary of the profiling.

This post is licensed under CC BY 4.0 by the author.