Warps and blocks?
Warps
Blocks


Usage of Shared Memory




How does the GPU execute code?




It is always good choice to have pairs of independent instructions.

What if the threads of a warp try to do different things?

Compilation process and GPU assembly language


