Synchronization

Shuffle functions

synch of mem ops

GA[i] = ...;
__threadfence();
G = 10;

Memory access optimization

Caching

Partition camping

Transfer between host and GPU memory