CUB provides state-of-the-art, reusable software components for every layer
of the CUDA programming model:
* Parallel primitives
* Warp-wide "collective" primitives
* Block-wide "collective" primitives
* Device-wide primitives
* Utilities
* Fancy iterators
* Thread and thread block I/O
* PTX intrinsics
* Device, kernel, and storage management
Installed Size: 2.9 MB
Architectures: all