The fastest sorting algorithm for CUDA I know of is radix sort.
It is included in the CUDA unbound library.
RE: Sort: implementation of sorting algorithms in C and CUDA
You are viewing a single comment's thread from:
Sort: implementation of sorting algorithms in C and CUDA