CUDA-quicksort: An improved GPU-based implementation of quicksort