Copies data from the device to the host.
The buffer that the data from the CUDA device will be written to.
See Implementation
Copies data from the device to the host.