CUDA programming model
API - Application Programming Interface
Global memory: allocate, copy, free
cudaError_t cudaMalloc(void **devPtr, size_t size)
cudaError_t cudaMemcpy(void *dst, const void *src, size_t count, enum cudaMemcpyKind kind)
//kind: cudaMemcpyHostToDevice, codaMemcpyDeviceToHost
cudaError_t cudaMemset(void *devPtr, int value, size_t count)
cudaError_t cudaFree(void *devPtr)
Device memory allocation
Host-Device data transfer
CUDA function declarations
Kernel execution

Exercises
Last updated