Always check return codes of CUDA calls for errors. Do not use __syncthreads() in conditional code unless the condition is guaranteed to evaluate identically for all threads of each block. Run your program under cuda-memcheck to detect stray memory accesses. If your kernel dies for larger problem sizes, it might exceed the runtime limit and trigger the watchdog timer.
%.o : %.cu
$(NVCC) $(CUFLAGS) -c -o $@ $<
You must Log In to add a comment.
New Private Message
Follow Us On
Copyright © 2015 NVIDIA Corporation