detector for bank conflict
Does anyone know a detector for bank conflict in shared memory?

Thanks in advance!
Does anyone know a detector for bank conflict in shared memory?



Thanks in advance!

#1
Posted 04/25/2012 12:52 PM   
The visual profiler?
The visual profiler?

Always check return codes of CUDA calls for errors. Do not use __syncthreads() in conditional code unless the condition is guaranteed to evaluate identically for all threads of each block. Run your program under cuda-memcheck to detect stray memory accesses. If your kernel dies for larger problem sizes, it might exceed the runtime limit and trigger the watchdog timer.

#2
Posted 04/25/2012 01:24 PM   
Sorry, in Linux.
I don't know whether nvvp can detect this kind of problem.
Sorry, in Linux.

I don't know whether nvvp can detect this kind of problem.

#3
Posted 04/25/2012 01:57 PM   
The visual profiler runs on Linux as well and can provide this data. On compute capability 1.x it cannot distinguish though whether warp serialization is due to bank conflicts or due to nonuniform constant memory reads.
The visual profiler runs on Linux as well and can provide this data. On compute capability 1.x it cannot distinguish though whether warp serialization is due to bank conflicts or due to nonuniform constant memory reads.

Always check return codes of CUDA calls for errors. Do not use __syncthreads() in conditional code unless the condition is guaranteed to evaluate identically for all threads of each block. Run your program under cuda-memcheck to detect stray memory accesses. If your kernel dies for larger problem sizes, it might exceed the runtime limit and trigger the watchdog timer.

#4
Posted 04/25/2012 10:46 PM   
[quote name='tera' date='25 April 2012 - 10:46 PM' timestamp='1335394015' post='1401023']
The visual profiler runs on Linux as well and can provide this data. On compute capability 1.x it cannot distinguish though whether warp serialization is due to bank conflicts or due to nonuniform constant memory reads.
[/quote]

Thank you! It works on Linux!
[quote name='tera' date='25 April 2012 - 10:46 PM' timestamp='1335394015' post='1401023']

The visual profiler runs on Linux as well and can provide this data. On compute capability 1.x it cannot distinguish though whether warp serialization is due to bank conflicts or due to nonuniform constant memory reads.





Thank you! It works on Linux!

#5
Posted 05/03/2012 02:57 PM   
Scroll To Top