Cuda and OpenMP
Hi everybody,

I have two questions about CUDA streams on GTX 480 and OpenMP

1) Can I load data on/from the GPU using the OpenMP threads and CUDA streams in parallel? Is it also possible to use OpenMP to launch the concurrent kernels on different streams? If those two options are not allowed, is it because the GPU/CPU communictations can only be done through thread0.

2) I have a square SIMD algorithm, that is to say an SIMD (SIMDB B for Big) that calls another SIMD (SIMDS S for Small). I want to know whether it is more optimal to use one big loop (which is the SIMDB) that launches the kernel that performs SIMDS, or I should reduce the length of the loop which launches concurrent kernels on the different streams? In my opinion, it depends on the memory size but I am not sure how.

Thank you for your response
Hi everybody,



I have two questions about CUDA streams on GTX 480 and OpenMP



1) Can I load data on/from the GPU using the OpenMP threads and CUDA streams in parallel? Is it also possible to use OpenMP to launch the concurrent kernels on different streams? If those two options are not allowed, is it because the GPU/CPU communictations can only be done through thread0.



2) I have a square SIMD algorithm, that is to say an SIMD (SIMDB B for Big) that calls another SIMD (SIMDS S for Small). I want to know whether it is more optimal to use one big loop (which is the SIMDB) that launches the kernel that performs SIMDS, or I should reduce the length of the loop which launches concurrent kernels on the different streams? In my opinion, it depends on the memory size but I am not sure how.



Thank you for your response

#1
Posted 09/03/2010 04:27 AM   
Hi everybody,

I have two questions about CUDA streams on GTX 480 and OpenMP

1) Can I load data on/from the GPU using the OpenMP threads and CUDA streams in parallel? Is it also possible to use OpenMP to launch the concurrent kernels on different streams? If those two options are not allowed, is it because the GPU/CPU communictations can only be done through thread0.

2) I have a square SIMD algorithm, that is to say an SIMD (SIMDB B for Big) that calls another SIMD (SIMDS S for Small). I want to know whether it is more optimal to use one big loop (which is the SIMDB) that launches the kernel that performs SIMDS, or I should reduce the length of the loop which launches concurrent kernels on the different streams? In my opinion, it depends on the memory size but I am not sure how.

Thank you for your response
Hi everybody,



I have two questions about CUDA streams on GTX 480 and OpenMP



1) Can I load data on/from the GPU using the OpenMP threads and CUDA streams in parallel? Is it also possible to use OpenMP to launch the concurrent kernels on different streams? If those two options are not allowed, is it because the GPU/CPU communictations can only be done through thread0.



2) I have a square SIMD algorithm, that is to say an SIMD (SIMDB B for Big) that calls another SIMD (SIMDS S for Small). I want to know whether it is more optimal to use one big loop (which is the SIMDB) that launches the kernel that performs SIMDS, or I should reduce the length of the loop which launches concurrent kernels on the different streams? In my opinion, it depends on the memory size but I am not sure how.



Thank you for your response

#2
Posted 09/03/2010 04:27 AM   
This is "CUDA Contests" forum. Post it in "General GPU discussion" or "CUDA programming an development" forum.
This is "CUDA Contests" forum. Post it in "General GPU discussion" or "CUDA programming an development" forum.

Ignorance Rules; Knowledge Liberates!

#3
Posted 09/28/2010 09:34 AM   
This is "CUDA Contests" forum. Post it in "General GPU discussion" or "CUDA programming an development" forum.
This is "CUDA Contests" forum. Post it in "General GPU discussion" or "CUDA programming an development" forum.

Ignorance Rules; Knowledge Liberates!

#4
Posted 09/28/2010 09:34 AM   
This is "CUDA Contests" forum. Post it in "General GPU discussion" or "CUDA programming an development" forum.
This is "CUDA Contests" forum. Post it in "General GPU discussion" or "CUDA programming an development" forum.

Ignorance Rules; Knowledge Liberates!

#5
Posted 09/28/2010 09:34 AM   
Scroll To Top