how to assign shared memory size with variable blockDim.x blockDim.y and blockDim.z

syoon · September 28, 2010, 8:41pm

Can I assign a shared memory like the following? apparently not. I got error messages.

shared float a_d[blockDim.xblockDim.yblockDim.z];

I want to make it the size of block. Can I use the variables for it?

Or, I have to use a specific number for the size.

Any comments are welcome and thanks in advance.

tera · September 29, 2010, 1:24am

No, because the size must be known at compile time.
What you can do is use the dynamical shared memory allocation feature. Declare a_d as follows:
[font=“Courier New”]extern shared float a_d;[/font]
and add the required size as third configuration parameter of the kernel invocation:
[font=“Courier New”]my_kernel<<<gridsize, blocksize, blocksize.xblocksize.yblocksize.z*sizeof(float)>>>();[/font]

Note that this only works for one variable size array.

tera · September 29, 2010, 1:24am

No, because the size must be known at compile time.
What you can do is use the dynamical shared memory allocation feature. Declare a_d as follows:
[font=“Courier New”]extern shared float a_d;[/font]
and add the required size as third configuration parameter of the kernel invocation:
[font=“Courier New”]my_kernel<<<gridsize, blocksize, blocksize.xblocksize.yblocksize.z*sizeof(float)>>>();[/font]

Note that this only works for one variable size array.

syoon · September 29, 2010, 3:46pm

Hmmmm… continuation of the equation. Something is still not right.

so, I set in my main code,

dim3 dimGrid2(129,129),dimBlock2(1,1,33);

Kernel<<<dimGrid2,dimBlock2,33*sizeof(REAL)>>>

…

global void Kernel(REAL *COEF,REAL *P,REAL *PN,REAL *RHS,

        int imax,int jmax,int kmax, REAL *cost)

{

shared REAL COEFs0;

__shared__ REAL COEFs1[];

__shared__ REAL COEFs2[];

}

so that i can dynamically assign same size of arrays in the kernel.

Oh. I didn’t find the third parameter usage in the book “Programming Massively Parallel Processors” but I looked up again in documents from NVIDIA and now I know there is even a fourth parameter you can set… Thanks very much!

syoon · September 29, 2010, 3:46pm