Before anything else here is my test machine and CUDA version:
Win7 64bits with 6GB of RAM.
CUDA Tookit 4.0 also 64bits.
Now my problem:
I have a pretty big kernel to compile and it stops at be.exe.
After some investigation I found that be.exe stops with an out of memory error when it started using more than 2GB of RAM, and that makes sense, because be.exe is a 32bits application (even thou the toolkit is the 64bits version).
Hoping that my kernel compile wouldn't exceed 3GB and that be.exe would be compatible with the flag /LARGEDADDRESSAWARE, I changed it.
But alas... The compilation quickly consumed the 3GB and the same error occurred...
Now I am stuck... The only chance is to have be.exe compiled has a 64bits executable, then it would be able to use all the RAM available (unfortunately cutting stuff out of the kernel is not a possibility)....
Or there is any workaround that I am missing?
Best regards to all!
I need to compile sm_1x code, so the new LLVM based compiler won't be used...
You must Log In to send a PM.
Please Log In | Register to add a comment.
Not a member? Register Now