Saw the other post like this but dropping the memory range increments doesn’t help.
Ran the deviceQuery & my card seems fine, see below. When I then run the BandwidthTest I get the error below. Tried various ranges with no joy. Any idea why this doesn’t work ? The other post also mentioned for their card you could do memory mapping instead which was faster to get around it. Can I do that with my card ? All drives/SDK’s etc all installed fine & everything seemed to compile OK.
Many thanks.
Russ
./bandwidthTest --mode=range --start=1024 --end=102400 --increment=1024
[bandwidthTest] starting…
./bandwidthTest Starting…
Running on…
Device 0: GeForce 8600M GT
Range Mode
bandwidthTest.cu(761) : CUDA Runtime API error 2: out of memory.
[deviceQuery] starting…
./deviceQuery Starting…
CUDA Device Query (Runtime API) version (CUDART static linking)
Found 1 CUDA Capable device(s)
Device 0: “GeForce 8600M GT”
CUDA Driver Version / Runtime Version 4.1 / 4.1
CUDA Capability Major/Minor version number: 1.1
Total amount of global memory: 256 MBytes (268238848 bytes)
( 4) Multiprocessors x ( 8) CUDA Cores/MP: 32 CUDA Cores
GPU Clock Speed: 1.04 GHz
Memory Clock rate: 650.00 Mhz
Memory Bus Width: 128-bit
Max Texture Dimension Size (x,y,z) 1D=(8192), 2D=(65536,32768), 3D=(2048,2048,2048)
Max Layered Texture Size (dim) x layers 1D=(8192) x 512, 2D=(8192,8192) x 512
Total amount of constant memory: 65536 bytes
Total amount of shared memory per block: 16384 bytes
Total number of registers available per block: 8192
Warp size: 32
Maximum number of threads per block: 512
Maximum sizes of each dimension of a block: 512 x 512 x 64
Maximum sizes of each dimension of a grid: 65535 x 65535 x 1
Maximum memory pitch: 2147483647 bytes
Texture alignment: 256 bytes
Concurrent copy and execution: Yes with 1 copy engine(s)
Run time limit on kernels: Yes
Integrated GPU sharing Host Memory: No
Support host page-locked memory mapping: Yes
Concurrent kernel execution: No
Alignment requirement for Surfaces: Yes
Device has ECC support enabled: No
Device is using TCC driver mode: No
Device supports Unified Addressing (UVA): No
Device PCI Bus ID / PCI location ID: 1 / 0
Compute Mode:
< Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >
deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 4.1, CUDA Runtime Version = 4.1, NumDevs = 1, Device = GeForce 8600M GT
[deviceQuery] test results…
PASSED
exiting in 3 seconds: 3…2…1…done!