Hi,
I recently installed a TESLA 2075 on Ubuntu10.04. Drivers and runtime are both CUDA 4.1. Compiling the software SDK runs through with out errors. deviceQuery returns:
[deviceQuery] starting…
bin/linux/release/deviceQuery Starting…
CUDA Device Query (Runtime API) version (CUDART static linking)
Found 1 CUDA Capable device(s)
Device 0: “Tesla C2075”
CUDA Driver Version / Runtime Version 4.1 / 4.1
CUDA Capability Major/Minor version number: 2.0
Total amount of global memory: 5375 MBytes (5636554752 bytes)
(14) Multiprocessors x (32) CUDA Cores/MP: 448 CUDA Cores
GPU Clock Speed: 1.15 GHz
Memory Clock rate: 1566.00 Mhz
Memory Bus Width: 384-bit
L2 Cache Size: 786432 bytes
Max Texture Dimension Size (x,y,z) 1D=(65536), 2D=(65536,65535), 3D=(2048,2048,2048)
Max Layered Texture Size (dim) x layers 1D=(16384) x 2048, 2D=(16384,16384) x 2048
Total amount of constant memory: 65536 bytes
Total amount of shared memory per block: 49152 bytes
Total number of registers available per block: 32768
Warp size: 32
Maximum number of threads per block: 1024
Maximum sizes of each dimension of a block: 1024 x 1024 x 64
Maximum sizes of each dimension of a grid: 65535 x 65535 x 65535
Maximum memory pitch: 2147483647 bytes
Texture alignment: 512 bytes
Concurrent copy and execution: Yes with 2 copy engine(s)
Run time limit on kernels: No
Integrated GPU sharing Host Memory: No
Support host page-locked memory mapping: Yes
Concurrent kernel execution: Yes
Alignment requirement for Surfaces: Yes
Device has ECC support enabled: Yes
Device is using TCC driver mode: No
Device supports Unified Addressing (UVA): Yes
Device PCI Bus ID / PCI location ID: 6 / 0
Compute Mode:
< Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >
deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 4.1, CUDA Runtime Version = 4.1, NumDevs = 1, Device = Tesla C2075
[deviceQuery] test results…
PASSED
exiting in 3 seconds: 3…2…1…done!
So far, no problem. I can also see the card using nvidia-smi:
Tue Apr 3 16:25:39 2012
±-----------------------------------------------------+
| NVIDIA-SMI 2.285.05 Driver Version: 285.05.33 |
|-------------------------------±---------------------±---------------------+
| Nb. Name | Bus Id Disp. | Volatile ECC SB / DB |
| Fan Temp Power Usage /Cap | Memory Usage | GPU Util. Compute M. |
|===============================+======================+======================|
| 0. Tesla C2075 | 0000:06:00.0 Off | 0 0 |
| 30% 52 C P0 80W / 225W | 0% 10MB / 5375MB | 99% Default |
|-------------------------------±---------------------±---------------------|
| Compute processes: GPU Memory |
| GPU PID Process name Usage |
|=============================================================================|
| No running compute processes found |
±----------------------------------------------------------------------------+
now if I try to run an application, say vectorAdd: the status is like this:
Tue Apr 3 16:26:29 2012
±-----------------------------------------------------+
| NVIDIA-SMI 2.285.05 Driver Version: 285.05.33 |
|-------------------------------±---------------------±---------------------+
| Nb. Name | Bus Id Disp. | Volatile ECC SB / DB |
| Fan Temp Power Usage /Cap | Memory Usage | GPU Util. Compute M. |
|===============================+======================+======================|
| 0. Tesla C2075 | 0000:06:00.0 Off | 0 0 |
| 30% 52 C P12 32W / 225W | 1% 59MB / 5375MB | 0% Default |
|-------------------------------±---------------------±---------------------|
| Compute processes: GPU Memory |
| GPU PID Process name Usage |
|=============================================================================|
| 0. 9361 …A_GPU_Computing_SDK/C/bin/linux/release/vectorAdd 47MB |
±----------------------------------------------------------------------------+
Nothing seems to happen on the card, GPU Util(isation, I assume) is stuck at 0% and the code just hangs, no error no nothing. The only thing I see is that one of the CPUs is at 100%.
Any ideas what might be wrong?
Thanks a lot, MW