I’ve a Linux server with Fedora 14 64-bits and a GeForce GTX 680 (with 1536 CUDA cores) and other GPUs.
With the develop driver version 285.05.33, the system doesn’t recognize the GTX 680.
For this, I have needed install the driver version 295.40. With that, the system finally recognizes the GTX 680.
But the problem is that the “deviceQuery” example of the SDK doesn’t recognize the cuda cores of the GTX 680.
This is part of the output:
Device 3: “GeForce GTX 680”
CUDA Driver Version / Runtime Version 4.2 / 4.1
CUDA Capability Major/Minor version number: 3.0
Total amount of global memory: 2048 MBytes (2147287040 bytes) MapSMtoCores SM 3.0 is undefined (please update to the latest SDK)!
MapSMtoCores SM 3.0 is undefined (please update to the latest SDK)!
( 8) Multiprocessors x (-1) CUDA Cores/MP: -8 CUDA Cores
GPU Clock Speed: 0.71 GHz
Memory Clock rate: 3004.00 Mhz
Memory Bus Width: 256-bit
Is like if the Toolkit 4.1 or the SDK 4.1 don’t support the CUDA capability 3.0.
Is possible that this be the problem?
The CUDA API does not give a way for programs to ask how many CUDA cores per multiprocessor are present in a device. As a result, the deviceQuery example has to build in a lookup table to convert number of multiprocessors to number of CUDA cores. This has nothing to do with the toolkit or driver, so you can ignore the warning.
Note that a beta release of CUDA 4.2 was quietly announced on the forums:
You definitely want that toolkit, even though it won’t update the SDK examples.
It is a micromagnetic simulator, but lamentably for now it not works properly with the GeForce GTX 680. We think it is because the last version of the CUDPP library that we used for some calculations of the program maybe is not compatible with the CUDA Capability 3.0.
Until now we have made satisfactory tests with Teslas 1060, 2050, 2070, 2075, 2090 and GeForce GTX 580.
Exactly, is needed install the CUDPP 2.0 to run CUDA programs with the CUDA Toolkit 4.2 and CUDA Capability 3.0
But now, the problem is that the program runs more slowly with the new driver than with the old driver.
With the driver 270.41.19 is much faster than with the driver 285.05.33, 295.40, 295.41 and 295.49.