I compiled the cuda samples. It seems to pass the test, but BLAS related are failing:
./matrixMul
[Matrix Multiply Using CUDA] - Starting...
GPU Device 0: "GeForce GTX 750 Ti" with compute capability 5.0
MatrixA(320,320), MatrixB(640,320)
Computing result using CUDA Kernel...
done
Performance= 183.40 GFlop/s, Time= 0.715 msec, Size= 131072000 Ops, WorkgroupSize= 1024 threads/block
Checking computed result for correctness: Result = PASS
NOTE: The CUDA Samples are not meant for performance measurements. Results may vary when GPU Boost is enabled.
./matrixMulCUBLAS
./matrixMulCUBLAS: error while loading shared libraries: libcublas.so.7.0: cannot open shared object file: No such file or directory
I found libcublas.so.7.0 and related files in two different directories:
>ls -la /usr/local/cuda-7.0/lib64/libcublas*
lrwxrwxrwx 1 root root 16 Feb 18 12:01 /usr/local/cuda-7.0/lib64/libcublas.so -> libcublas.so.7.0
lrwxrwxrwx 1 root root 19 Feb 18 12:01 /usr/local/cuda-7.0/lib64/libcublas.so.7.0 -> libcublas.so.7.0.28
-rwxr-xr-x 1 root root 31160168 Feb 18 12:01 /usr/local/cuda-7.0/lib64/libcublas.so.7.0.28
-rw-r--r-- 1 root root 26032916 Feb 18 12:38 /usr/local/cuda-7.0/lib64/libcublas_device.a
-rw-r--r-- 1 root root 35269768 Feb 18 12:38 /usr/local/cuda-7.0/lib64/libcublas_static.a
> ls -la /usr/local/cuda/lib64/libcublas*
lrwxrwxrwx 1 root root 16 Feb 18 12:01 /usr/local/cuda/lib64/libcublas.so -> libcublas.so.7.0
lrwxrwxrwx 1 root root 19 Feb 18 12:01 /usr/local/cuda/lib64/libcublas.so.7.0 -> libcublas.so.7.0.28
-rwxr-xr-x 1 root root 31160168 Feb 18 12:01 /usr/local/cuda/lib64/libcublas.so.7.0.28
-rw-r--r-- 1 root root 26032916 Feb 18 12:38 /usr/local/cuda/lib64/libcublas_device.a
-rw-r--r-- 1 root root 35269768 Feb 18 12:38 /usr/local/cuda/lib64/libcublas_static.a
> echo $LD_LIBRARY_PATH
/usr/local/cuda-7.0/lib64
> ./matrixMulCUBLAS
./matrixMulCUBLAS: error while loading shared libraries: libcublas.so.7.0: cannot open shared object file: No such file or directory
You’ve pretty much stumped me. I still believe that your LD_LIBRARY_PATH variable is either not correct or not getting picked up properly (the ldd output suggests this), but I can’t see enough of what you’re doing to spot the error, and I can’t think of further questions to ask.
Another possibility is that the libcublas.so.7.0 file is simply the wrong one, e.g. for a different linux distribution or something like that. It seems unlikely to me, however. 32bit vs. 64bit doesn’t seem to be the issue.
An alternative thing you might try is to modify the runtime linker mechanism to incorporate the necessary path to search, without requiring LD_LIBRARY_PATH to be set.