error while loading shared libraries: libcudart.so.4: cannot open shared object file: No such file o
Hi,

I am trying to execute MPI and CUDA code on a cluster. The code works fine on single machine but when I try to execute it on cluster I get error:

error while loading shared libraries: libcudart.so.4: cannot open shared object file: No such file or directory

I checked my PATH and LD_PATH and it looks ok. I have a .bashrc file which contains following entries -

export PATH=$PATH:/usr/local/lib/:/usr/local/lib/openmpi:/usr/local/cuda/bin
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/local/lib:/usr/local/lib/openmpi/:/usr/local/cuda/lib/:

All the machines haves same installation of CUDA and OpenMPI.

Can anyone help me with this. This problem is really annoying.

Thanks.
Hi,



I am trying to execute MPI and CUDA code on a cluster. The code works fine on single machine but when I try to execute it on cluster I get error:



error while loading shared libraries: libcudart.so.4: cannot open shared object file: No such file or directory



I checked my PATH and LD_PATH and it looks ok. I have a .bashrc file which contains following entries -



export PATH=$PATH:/usr/local/lib/:/usr/local/lib/openmpi:/usr/local/cuda/bin

export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/local/lib:/usr/local/lib/openmpi/:/usr/local/cuda/lib/:



All the machines haves same installation of CUDA and OpenMPI.



Can anyone help me with this. This problem is really annoying.



Thanks.

#1
Posted 05/01/2012 08:56 AM   
Hi,
Not sure if it's a typo, but there is a with space in your LD_LIBRARY_PATH definition... Better checking.
Hi,

Not sure if it's a typo, but there is a with space in your LD_LIBRARY_PATH definition... Better checking.

#2
Posted 05/01/2012 09:32 AM   
There is no space. its just a type :)
There is no space. its just a type :)

#3
Posted 05/01/2012 09:48 AM   
I have solved the issue.

Now I use mpirun -x LD_LIBRARY_PATH... to execute the job and everything works like a charm :)
I have solved the issue.



Now I use mpirun -x LD_LIBRARY_PATH... to execute the job and everything works like a charm :)

#4
Posted 05/01/2012 10:04 AM   
[quote name='Ron12' date='01 May 2012 - 04:04 AM' timestamp='1335866647' post='1402958']
I have solved the issue.

Now I use mpirun -x LD_LIBRARY_PATH... to execute the job and everything works like a charm :)
[/quote]

I tried running my MPI + CUDA code by using -x LD_LIBRARY_PATH, but it's still giving me [b]error while loading shared libraries: libcudart.so.3: cannot open shared object file: No such file or directory [/b].
I ran via terminal:[b] mpirun -machinefile machines -x /usr/local/cuda/lib64 -np 6 ./mpicuda[/b]

I have no idea what is wrong with it.

Thanks in advance.
[quote name='Ron12' date='01 May 2012 - 04:04 AM' timestamp='1335866647' post='1402958']

I have solved the issue.



Now I use mpirun -x LD_LIBRARY_PATH... to execute the job and everything works like a charm :)





I tried running my MPI + CUDA code by using -x LD_LIBRARY_PATH, but it's still giving me error while loading shared libraries: libcudart.so.3: cannot open shared object file: No such file or directory .

I ran via terminal: mpirun -machinefile machines -x /usr/local/cuda/lib64 -np 6 ./mpicuda



I have no idea what is wrong with it.



Thanks in advance.

#5
Posted 06/21/2012 04:08 PM   
[quote name='c0mrade' date='21 June 2012 - 10:08 AM' timestamp='1340294932' post='1424660']
I tried running my MPI + CUDA code by using -x LD_LIBRARY_PATH, but it's still giving me [b]error while loading shared libraries: libcudart.so.3: cannot open shared object file: No such file or directory [/b].
I ran via terminal:[b] mpirun -machinefile machines -x /usr/local/cuda/lib64 -np 6 ./mpicuda[/b]

I have no idea what is wrong with it.

Thanks in advance.
[/quote]

My bad, right now it is working fine. [b]mpirun -machinefile machines -x LD_LIBRARY_PATH -np 6 ./mpicuda[/b]
[quote name='c0mrade' date='21 June 2012 - 10:08 AM' timestamp='1340294932' post='1424660']

I tried running my MPI + CUDA code by using -x LD_LIBRARY_PATH, but it's still giving me error while loading shared libraries: libcudart.so.3: cannot open shared object file: No such file or directory .

I ran via terminal: mpirun -machinefile machines -x /usr/local/cuda/lib64 -np 6 ./mpicuda



I have no idea what is wrong with it.



Thanks in advance.





My bad, right now it is working fine. mpirun -machinefile machines -x LD_LIBRARY_PATH -np 6 ./mpicuda

#6
Posted 06/21/2012 08:44 PM   
Scroll To Top