Running an example to check multiple GPUS on multiple nodes using MPI or another approach

Hello friends, how are you?

I finish my cuda setup with rocks cluster. Each compute node have two geforce 9500 gt just for testing.

To run all gpus, I’d like to know if some application or example is available. I think MPI is the best approach for it, because it´s available on rocks.

Thanks.