I am running an AWS EC2 instance with Volta V100,
running Ubuntu 16.04.
I installed Cuda 9.1 toolkit following all the instructions mentioned in:
$ lspci | grep -i nvidia
00:1e.0 3D controller: NVIDIA Corporation GV100 [Tesla V100 SXM2] (rev a1)
When I run ‘nvprof -h’, it core dumps,
giving me an error like:
$ nvprof -h
*** Error in `nvprof’: free(): invalid pointer: 0x0000000001190920 ***
======= Backtrace: =========
/lib/x86_64-linux-gnu/libc.so.6(+0x777e5)[0x7fadca6337e5]
/lib/x86_64-linux-gnu/libc.so.6(+0x8037a)[0x7fadca63c37a]
/lib/x86_64-linux-gnu/libc.so.6(cfree+0x4c)[0x7fadca64053c]
nvprof[0xa8bbfd]
nvprof[0xaa22e8]
nvprof[0xaa1523]
nvprof(_ZNSt6locale18_S_initialize_onceEv+0x32)[0xaa1226]
/lib/x86_64-linux-gnu/libpthread.so.0(+0xea99)[0x7fadcb0bba99]
nvprof[0xaa0ec5]
nvprof[0xaa127f]
nvprof[0xaa1008]
nvprof[0xa875c0]
nvprof[0xab4362]
nvprof[0xab221b]
nvprof[0xa4a0e4]
nvprof[0xaeb956]
======= Memory map: ========
00400000-00b83000 r-xp 00000000 ca:01 220821 /usr/local/cuda-9.1/bin/nvprof
00d82000-0113f000 rwxp 00782000 ca:01 220821 /usr/local/cuda-9.1/bin/nvprof
I do have cuda toolkit 8.0 on the same machine.
That nvprof runs fine, but does not recognize V100.
/usr/local/cuda-8.0/bin/nvprof cuda_binary
======== Warning: This version of nvprof doesn’t support the underlying device, GPU profiling skipped.
Can someone please let me know how to resolve this issue?