something like "top" to monitor the gpu ?

Hi,

is there something similar to “top” to monitor the graphic adapter and the gpu ?

Thanks in advance.

Bernd

nvidia-smi -i 3 -l -q -d will give you output include a “busy” and “memory used” percentage as well as ECC stats on Fermi Teslas, and other stuff like temps and fanspeeds at 3 second intervals, something like top does:

==============NVSMI LOG==============

Timestamp			: Wed Jan 26 19:51:54 2011

Driver Version			: 260.19.21

GPU 0:

	Product Name		: GeForce GTX 275

	PCI Device/Vendor ID	: 5e610de

	PCI Location ID		: 0:1:0

	Board Serial		: 154552075019

	Display			: Connected

	Temperature		: 56 C

	Fan Speed		: 40%

	Utilization

	    GPU			: 1%

	    Memory		: 3%

GPU 1:

	Product Name		: GeForce GTX 470

	PCI Device/Vendor ID	: 6cd10de

	PCI Location ID		: 0:4:0

	Board Serial		: 6178347089

	Display			: Not connected

	Temperature		: 83 C

	Fan Speed		: 55%

	Utilization

	    GPU			: 0%

	    Memory		: 0%

	Volatile ECC errors	:

	  Single bit		:

	    FB			: 0

	    RF			: 0

	    L1			: 0

	    L2			: 0

	    Total		: 0

	  Double bit		:

	    FB			: 0

	    RF			: 0

	    L1			: 0

	    L2			: 0

	    Total		: 0

GPU busy is actually the percentage of time over the last second the SMs were busy, and the memory utilization is actually the percentage of bandwidth used during the last second. Full memory consumption statistics come with the next release.

Hi,

my output is much less verbose:

pc56846:~ # nvidia-smi -a -q

GPU 0:

    Product Name            : GeForce GTX 460

    PCI ID                  : e2210de

    Temperature             : 40 C

Why do you have much more informations ?

Bernd

You need the -d option, I think, otherwise I am not sure.

pc56846:~ # nvidia-smi -d -q

GPU 0:

    Product Name            : GeForce GTX 460

    PCI ID                  : e2210de

    Temperature             : 39 C

Unfortunaly the same.

Bernd

What driver and CUDA version are you using. I think this only works with CUDA 3.2.

CUDA 3.1 and driver 256.40 (or 256.53), currently i don’t know exactly.

Bernd

Hi,

you are right. CUDA Toolkit 3.2 Downloads | NVIDIA Developer says

“New NVIDIA System Management Interface (nvidia-smi) support for reporting % GPU busy, and several GPU performance counters”

Bernd

I second that request.

Let us say multiple GPU APPs are running, the “TOP” utility shows only the CPU usage of the processes.
Even when a GPU App is sleeping, the kernels spawned by it could be actively running on the GPU.
So, what “Top” shows is completely irrelevant for us.

It would be very good to see what actual kernel is running on the GPU. Only the driver can tell that.
If NVIDIA can tell us info on

  1. How many STREAMS are present?
  2. PID or THREAD ID associated with each STREAM
  3. Pending Kernel Launches on each queue (would be good to show other operations like memcopy as well)
  4. Name of GPU Kernel(s) that is(are) currently running

Thanks
Best Regards,
Sarnath

Here is something I wrote to sort of address the need for a top like tool for NVIDIA GPUs:

For Linux, I use this HTOP like tool that I wrote myself. It monitors and gives an overview of the GPU temperature as well as the core / VRAM / PCI-E & memory bus usage. It does not monitor what’s running on the GPU though. [url]https://github.com/mountassir/gmonitor[/url]

Try out nvitop, a monitor based on NVML Python bindings nvidia-ml-py:

On Linux:

On Windows:

Tree-view: