CUDA Programming and Performance

Tesla Compute Cluster driver released non-display driver for 64-bit Windows Server 08/08 R2
Tesla Compute Cluster driver released non-display driver for 64-bit Windows Server 08/08 R2

38 Replies

18,839 Views

sWienke

5 years ago

"Display driver stopped responding and has recovered" WDDM Timeout Detection and Recovery
"Display driver stopped responding and has recovered" WDDM Timeout Detection and Recovery

19 Replies

103,349 Views

eglion9517530

4 years ago

reasons why splitting large kernel to smaller one lower perfromance
reasons why splitting large kernel to smaller one lower perfromance

0 Replies

12 Views

iliak

1 hour ago

Tegra K1 MatVec Multiplication Benchmark Revision (Zero Copy vs Unified Memory)
Tegra K1 MatVec Multiplication Benchmark Revision (Zero Copy vs Unified Memory)

3 Replies

91 Views

Carl-Dieter

3 days ago

How to find the maximum number in an array in GPU and CPU and calculate the time for the processes.
How to find the maximum number in an array in GPU and CPU and calculate the time for the processes.

0 Replies

26 Views

SSK1231243

23 hours ago

compiling ffmpeg with nvec support fails to apply patch
compiling ffmpeg with nvec support fails to apply patch

4 Replies

329 Views

Patricio_Vidal

2 months ago

Overlap and Add/Sum Library
Overlap and Add/Sum Library

0 Replies

37 Views

luisgo

2 days ago

Unable to achieve concurrency in kernel launches
Unable to achieve concurrency in kernel launches

2 Replies

50 Views

mdotali

2 days ago

Cuda Code crashes - memory problems?
Cuda Code crashes - memory problems?

3 Replies

75 Views

EllaPropella

2 days ago

How can I change the configuration of compilation mode to 64 bit in Nsight?
How can I change the configuration of compilation mode to 64 bit in Nsight?

3 Replies

151 Views

ytao573

2 years ago

Unexplained low gld_efficiency
Unexplained low gld_efficiency

0 Replies

41 Views

ttart398

2 days ago

Problem probably in cuFFT
Problem probably in cuFFT

5 Replies

397 Views

luisgo

1 year ago

tex1D and tex1Dfetch to C
tex1D and tex1Dfetch to C

1 Replies

45 Views

ZhiKaanZhi

3 days ago

Managed memory vs cudaHostAlloc - TK1
Managed memory vs cudaHostAlloc - TK1

9 Replies

1,190 Views

Milliarde

2 years ago

Jetson TK1 performance bottleneck
Jetson TK1 performance bottleneck

4 Replies

86 Views

mdotali

4 days ago

Labeling (connected component labeling) in cuda
Labeling (connected component labeling) in cuda

3 Replies

87 Views

saai

5 days ago

Create single .ptx (or .cubin) file from multiple .cu sources
Create single .ptx (or .cubin) file from multiple .cu sources

2 Replies

127 Views

Piane_Ramso

1 week ago

GPU utilization of *completed* processes?
GPU utilization of *completed* processes?

1 Replies

67 Views

alexishuxley

5 days ago

cuda memory usage in debug(with GDB),debug(without GDB) and release differ, extra 2GB usage in relea
cuda memory usage in debug(with GDB),debug(without GDB) and release differ, extra 2GB usage in relea

11 Replies

375 Views

iliak

2 weeks ago

nvcc and nested class issue
nvcc and nested class issue

2 Replies

218 Views

cesc1

5 months ago

Create Topic