CUDA Programming and Performance

Tesla Compute Cluster driver released non-display driver for 64-bit Windows Server 08/08 R2
Tesla Compute Cluster driver released non-display driver for 64-bit Windows Server 08/08 R2

38 Replies

24,185 Views

sWienke

6 years ago

"Display driver stopped responding and has recovered" WDDM Timeout Detection and Recovery
"Display driver stopped responding and has recovered" WDDM Timeout Detection and Recovery

19 Replies

114,723 Views

eglion9517530

4 years ago

Can a GPU Speed-up this Use-Case?
Can a GPU Speed-up this Use-Case?

1 Replies

24 Views

bscully

3 hours ago

Final word on Titan X and TCC?
Final word on Titan X and TCC?

15 Replies

3,132 Views

Ailleur

7 months ago

Row buffer size of DRAM in GPU
Row buffer size of DRAM in GPU

0 Replies

26 Views

iamkaka

15 hours ago

cusolverDnCgesvd performance vs MKL
cusolverDnCgesvd performance vs MKL

20 Replies

1,833 Views

mh1

12 months ago

timers on GPU and CPu
timers on GPU and CPu

0 Replies

26 Views

Manalo

19 hours ago

undefined reference to `cutStartTimer'
undefined reference to `cutStartTimer'

3 Replies

22 Views

SaiAutukuri

21 hours ago

Read vectors with power-law distributed frequency, any cache optimization we can do?
Read vectors with power-law distributed frequency, any cache optimization we can do?

0 Replies

25 Views

wtan

22 hours ago

nvcc & clang 7 (no typo here)
nvcc & clang 7 (no typo here)

31 Replies

12,068 Views

savage309

7 months ago

__restrict__ - where must I have it?
__restrict__ - where must I have it?

1 Replies

86 Views

kalle1

3 days ago

Cache model and replacement policies for GPU memory
Cache model and replacement policies for GPU memory

0 Replies

38 Views

iamkaka

2 days ago

How to set all elements in a CUDA array to be zeros?
How to set all elements in a CUDA array to be zeros?

3 Replies

2,576 Views

ScottWang

3 years ago

Conditional operations -- when to noop and when not to
Conditional operations -- when to noop and when not to

0 Replies

54 Views

kalle1

3 days ago

Cuda 7.5 give a 30% performance loss vs cuda 6.5
Cuda 7.5 give a 30% performance loss vs cuda 6.5

31 Replies

4,055 Views

sp_

8 months ago

Bypassing cache while running a benchmark
Bypassing cache while running a benchmark

1 Replies

67 Views

SaiAutukuri

3 days ago

Only 1 GPU used
Only 1 GPU used

2 Replies

110 Views

bwana

6 days ago

Error: __shared__ variables cannot have external linkage
Error: __shared__ variables cannot have external linkage

1 Replies

59 Views

AlexBotev

4 days ago

Google gpucc vs. Nvidia nvcc?
Google gpucc vs. Nvidia nvcc?

8 Replies

1,909 Views

Boric_Tan

3 months ago

How to decrease cudaMemcpy time
How to decrease cudaMemcpy time

1 Replies

109 Views

IIIID

4 days ago

Create Topic