CUDA Programming and Performance

Tesla Compute Cluster driver released non-display driver for 64-bit Windows Server 08/08 R2
Tesla Compute Cluster driver released non-display driver for 64-bit Windows Server 08/08 R2

38 Replies

24,202 Views

sWienke

6 years ago

"Display driver stopped responding and has recovered" WDDM Timeout Detection and Recovery
"Display driver stopped responding and has recovered" WDDM Timeout Detection and Recovery

19 Replies

114,830 Views

eglion9517530

4 years ago

GPU Pro Tip: CUDA 7 Streams Simplify Concurrency
GPU Pro Tip: CUDA 7 Streams Simplify Concurrency

0 Replies

1 Views

layert

1 minute ago

Device function pointers: Is it possible to use them in a useful way?
Device function pointers: Is it possible to use them in a useful way?

12 Replies

130 Views

dgrat

2 days ago

2D CUDA convolution
2D CUDA convolution

3 Replies

1,195 Views

DoctorG

1 year ago

Pinned memory limit
Pinned memory limit

16 Replies

1,012 Views

Cui

7 months ago

Cache model and replacement policies for GPU memory
Cache model and replacement policies for GPU memory

1 Replies

70 Views

iamkaka

4 days ago

Insight on performance of GTX 480 for LIB benchmark
Insight on performance of GTX 480 for LIB benchmark

1 Replies

42 Views

SaiAutukuri

2 days ago

Can a GPU Speed-up this Use-Case?
Can a GPU Speed-up this Use-Case?

1 Replies

52 Views

bscully

2 days ago

Final word on Titan X and TCC?
Final word on Titan X and TCC?

15 Replies

3,151 Views

Ailleur

7 months ago

Row buffer size of DRAM in GPU
Row buffer size of DRAM in GPU

0 Replies

31 Views

iamkaka

2 days ago

cusolverDnCgesvd performance vs MKL
cusolverDnCgesvd performance vs MKL

20 Replies

1,849 Views

mh1

12 months ago

timers on GPU and CPu
timers on GPU and CPu

0 Replies

35 Views

Manalo

3 days ago

undefined reference to `cutStartTimer'
undefined reference to `cutStartTimer'

3 Replies

28 Views

SaiAutukuri

3 days ago

Read vectors with power-law distributed frequency, any cache optimization we can do?
Read vectors with power-law distributed frequency, any cache optimization we can do?

0 Replies

28 Views

wtan

3 days ago

nvcc & clang 7 (no typo here)
nvcc & clang 7 (no typo here)

31 Replies

12,171 Views

savage309

7 months ago

__restrict__ - where must I have it?
__restrict__ - where must I have it?

1 Replies

92 Views

kalle1

5 days ago

How to set all elements in a CUDA array to be zeros?
How to set all elements in a CUDA array to be zeros?

3 Replies

2,592 Views

ScottWang

3 years ago

Conditional operations -- when to noop and when not to
Conditional operations -- when to noop and when not to

0 Replies

58 Views

kalle1

4 days ago

Cuda 7.5 give a 30% performance loss vs cuda 6.5
Cuda 7.5 give a 30% performance loss vs cuda 6.5

31 Replies

4,100 Views

sp_

8 months ago

Create Topic