CUDA Programming and Performance

Tesla Compute Cluster driver released non-display driver for 64-bit Windows Server 08/08 R2
Tesla Compute Cluster driver released non-display driver for 64-bit Windows Server 08/08 R2

38 Replies

26,030 Views

sWienke

7 years ago

"Display driver stopped responding and has recovered" WDDM Timeout Detection and Recovery
"Display driver stopped responding and has recovered" WDDM Timeout Detection and Recovery

19 Replies

133,364 Views

eglion9517530

5 years ago

Printf inside kernel
Printf inside kernel

4 Replies

21,067 Views

Valmass

6 years ago

How FP32 and FP16 units are implemented in GP100 GPU's
How FP32 and FP16 units are implemented in GP100 GPU's

2 Replies

17 Views

Varun1312

4 hours ago

Object segmentation with CUDA-Memory requirements
Object segmentation with CUDA-Memory requirements

6 Replies

69 Views

Alex1980

4 days ago

Cuda Prefix Scan
Cuda Prefix Scan

1 Replies

0 Views

merveu

4 hours ago

Dispatch Kernel Overhead (OpenCL)
Dispatch Kernel Overhead (OpenCL)

1 Replies

21 Views

PabloBot

7 hours ago

CUDA compatibility with 1050Ti
CUDA compatibility with 1050Ti

2 Replies

26 Views

alokshankarpa

13 hours ago

Issues with multiple contexts on M60s in Azure
Issues with multiple contexts on M60s in Azure

3 Replies

101 Views

DomF

22 hours ago

Why am I getting better performance with per column vs per row for matrix addition?
Why am I getting better performance with per column vs per row for matrix addition?

1 Replies

55 Views

Phaino

1 day ago

Is it safe to use different versions of cuda runtime within a single process?
Is it safe to use different versions of cuda runtime within a single process?

0 Replies

41 Views

gagra

2 days ago

Reading globaltimer register or calling clock/clock64 in loop prevent concurrent kernel execution?
Reading globaltimer register or calling clock/clock64 in loop prevent concurrent kernel execution?

17 Replies

225 Views

MingYang4

1 week ago

Some problem about Synchronize CPU and GPU
Some problem about Synchronize CPU and GPU

0 Replies

56 Views

willy810473

2 days ago

Driver for GTX 1080 Ti
Driver for GTX 1080 Ti

19 Replies

620 Views

JohnWatts

1 week ago

A more accurate and faster implementation of powf()
A more accurate and faster implementation of powf()

2 Replies

52 Views

njuffa

4 days ago

copy result to host question
copy result to host question

5 Replies

142 Views

s002wjh

5 days ago

Unexpected rounding behavior in HFMA
Unexpected rounding behavior in HFMA

5 Replies

153 Views

ptillet

5 days ago

Problem about cudaCreateChannelDesc
Problem about cudaCreateChannelDesc

3 Replies

61 Views

archernzy

5 days ago

Question about kernel granularity
Question about kernel granularity

5 Replies

180 Views

grynet

2 weeks ago

Issue with addition of shared memory and thread indexing
Issue with addition of shared memory and thread indexing

0 Replies

63 Views

Flemin

5 days ago

Create Topic