Profile
Activity
txbob
 

Recent Topics

No Recent Topics


Recent Comments

Signature

Profile

Signed up on Feb 2011
Last seen on Oct 23 2017
Started 3 topics
5017 total posts

Recent Posts

How much faster are atomicAdd() operations to __shared__ on ...

It depends on various factors. See figure 3 here: https://devblogs.nvidia.com/parallelforall/gpu ...

11 hours ago on CUDA Programming and Performance

NCCL 2.0 download page is down

I didn't have any trouble accessing this page: https://developer.nvidia.com/nccl/nccl-download ...

20 hours ago on GPU-Accelerated Libraries

leak in cublasCreate + cublasDestroy

That doesn't look like a leak to me. The library has overhead which will be incurred the first time ...

20 hours ago on GPU-Accelerated Libraries

nvidia-smi topo SOC

Dell R740 is a skylake CPU system. Skylake processors from Intel introduced the possibility for mul ...

2 days ago on CUDA Setup and Installation

kernel index bug?

At around 16 million you'll reach the limit of what can reliably be stored in a float quantity, if y ...

3 days ago on CUDA Programming and Performance