Nvidia Pascal TITAN Xp, TITAN X, GeForce GTX 1080 Ti, GTX 1080, GTX 1070, GTX 1060, GTX 1050 & G...
Does SM 6.1 have the same updated Unified Memory system features of SM 6.0, like system-wide addressing and coherent page faulting? The P100 whitepaper and [url=https://devblogs.nvidia.com/parallelforall/inside-pascal/]Mark Harris's Pascal introduction[/url] carefully call these P100 features, not Pascal's.
Does SM 6.1 have the same updated Unified Memory system features of SM 6.0, like system-wide addressing and coherent page faulting? The P100 whitepaper and Mark Harris's Pascal introduction carefully call these P100 features, not Pascal's.

Posted 06/20/2016 03:43 AM   
https://forum.beyond3d.com/threads/nvidia-pascal-announcement.57763/page-70#post-1925450 "So I got an email from NVIDIA this morning. We can finally lay the question of FP16 execution to rest once and for all. GP104 has a single, dedicated FP16x2 core per SM. The FP32 cores cannot execute FP16x2. This is basically identical to how NVIDIA does FP64, except GP104 has more FP64 units (4 per SM). This is where the 1/128 instruction rate comes from, and since it's capable of executing 2 FP16 ops in a vec2, the resulting 1/64 FLOP rate. This also means that it takes 32 clocks to actually execute a single instruction of a single warp."
https://forum.beyond3d.com/threads/nvidia-pascal-announcement.57763/page-70#post-1925450

"So I got an email from NVIDIA this morning. We can finally lay the question of FP16 execution to rest once and for all.

GP104 has a single, dedicated FP16x2 core per SM. The FP32 cores cannot execute FP16x2.

This is basically identical to how NVIDIA does FP64, except GP104 has more FP64 units (4 per SM). This is where the 1/128 instruction rate comes from, and since it's capable of executing 2 FP16 ops in a vec2, the resulting 1/64 FLOP rate. This also means that it takes 32 clocks to actually execute a single instruction of a single warp."

Posted 06/23/2016 02:07 AM   
Are you able to switch to Pstate P0 the 1070 and 1080 in a cuda app ? I was using a small trick on the 9xx with application clocks (setting them to the highest mem clock was switching the card to P0) Its a problem because P2 mem clock is reduced (and not user editable)
Are you able to switch to Pstate P0 the 1070 and 1080 in a cuda app ? I was using a small trick on the 9xx with application clocks (setting them to the highest mem clock was switching the card to P0)

Its a problem because P2 mem clock is reduced (and not user editable)

Posted 06/23/2016 06:27 PM   
http://www.geforce.com/hardware/10series/geforce-gtx-1060 http://www.anandtech.com/show/10474/nvidia-announces-geforce-gtx-1060-july-19 GTX 1060 with 6GB GDDR5, 192bit memory bus, out on July 19th.

Posted 07/09/2016 05:53 AM   
I am securing a 1070 (MSI GeForce GTX 1070 DirectX 12 GTX 1070 GAMING X 8G 8GB) in the coming week and will run code/tests/commands people are interested in. Start posting up things you would like to see Windows 7/ubuntu 16.04 available. I will create a custom thread by mid week 7/14/16
I am securing a 1070 (MSI GeForce GTX 1070 DirectX 12 GTX 1070 GAMING X 8G 8GB) in the coming week and will run code/tests/commands people are interested in. Start posting up things you would like to see

Windows 7/ubuntu 16.04 available.
I will create a custom thread by mid week 7/14/16

Posted 07/10/2016 07:34 PM   
http://www.nvidia.com/download/driverResults.aspx/105037/en-us http://www.nvidia.com/download/driverResults.aspx/105033/en-us GeForce 368.81 WHQL driver for GTX 1060. http://www.nvidia.com/download/driverResults.aspx/105343/en-us GeForce 367.35 Linux driver for 1060.
http://www.geforce.com/whats-new/articles/nvidia-titan-x-pascal-available-august-2nd http://www.geforce.com/hardware/10series/titan-x-pascal https://blogs.nvidia.com/blog/2016/07/21/titan-x/ https://blogs.nvidia.com/blog/2016/07/21/titan-x-deep-learning/ Pascal TITAN X with 12GB GDDR5X, 480GB/s of bandwidth.
Gadzooks! I don't think anyone saw the TITAN X coming! So why is it called TITAN X and not: [list] [.]TITAN P[/.] [.]TITAN X′[/.] [.]TITAN X²[/.] [.]TITAN XX[/.] [.]TITAN XXX[/.] [.]TITAN YOLO[/.] [.]TITAN MMXVI[/.] [.]TITAN $1200[/.] [.]TITAN ¯\_(ツ)_/¯[/.] [/list]
Gadzooks!

I don't think anyone saw the TITAN X coming!

So why is it called TITAN X and not:
  • TITAN P
  • TITAN X′
  • TITAN X²
  • TITAN XX
  • TITAN XXX
  • TITAN YOLO
  • TITAN MMXVI
  • TITAN $1200
  • TITAN ¯\_(ツ)_/¯

Posted 07/22/2016 05:48 AM   
Uhh yes, why not call the Titans with the initial of the architecture? Titan K, Titan M, Titan P, Titan V... that would also make the naming closer to the Quadro cards (K6000, M6000) and avoid the messy confusion we will get now. More important question though: How many FP16 units per SM are in GP102? 15.3 B transistors on the P100 (due to FP16v2? Here also the HBM and NVLINK interfaces have to be taken into account) 11.0 B transistors on the GP102 (52%, probably just cache, memory interface and additional SMs) 7.2 B transistros on the GP104 4.4 B transistors on the GP106 Transistor count suggests the GP102 is approx. GP104+GP106 big (the math adds up). I guess this would suggest also one FP16 unit per SM on the Titan X? Was nVidia not able to put 24 GB of GDDR5X on that card for that price tag?
Uhh yes, why not call the Titans with the initial of the architecture?
Titan K, Titan M, Titan P, Titan V...
that would also make the naming closer to the Quadro cards (K6000, M6000) and avoid the messy confusion we will get now.

More important question though: How many FP16 units per SM are in GP102?
15.3 B transistors on the P100 (due to FP16v2? Here also the HBM and NVLINK interfaces have to be taken into account)
11.0 B transistors on the GP102 (52%, probably just cache, memory interface and additional SMs)
7.2 B transistros on the GP104
4.4 B transistors on the GP106

Transistor count suggests the GP102 is approx. GP104+GP106 big (the math adds up).

I guess this would suggest also one FP16 unit per SM on the Titan X?

Was nVidia not able to put 24 GB of GDDR5X on that card for that price tag?

Posted 07/22/2016 12:00 PM   
Titan "10" :)
Titan "10" :)

Posted 07/22/2016 12:09 PM   
i'm think that gp102 has the same 6.1 arch and 3840 alus. but yields are yet low, so they disabled two SMs. as process will mature, i think it will be renamed to Ti, and real titan arrives - with all 3840 alus, 24 GB and $1000 price
i'm think that gp102 has the same 6.1 arch and 3840 alus. but yields are yet low, so they disabled two SMs. as process will mature, i think it will be renamed to Ti, and real titan arrives - with all 3840 alus, 24 GB and $1000 price

Posted 07/22/2016 12:27 PM   
Thanks @NVE, so we have for the GTX1080 - GP104 FP64 TFLOP = 1/32 FP32 TFLOP ! FP16 TFLOP = 1/64 FP32 TFLOP ! ... now again the question. This time : How will it look like for Titan X / GP102 ?
Thanks @NVE, so we have for the GTX1080 - GP104

FP64 TFLOP = 1/32 FP32 TFLOP !
FP16 TFLOP = 1/64 FP32 TFLOP !

... now again the question. This time : How will it look like for Titan X / GP102 ?

Posted 07/22/2016 01:07 PM   
@SvenMeyer Probably the same, seems like GP102 is SM61. Here, nVidia only talks about FP32 and INT8 performance, suggesting this is just an upscaled GP104: https://blogs.nvidia.com/blog/2016/07/21/titan-x/ Since they mention deep learning, I'm sure they would have mentioned fast FP16 if it would exist. For comparison Titan X has 44 TOP INT8, 1080 has 33 TOP INT8
@SvenMeyer
Probably the same, seems like GP102 is SM61.
Here, nVidia only talks about FP32 and INT8 performance, suggesting this is just an upscaled GP104:
https://blogs.nvidia.com/blog/2016/07/21/titan-x/
Since they mention deep learning, I'm sure they would have mentioned fast FP16 if it would exist.

For comparison Titan X has 44 TOP INT8, 1080 has 33 TOP INT8

Posted 07/22/2016 01:16 PM   
I'm calculating 4-5 GFLOPS/W less than the 1080, so unless you need more memory/memory bandwidth, it's not a big step.
I'm calculating 4-5 GFLOPS/W less than the 1080, so unless you need more memory/memory bandwidth, it's not a big step.

Posted 07/22/2016 02:12 PM   
http://www.nvidia.com/download/driverResults.aspx/105851/en-us http://www.nvidia.com/download/driverResults.aspx/105847/en-us Nvidia GeForce 372.54 WHQL driver released.

Posted 08/16/2016 06:17 AM   
Scroll To Top

Add Reply