nVidia Pascal X80

Message boards : Graphics cards (GPUs) : nVidia Pascal X80
Message board moderation

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
Jim1348

Send message
Joined: 28 Jul 12
Posts: 819
Credit: 1,591,285,971
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 43166 - Posted: 9 Apr 2016, 19:55:28 UTC - in response to Message 43165.  

There was a comment a while ago on the POEM forum that it was not clear whether the HBM2 stacked memory had the fine-grained address ability (if that is the term) required for optimum performance for their work. Whether that applies to GPUGrid I don't know, but I would not jump into the lake without checking for rocks first.
ID: 43166 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 43167 - Posted: 9 Apr 2016, 23:58:23 UTC - in response to Message 43166.  

Almost a cert it will be GDDR5@8GHz on the gaming cards.
Agree that GP104/6... will not be that similar to GP100:- DP just isn't needed on mid-range to high end gaming cards, never mind entry level cards. Would be cautious about performance though - who knows what bottlenecks there will be.
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help
ID: 43167 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jozef J

Send message
Joined: 7 Jun 12
Posts: 112
Credit: 1,140,895,172
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 43169 - Posted: 10 Apr 2016, 10:57:48 UTC

http://www.ebay.com/sch/i.html?_from=R40&_sacat=0&_nkw=980Ti&_sop=15

Now it began and still going to be in the future very much nvidia 980Ti for sale at a good price. but import charges to europe.. destroy all. It is also a lot of good servers on ebay for only Cpu crunch in the usa for sale. but import charges in europe are crazy.
http://www.ebay.com/sch/i.html?_from=R40&_sacat=0&_dcat=11211&Number%2520of%2520Processors=8&_nkw=server&_sop=16
ID: 43169 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
nanoprobe

Send message
Joined: 26 Feb 12
Posts: 184
Credit: 222,376,233
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 43181 - Posted: 12 Apr 2016, 16:01:23 UTC - in response to Message 43076.  

Wonder what they will cost also.

Your first born. 🙀
ID: 43181 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
eXaPower

Send message
Joined: 25 Sep 13
Posts: 293
Credit: 1,897,601,978
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 43235 - Posted: 22 Apr 2016, 13:07:19 UTC

For those interested:

GP100 Pascal architecture Whitepaper is now available.

Whitepaper access and download from Nvidia's website requires registration (including already registered developer accounts.)



ID: 43235 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jozef J

Send message
Joined: 7 Jun 12
Posts: 112
Credit: 1,140,895,172
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 43236 - Posted: 22 Apr 2016, 18:33:38 UTC

ID: 43236 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Wrend

Send message
Joined: 9 Nov 12
Posts: 51
Credit: 522,101,722
RAC: 0
Level
Lys
Scientific publications
watwatwatwatwatwat
Message 43237 - Posted: 22 Apr 2016, 20:38:28 UTC

Nice. Looks like my Titan Blacks may have finally found a worthy replacement... XD

Looks like we're maybe getting some decent double precision, 64FP capabilities again too?

Just started doing some tests on DP using MilkyWay@Home to see if it can make use of it well enough on my Titan Black cards. (Sorry, bit of a break from GPUGrid.)
My BOINC Cruncher, Minecraft Multiserver, Mobile Device Mainframe, and Home Entertainment System/Workstation: http://www.overclock.net/lists/display/view/id/4678036#

ID: 43237 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
eXaPower

Send message
Joined: 25 Sep 13
Posts: 293
Credit: 1,897,601,978
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 43320 - Posted: 7 May 2016, 14:02:41 UTC

GEFORCE GTX 1080 ($599 MSRB) Available May 27:
7.2 billion transistors
294mm² die
2560 NVIDIA CUDA Cores
1607 (MHz) Base Clock
1733 (MHz) Boost Clock
GPU boost 3.0
8.5 (FP32) TeraFLOPs
180W reference TDP 100% power limit (real TDP) likely 225W+
(1) 8 pin

Memory Specs:
10 Gbps Memory Speed
8 GB GDDR5X
256-bit Interface Width
320 (GB/sec) Bandwidth

Geforce GTX 1070
$379 MSRP
Available June 10


Last night's Pascal 1080 unveil demo was running at 2.1MHz on air - similar to Maxwell L2N cooled clocks.
Pascal +2.1MHz boost will be Maxwell's 1.5GHz. (For 24/7 OC)

Early adopters can help GPUGRID beta test a new ACEMD app - Count me in.
Once general pubic availability (June~July) is secured when does the Project announce a new CUDA phase?
Will Pascal be a repeat of GK110 initial ACEMD production difficulties?

Titan X / GTX 980ti is now EOL/DOA.

Maxwell's real TDP's (power limit):
GTX980ti = 325~350W
GTX980 = 250~275W
GTX970 = 220~240W
GTX960 = 160~180W
GTX750 = 60~75W


ID: 43320 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Retvari Zoltan
Avatar

Send message
Joined: 20 Jan 09
Posts: 2380
Credit: 16,897,957,044
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 43321 - Posted: 7 May 2016, 15:08:19 UTC - in response to Message 43320.  
Last modified: 7 May 2016, 15:13:09 UTC

Now that's the product we are waiting for.
If it's not faster by its newer architecture, it should have 1540MHz GPU clock to achieve the performance of an overclocked GTX980Ti (@1400MHz).
So by its standard boost clock it should be ~12.5% faster; but if the 2.1GHz is true, then it should be ~36.4% faster than a GTX980Ti@1400MHz while consuming only ~2/3 of the electricity.
I thought that this chip would have 3072 CUDA cores, so it has only the 5/6 of what I've expected, but it will be enough to top the GTX980Ti as it has higher clocks.

Will Pascal be a repeat of GK110 initial ACEMD production difficulties?
As it happened at the release of every previous GPU generation I expect that there will be some difficulties. (The present app won't work with the new cards)
ID: 43321 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
eXaPower

Send message
Joined: 25 Sep 13
Posts: 293
Credit: 1,897,601,978
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 43326 - Posted: 8 May 2016, 14:46:24 UTC - in response to Message 43321.  
Last modified: 8 May 2016, 14:57:50 UTC

If it's not faster by its newer architecture, it should have 1540MHz GPU clock to achieve the performance of an overclocked GTX980Ti (@1400MHz).
So by its standard boost clock it should be ~12.5% faster; but if the 2.1GHz is true, then it should be ~36.4% faster than a GTX980Ti@1400MHz while consuming only ~2/3 of the electricity.

2.1GHz is true - check out Nvidia's blog. GP104 Mid-tier 16nm die going to destroy the 3rd gen 28nm big die (GM200) performance/watt.
28nm Big die generations: GK110a > GK110b (GK210) > GM200
There's no doubting that GTX980ti is a strong 32bit chip but with all that heat it's tough cooling a dense setup OCed.
IMHO: Water cooling GM200 was really only way to go - WC still has a 350W power limit running a monster program.

I thought that this chip would have 3072 CUDA cores, so it has only the 5/6 of what I've expected, but it will be enough to top the GTX980Ti as it has higher clocks.

3072 CUDA is possible as a 2nd or 3rd gen mid-tier 16nm (GP204 or GV104 Volta) part with a slightly bigger die - similar to 4th or is it 5th gen 28nm GM204 (398mm²) .vs. 1st/2nd gen 28nm GK104 (294mm²). (3rd gen mid/low tier 28nm is GM107)
Maybe the big die GP100 becomes a Geforce >3000CUDA - or Volta might be the first (Geforce) Big die?
Either way the GPU performance/watt advancements is impressive compared to last couple of CPU generations.

A non-Pascal question: does you're GM200 hit 1.5GHz stable on any projects and what's the highest (stable) OC for ACEMD?
I've always thought GM200 1.5GHz ACEMD was possible. I've ran GM204 @1.5GHz since having them. (A lot Maxwell boards push the 1.5GHz boundary.)

FYI: To find any Maxwell and (Pascal) PCB (BIOS) power limit - run Primegrid's OCL4 n=20/21/22 Genefer created by (Developer Yves Gallot) or Sisoftware CUDA scientist benchmark OCed.
ID: 43326 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 43346 - Posted: 10 May 2016, 7:55:36 UTC - in response to Message 43326.  
Last modified: 10 May 2016, 7:58:07 UTC

2560 NVIDIA CUDA Cores would be a nice step up from 2048 (GTX980), which is primarily what we should be comparing a 1080 with. 8GB DDR5 also doubles the 980's 4GB DDR5. The 1733 (MHz) Boost Clock might actually Boost to ~1850MHz without any tweaking. If it outperforms the GTX980Ti for throughput then even better, but even if it doesn't it's highly likely to be better in terms of performance/Watt.

For the app to work here it will probably need to be recompiled with the latest CUDA Tool Kit. ACEMD is a complex app unlike some others which might work straight out of the box or with only a little bit of work. If the cards don't initially work here they might work elsewhere. Worst case scenario is that GPUGrid has to wait on a new CUDA tool kit and then don't have the time over the summer holidays to redevelop and test the app. Best case scenario is that a fully functional CUDA took kit is available on launch day and Gianni and Matt are available and up to speed with any CUDA advances/changes and get a Pascal to test with.
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help
ID: 43346 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
eXaPower

Send message
Joined: 25 Sep 13
Posts: 293
Credit: 1,897,601,978
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 43651 - Posted: 30 May 2016, 11:33:14 UTC
Last modified: 30 May 2016, 11:34:42 UTC

https://xdevs.com/guide/pascal_oc/

Many Geforce GTX 1080 (Gamer) review(s) recently confirmed an average performance increase of ~30% verse stock GTX980ti.
(1) OCed GTX1080 typical performance = (2.3) GTX970
(As of now: No reviewers have published a folding@home or sisoftware scientific CUDA benchmark. )

GTX1080 fastest sisoftware single precision scientific CUDA result as of today:
http://ranker.sisoftware.net/show_device.php?

Total Benchmark score 877.78GFLOPS
GEMM (General Matrix Multiply) 4259.25GFLOPS
FFT (Fast Fourier Transform) 180.90GFLOPS
N-Body (N Body Simulation) 3962.22GFLOPS

My GTX970 at 1602MHz for reference (fastest GTX970 in the si software database):
Total benchmark score 666.70GFLOPS
GEMM (General Matrix Multiply) 2844.11GFLOPS
FFT (Fast Fourier Transform) 156.28GFLOPS
N-Body (N Body Simulation) 2684.82GFLOPS

GTX1080 is 314^mm die with 640CUDA per (4) GPC / 2560CUDA / 20SM * 128CUDA per / (80) 32CUDA blocks / 160 dispatch units / 80 warp schedulers / 640 ld/store units / 640 SFU / 160TMU / 64ROP / 2MB 2nd level cache / 48-112KB 1st level cache with 96KB shared memory for each SM / 65536 [32]bit registers per SM at 16384 per 32CUDA block)

GP104 (SM) core structure is as Maxwell SMM (4*32c blocks 128CUDA per SM) while GP100 has (64CUDA / 32DP per SM). GP104 256bit memory interface is split into 8*32bit rather than 4*64bit controllers. 8 ROP are tied to Pascal's 32bit partition. GM200/204 have 16 ROP for every 64bit controller.

Each GP104 GPC has 5 SM's (640CUDA). GP100 GPC amounts to the same 640 in (10 sm * 64CUDA)
The GP104 DP:SP ratio is (Maxwell's) 1/32. (1) DP core per 32SP block (4DP per 128c SMP)
GP104 (FP16) will not have double the FP32 output as GP100. GP104 same as Maxwell's 1:1 FP32/FP16 ratio.
The new 4.0 polymorph engine / SIP video encode / decode / display / memory compression cache(s) / etc. redesigned. Pascal also other enhancements. (see reviews)

TX1070 specs were also confirmed - it's cut down more than Maxwell's (970/980) 1664 .vs. 2048 [3] SMM 384CUDA. GTX1070 has 1920CUDA .vs. the GTX1080 2560CUDA. That's (640core) GPC worth of cut - 5 (128C) SMP's. OCed GTX1070 won't be able to match stock GTX1080 FLOPs even if the GTX1070 OCed beyond a stable 2100MHz. Current Performance on GTX970 at 1.5GHz equals the stock GTX980 running WU's with no lag or choke points.

From looks of most reviews - Pascal 24/7 compute OC scaling sweet spot is ~2100. Once above 2100MHz Pascal cores lose some steam. (Maxwell's 1500 ~ 1550 ~ 1600MHz)

http://videocardz.com/60547/comparison-of-custom-geforce-gtx-1080-pcbs

Zotac's 16VRM + 3Mem (I think it's really 20VRM + 2AUX + 2Mem = 24 total phases) has over 3 times the power delivery compared to a (reference 5+1) model. If (any) crunchers find this card available - please link it here. Most reports indicate Zotac PGF (with OC+ microcontroller) is specific only to Asia markets. It's possible that AMP Extreme GTX1080 will get the PGF PCB in the USA though not confirmed. (no official word as of yet.)

(PGF a great card to cool passively (uni-block and no radiator fans) if one has a (hard-line) water cooled system with a +240mm radiator(s).
IMO - numerous fans are not required for radiators.)

A Zotac Extreme GTX 970 13 phase with OC+ module (70.7% GPU-Z ASIC rating) and EVGA 8 phase (81.1% ASIC rating) both at >1.5GHz eat most GTX 980 for breakfast over at POEM and PrimeGrid Genefer. Prime numbers are helpful in Bio-medical research.

Phases matter: at stock voltages and similar core temps - the EVGA power consumption is 240W (1451MHz) computing Primegrid n=21/22 Genefer WU. The Zotac 210W (1481MHz). Genefer certainly the most powerful (Maxwell) app with (lowest stable overclock) available on the BOINC platform by a -100MHz twin-turbo gap. My 24/7 electric usage and bill went up compared to CUDA6.5 ACEMD. These (2) 970's have a 10-15W difference computing ACEMD WU at 1.5GHz.
ID: 43651 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
davebodger

Send message
Joined: 30 Jul 11
Posts: 2
Credit: 7,052,262
RAC: 0
Level
Ser
Scientific publications
wat
Message 43767 - Posted: 11 Jun 2016, 15:34:54 UTC

Just tried my nice shiny new Gigabyte GTX1080 on GPUGRID and I just got Computation Errors on the two wu's I downloaded. :-(
Astroids was the same but Collatz works OK, so I know it's not the card.
I've turned off the project now (Allow no new taks) until you say it's OK for me to try again, I don't want to waste wu's or time, as I presume you need to adapt for Pascal?
ID: 43767 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Retvari Zoltan
Avatar

Send message
Joined: 20 Jan 09
Posts: 2380
Credit: 16,897,957,044
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 43770 - Posted: 11 Jun 2016, 18:45:16 UTC - in response to Message 43767.  
Last modified: 11 Jun 2016, 18:46:44 UTC

Just tried my nice shiny new Gigabyte GTX1080 on GPUGRID and I just got Computation Errors on the two wu's I downloaded. :-(
Astroids was the same but Collatz works OK, so I know it's not the card.
I've turned off the project now (Allow no new taks) until you say it's OK for me to try again, I don't want to waste wu's or time, as I presume you need to adapt for Pascal?
Exactly. See this post:
GDF wrote:
HI,
we expect great performance from the GTX1080 but at the moment we don't have any.
As soon as we have them, we need to recompile the code for them and check it.

At the moment, the app will crash on any new pascal gpu.

gdf
ID: 43770 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
peeticek_LubosKrutek

Send message
Joined: 30 Nov 08
Posts: 7
Credit: 62,377,145
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 43772 - Posted: 12 Jun 2016, 5:55:34 UTC - in response to Message 43767.  

What time for collatz tasks are you getting? Rac?
Did you try anither projects except the gougrid and asteroids?
If yes, ehat results?

Thanks
ID: 43772 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
wiyosaya

Send message
Joined: 22 Nov 09
Posts: 114
Credit: 589,114,683
RAC: 0
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 44022 - Posted: 20 Jul 2016, 18:34:22 UTC

For anyone interested, AnandTech published Compute benchmark results for the consumer founders edition cards. http://www.anandtech.com/show/10325/the-nvidia-geforce-gtx-1080-and-1070-founders-edition-review/28


ID: 44022 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2

Message boards : Graphics cards (GPUs) : nVidia Pascal X80

©2025 Universitat Pompeu Fabra