Message boards :
Graphics cards (GPUs) :
nVidia Pascal X80
Message board moderation
Previous · 1 · 2
| Author | Message |
|---|---|
|
Send message Joined: 28 Jul 12 Posts: 819 Credit: 1,591,285,971 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
There was a comment a while ago on the POEM forum that it was not clear whether the HBM2 stacked memory had the fine-grained address ability (if that is the term) required for optimum performance for their work. Whether that applies to GPUGrid I don't know, but I would not jump into the lake without checking for rocks first. |
skgivenSend message Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Almost a cert it will be GDDR5@8GHz on the gaming cards. Agree that GP104/6... will not be that similar to GP100:- DP just isn't needed on mid-range to high end gaming cards, never mind entry level cards. Would be cautious about performance though - who knows what bottlenecks there will be. FAQ's HOW TO: - Opt out of Beta Tests - Ask for Help |
|
Send message Joined: 7 Jun 12 Posts: 112 Credit: 1,140,895,172 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
http://www.ebay.com/sch/i.html?_from=R40&_sacat=0&_nkw=980Ti&_sop=15 Now it began and still going to be in the future very much nvidia 980Ti for sale at a good price. but import charges to europe.. destroy all. It is also a lot of good servers on ebay for only Cpu crunch in the usa for sale. but import charges in europe are crazy. http://www.ebay.com/sch/i.html?_from=R40&_sacat=0&_dcat=11211&Number%2520of%2520Processors=8&_nkw=server&_sop=16 |
|
Send message Joined: 26 Feb 12 Posts: 184 Credit: 222,376,233 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Wonder what they will cost also. Your first born. 🙀 |
|
Send message Joined: 25 Sep 13 Posts: 293 Credit: 1,897,601,978 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
For those interested: GP100 Pascal architecture Whitepaper is now available. Whitepaper access and download from Nvidia's website requires registration (including already registered developer accounts.) |
|
Send message Joined: 7 Jun 12 Posts: 112 Credit: 1,140,895,172 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
|
|
Send message Joined: 9 Nov 12 Posts: 51 Credit: 522,101,722 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]()
|
Nice. Looks like my Titan Blacks may have finally found a worthy replacement... XD Looks like we're maybe getting some decent double precision, 64FP capabilities again too? Just started doing some tests on DP using MilkyWay@Home to see if it can make use of it well enough on my Titan Black cards. (Sorry, bit of a break from GPUGrid.) My BOINC Cruncher, Minecraft Multiserver, Mobile Device Mainframe, and Home Entertainment System/Workstation: http://www.overclock.net/lists/display/view/id/4678036#
|
|
Send message Joined: 25 Sep 13 Posts: 293 Credit: 1,897,601,978 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
GEFORCE GTX 1080 ($599 MSRB) Available May 27: 7.2 billion transistors Last night's Pascal 1080 unveil demo was running at 2.1MHz on air - similar to Maxwell L2N cooled clocks. Pascal +2.1MHz boost will be Maxwell's 1.5GHz. (For 24/7 OC) Early adopters can help GPUGRID beta test a new ACEMD app - Count me in. Once general pubic availability (June~July) is secured when does the Project announce a new CUDA phase? Will Pascal be a repeat of GK110 initial ACEMD production difficulties? Titan X / GTX 980ti is now EOL/DOA. Maxwell's real TDP's (power limit): GTX980ti = 325~350W GTX980 = 250~275W GTX970 = 220~240W GTX960 = 160~180W GTX750 = 60~75W |
Retvari ZoltanSend message Joined: 20 Jan 09 Posts: 2380 Credit: 16,897,957,044 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Now that's the product we are waiting for. If it's not faster by its newer architecture, it should have 1540MHz GPU clock to achieve the performance of an overclocked GTX980Ti (@1400MHz). So by its standard boost clock it should be ~12.5% faster; but if the 2.1GHz is true, then it should be ~36.4% faster than a GTX980Ti@1400MHz while consuming only ~2/3 of the electricity. I thought that this chip would have 3072 CUDA cores, so it has only the 5/6 of what I've expected, but it will be enough to top the GTX980Ti as it has higher clocks. Will Pascal be a repeat of GK110 initial ACEMD production difficulties?As it happened at the release of every previous GPU generation I expect that there will be some difficulties. (The present app won't work with the new cards) |
|
Send message Joined: 25 Sep 13 Posts: 293 Credit: 1,897,601,978 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
If it's not faster by its newer architecture, it should have 1540MHz GPU clock to achieve the performance of an overclocked GTX980Ti (@1400MHz). 2.1GHz is true - check out Nvidia's blog. GP104 Mid-tier 16nm die going to destroy the 3rd gen 28nm big die (GM200) performance/watt. 28nm Big die generations: GK110a > GK110b (GK210) > GM200 There's no doubting that GTX980ti is a strong 32bit chip but with all that heat it's tough cooling a dense setup OCed. IMHO: Water cooling GM200 was really only way to go - WC still has a 350W power limit running a monster program. I thought that this chip would have 3072 CUDA cores, so it has only the 5/6 of what I've expected, but it will be enough to top the GTX980Ti as it has higher clocks. 3072 CUDA is possible as a 2nd or 3rd gen mid-tier 16nm (GP204 or GV104 Volta) part with a slightly bigger die - similar to 4th or is it 5th gen 28nm GM204 (398mm²) .vs. 1st/2nd gen 28nm GK104 (294mm²). (3rd gen mid/low tier 28nm is GM107) Maybe the big die GP100 becomes a Geforce >3000CUDA - or Volta might be the first (Geforce) Big die? Either way the GPU performance/watt advancements is impressive compared to last couple of CPU generations. A non-Pascal question: does you're GM200 hit 1.5GHz stable on any projects and what's the highest (stable) OC for ACEMD? I've always thought GM200 1.5GHz ACEMD was possible. I've ran GM204 @1.5GHz since having them. (A lot Maxwell boards push the 1.5GHz boundary.) FYI: To find any Maxwell and (Pascal) PCB (BIOS) power limit - run Primegrid's OCL4 n=20/21/22 Genefer created by (Developer Yves Gallot) or Sisoftware CUDA scientist benchmark OCed. |
skgivenSend message Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
2560 NVIDIA CUDA Cores would be a nice step up from 2048 (GTX980), which is primarily what we should be comparing a 1080 with. 8GB DDR5 also doubles the 980's 4GB DDR5. The 1733 (MHz) Boost Clock might actually Boost to ~1850MHz without any tweaking. If it outperforms the GTX980Ti for throughput then even better, but even if it doesn't it's highly likely to be better in terms of performance/Watt. For the app to work here it will probably need to be recompiled with the latest CUDA Tool Kit. ACEMD is a complex app unlike some others which might work straight out of the box or with only a little bit of work. If the cards don't initially work here they might work elsewhere. Worst case scenario is that GPUGrid has to wait on a new CUDA tool kit and then don't have the time over the summer holidays to redevelop and test the app. Best case scenario is that a fully functional CUDA took kit is available on launch day and Gianni and Matt are available and up to speed with any CUDA advances/changes and get a Pascal to test with. FAQ's HOW TO: - Opt out of Beta Tests - Ask for Help |
|
Send message Joined: 25 Sep 13 Posts: 293 Credit: 1,897,601,978 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
https://xdevs.com/guide/pascal_oc/ Many Geforce GTX 1080 (Gamer) review(s) recently confirmed an average performance increase of ~30% verse stock GTX980ti. (1) OCed GTX1080 typical performance = (2.3) GTX970 (As of now: No reviewers have published a folding@home or sisoftware scientific CUDA benchmark. ) GTX1080 fastest sisoftware single precision scientific CUDA result as of today: http://ranker.sisoftware.net/show_device.php? Total Benchmark score 877.78GFLOPS GEMM (General Matrix Multiply) 4259.25GFLOPS FFT (Fast Fourier Transform) 180.90GFLOPS N-Body (N Body Simulation) 3962.22GFLOPS My GTX970 at 1602MHz for reference (fastest GTX970 in the si software database): Total benchmark score 666.70GFLOPS GEMM (General Matrix Multiply) 2844.11GFLOPS FFT (Fast Fourier Transform) 156.28GFLOPS N-Body (N Body Simulation) 2684.82GFLOPS GTX1080 is 314^mm die with 640CUDA per (4) GPC / 2560CUDA / 20SM * 128CUDA per / (80) 32CUDA blocks / 160 dispatch units / 80 warp schedulers / 640 ld/store units / 640 SFU / 160TMU / 64ROP / 2MB 2nd level cache / 48-112KB 1st level cache with 96KB shared memory for each SM / 65536 [32]bit registers per SM at 16384 per 32CUDA block) GP104 (SM) core structure is as Maxwell SMM (4*32c blocks 128CUDA per SM) while GP100 has (64CUDA / 32DP per SM). GP104 256bit memory interface is split into 8*32bit rather than 4*64bit controllers. 8 ROP are tied to Pascal's 32bit partition. GM200/204 have 16 ROP for every 64bit controller. Each GP104 GPC has 5 SM's (640CUDA). GP100 GPC amounts to the same 640 in (10 sm * 64CUDA) The GP104 DP:SP ratio is (Maxwell's) 1/32. (1) DP core per 32SP block (4DP per 128c SMP) GP104 (FP16) will not have double the FP32 output as GP100. GP104 same as Maxwell's 1:1 FP32/FP16 ratio. The new 4.0 polymorph engine / SIP video encode / decode / display / memory compression cache(s) / etc. redesigned. Pascal also other enhancements. (see reviews) TX1070 specs were also confirmed - it's cut down more than Maxwell's (970/980) 1664 .vs. 2048 [3] SMM 384CUDA. GTX1070 has 1920CUDA .vs. the GTX1080 2560CUDA. That's (640core) GPC worth of cut - 5 (128C) SMP's. OCed GTX1070 won't be able to match stock GTX1080 FLOPs even if the GTX1070 OCed beyond a stable 2100MHz. Current Performance on GTX970 at 1.5GHz equals the stock GTX980 running WU's with no lag or choke points. From looks of most reviews - Pascal 24/7 compute OC scaling sweet spot is ~2100. Once above 2100MHz Pascal cores lose some steam. (Maxwell's 1500 ~ 1550 ~ 1600MHz) http://videocardz.com/60547/comparison-of-custom-geforce-gtx-1080-pcbs Zotac's 16VRM + 3Mem (I think it's really 20VRM + 2AUX + 2Mem = 24 total phases) has over 3 times the power delivery compared to a (reference 5+1) model. If (any) crunchers find this card available - please link it here. Most reports indicate Zotac PGF (with OC+ microcontroller) is specific only to Asia markets. It's possible that AMP Extreme GTX1080 will get the PGF PCB in the USA though not confirmed. (no official word as of yet.) (PGF a great card to cool passively (uni-block and no radiator fans) if one has a (hard-line) water cooled system with a +240mm radiator(s). IMO - numerous fans are not required for radiators.) A Zotac Extreme GTX 970 13 phase with OC+ module (70.7% GPU-Z ASIC rating) and EVGA 8 phase (81.1% ASIC rating) both at >1.5GHz eat most GTX 980 for breakfast over at POEM and PrimeGrid Genefer. Prime numbers are helpful in Bio-medical research. Phases matter: at stock voltages and similar core temps - the EVGA power consumption is 240W (1451MHz) computing Primegrid n=21/22 Genefer WU. The Zotac 210W (1481MHz). Genefer certainly the most powerful (Maxwell) app with (lowest stable overclock) available on the BOINC platform by a -100MHz twin-turbo gap. My 24/7 electric usage and bill went up compared to CUDA6.5 ACEMD. These (2) 970's have a 10-15W difference computing ACEMD WU at 1.5GHz. |
|
Send message Joined: 30 Jul 11 Posts: 2 Credit: 7,052,262 RAC: 0 Level ![]() Scientific publications
|
Just tried my nice shiny new Gigabyte GTX1080 on GPUGRID and I just got Computation Errors on the two wu's I downloaded. :-( Astroids was the same but Collatz works OK, so I know it's not the card. I've turned off the project now (Allow no new taks) until you say it's OK for me to try again, I don't want to waste wu's or time, as I presume you need to adapt for Pascal? |
Retvari ZoltanSend message Joined: 20 Jan 09 Posts: 2380 Credit: 16,897,957,044 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Just tried my nice shiny new Gigabyte GTX1080 on GPUGRID and I just got Computation Errors on the two wu's I downloaded. :-(Exactly. See this post: GDF wrote: HI, |
|
Send message Joined: 30 Nov 08 Posts: 7 Credit: 62,377,145 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
What time for collatz tasks are you getting? Rac? Did you try anither projects except the gougrid and asteroids? If yes, ehat results? Thanks |
|
Send message Joined: 22 Nov 09 Posts: 114 Credit: 589,114,683 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
For anyone interested, AnandTech published Compute benchmark results for the consumer founders edition cards. http://www.anandtech.com/show/10325/the-nvidia-geforce-gtx-1080-and-1070-founders-edition-review/28 |
©2025 Universitat Pompeu Fabra