Changes to scheduling policy

Message boards : News : Changes to scheduling policy
Message board moderation

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
biodoc

Send message
Joined: 26 Aug 08
Posts: 183
Credit: 10,085,929,375
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 38292 - Posted: 4 Oct 2014, 14:21:28 UTC - in response to Message 38290.  

biodoc - it's on acemdbeta and short now. Please test it!

Matt


Ok, will do. Finishing up a windows cuda 6.5 long WU in about an hour.

Thanks!
ID: 38292 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
biodoc

Send message
Joined: 26 Aug 08
Posts: 183
Credit: 10,085,929,375
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 38293 - Posted: 4 Oct 2014, 14:28:09 UTC - in response to Message 38289.  

There'll be a CUDA 6.5 app for linux later today.


Will it work for my 780Ti or is it exclusive for the 980/970?
ID: 38293 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile MJH

Send message
Joined: 12 Nov 07
Posts: 696
Credit: 27,266,655
RAC: 0
Level
Val
Scientific publications
watwat
Message 38295 - Posted: 4 Oct 2014, 18:10:43 UTC

I've revised the scheduling policy rules in the original post.
ID: 38295 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Betting Slip

Send message
Joined: 5 Jan 09
Posts: 670
Credit: 2,498,095,550
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 38300 - Posted: 5 Oct 2014, 7:36:25 UTC - in response to Message 38295.  

I havr a GTX460 which has not had work for over a day (long)
Radio Caroline, the world's most famous offshore pirate radio station.
Great music since April 1964. Support Radio Caroline Team -
Radio Caroline
ID: 38300 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [VENETO] sabayonino

Send message
Joined: 4 Apr 10
Posts: 50
Credit: 650,142,596
RAC: 0
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 38318 - Posted: 6 Oct 2014, 12:27:50 UTC - in response to Message 38300.  
Last modified: 6 Oct 2014, 12:29:52 UTC

Hi

no more WUs (Short and Long)

GtX 750ti - Linux - nv-343.22
GTX 780 - Linux - nv-343.22
GTX 780ti - Linux - nv-343.22
GTX 660ti - Linux - nv-343.22
GTX 760 - Linux - nv-343.22
ID: 38318 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile MJH

Send message
Joined: 12 Nov 07
Posts: 696
Credit: 27,266,655
RAC: 0
Level
Val
Scientific publications
watwat
Message 38319 - Posted: 6 Oct 2014, 12:32:37 UTC - in response to Message 38318.  

Veneto,

Is your BOINC client new enough to be reporting the driver version number?

Matt
ID: 38319 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile MJH

Send message
Joined: 12 Nov 07
Posts: 696
Credit: 27,266,655
RAC: 0
Level
Val
Scientific publications
watwat
Message 38322 - Posted: 6 Oct 2014, 14:46:05 UTC - in response to Message 38318.  

sabayonino, I see that you are getting work now.

Matt
ID: 38322 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [VENETO] sabayonino

Send message
Joined: 4 Apr 10
Posts: 50
Credit: 650,142,596
RAC: 0
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 38326 - Posted: 6 Oct 2014, 17:28:19 UTC - in response to Message 38322.  
Last modified: 6 Oct 2014, 17:29:12 UTC

sabayonino, I see that you are getting work now.

Matt


Now all my gpus are crunching :) (cuda65)

so my boinc client version is 7.2.42 for all hosts with gpu

maybe it was a temporary problem :)

tnx
ID: 38326 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
valterc

Send message
Joined: 21 Jun 10
Posts: 21
Credit: 10,863,141,443
RAC: 3,504
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 38337 - Posted: 7 Oct 2014, 10:23:00 UTC - in response to Message 38326.  

I have more or less the same issues on this host http://www.gpugrid.net/results.php?hostid=178360

Boinc 7.2.42
Ubuntu 14.04.1 LTS (GNU/Linux 3.13.0-36-generic x86_64)

mar 07 ott 2014 11:53:33 CEST | | CUDA: NVIDIA GPU 0: GeForce GTX 780 Ti (driver version unknown, CUDA version 6.5, compute capability 3.5, 3072MB, 2987MB available, 6022 GFLOPS peak)
mar 07 ott 2014 11:53:33 CEST | | OpenCL: NVIDIA GPU 0: GeForce GTX 780 Ti (driver version 343.13, device version OpenCL 1.1 CUDA, 3072MB, 2987MB available, 6022 GFLOPS peak)


I'm able to get short workunits *only*, no matter what I try
ID: 38337 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [VENETO] sabayonino

Send message
Joined: 4 Apr 10
Posts: 50
Credit: 650,142,596
RAC: 0
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 38339 - Posted: 7 Oct 2014, 11:19:20 UTC - in response to Message 38337.  
Last modified: 7 Oct 2014, 11:21:13 UTC

Hi Valterc


as reported here

only shorts are available
ID: 38339 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile MJH

Send message
Joined: 12 Nov 07
Posts: 696
Credit: 27,266,655
RAC: 0
Level
Val
Scientific publications
watwat
Message 38340 - Posted: 7 Oct 2014, 12:18:24 UTC - in response to Message 38339.  

The Linux cuda65 app is on long now.

Matt
ID: 38340 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Hype

Send message
Joined: 21 Nov 11
Posts: 10
Credit: 8,509,903
RAC: 0
Level
Ser
Scientific publications
wat
Message 38414 - Posted: 11 Oct 2014, 21:37:27 UTC

Hello,

unfortunately I'm getting computation errors most of the time.
I've got two GTX 570 with 2.5 GB VRAM each, newest driver 344.11.
Doesn't matter if I'm in SLI or not.
Other GPU projects like SETI, Einstein or Asteroids run fine.
Is there anything I can do?
ID: 38414 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Retvari Zoltan
Avatar

Send message
Joined: 20 Jan 09
Posts: 2380
Credit: 16,897,957,044
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 38415 - Posted: 11 Oct 2014, 22:41:56 UTC - in response to Message 38414.  
Last modified: 11 Oct 2014, 22:45:41 UTC

unfortunately I'm getting computation errors most of the time.

If you take a look into your tasks details, you could see the reason for those errors:
# The simulation has become unstable. Terminating to avoid lock-up (1)

This error is a sign of an unstable GPU. The root of this instability can be various:
- Too high GPU temperature (above 80°C - so this is not for you)
- Too low GPU voltage for the given GPU clock
- Too high GPU clock for the given GPU voltage (e.g. an aging GPU could not run even at factory settings)
- Too high GDDR5 frequency
- Insufficient, low quality or (nearly) broken PSU
- Too high transient resistance on the PCIe power connectors (usually caused by Molex->PCIe converters), or on the two 12V pins of the 24-pin MB power connector

I've got two GTX 570 with 2.5 GB VRAM each, newest driver 344.11.

This card has twice as much memory chips as a standard GTX570 has, so perhaps the GPU can't drive the memory data lanes that fast.

Doesn't matter if I'm in SLI or not.

SLI is usually a source of random errors.

Other GPU projects like SETI, Einstein or Asteroids run fine.

Other GPU projects has obsolete GPU applications built on older CUDA versions, while GPUGrid uses the latest (CUDA6.5 at the moment), therefore other projects couldn't stress the GPU as much as the GPUGrid client does.
The "GPU usage" measurement is misleading.

Is there anything I can do?

Check all power connectors in your PC for burnt ones.
Lower the GPU clock by 100MHz steps until it gets stable, if it doesn't work then try again by lowering the GDDR5 frequency by 100MHz steps.
If your GPU gets stable by lowering the GPU clock at some point, you can try to raise the GPU clock by 10-20MHz steps, while it doesn't cause these "simulation became unstable" messages, then increase the GPU voltage by 12.5mV, and repeat increasing the clock while the GPU doesn't get hot.
Beware of that different GPUGrid batches stressing the GPU differently, so if there's no stability headroom in your settings, some harder workunits could fail.
ID: 38415 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Hype

Send message
Joined: 21 Nov 11
Posts: 10
Credit: 8,509,903
RAC: 0
Level
Ser
Scientific publications
wat
Message 38510 - Posted: 14 Oct 2014, 17:26:35 UTC

Thank you very much for the detailed information.
I checked the system and everything looks fine.
I lowered the clocks from 732 mhz to 650 mhz, but had 3 driver crashes while processing 2 short run WUs.
However, no computation errors, both completed successfully.
ID: 38510 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Hype

Send message
Joined: 21 Nov 11
Posts: 10
Credit: 8,509,903
RAC: 0
Level
Ser
Scientific publications
wat
Message 38513 - Posted: 14 Oct 2014, 18:31:42 UTC

And the next WU crashed again at about 10% :-(
ID: 38513 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2

Message boards : News : Changes to scheduling policy

©2025 Universitat Pompeu Fabra