All of a sudden not getting work

Message boards : Graphics cards (GPUs) : All of a sudden not getting work
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · Next

AuthorMessage
STE\/E

Send message
Joined: 18 Sep 08
Posts: 368
Credit: 4,174,624,885
RAC: 0
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 4992 - Posted: 28 Dec 2008, 10:39:42 UTC

And, just think, not THAT long ago we were wasting all those GPU cycles completely 24/7 ... now we are wasting them only SOME of the time ... in other words, the cup is really half full ...


Look @ it this way Paul, for some people the Cup is never full or even half full, if they had full Caches they would just find something else to Whine about ... :)
ID: 4992 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Beyond
Avatar

Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 5005 - Posted: 28 Dec 2008, 14:35:28 UTC - in response to Message 4988.  

The pentagram goes on the ceiling, a ring of salt on the floor:D

Oh man, no wonder that one didn't work!
ID: 5005 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Beyond
Avatar

Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 5007 - Posted: 28 Dec 2008, 14:45:19 UTC - in response to Message 4990.  

You say "set to DL 3 days of work". Do you mean the cache is set for 3 days? GPUGRID has short turn around times and typically will want work back within 3-4 days so that might be the reason why you can't get any (as well as BOINC 6.4.5 not working properly). Try setting the cache a bit lower to say 2 days, that way BOINC should think it can complete the work in time.

I tried both longer and shorter caching, neither worked at the time. It was a server problem yesterday AM. Those who had work in their caches probably didn't notice it but many had the no work problem. I don't use v6.4.5, it's highly flawed (they need to take the message saying to use v6.4.5 off the front page). At the time I tried v6.3.21, v6.4.1 and v6.4.2. None of them worked during the server problem. Things seem to be working again (at least here) and I've received 3 WUs since the server started working more or less properly again.
ID: 5007 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mike047

Send message
Joined: 21 Dec 08
Posts: 47
Credit: 7,330,049
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwat
Message 5011 - Posted: 28 Dec 2008, 15:01:48 UTC - in response to Message 5007.  

I use v6.4.5 on Ubuntu and a 1.5 day cache. The system has worked as it is supposed to for the second day in a row. When work is completed, it gets one more unit. No intervention on my part.

Of course, now that I have said that....everything will fall apart.:D

Need more salt.

mike
ID: 5011 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 5012 - Posted: 28 Dec 2008, 15:05:10 UTC

- I'm sure the devs know the message "no cell blabla" doesn't fit.. but since the real reason is not known yet they didn't make it more specific. Might be good to at least include one for "really no gpu work available", though!

- yesterday was a pure server problem, the server logs have been sent to the BOINC devs.. let's keep our fingers crossed they find the bug

- for everyone who accidently painted the pentagram on the floor: you can still use it, no need to rip off your parquet just yet! Do the dance upside down and mount the salt to the ceiling, just be careful not to choke yourself while blowing the horn.

MrS
Scanning for our furry friends since Jan 2002
ID: 5012 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Paul D. Buck

Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 5018 - Posted: 28 Dec 2008, 16:17:26 UTC - in response to Message 4992.  

And, just think, not THAT long ago we were wasting all those GPU cycles completely 24/7 ... now we are wasting them only SOME of the time ... in other words, the cup is really half full ...


Look @ it this way Paul, for some people the Cup is never full or even half full, if they had full Caches they would just find something else to Whine about ... :)

Like me complaining at Rosetta that 4 tasks died with a File lock problem ... I have since, now, run one task to completion and have another with an hour to go ...

Ah well, if not one thing it is another ...
ID: 5018 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Alain Maes

Send message
Joined: 8 Sep 08
Posts: 63
Credit: 1,696,957,181
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 5032 - Posted: 29 Dec 2008, 9:32:13 UTC

My last manual update now dates back to two days ago when I managed to get 4 WUs. Since then work fetch works again automatically, my cache being kept at 3 WUs almost continuously with one running. Great! And just in time for me since I am leaving in an hour for a week to visit my parents.

Just one remark though. I noticed in BOINCVIEW that my waiting WUs have a completion time of 11 days and almost 16 hours. As soon as a WU starts it immediately jumps to a completion time of 7 hr 50 min, which is BTW pretty accurate for my GTX260.

Kind regards and happy crunching.

Alain
ID: 5032 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Paul D. Buck

Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 5059 - Posted: 30 Dec 2008, 1:28:51 UTC

I got the now work message from a call 12/29/2008 4:59:50 (California time) and I think the situation was caused THIS TIME by having one task in my queue that was rated at 30+ hours. Now, that was from having that 2 plus day task the other day and still thinking that the computer has a 9800 GT vs the new GTX 280 ... I am 36 minutes in, 9.8% done and the time is dropping rapidly from 30 hours and is currently at 26:06 and falling.

*SO*, what that tells me is that the "generic" message we are getting for all sorts of cases is probably related to the scheduler thinking that I have plenty to do and there is no need to give me more work ... my request asked for 56,124 seconds of work, my queue is set to 0.50 days extra work, I have 6 projects on the computer with two out of work (Pirates and LHC) with work from Cosmology and WCG, I don't know why Malaria is dry ...

So, I have 9 tasks in progress, one GPU Grid, 4 WCG, and 4 Cosmology... I have 7 cosmology queued and 4 additional WCG ... runtimes span 2 hours remaining to 20 hours to complete from cold start ...

Thumbnail look seems like I only have about half a days work in hand ... though I may be just on the other side at 0.6 days in hand...

I need a nap ... it is likely I will be back up in a couple hours and I will report if anything changes ...

Oh, when I looked there were 700 plus tasks in the feeder queue ...
ID: 5059 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Kokomiko
Avatar

Send message
Joined: 18 Jul 08
Posts: 190
Credit: 24,093,690
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 5061 - Posted: 30 Dec 2008, 2:21:01 UTC

That's similar to the situation here. To get new work I have to stop the other projects. I think it's a problem with the handling of the scheduler and the high prio mode for the core to feed the GPU in the background, hidden for the user. Only if all other tasks are stopped and the system makes a call for more than 250,000, I get new work. Here are running: CPDN (over 800 hours), PrimeGrid and MilkyWay with work for the workcache of 0.5 days, LHC and Pirates calling for work without success.
ID: 5061 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Paul D. Buck

Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 5069 - Posted: 30 Dec 2008, 8:34:10 UTC

Ok, up from my nap, one task nearing completion I asked for more work and got two tasks.

Yes there are issues with the work fetch policy in that it has not been generalized. Worse ... well, I posted another thread where you can see the discussion as it stands at the moment...

But, yes, there are issues and the current policy as coded in the BOINC Manager may let the CUDA resource go idle ... oops ... :)
ID: 5069 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [AF>Libristes>Jip] Elgrande71
Avatar

Send message
Joined: 16 Jul 08
Posts: 45
Credit: 78,618,001
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 5075 - Posted: 30 Dec 2008, 15:02:55 UTC

I always had these messages :
Full-atom molecular dynamics for Cell processor is not available for your type of computer.
Full-atom molecular dynamics on Cell processor is not available for your type of computer.

They irrate me a lot now. I tried reattaching project and all of the options available but nothing solve the problem.
These messages happened on host 6362, 5716 and 15576.
I am really tired of this situation.
Fix it rapidly please.
ID: 5075 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist

Send message
Joined: 14 Mar 07
Posts: 1958
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 5077 - Posted: 30 Dec 2008, 15:49:19 UTC - in response to Message 5075.  

This is really a problem for us as well. There was a change of scheduler policy a couple of weeks ago in the BOINC server which is buggy. Please post on the BOINC dev forums as well. The holiday period is slowing them down in fixing it.


gdf
ID: 5077 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Paul D. Buck

Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 5080 - Posted: 30 Dec 2008, 16:13:57 UTC

I had the problem for a couple hours this morning but just got two more (whew!) ...

Are you feeding PS3 and Nvidia systems off of the same feeder?

I wonder if the pull rate is such that the feeder gets "clogged" with tasks for the other platform ...

Other projects have had issues like this ... in the dark past ...
ID: 5080 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Stefan Ledwina
Avatar

Send message
Joined: 16 Jul 07
Posts: 464
Credit: 298,573,998
RAC: 0
Level
Asn
Scientific publications
watwatwatwatwatwatwatwat
Message 5083 - Posted: 30 Dec 2008, 16:44:32 UTC - in response to Message 5080.  

Looking at the server status it seems there's only one feeder for all platforms...

pixelicious.at - my little photoblog
ID: 5083 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Nognlite

Send message
Joined: 9 Nov 08
Posts: 69
Credit: 25,106,923
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwat
Message 5086 - Posted: 30 Dec 2008, 16:50:39 UTC

No work again. Second day in a row. (As my 280's sit idly by contemplating their existance!!)

Pat
ID: 5086 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
lllvette
Avatar

Send message
Joined: 1 Nov 08
Posts: 6
Credit: 16,022,037
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 5089 - Posted: 30 Dec 2008, 17:58:26 UTC

When is this "Atom Not available " Problem Going to be fixed? It Has been going on for about 2 weeks now.... Is Any one addressing this problem? I have tried 6.4.2, 6.5.0 versions, Reset project several times , Reloaded drivers Etc..and nothing works... Please help... Thanks.
ID: 5089 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Stefan Ledwina
Avatar

Send message
Joined: 16 Jul 07
Posts: 464
Credit: 298,573,998
RAC: 0
Level
Asn
Scientific publications
watwatwatwatwatwatwatwat
Message 5094 - Posted: 30 Dec 2008, 18:59:43 UTC - in response to Message 5089.  

"lllvette" wrote:
When is this "Atom Not available " Problem Going to be fixed? It Has been going on for about 2 weeks now.... Is Any one addressing this problem? I have tried 6.4.2, 6.5.0 versions, Reset project several times , Reloaded drivers Etc..and nothing works... Please help... Thanks.


Since you are posting in the right thread, could you please look at the last posts? ;) Especially the one 4 posts before yours...

The project admin is aware of the problem but he can not fix it because there's something wrong with the server software and they are also waiting for a fix from Berkeley...


"GDF" wrote:
This is really a problem for us as well. There was a change of scheduler policy a couple of weeks ago in the BOINC server which is buggy. Please post on the BOINC dev forums as well. The holiday period is slowing them down in fixing it.


gdf




pixelicious.at - my little photoblog
ID: 5094 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
JAMC

Send message
Joined: 16 Nov 08
Posts: 28
Credit: 12,688,454
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwat
Message 5099 - Posted: 30 Dec 2008, 21:43:04 UTC

Not able to get any new WU's even with manual priming...5 machines... old "Atom Not Available" message...server problems???
ID: 5099 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Kokomiko
Avatar

Send message
Joined: 18 Jul 08
Posts: 190
Credit: 24,093,690
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 5100 - Posted: 30 Dec 2008, 22:18:19 UTC

Same here, got the last WU at 4:44 UTC, now 3 machines will running dry in the next 4 to 8 hours.
ID: 5100 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
STE\/E

Send message
Joined: 18 Sep 08
Posts: 368
Credit: 4,174,624,885
RAC: 0
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 5101 - Posted: 30 Dec 2008, 22:19:42 UTC - in response to Message 5089.  

When is this "Atom Not available " Problem Going to be fixed? It Has been going on for about 2 weeks now.... Is Any one addressing this problem? I have tried 6.4.2, 6.5.0 versions, Reset project several times , Reloaded drivers Etc..and nothing works... Please help... Thanks.


Nothing you can do about it lllvette, some of my Box's are out too and I've switched them to 4&1 from 3&1 so 1 Core doesn't sit Idle waiting for another GPU Wu.

It wouldn't be so bad though if the Project would change that 24 Hour Wait Period before it trys to contact the Server again so we didn't have to Manually Contact every so often to try & pick up a Wu ...
ID: 5101 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · Next

Message boards : Graphics cards (GPUs) : All of a sudden not getting work

©2025 Universitat Pompeu Fabra