Any changes to project since March30 ?

Message boards : Graphics cards (GPUs) : Any changes to project since March30 ?
Message board moderation

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 8192 - Posted: 4 Apr 2009, 23:46:07 UTC

MarkJ reported a hanging task similar to what you see / have seen and it also happened on his box with 2 GPUs.. could mean something.

MrS
Scanning for our furry friends since Jan 2002
ID: 8192 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
MarkJ
Volunteer moderator
Volunteer tester

Send message
Joined: 24 Dec 08
Posts: 738
Credit: 200,909,904
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 8194 - Posted: 5 Apr 2009, 0:04:56 UTC

@ Paul: Did you restart BOINC?

I did that on mine and that seemed to get them moving. Failing that try rebooting the machine.
BOINC blog
ID: 8194 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Paul D. Buck

Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 8201 - Posted: 5 Apr 2009, 7:26:25 UTC - in response to Message 8194.  
Last modified: 5 Apr 2009, 8:23:21 UTC

@ Paul: Did you restart BOINC?

I did that on mine and that seemed to get them moving. Failing that try rebooting the machine.

I did a restart, though not a cold start, between "sets" of these odd tasks. But the second pair just completed this afternoon.

With NO change to the system, and no restart, I have 4 in flight with all of them on track to complete in the normal 6 hours, for example the one at 25% done is 1:33 hours with 7 hours to run ...

Oh, now that is interesting ... one GPU task is in "Waiting" ... now why is that ...

Even more interesting is the fact that the task suspended has a deadline sooner than the task that is running ... Hmmm, two tasks where the time to complete is INCREASING 3 seconds for each runtime second ...

Ok, I just shot two of those tasks and down leveled to 6.5.0 to see what will happen (also did a re-boot) ...

{edit, add}

Anyone else having this issue, would you please note which version of BOINC are you using? This *MAY* be an issue with 6.6.x, assuming we were all running 6.6.15+ when we saw this issue. If we were all running 6.6.x, the next question is if you are running into this problem in a system with multiple GPU "cores" (more than one GPU task in progress at the same time.

I have a suspicion as to what might be happening and need this information to make a more coherent report to the Alpha mailing list ...

I have been watching since I down leveled to 6.5.0 and I don't seem to have the odd behavior of slowly increasing time to completion ...

I will also note that this increased time may NOT be correctly reflected in the data shown on the task page of GPU Grid after you report the task ...
ID: 8201 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
MarkJ
Volunteer moderator
Volunteer tester

Send message
Joined: 24 Dec 08
Posts: 738
Credit: 200,909,904
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 8202 - Posted: 5 Apr 2009, 9:30:52 UTC

I see the following from the Home page news:

A new batch of workunits out
April 3, 2009
We have just submitted a new batch of workunits which improve further the accuracy of the free energy calculations


Maybe this has something to do with it?
BOINC blog
ID: 8202 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Paul D. Buck

Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 8216 - Posted: 5 Apr 2009, 15:49:04 UTC - in response to Message 8202.  

I see the following from the Home page news:

A new batch of workunits out
April 3, 2009
We have just submitted a new batch of workunits which improve further the accuracy of the free energy calculations


Maybe this has something to do with it?

I don't think so, but it could...

The reason I don't think so is that I changed my version BACK down to 6.5.0 and the issue seems to have gone away. I have this suspicion as to what else may have happened. It could be one, or the other, or both ... but, when I first asked this question, GDF seemed to indicate that this was not the issue ...

At any rate ... if no one else has the issue, then it was just me, and in that case I would have to ascribe it to 6.6.20 ...
ID: 8216 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
MarkJ
Volunteer moderator
Volunteer tester

Send message
Joined: 24 Dec 08
Posts: 738
Credit: 200,909,904
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 8242 - Posted: 6 Apr 2009, 12:22:35 UTC

I picked up a GIANNI wu which locked up at 13%. Was sitting there when I got home tonight. No movement in %, just the times counting up. I restarted boinc which got it going again (now past 20%), but seems to be rather unusual. Might be a feature in boinc 6.6.20 which that machine is now running.

Its here, but I have yet to upload it. There goes my bonus credits.
BOINC blog
ID: 8242 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 8252 - Posted: 6 Apr 2009, 17:01:30 UTC - in response to Message 8242.  

Occasionally lock-ups happen. Might be nice if the app could detect that it's not progressing any more and could initiate a restart of itself. But I guess this is not trivial to implement.. as a task which is stuck doesn't execute its code any more.

MrS
Scanning for our furry friends since Jan 2002
ID: 8252 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Paul D. Buck

Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 8260 - Posted: 6 Apr 2009, 18:57:50 UTC - in response to Message 8242.  

I picked up a GIANNI wu which locked up at 13%. Was sitting there when I got home tonight. No movement in %, just the times counting up. I restarted boinc which got it going again (now past 20%), but seems to be rather unusual. Might be a feature in boinc 6.6.20 which that machine is now running.

Its here, but I have yet to upload it. There goes my bonus credits.

The new version is supposed to be detecting stalled tasks that increase in time but not percentage... but like all new features, I don't know if it works yet or not ...
ID: 8260 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 8262 - Posted: 6 Apr 2009, 20:20:35 UTC - in response to Message 8260.  

The new version of BOINC, not GPU-Grid itself?
Would be a cool feature, if it really worked. They'd have to take normal runtime / update intervals into account, though. Otherwise they might prevent very slow computers or huge WUs from running at all, because they seem to be stalled.

MrS
Scanning for our furry friends since Jan 2002
ID: 8262 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
MarkJ
Volunteer moderator
Volunteer tester

Send message
Joined: 24 Dec 08
Posts: 738
Credit: 200,909,904
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 8263 - Posted: 6 Apr 2009, 21:31:40 UTC - in response to Message 8262.  

The new version of BOINC, not GPU-Grid itself?
Would be a cool feature, if it really worked. They'd have to take normal runtime / update intervals into account, though. Otherwise they might prevent very slow computers or huge WUs from running at all, because they seem to be stalled.

MrS


Supposedly in BOINC. But its only meant to work for cpu-intensive tasks, so it doesn't work with GPUgrid.

That GIANNI task locked up again at 87& this time. I have posted about it on the boinc alpha mailing list. Restarted BOINC and its progressing again now.
BOINC blog
ID: 8263 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Paul D. Buck

Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 8264 - Posted: 6 Apr 2009, 21:46:29 UTC - in response to Message 8263.  

The new version of BOINC, not GPU-Grid itself?
Would be a cool feature, if it really worked. They'd have to take normal runtime / update intervals into account, though. Otherwise they might prevent very slow computers or huge WUs from running at all, because they seem to be stalled.

MrS


Supposedly in BOINC. But its only meant to work for cpu-intensive tasks, so it doesn't work with GPUgrid.

According to the glossy advertising it is supposed to work with both ...

But, I have serious doubts about 6.6.20 for other reasons ...
ID: 8264 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
The_Bad_Penguin

Send message
Joined: 16 Dec 07
Posts: 23
Credit: 9,286,325
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwatwat
Message 8814 - Posted: 24 Apr 2009, 2:06:25 UTC

Away for almost three weeks at F@H.

I'm going to give this a shot again...
ID: 8814 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2

Message boards : Graphics cards (GPUs) : Any changes to project since March30 ?

©2025 Universitat Pompeu Fabra