Advanced search

Message boards : Graphics cards (GPUs) : DCF

Author Message
Profile Krunchin-Keith [USA]
Avatar
Send message
Joined: 17 May 07
Posts: 512
Credit: 111,288,061
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4329 - Posted: 14 Dec 2008 | 17:01:58 UTC

The server should be now ignoring dcf for the time being.
I will reuse it in a couple of weeks when things are stabilized.
Let me know if the situation improves.

gdf

OK

It has started to drop from the 100.

Last check shows DCF at 99.011805. I will leave my host 6133 untouched, so as it can be monitored to see if it corrects back to near 1. Although it will be slow as it takes 16 hours per task. I will check again before I leave on vacation, in about 5 days, then it has 10 days to run untouched.

Followup. DCF still dropping.

Went to 98 and now down to:
Task duration correction factor 85.950111

To completion dropping too, it was some 785 hours maybe when DCF was 98 to 100,

Last task started with 128 hours to completion estimate.

I still do not get another task when one finishes, only way is to suspend other running projects which will force the schedule to request work, then I can resume the other projects. I assume once the DCF drops lower this behavior will stop.
____________
Alpha Tester ~~ BOINCin since 10-Apr-2004 (2.28) ~~~ Join team USA

Profile K1atOdessa
Send message
Joined: 25 Feb 08
Posts: 249
Credit: 422,354,314
RAC: 938,897
Level
Gln
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4335 - Posted: 14 Dec 2008 | 18:32:10 UTC - in response to Message 4329.


I still do not get another task when one finishes, only way is to suspend other running projects which will force the schedule to request work, then I can resume the other projects. I assume once the DCF drops lower this behavior will stop.


I manually reset the project after I ran out of work recently. This changed the DCF back to 1, but I still had one or two "High Priority" tasks and had to suspend other project to get work. After those two finished, I now have normal priority GPUGrid WU's processing and it keeps 4 WU's in my queue (2 processing on my 2 8800GT's and 2 waiting). So, I would expect it would do the same for you, eventually.

Profile Kokomiko
Avatar
Send message
Joined: 18 Jul 08
Posts: 190
Credit: 24,093,690
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 4336 - Posted: 14 Dec 2008 | 18:58:22 UTC

The DCF is again goof up on my PC. After this WU 161943 the next WU 162890 have again to run with a destroyed DCF of 60.830372 and a estimated runtime of over 80 hours. I'm really one of the hardest while testing software, but the actual situation is annoying. Looks like it's the wrong way to play with the DCF to control the behavior of he GPU work priority.
____________

Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Send message
Joined: 14 Mar 07
Posts: 1957
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 4340 - Posted: 14 Dec 2008 | 20:10:37 UTC - in response to Message 4336.

Anderson says that resetting project will reset the dcf.

In the meanwhile, I have returned to previous values, with factor two corrections applied. I will slowly move the server to the correct values.

gdf

Profile K1atOdessa
Send message
Joined: 25 Feb 08
Posts: 249
Credit: 422,354,314
RAC: 938,897
Level
Gln
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4379 - Posted: 16 Dec 2008 | 5:13:33 UTC - in response to Message 4336.

The DCF is again goof up on my PC.


My DCF is ~0.7 after going back to 1.00 after a manual reset. However, my estimate times for GPUGrid WU's are back in the 100's of hours (starting at about 200 hours). This is after they went back down to approx. normal.

I have several other non-GPU WU's crunching, with High Priority as well. Not sure if the DCF is completely to blame. I now have 6 tasks running (5 in High Priority, 1 of those in GPUGrid, with another GPUGrid not running High Priority).

I know I could set the processor usage to 99% to get down to at least 5 active WU's (both GPUGrid and not), but don't want to have to apply a fix that will need to be changes when everything returns to normal.

It appears that I always have 1 GPUGrid tasks running High Priority and the other not. It will not fetch new work. All this starting after I "upgraded" to client 6.4.5. I'll let it run, only manually suspending other tasks as necessary to get new work for GPUGrid. I'd prefer not to babysit Boinc, though.

Profile Krunchin-Keith [USA]
Avatar
Send message
Joined: 17 May 07
Posts: 512
Credit: 111,288,061
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4382 - Posted: 16 Dec 2008 | 12:40:47 UTC

My DCF on this host has jumped back up to 100.

STE\/E
Send message
Joined: 18 Sep 08
Posts: 368
Credit: 4,173,502,885
RAC: 27,150,888
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 4384 - Posted: 16 Dec 2008 | 13:00:34 UTC

I reset several Box's this morning, when the new Wu's Downloaded 2 of them showed 22:00:00 Hours to completion (which is still about 4 times what it should be) & 2 showed 279:00:00 Hours to completion. I'm about 1/2 thru the first Wu & when I checked my DCF in the client_state.xml file it showed only 1.000000 for a DCF Value.

So I take it the DCF Value in the .xml file won't change until after the first Wu is finished, also if the Value was reset to 1.000000 why do 2 of the Wu's still show 279 Hours to completion when they Downloaded ... ???

Profile JStateson
Avatar
Send message
Joined: 31 Oct 08
Posts: 186
Credit: 3,408,332,333
RAC: 809,220
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4386 - Posted: 16 Dec 2008 | 13:17:41 UTC

hmm - I have been seeing this message quote:
"Message from server:(wont finish in time) BOINC runs 97.4% of time, computation enabled 100.0% of that" and cant get any WU's.

I went and reset the project and immediately gpugrid downloaded two tasks. Time to complete is just under 14 hours, about right.

Not sure that is going on and that error message does not make a lot of sense. I always have computation enabled 100% and BOINC runs all the time.

Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Send message
Joined: 14 Mar 07
Posts: 1957
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 4389 - Posted: 16 Dec 2008 | 13:36:32 UTC - in response to Message 4386.

With the current update of the server everything should more or less work.
However, the server still cannot discriminate between an 8800 or GTX280, so the WU estimates are tuned on a 8800. Soon, the server will improve is estimate on the speed of your GPU.

gdf

STE\/E
Send message
Joined: 18 Sep 08
Posts: 368
Credit: 4,173,502,885
RAC: 27,150,888
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 4390 - Posted: 16 Dec 2008 | 13:40:41 UTC - in response to Message 4386.

hmm - I have been seeing this message quote:
"Message from server:(wont finish in time) BOINC runs 97.4% of time, computation enabled 100.0% of that" and cant get any WU's.

I went and reset the project and immediately gpugrid downloaded two tasks. Time to complete is just under 14 hours, about right.

Not sure that is going on and that error message does not make a lot of sense. I always have computation enabled 100% and BOINC runs all the time.


I have several Box's dead in the water at the moment because the Server won'r give them work, but it'll try in 24 hours again "rolls eyes" ... ;)

localizer
Send message
Joined: 17 Apr 08
Posts: 113
Credit: 1,656,514,857
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4391 - Posted: 16 Dec 2008 | 14:36:26 UTC - in response to Message 4390.

.......... I'm seeing some improvement - last couple of WUs came down with a 30hour ETA.
GDF - it would be nice if you could summarise the project's current position & advice to crunchers. In one place it would be good to have your thoughts on downgrading the client - or not, manually adjusting the DCF, hanging in there....etc.

P.

STE\/E
Send message
Joined: 18 Sep 08
Posts: 368
Credit: 4,173,502,885
RAC: 27,150,888
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 4392 - Posted: 16 Dec 2008 | 14:42:13 UTC - in response to Message 4391.
Last modified: 16 Dec 2008 | 15:04:51 UTC

.......... I'm seeing some improvement - last couple of WUs came down with a 30hour ETA.
GDF - it would be nice if you could summarise the project's current position & advice to crunchers. In one place it would be good to have your thoughts on downgrading the client - or not, manually adjusting the DCF, hanging in there....etc.

P.


It doesn't seem to matter if you Downgrade the Client which I did on all my Box's last night, or if you manually adjust the DCF, or if you Re-Set the project, the Wu's still come with hap-hazardly To Completion Times.

I can Reset the Project & get 4 new Wu's & 1 will be at 5 hours 1 at 22 hours and maybe 2 at 279 hours. I'm still getting some as high as 27,000 hours even after a Reset of the Project.

BOINCView has been even messing up the last few days on me & I think it may be some something to do with the high To Completion Times, could be wrong on that though ...

Profile K1atOdessa
Send message
Joined: 25 Feb 08
Posts: 249
Credit: 422,354,314
RAC: 938,897
Level
Gln
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4393 - Posted: 16 Dec 2008 | 15:14:56 UTC - in response to Message 4391.

I'm seeing some improvement - last couple of WUs came down with a 30hour ETA. - Burdett


None of my WU's (GPUGRid or otherwise) are running High Priority now. It's been a little see-saw over the past 48-72 hours, but everything appears normal again now. I have two WU's crunching and two pending, 1 with a 5 hr estimate and 1 with a 21 hour estimate. Now, I have 8800GT's and those estimates are both low and high, but certainly much better than the 1000's of hours I was seeing before.

It doesn't seem to matter if you Downgrade the Client which I did on all my Box's last night, or if you manually adjust the DCF, or if you Re-Set the project, the Wu's still come with hap-hazardly To Completion Times. - Poorboy


I agree. After my manual reset, I did see an improvement but it eventually went back to high estimated times. I didn't do anything at that point (besides suspending other tasks to get GPUGrid work) and it appears that everything is OK for the moment.

frankhagen
Send message
Joined: 18 Sep 08
Posts: 65
Credit: 3,037,414
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwat
Message 4397 - Posted: 16 Dec 2008 | 16:27:49 UTC - in response to Message 4393.

about time for a new highscore-thread?

latest one started with 1190 hours on an 260..

frankhagen
Send message
Joined: 18 Sep 08
Posts: 65
Credit: 3,037,414
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwat
Message 4398 - Posted: 16 Dec 2008 | 18:30:13 UTC - in response to Message 4397.

next one: 18300 hours - rising like mad..

Profile K1atOdessa
Send message
Joined: 25 Feb 08
Posts: 249
Credit: 422,354,314
RAC: 938,897
Level
Gln
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4400 - Posted: 16 Dec 2008 | 18:47:29 UTC

My latest completed WU was back to the shorter ones: Last WU.

2435 credit vs. 3232 credit

Is there any correlation between this issue with high estimated times / high priority tasks and the smaller WU's? Once it reverted back to the "3232" WU's 2 days ago, my rig appear to stabilize and work correctly. That is until it started process the "2435" WU's again.

Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Send message
Joined: 14 Mar 07
Posts: 1957
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 4401 - Posted: 16 Dec 2008 | 18:47:35 UTC - in response to Message 4398.

Which workunit name?

gdf

localizer
Send message
Joined: 17 Apr 08
Posts: 113
Credit: 1,656,514,857
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4404 - Posted: 16 Dec 2008 | 19:16:18 UTC - in response to Message 4401.

... I'm back to 'work not available for your type of computer' message - followed by the 24hour 'sod off' back off.
As it now appears we have to manage GPUGrid manually - could we remove (or double) the 1 WU per CPU limit - I can get through 8 WUs in 24 hours or so on one of my hosts ... if I could load up, the 24hours backoff could pass without idle time. Or just reduce the 24 backoff to something more useful - 6 hours?



P.

frankhagen
Send message
Joined: 18 Sep 08
Posts: 65
Credit: 3,037,414
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwat
Message 4405 - Posted: 16 Dec 2008 | 21:15:20 UTC - in response to Message 4401.

Which workunit name?

gdf


WU:124466 currently @ 39% showing 6725 hours remaining.

host-id is 16930

Profile Krunchin-Keith [USA]
Avatar
Send message
Joined: 17 May 07
Posts: 512
Credit: 111,288,061
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4406 - Posted: 16 Dec 2008 | 21:19:47 UTC

OK, Mine dropped again from 100 to 'Task duration correction factor 91.193244'

This will be my last post. I have some more last minute preparaions for vacation tonight, then I'm off tomorrow for 12 days. MErry Christmas to all, see you all later, much later.

Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Send message
Joined: 14 Mar 07
Posts: 1957
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 4409 - Posted: 17 Dec 2008 | 9:56:32 UTC - in response to Message 4405.

You should reset project. Your DCF is 88.

gdf

Profile Kokomiko
Avatar
Send message
Joined: 18 Jul 08
Posts: 190
Credit: 24,093,690
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 4633 - Posted: 20 Dec 2008 | 19:59:03 UTC
Last modified: 20 Dec 2008 | 20:03:17 UTC

... and there must still be an error in the workunits, that's messed up the DCF. After reporting this workunit 170459 my DCF goes up to 7.478452 and the estimated time of the next WU jumps from the regular 6:28 up to over 60 hours, the task fall in high prio mode and blocks the load of another workunit.
____________

Profile [BOINC@Poland]AiDec
Send message
Joined: 2 Sep 08
Posts: 53
Credit: 9,213,937
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwat
Message 4652 - Posted: 21 Dec 2008 | 2:07:19 UTC - in response to Message 4633.

... and there must still be an error in the workunits, that's messed up the DCF. After reporting this workunit 170459 my DCF goes up to 7.478452 and the estimated time of the next WU jumps from the regular 6:28 up to over 60 hours, the task fall in high prio mode and blocks the load of another workunit.


Try to restart project. `Full restart` with detaching and deleting all files which will be not removed after detaching.
____________

Profile Kokomiko
Avatar
Send message
Joined: 18 Jul 08
Posts: 190
Credit: 24,093,690
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 4655 - Posted: 21 Dec 2008 | 5:42:55 UTC - in response to Message 4652.


Try to restart project. `Full restart` with detaching and deleting all files which will be not removed after detaching.


I know other ways ... it's only a hint, that there still a problem exist.

____________

Post to thread

Message boards : Graphics cards (GPUs) : DCF

//