Advanced search

Message boards : Graphics cards (GPUs) : 6.4.4 is out for all platforms

Author Message
Profile Stefan Ledwina
Avatar
Send message
Joined: 16 Jul 07
Posts: 464
Credit: 288,617,006
RAC: 1,969,263
Level
Asn
Scientific publications
watwatwatwatwatwatwatwat
Message 4193 - Posted: 7 Dec 2008 | 8:06:41 UTC

Continuation from http://www.gpugrid.net/forum_thread.php?id=539&nowrap=true#4192

I'm told the behavior in 6.4.3 is correct.

It has been changed because the goal is to have GPU jobs run at top speed without people having to change their #CPUs preference. So if there is a GPU job running, the scheduler will avoid saturating the available CPU's (otherwise the GPU job runs slow).


I hope they changed it only for Windows and not for Linux, otherwise people would end up having one core idle if they use Linux...

I'll give 6.4.4 a try on a Linux box...



They changed it also for the Linux BOINC Clients...
Great! Windows users can now run GPUGRID at full speed because the app gets a full core, but Linux users have now one idle CPU core, because the Linux GPUGRID app doesn't need a full core...

Now Linux users will have to write a cc_config.xml with ncpus+1 again to use all their CPU cores...
____________

pixelicious.at - my little photoblog

Profile The Gas Giant
Avatar
Send message
Joined: 20 Sep 08
Posts: 54
Credit: 607,157
RAC: 0
Level
Gly
Scientific publications
watwatwatwat
Message 4194 - Posted: 7 Dec 2008 | 10:22:51 UTC

It's still not working quite right.

I upgrade to 6.4.4, it runs benchmarks and starts 3 tasks on my quad core with the %processors at 99%. I change it to 100% and 5 tasks start. I close BOINC and restart and it starts 4 tasks while at 100% of cpu's.....

Profile Stefan Ledwina
Avatar
Send message
Joined: 16 Jul 07
Posts: 464
Credit: 288,617,006
RAC: 1,969,263
Level
Asn
Scientific publications
watwatwatwatwatwatwatwat
Message 4195 - Posted: 7 Dec 2008 | 10:27:59 UTC - in response to Message 4194.

On Windows?

With BOINC 6.4.4 on Vista 64bit and 100% of CPUs I have three CPU tasks and one GPU task running. Like it should work now...

Can you have a look what happens after the next task switch?
____________

pixelicious.at - my little photoblog

Profile The Gas Giant
Avatar
Send message
Joined: 20 Sep 08
Posts: 54
Credit: 607,157
RAC: 0
Level
Gly
Scientific publications
watwatwatwat
Message 4196 - Posted: 7 Dec 2008 | 10:49:49 UTC

XP. On task switch with 100% of CPUs it is running 5 tasks.

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4197 - Posted: 7 Dec 2008 | 11:20:49 UTC - in response to Message 4193.

Continuation from http://www.gpugrid.net/forum_thread.php?id=539&nowrap=true#4192

I'm told the behavior in 6.4.3 is correct.

It has been changed because the goal is to have GPU jobs run at top speed without people having to change their #CPUs preference. So if there is a GPU job running, the scheduler will avoid saturating the available CPU's (otherwise the GPU job runs slow).


I hope they changed it only for Windows and not for Linux, otherwise people would end up having one core idle if they use Linux...

I'll give 6.4.4 a try on a Linux box...



They changed it also for the Linux BOINC Clients...
Great! Windows users can now run GPUGRID at full speed because the app gets a full core, but Linux users have now one idle CPU core, because the Linux GPUGRID app doesn't need a full core...

Now Linux users will have to write a cc_config.xml with ncpus+1 again to use all their CPU cores...


That's strange! It seems like they decided this "0.9 CPU 1.0 CUDA" sharing doesn't work and dropped the concept completely? Otherwise the project could just have set 1.0 CPU 1.0 CUDA and they could have achieved the same effect.

With the second method they could implement some way to set different CPU values for different OSes. You could also give the Win / Linux BOINC clients different behaviour, but then everything might get screwed up as soon as there's a second GPU project!

This really has to be set by the project and not by BOINC.

MrS
____________
Scanning for our furry friends since Jan 2002

Profile Stefan Ledwina
Avatar
Send message
Joined: 16 Jul 07
Posts: 464
Credit: 288,617,006
RAC: 1,969,263
Level
Asn
Scientific publications
watwatwatwatwatwatwatwat
Message 4199 - Posted: 7 Dec 2008 | 11:45:34 UTC - in response to Message 4197.
Last modified: 7 Dec 2008 | 11:53:34 UTC

Could you post that to the BOINC alpha email list? Maybe they'll listen to you... ;)

I really hope the Synedoche guys will do a better job...
____________

pixelicious.at - my little photoblog

Profile Krunchin-Keith [USA]
Avatar
Send message
Joined: 17 May 07
Posts: 512
Credit: 111,288,061
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4200 - Posted: 7 Dec 2008 | 16:59:45 UTC - in response to Message 4197.

Continuation from http://www.gpugrid.net/forum_thread.php?id=539&nowrap=true#4192

I'm told the behavior in 6.4.3 is correct.

It has been changed because the goal is to have GPU jobs run at top speed without people having to change their #CPUs preference. So if there is a GPU job running, the scheduler will avoid saturating the available CPU's (otherwise the GPU job runs slow).


I hope they changed it only for Windows and not for Linux, otherwise people would end up having one core idle if they use Linux...

I'll give 6.4.4 a try on a Linux box...



They changed it also for the Linux BOINC Clients...
Great! Windows users can now run GPUGRID at full speed because the app gets a full core, but Linux users have now one idle CPU core, because the Linux GPUGRID app doesn't need a full core...

Now Linux users will have to write a cc_config.xml with ncpus+1 again to use all their CPU cores...


That's strange! It seems like they decided this "0.9 CPU 1.0 CUDA" sharing doesn't work and dropped the concept completely? Otherwise the project could just have set 1.0 CPU 1.0 CUDA and they could have achieved the same effect.

With the second method they could implement some way to set different CPU values for different OSes. You could also give the Win / Linux BOINC clients different behaviour, but then everything might get screwed up as soon as there's a second GPU project!

This really has to be set by the project and not by BOINC.

MrS

Well yes and no.

When a project sets CPU=1, the task is run at 'low' system priority.
When a project sets CPU<1, the task is run at a higher priority, for windows it is run as Normal. Windows has 6 steps, low to Realtime with Normal being two steps above Low.

Profile Stefan Ledwina
Avatar
Send message
Joined: 16 Jul 07
Posts: 464
Credit: 288,617,006
RAC: 1,969,263
Level
Asn
Scientific publications
watwatwatwatwatwatwatwat
Message 4201 - Posted: 7 Dec 2008 | 17:25:27 UTC - in response to Message 4200.

Yea, that's ok, but there wasn't a difference between low and normal priority with the acemd tasks...
____________

pixelicious.at - my little photoblog

Profile Krunchin-Keith [USA]
Avatar
Send message
Joined: 17 May 07
Posts: 512
Credit: 111,288,061
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4202 - Posted: 7 Dec 2008 | 17:28:31 UTC - in response to Message 4199.

Could you post that to the BOINC alpha email list? Maybe they'll listen to you... ;)

I really hope the Synedoche guys will do a better job...

I thought of that.

Can you tell me what cpu (core) percentange you show as being used and the system priority for both windows and linux.

On Windows XP I get about 30-40% CPU (which is 70-80% of one core) at 'Normal'

Profile Krunchin-Keith [USA]
Avatar
Send message
Joined: 17 May 07
Posts: 512
Credit: 111,288,061
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4203 - Posted: 7 Dec 2008 | 17:31:27 UTC - in response to Message 4194.

It's still not working quite right.

I upgrade to 6.4.4, it runs benchmarks and starts 3 tasks on my quad core with the %processors at 99%. I change it to 100% and 5 tasks start. I close BOINC and restart and it starts 4 tasks while at 100% of cpu's.....

Running 5 tasks could be normal if it determined some task was in deadline trouble.

Profile The Gas Giant
Avatar
Send message
Joined: 20 Sep 08
Posts: 54
Credit: 607,157
RAC: 0
Level
Gly
Scientific publications
watwatwatwat
Message 4206 - Posted: 7 Dec 2008 | 19:09:47 UTC - in response to Message 4203.

It's still not working quite right.

I upgrade to 6.4.4, it runs benchmarks and starts 3 tasks on my quad core with the %processors at 99%. I change it to 100% and 5 tasks start. I close BOINC and restart and it starts 4 tasks while at 100% of cpu's.....

Running 5 tasks could be normal if it determined some task was in deadline trouble.

There were no tasks in deadline trouble and if there were, stopping and restarting BOINC should not change the number of tasks running. Just some strange behaviour.

Profile Aardvark
Avatar
Send message
Joined: 27 Nov 08
Posts: 28
Credit: 82,362,324
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 4209 - Posted: 7 Dec 2008 | 19:51:06 UTC

Hi

I've downloaded "boinc_6.4.4_windows_x86_64". Getting error "Error reading setup initialisation file" when I try to install 6.4.4. I'm running Vista premium 64 bit.
Anybody else with this issue ?

Profile Stefan Ledwina
Avatar
Send message
Joined: 16 Jul 07
Posts: 464
Credit: 288,617,006
RAC: 1,969,263
Level
Asn
Scientific publications
watwatwatwatwatwatwatwat
Message 4210 - Posted: 7 Dec 2008 | 21:31:14 UTC - in response to Message 4209.

Nope... I also installed it on Vista Premium 64 bit without an error...

Maybe it's only a corrupt installation file because something happened during the download? Have you tried to download the installer again?
____________

pixelicious.at - my little photoblog

Profile Stefan Ledwina
Avatar
Send message
Joined: 16 Jul 07
Posts: 464
Credit: 288,617,006
RAC: 1,969,263
Level
Asn
Scientific publications
watwatwatwatwatwatwatwat
Message 4211 - Posted: 7 Dec 2008 | 21:33:08 UTC - in response to Message 4202.

Could you post that to the BOINC alpha email list? Maybe they'll listen to you... ;)

I really hope the Synedoche guys will do a better job...

I thought of that.

Can you tell me what cpu (core) percentange you show as being used and the system priority for both windows and linux.

On Windows XP I get about 30-40% CPU (which is 70-80% of one core) at 'Normal'


I'll try to do that tomorrow...
But I just read your email you sent to BOINC alpha, you explained the problem pretty good!
____________

pixelicious.at - my little photoblog

Profile Krunchin-Keith [USA]
Avatar
Send message
Joined: 17 May 07
Posts: 512
Credit: 111,288,061
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4213 - Posted: 7 Dec 2008 | 22:20:16 UTC - in response to Message 4211.
Last modified: 7 Dec 2008 | 22:23:13 UTC

Could you post that to the BOINC alpha email list? Maybe they'll listen to you... ;)

I really hope the Synedoche guys will do a better job...

I thought of that.

Can you tell me what cpu (core) percentange you show as being used and the system priority for both windows and linux.

On Windows XP I get about 30-40% CPU (which is 70-80% of one core) at 'Normal'

I'll try to do that tomorrow...
But I just read your email you sent to BOINC alpha, you explained the problem pretty good!

And the reply is:
"david" wrote:
I checked in the following change:
- client: tweak CPU scheduling policy. When there's a coproc job:
Windows: don't saturate CPUs
Unix: saturate CPUs

of course now you have to wait for 6.4.5, but everybody should be happy then.

Profile Stefan Ledwina
Avatar
Send message
Joined: 16 Jul 07
Posts: 464
Credit: 288,617,006
RAC: 1,969,263
Level
Asn
Scientific publications
watwatwatwatwatwatwatwat
Message 4216 - Posted: 7 Dec 2008 | 23:24:50 UTC - in response to Message 4213.

Could you post that to the BOINC alpha email list? Maybe they'll listen to you... ;)

I really hope the Synedoche guys will do a better job...

I thought of that.

Can you tell me what cpu (core) percentange you show as being used and the system priority for both windows and linux.

On Windows XP I get about 30-40% CPU (which is 70-80% of one core) at 'Normal'

I'll try to do that tomorrow...
But I just read your email you sent to BOINC alpha, you explained the problem pretty good!

And the reply is:
"david" wrote:
I checked in the following change:
- client: tweak CPU scheduling policy. When there's a coproc job:
Windows: don't saturate CPUs
Unix: saturate CPUs

of course now you have to wait for 6.4.5, but everybody should be happy then.


Yup, I already read it...
It sure will be the best for now with GPUGRID, but like ETA already said - who knows what happens if another GPU projects starts which maybe doesn't need the CPU for their calculations on Windows. Or maybe the other way round - if it also needs the GPU for calculations on Linux... ;)
____________

pixelicious.at - my little photoblog

Profile Aardvark
Avatar
Send message
Joined: 27 Nov 08
Posts: 28
Credit: 82,362,324
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 4218 - Posted: 8 Dec 2008 | 0:39:51 UTC - in response to Message 4210.

Tried download again tonight, now installed & running. Must have been a Gremlin. I'd already tried a second dowload after the first had failed.

Profile Krunchin-Keith [USA]
Avatar
Send message
Joined: 17 May 07
Posts: 512
Credit: 111,288,061
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4223 - Posted: 8 Dec 2008 | 14:45:44 UTC

I already had trouble with 6.4.4

It ran, one after the other two GPURID tasks, and then they were both ready to report. No attempt to download more work. so the GPU was idle. I had to suspend the cpu only project to force the cleint to start polling projects for work, it then got 1 task for GPUGRID.net and began to run. This is not desired behavior. And it should not even have happened as the two projects running, had equal resource shares and with two cores, that should pretty much keep 1 task for each project running. 6.4.3 and 6.4.2 and before worked better.

I do not know why the decision to change the scheduling policy was made.

Lets hope the future fix applied is a fix.

Phoneman1
Send message
Joined: 25 Nov 08
Posts: 51
Credit: 980,186
RAC: 0
Level
Gly
Scientific publications
watwat
Message 4224 - Posted: 8 Dec 2008 | 16:09:59 UTC

Today I took delivery of a new graphic card for my second machine. Having installed Boinc 6.4.4 over the weekend all I had to do was to get the new drivers installed and upgraded to the 178.24 level and re-connect the second machine to GPU Grid. One GPU task downloaded and it kicked one existing task out. After a minute or so that task restarted. Since then the GPU task and 4 Cosmology tasks have been happily sharing that machine.

Meanwhile on the first machine the GPU task is sharing with 3 Cosmology tasks.

All the Cosmology tasks on both machines are running high priority (they shouldn't be but Boinc seems to be calculating the work buffer when a GPU task is running the same as if an ordinary project was running).

On the first machine the GPU task is also running high priority because the GPU task will miss its deadline by a few hours.

It looks to me as if the GPU task status is the key to getting 3 or 4 other tasks running, on Windows at least.

Machine 1 : XFX Geforce 8600GTS

Machine 2 : Asus Geforce ENGTX260

Both machines : Vista Home Premium 32 bit, Boinc 6.4.4, Coda 2.0, Nvidia 178.24

Phoneman1

Profile Kokomiko
Avatar
Send message
Joined: 18 Jul 08
Posts: 190
Credit: 24,093,690
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 4225 - Posted: 8 Dec 2008 | 18:03:55 UTC - in response to Message 4223.

I already had trouble with 6.4.4

[...]


Same here. When I'm back home, my big cards (GTX280, 2 x GTX260²) are idle and also by manual update they call 0 seconds work. So I downgraded again to 6.4.2.

____________

Profile [AF>HFR>RR] Jim PROFIT
Send message
Joined: 3 Jun 07
Posts: 107
Credit: 31,331,137
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwat
Message 4226 - Posted: 8 Dec 2008 | 20:30:32 UTC

I have trouble with this version and the previous one.

On computers with 2 GPU cards, if this is the same model of GPU card, with the previous version on a quad cores, i had 3 CPU task running and 2 GPU task.
On computer with one GPU card, sometimes i had 4 CPU task and 1 GPU, or 3 CPU task and 1 GPU task.

So with the new version 6.4.3 or 6.4.4, i have on all computers, one task per CPU and GPU. So one quad, 2 CPU task and 2 GPU task, if i have 2 GPU cards.

But, if i have 2 different GPU cards, at one moment, i have to wait that the last task was finished before download another WU. So the more powerful card is waiting.

I downgraded to 6.4.2, because i don't understand why the Task duration correction factor is set to 100 by default.
I understand you want to have full power of the GPU cards, but there is some problems with this configuration.

I can't be monitoring all the time my computers.
I let my computers ran with the 6.4.4 since Sunday morning, and i don't see any improvements with this situation.

Jim PROFIT

Profile Wassertropfen
Avatar
Send message
Joined: 14 Aug 08
Posts: 15
Credit: 13,774,919
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4227 - Posted: 9 Dec 2008 | 12:23:12 UTC - in response to Message 4226.

6.4.4 works fine. I had to change the Task duration correction to 1.
____________
Constant dripping wears away the stone. :)

Profile Krunchin-Keith [USA]
Avatar
Send message
Joined: 17 May 07
Posts: 512
Credit: 111,288,061
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4228 - Posted: 9 Dec 2008 | 16:40:46 UTC - in response to Message 4227.

6.4.4 works fine. I had to change the Task duration correction to 1.

After Jim's psot, I looked and found that too, my DCF was set to 100.000000

I reset it to 1.000000 but don't know if it helped. However there are still small problems in 6.4.4 which have been corrected and 6.4.5 will be out soon. Hopefully the fix for methods of running under windows and lixux will work correctly for both.

JAMC
Send message
Joined: 16 Nov 08
Posts: 28
Credit: 12,688,454
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwat
Message 4229 - Posted: 9 Dec 2008 | 16:56:22 UTC

How do you change the DCF? (XP Home, still with 6.3.21)

Profile [AF>HFR>RR] Jim PROFIT
Send message
Joined: 3 Jun 07
Posts: 107
Credit: 31,331,137
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwat
Message 4230 - Posted: 9 Dec 2008 | 17:32:07 UTC

The problem with DCF is that when you change it, the value will be change when you upload/download a WU.

So you have to change the value every day!!

You will find this value in the file "client_state.xml".


Jim PROFIT

Profile Krunchin-Keith [USA]
Avatar
Send message
Joined: 17 May 07
Posts: 512
Credit: 111,288,061
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4233 - Posted: 9 Dec 2008 | 23:28:16 UTC - in response to Message 4230.

The problem with DCF is that when you change it, the value will be change when you upload/download a WU.

So you have to change the value every day!!

You will find this value in the file "client_state.xml".


Jim PROFIT

That is because this version has a bug in it, calculating the value wrong or the estiamate is off so the actual time it takes is longer than the estimate, so it jacks up the DCF factor. I set mine to 1 yesterday, and after 1 run, it was at 99 something. I reset it again to 1 and installed 6.4.5 to see if anything changes or improves.

Profile [AF>HFR>RR] Jim PROFIT
Send message
Joined: 3 Jun 07
Posts: 107
Credit: 31,331,137
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwat
Message 4234 - Posted: 10 Dec 2008 | 11:29:44 UTC - in response to Message 4233.
Last modified: 10 Dec 2008 | 11:30:39 UTC

It's more than a bug!
I try the 6.4.5 this night and get the same problem.

This morning i got a computer with 2 GPU, only one GPU was crunching, and the DCF for this computer was set to 100, not by me.
I thought this problem is just on computer with 2 or more GPU cards.

So i downgraded to 6.4.2, because on computer with 2 GPU, the DCF stay all the time to the value of 100. Btu when i downgraded, i have tho manually change this value in the file or the DCF is not change even after one or more WU was calculated!

Jim PROFIT

Post to thread

Message boards : Graphics cards (GPUs) : 6.4.4 is out for all platforms

//