New task on long queue, significantly longer than traditional tasks

Message boards : News : New task on long queue, significantly longer than traditional tasks
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · Next

AuthorMessage
Profile nate

Send message
Joined: 6 Jun 11
Posts: 124
Credit: 2,928,865
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwat
Message 23939 - Posted: 13 Mar 2012, 20:56:10 UTC - in response to Message 23938.  

I'm just starting on a FAX4 and while my gpu idled for few seconds I saw in GPU-Z that the memory used was 150mb for Win7.
Now with the card loaded with FAX4 it's at 1232mb so that's a 1082megabytes usage for FAX4. Memory controller load is at 28-29% with 800mhz memory clock (gpu-z reading)

One thing I do NOT like is the gpu load. With other long runs and other projects the gpu load is anything between 94-99%. Now it's at 86-88% ?!
CPU is 2500K@4500Mhz
I have swan_sync=0 and only three cpu tasks running along with gpugrid.
Do I really have to put my cpu at higher clocks or what?
This is the task http://www.gpugrid.net/workunit.php?wuid=3264669



That's a pretty solid overclock already, so I wouldn't push it unless you really want to. Or if you are using LN2 to cool your cores ;).

These tasks seem to utilize less percent of the GPU than others, and I'm not sure why that is at the moment. I'm not sure if it's an issue with dividing the task between the CPU and GPU or something else (memory management, maybe?). I'll try to get back to you. There may not be anything we or you can do. We'll see.
ID: 23939 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Spatzthecat

Send message
Joined: 26 Nov 09
Posts: 33
Credit: 1,282,387,913
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 23942 - Posted: 13 Mar 2012, 21:43:54 UTC - in response to Message 23939.  

You seen to have found the happy medium (sweet spot) nate.

I am crunching a FAX4 wu on Win7 i7 990X @ 3876MHz and a GTX570, similar GPU load as Lagittaja 86-88%, memory controller load quite a bit less @ 16% and only using 690MB memory. I have swan_sync=0 and only two cpu tasks running along with gpugrid.

It is looking like approx 13hrs 40mins to complete.

Does the usages seem right to you or is there a way to improve on this?
ID: 23942 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Lagittaja

Send message
Joined: 1 Nov 10
Posts: 6
Credit: 4,539,537
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwatwat
Message 23944 - Posted: 13 Mar 2012, 22:48:49 UTC - in response to Message 23939.  
Last modified: 13 Mar 2012, 22:51:12 UTC

I'm just starting on a FAX4 and while my gpu idled for few seconds I saw in GPU-Z that the memory used was 150mb for Win7.
Now with the card loaded with FAX4 it's at 1232mb so that's a 1082megabytes usage for FAX4. Memory controller load is at 28-29% with 800mhz memory clock (gpu-z reading)

One thing I do NOT like is the gpu load. With other long runs and other projects the gpu load is anything between 94-99%. Now it's at 86-88% ?!
CPU is 2500K@4500Mhz
I have swan_sync=0 and only three cpu tasks running along with gpugrid.
Do I really have to put my cpu at higher clocks or what?
This is the task http://www.gpugrid.net/workunit.php?wuid=3264669



That's a pretty solid overclock already, so I wouldn't push it unless you really want to. Or if you are using LN2 to cool your cores ;).

These tasks seem to utilize less percent of the GPU than others, and I'm not sure why that is at the moment. I'm not sure if it's an issue with dividing the task between the CPU and GPU or something else (memory management, maybe?). I'll try to get back to you. There may not be anything we or you can do. We'll see.


Temps aren't an issue with running only three cpu tasks :)
4500Mhz I can go with 1.272v and with 4800Mhz I can go with 1.376v.

But getting to my point: Well I did raise the 2500K to 4800Mhz and the gpu load didn't get affected, still in the 86-88% ball park so it is not a cpu bottleneck for sure despite the fact that the gpu process in task mananager is showing as 25% usage as in it's hogging the free core completely I gave it with swan_sync.
Funny thing is that the gpu load seems to vary a little bit. First I see that it's jumping around between 86-88%, then suddenly it's holding steady at 90-91% and after a while it goes back to jumping between 86-88%.
Otherwise great wu's :thumb:
ID: 23944 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Simba123

Send message
Joined: 5 Dec 11
Posts: 147
Credit: 69,970,684
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 23945 - Posted: 13 Mar 2012, 23:27:09 UTC
Last modified: 13 Mar 2012, 23:30:56 UTC

Thanks Nate :)
those changes should make things a little easier and more consistent.

As for the GPU usage, anything above 90% is good IMHO. 95% would be excellent.

I have no idea about the coding used, but it appears to be adaptable to the workunit on the fly. As in, usage of the GPU can change throughout the run. I too have noticed that usage can change from 80-88% during the run.
Also noticed that the time taken to complete a % can change throughout the run as well.
My last unit took significantly more time to process the last 25% of the unit than it did to do the other 75%(when observed in 25% blocks)

I think 12 hours or so is an optimum run-time. That lets us run machines overnight and complete a task so we can turn them off before we go to work. Also they are not long enough that it is too annoying if a unit fails close to the end. You would have heard my scream from Australia had that unit of mine crashed at 22+ hours of runtime :/
ID: 23945 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
candido

Send message
Joined: 12 Jun 11
Posts: 12
Credit: 150,069,999
RAC: 0
Level
Ile
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 23946 - Posted: 14 Mar 2012, 0:02:52 UTC - in response to Message 23945.  

just lost one after 10 hours running
aaarrrrrggggghhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh!
well, no harm done
and it was my falt (messing with overclocking and knowing nothing about that, just going to return it to default settings and leave it like that)
ID: 23946 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Michael Kingsford Gray
Avatar

Send message
Joined: 28 Nov 11
Posts: 21
Credit: 121,646,463
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwat
Message 23951 - Posted: 14 Mar 2012, 5:26:48 UTC - in response to Message 23915.  

That's around half what I expected (1420MB).
...So is it the operating system, driver, or app?

Problem solved!
I was relying on the windows side-bar widgets (one for each card) which reports half of the memory usage that GPU-Z does.

The widgets are broken.
Damn, I feel like a newbie now!
ID: 23951 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile nate

Send message
Joined: 6 Jun 11
Posts: 124
Credit: 2,928,865
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwat
Message 23957 - Posted: 14 Mar 2012, 13:36:47 UTC
Last modified: 14 Mar 2012, 13:37:48 UTC

Spatzthecat: That does seem on the slow side for 570. For these tasks 10-11 hours seems about average (for that card). Windows has slightly lower performance than Linux, so that might explain part of it. Further, looks like you are running on your display card, which will also take away some performance if you are using the system to do other stuff. I'm not sure about other factors.

Lagittaja: Thanks for the info. Still looking into it, but it seems that a script added to the simulation may be the cause. Some simulations don't have them, but this one required it. We try to optimize the scripts as much as possible but they are interpreted at runtime and cause slowdowns. Many functions, but not all, are implemented in C/C++ to help out with speed.

Simba123:
I think 12 hours or so is an optimum run-time. That lets us run machines overnight and complete a task so we can turn them off before we go to work. Also they are not long enough that it is too annoying if a unit fails close to the end.
Thanks for the comment. This was one of the additional reasons for making them shorter. I noticed a few people saying something along those lines. Some people only crunch part-time, and we don't want to lose their contribution simply because they can't run steady 24/7. We may still be a little on the long side in that respect, so we'll consider that for the next big batch that goes to the long queue.

Michael Kingsford Gray: Glad to hear you got it figured out.
ID: 23957 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ritterm
Avatar

Send message
Joined: 31 Jul 09
Posts: 88
Credit: 244,413,897
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 23958 - Posted: 14 Mar 2012, 14:33:10 UTC

My stock C2Q/GTX570 finished this FAX4 in about 14.5 hours with SWAN_SYNC=0 and 1 dedicated CPU. Almost 3 times as long to finish as a typical NATHAN_CB1 task for only twice the credit? :-( [ ;-)]
ID: 23958 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mikkie

Send message
Joined: 19 Apr 11
Posts: 4
Credit: 3,779,371
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwat
Message 23960 - Posted: 14 Mar 2012, 15:27:49 UTC

I'm leaving this project. My card could handle those CB1 ones. They took about 12/13 hrs (my limit of computer use) on my 460SE card.

But these FAX things are too long for me and because of the time limitation I also ran out of any bonus points with these wu's. I am not interested in your short stuff.

For the fast cards and 24/7 people, 'good luck' with these FAX batch(es.) Its time to change to a other project. Bye.
ID: 23960 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
JLConawayII

Send message
Joined: 31 May 10
Posts: 48
Credit: 28,893,779
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 23962 - Posted: 14 Mar 2012, 20:17:53 UTC - in response to Message 23960.  

I'm leaving this project. My card could handle those CB1 ones. They took about 12/13 hrs (my limit of computer use) on my 460SE card.

But these FAX things are too long for me and because of the time limitation I also ran out of any bonus points with these wu's. I am not interested in your short stuff.

For the fast cards and 24/7 people, 'good luck' with these FAX batch(es.) Its time to change to a other project. Bye.



Wow. I guess this project is done for then. Time to shut down the servers, Mikkie is leaving.
ID: 23962 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
noname

Send message
Joined: 26 Aug 08
Posts: 4
Credit: 14,438,740
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwat
Message 23966 - Posted: 14 Mar 2012, 22:13:23 UTC - in response to Message 23960.  

I'm leaving this project. My card could handle those CB1 ones. They took about 12/13 hrs (my limit of computer use) on my 460SE card.

But these FAX things are too long for me and because of the time limitation I also ran out of any bonus points with these wu's. I am not interested in your short stuff.

For the fast cards and 24/7 people, 'good luck' with these FAX batch(es.) Its time to change to a other project. Bye.

I have gtx295 and gtx265, old NATHAN task were perfect for this card (runtime under 9h), but new FAX3 does run for 56hours. Sorry guys, too long. So I switched back to normal tasks and gues what? A739-TONI... task is @73% after 12h30min and aproximatley 4h30min is left to finish this task. I want to crunch this project and I dont think I have slow cards, but is it that much of a problem to make a smaller task? That 8h NATHAN tasks were ideal, even old FAX tasks with runtime 12h on my hardware wehe acceptable. Normal tasks for 18hours???
ID: 23966 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 23969 - Posted: 15 Mar 2012, 10:52:50 UTC - in response to Message 23966.  

FAX3 has been replaced with FAX4, which is ~2/3rds as long.
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help
ID: 23969 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Michael Kingsford Gray
Avatar

Send message
Joined: 28 Nov 11
Posts: 21
Credit: 121,646,463
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwat
Message 23983 - Posted: 16 Mar 2012, 9:33:27 UTC - in response to Message 23969.  

It appears that both the FAX3 and FAX4 tasks have exhausted, potentially rendering nagging gripes moot.
ID: 23983 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Stoneageman
Avatar

Send message
Joined: 25 May 09
Posts: 224
Credit: 34,057,374,498
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 23984 - Posted: 16 Mar 2012, 11:09:16 UTC

Still getting FAX4's & the occasional FAX3 here.
ID: 23984 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 23985 - Posted: 16 Mar 2012, 11:31:32 UTC - in response to Message 23983.  

There are a couple of non-FAX tasks in the queue, NATHAN_CB1 and IBUCH_xxxTRYP, which could have went your way. Nate's 'Long' CB1 tasks are the sort that run faster than Kashif's 'Normal length' HIV tasks, so we are not quite gripe-free:

I4R7-NATHAN_CB1_1-88-125-RND9588_0 3227378 91249 4 Mar 2012 | 9:08:19 UTC 4 Mar 2012 | 17:18:52 UTC Completed and validated 16,961.14 16,768.88 35,811.00 Long runs (8-12 hours on fastest card) v6.15 (cuda31)
277-KASHIF_HIVPR_cl_ba1-28-100-RND4840_0 3226987 115641 4 Mar 2012 | 7:43:11 UTC 4 Mar 2012 | 14:36:23 UTC Completed and validated 18,222.69 18,194.16 10,552.50 ACEMD2: GPU molecular dynamics v6.15 (cuda31)

Fortunately Ignasi's TRYP tasks are bringing balance to the GeForce,

358-IBUCH_metTRYP1-0-2-RND2279_2 3270731 91249 15 Mar 2012 | 23:51:50 UTC 16 Mar 2012 | 8:25:54 UTC Completed and validated 25,956.25 8,342.53 35,400.00 Long runs (8-12 hours on fastest card) v6.16 (cuda31)


FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help
ID: 23985 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Bruce G

Send message
Joined: 16 Mar 12
Posts: 2
Credit: 52,236,725
RAC: 0
Level
Thr
Scientific publications
watwatwatwat
Message 23989 - Posted: 16 Mar 2012, 14:45:59 UTC
Last modified: 16 Mar 2012, 14:49:05 UTC

I'm new and I've got 53 hours to do and it's also not crunching but saying the computer is in use I have an i5 processor and a Cuda GEFORCE 410 mobile and latest drivers, any suggestions?

Bruce
ID: 23989 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Bruce G

Send message
Joined: 16 Mar 12
Posts: 2
Credit: 52,236,725
RAC: 0
Level
Thr
Scientific publications
watwatwatwat
Message 23993 - Posted: 16 Mar 2012, 16:58:42 UTC - in response to Message 23989.  

I've gone on to join World Community grid.
ID: 23993 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Retvari Zoltan
Avatar

Send message
Joined: 20 Jan 09
Posts: 2380
Credit: 16,897,957,044
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 23997 - Posted: 16 Mar 2012, 19:41:20 UTC - in response to Message 23993.  

I'm new and I've got 53 hours to do and it's also not crunching but saying the computer is in use I have an i5 processor and a Cuda GEFORCE 410 mobile and latest drivers, any suggestions?

Your GeForce 410 mobile is too slow for this project.

I've gone on to join World Community grid.

So you figure it out by yourself. At WCG your GPU will be useful.
ID: 23997 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 24000 - Posted: 16 Mar 2012, 22:32:46 UTC - in response to Message 23997.  
Last modified: 16 Mar 2012, 22:37:35 UTC

it's also not crunching but saying the computer is in use

You would have needed to select 'Use GPU while computer is in use', from Boinc Manager, to run GPU tasks while using the system, but as Zoltan said, your GPU is not up to running GPUGrid tasks. While WCG has just started some GPU Beta testing, your GeForce 410M would not be up to crunching their either, but WCG has plenty of good CPU projects. GPUGrid does not run CPU tasks. Perhaps your GPU is of use at Einstein, though I think they are winding up a run.
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help
ID: 24000 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Simba123

Send message
Joined: 5 Dec 11
Posts: 147
Credit: 69,970,684
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 24004 - Posted: 17 Mar 2012, 7:43:15 UTC

ouch.
just picked up
http://www.gpugrid.net/result.php?resultid=5123672
and it's showing a time to complete of over 29 hours. :(

I'll give it an hour to see if that time comes down to below 24 hours.
If it doesn't, I'll abort it.
ID: 24004 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · Next

Message boards : News : New task on long queue, significantly longer than traditional tasks

©2025 Universitat Pompeu Fabra