New acemdlong workunits

Message boards : Number crunching : New acemdlong workunits
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Toni
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 9 Dec 08
Posts: 1006
Credit: 5,068,599
RAC: 0
Level
Ser
Scientific publications
watwatwatwat
Message 20633 - Posted: 8 Mar 2011, 15:29:34 UTC
Last modified: 8 Mar 2011, 19:31:22 UTC

Dears, we are starting to take full advantage of the "long" queue. I've sent a batch of 700 WUs named "AB*", which should be 8-12h on the fastest cards.

I started a (still preliminary) batch of experiments to study the behavior of a protein likely implied in Alzheimer's disease.

By the way, if you happen to see "MDIO warnings" on redefined parameters, they are harmless.
ID: 20633 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Retvari Zoltan
Avatar

Send message
Joined: 20 Jan 09
Posts: 2380
Credit: 16,897,957,044
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20634 - Posted: 8 Mar 2011, 20:14:05 UTC - in response to Message 20633.  
Last modified: 8 Mar 2011, 20:15:28 UTC

I've sent a batch of 700 WUs named "AB*", which should be 8-12h on the fastest cards.

I must have faster-than-fastest cards then, since these "AB*" WUs are completing around 17000 sec (4h 43m) on my GTX 580 @ 850MHz, and 19000 sec (5h 17m) on my GTX 480 @ 800MHz.
ID: 20634 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Ross*

Send message
Joined: 6 May 09
Posts: 34
Credit: 443,507,669
RAC: 0
Level
Gln
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20635 - Posted: 8 Mar 2011, 22:23:52 UTC - in response to Message 20634.  

[ I've sent a batch of 700 WUs named "AB*", which should be 8-12h on the fastest cards]
Hi
When can I get some more? 17000sec for 11k credits standard, 27000secs for the long [7.5hrs] for the long ones 57k credits. I love them.
Cheers
Ross*
ID: 20635 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Bedrich Hajek

Send message
Joined: 28 Mar 09
Posts: 490
Credit: 11,731,645,728
RAC: 57
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20636 - Posted: 9 Mar 2011, 0:04:02 UTC

I received 4 of these units so far. Three ran on my windows 7 machine with GTX 480 cards, with no problem. One ran on my windows xp machine with GTX 285 card. This one gave me a lot of problems. It caused my computer to shut down numerous times. I had to reboot the machine again and again, but the work unit kept running. I repaired my register, did a check disk, ran a virus scan and malware scan. I even under clocked the video card. Nothing. The unit finally finished sucessfully. Since I never had this problem before and the subsequent unit is working fine, I would discount the video card going bad. This unit must have had some issues.
ID: 20636 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Toni
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 9 Dec 08
Posts: 1006
Credit: 5,068,599
RAC: 0
Level
Ser
Scientific publications
watwatwatwat
Message 20637 - Posted: 9 Mar 2011, 19:32:55 UTC - in response to Message 20634.  
Last modified: 9 Mar 2011, 19:46:04 UTC

Thanks guys for the data. We are still tweaking WU sizes and credit factors. When things settle, do expect 8-12h and a credit factor of 1.5-2 (in addition to the one for fast return, if any). Maybe I'm optimizing WUs too much ;)

Concerning machine crashes, sorry to hear. BTW, I normally run the WUs on 275s in addition to Fermis.
ID: 20637 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile dataman
Avatar

Send message
Joined: 18 Sep 08
Posts: 36
Credit: 100,352,867
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 20638 - Posted: 9 Mar 2011, 19:52:34 UTC

Just wantedd to say I do like the new long wu's. I have completed 16 with no problems at all. Most run in 15 hours on GTX260's. I have no runtime yet for GTX460's. Great job folks! Keep up the good work.
Cheers!

ID: 20638 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ben

Send message
Joined: 27 Oct 08
Posts: 19
Credit: 22,658,253
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwat
Message 20656 - Posted: 14 Mar 2011, 10:24:41 UTC - in response to Message 20638.  

i got a acemdlong and its running at 32 hours long, im surprised at how long its taking, is it me or what, help please.
ID: 20656 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile nenym

Send message
Joined: 31 Mar 09
Posts: 137
Credit: 1,429,587,071
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20659 - Posted: 14 Mar 2011, 11:19:44 UTC
Last modified: 14 Mar 2011, 11:30:13 UTC

A have crunched several "AB" WU's, run time about 9:52; XP 64bit + GTX560Ti factory OC to 900MHz, driver 26726, host ID 31329. Used features: Swan_Sync=0, free core out of Boinc, priority of ACEMD & ACEMDLONG precess set to normal. GPU load about 97%.
ID: 20659 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20663 - Posted: 14 Mar 2011, 18:54:36 UTC - in response to Message 20659.  
Last modified: 14 Mar 2011, 19:09:15 UTC

Hi Ben.
Your longtask times seem to be all over the place; 19K, 22K, 55K, 36K.
I see a fairly constant 26000sec +/- 5% on my GTX470's. Some of this may be explained by different task types, but 19K and 22K for two TONI_AB1 tasks is quite far apart and 55K for a TONI_SMDTRYP7LONG10 sounds a bit long, I ran one and it was only slightly longer (under 28Ksec).

As you have a 500 series Fermi, I think it might be throttling itself (dropping speed if it gets too warm). If not perhaps your Boinc settings or system usage is causing such high variation in times.

Note that when you are using swan_sync and configure Boinc to only use 99% of the CPU cores (threads in your case), the GPUGrid task will use a full thread so your CPU usage will remain at 100% overall. After you set swan_sync you should restart the system. Might be an idea to check the last checkpoints of your tasks first.

Check the temperature of your system and GPU especially. You should be using temp controlling software to keep the card at a reasonable temperature (60-70deg). I suggest MSI Afterburner, as this allows you to configure the fan speed acording to the temperature (it increases as the temperature rises, and decreases when it should). A simpler but less controllable program is EVGA Precision. Both can be downloaded from freeware sites such as http://majorgeeks.com/ - restart required.
ID: 20663 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20676 - Posted: 16 Mar 2011, 18:51:26 UTC - in response to Message 20663.  

This IBUCH_2_mutEGFR task type is taking about 12h on a GTX470 (XPx86, swan_sync, free i7 thread). I think this is a good size given that it just creeps into the estimated 8-12h range and the card is at the lower end of the top Fermi cards.
ID: 20676 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ignasi

Send message
Joined: 10 Apr 08
Posts: 254
Credit: 16,836,000
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 20682 - Posted: 17 Mar 2011, 10:48:00 UTC - in response to Message 20676.  

Hey,

skgiven is right.
All IBUCH_?_??EGFR_110315 should take about that. They are the same length and size. 52K atoms for 2,500,000 simulation steps which is a simulation of 10 nanoseconds.

Many thanks for running acemdlong guys!

ignasi
ID: 20682 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
palmss

Send message
Joined: 28 Aug 08
Posts: 7
Credit: 60,897,550
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20708 - Posted: 18 Mar 2011, 21:45:13 UTC

Hi, on IBUCH EGFR my gpu is used at only 40% and it take almost 18h too finish a WU,it seems really long, on other long WU my gpu take 9h too finish a WU . Is this normal ? I have a GTX 570 thanks.
ID: 20708 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile nenym

Send message
Joined: 31 Mar 09
Posts: 137
Credit: 1,429,587,071
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20710 - Posted: 18 Mar 2011, 22:40:50 UTC - in response to Message 20708.  

Without Swan_Sync enviroment variable and free CPU core yes.
ID: 20710 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20712 - Posted: 19 Mar 2011, 0:12:44 UTC - in response to Message 20710.  
Last modified: 22 Mar 2011, 16:11:28 UTC

As nenym said, use swan_sync=0 and free up a CPU core in Boin Manager.

For how to see the FAQ: Best configurations for GPUGRID thread.

Some of the IBUCH tasks tend to be a bit less resource intensive than the other tasks, but 40% utilization is too low, so improve your settings.

One other thing, Vista and Win7 tend to be about 11% slower, due to the Windows Display Driver Model (WDDM), than XP; and that's after the system is optimized for GPUGrid crunching.
ID: 20712 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
palmss

Send message
Joined: 28 Aug 08
Posts: 7
Credit: 60,897,550
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20714 - Posted: 19 Mar 2011, 3:20:18 UTC

Thank you very much for your support, I added the variable SWAN_SYNC and I effectively increase the load of the GPU to 62%.
ID: 20714 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20722 - Posted: 19 Mar 2011, 20:51:19 UTC - in response to Message 20714.  

Did you also configure Boinc to only use 50% of the CPU?
I get 83 to 84% GPU utilization on my GTX470s when running the long IBUCH tasks.
ID: 20722 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Snow Crash

Send message
Joined: 4 Apr 09
Posts: 450
Credit: 539,316,349
RAC: 0
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20821 - Posted: 30 Mar 2011, 23:03:57 UTC
Last modified: 30 Mar 2011, 23:06:46 UTC

Points per second are very very different between the different long WU types.
These numbers come from a GTX295 on WinXP with a slight OC of shaders running at 1368 and were all returned within 24 hours. CPU = i7-920 @3.8 GHZ HT = ON, 2 threads free.

TONI_KKAL3 = .538 points per second
IBUCH_2_mutEGFR = .927 points per second point
TONI_AB1 = 1.292 points per second

elapsed times are also very different, you could make the KKALs twice as long again :-)

TONI_KKAL3 = ~7 hours
IBUCH_2_mutEGFR = ~19 hours
TONI_AB1 = ~12.5 hours
Thanks - Steve
ID: 20821 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Pepo
Avatar

Send message
Joined: 25 Dec 07
Posts: 19
Credit: 53,126
RAC: 0
Level

Scientific publications
watwat
Message 20826 - Posted: 31 Mar 2011, 14:36:22 UTC
Last modified: 31 Mar 2011, 14:45:42 UTC

I've got one acemdlong task on nVidia G105M - I've noticed the task after having run for 5 hours and reaching (IIRC) less than 3%, BOINC promised to surely finish it in less than a couple of weeks (with 4 days due-time) :-D

So I've suspended the task until I'll find this thread...

[edit]After resuming the task a few hours later, it crashed weirdly (error 0x3, <message>"System cannot find the given path"</message> + "MDIO ERROR: cannot open file 'restart.coor'").

I've updated my prefs to exclude "ACEMD for long runs". But, could the server try to estimate, whether the target machine is at all capable to finish a given task until deadline?

Peter
ID: 20826 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 20829 - Posted: 31 Mar 2011, 19:03:53 UTC - in response to Message 20826.  

The G105M is not powerful enough to run GPUGrid tasks.
It only has 8 shaders, you need about 100.
ID: 20829 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Pepo
Avatar

Send message
Joined: 25 Dec 07
Posts: 19
Credit: 53,126
RAC: 0
Level

Scientific publications
watwat
Message 20838 - Posted: 1 Apr 2011, 12:24:45 UTC - in response to Message 20829.  
Last modified: 1 Apr 2011, 12:32:57 UTC

The G105M is not powerful enough to run GPUGrid tasks.
It only has 8 shaders, you need about 100.

I'd say, bad wording: not powerful enough to keep given deadline. Sure, therefore I've already excluded such tasks from mu prefs.

But still, as the CPUs do not receive tasks, which will supposedly not be done in-time, (why) does the same not (yet) happen for other resources (GPUs in this case)? BOINC knows the GPU's speed estimate (in my case it's merely "39 GFLOPS peak", I just can not see it anywhere in cleartext in sched_request_www.gpugrid.net.xml), project knows the task's FLOPs cost...

Or is this just to be implemented yet? I may stand corrected...

Peter
ID: 20838 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
1 · 2 · Next

Message boards : Number crunching : New acemdlong workunits

©2025 Universitat Pompeu Fabra