Large scale experiment: MDAD

Message boards : News : Large scale experiment: MDAD
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · 4 . . . 8 · Next

AuthorMessage
Toni
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 9 Dec 08
Posts: 1006
Credit: 5,068,599
RAC: 0
Level
Ser
Scientific publications
watwatwatwat
Message 53462 - Posted: 25 Jan 2020, 10:07:33 UTC
Last modified: 25 Jan 2020, 10:08:22 UTC

We are starting a new large-scale experiment. There will be plenty of workunits, whose very first batch is currently being sent. Run times should be around 6h but with a lot of variability. They are very heterogeneous so please don't worry for failures.

Thanks! -Toni
ID: 53462 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Nick Name

Send message
Joined: 3 Sep 13
Posts: 53
Credit: 1,533,531,731
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 53465 - Posted: 25 Jan 2020, 15:08:17 UTC - in response to Message 53462.  

These are running just fine on both Windows and Linux so far. I haven't seen any run times near six hours yet. I also see that the Linux app is loading the GPU much higher than the Windows app is, about double.
Team USA forum | Team USA page
Join us and #crunchforcures. We are now also folding:join team ID 236370!
ID: 53465 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Azmodes

Send message
Joined: 7 Jan 17
Posts: 34
Credit: 1,371,429,518
RAC: 0
Level
Met
Scientific publications
watwatwat
Message 53466 - Posted: 25 Jan 2020, 15:12:36 UTC

Almost 20 tasks validated so far, but I have also had two WUs end in an error after a few seconds, on two different hosts so far:
<core_client_version>7.9.3</core_client_version>
<![CDATA[
<message>
process exited with code 195 (0xc3, -61)</message>
<stderr_txt>
12:23:14 (10594): wrapper (7.7.26016): starting
12:23:14 (10594): wrapper (7.7.26016): starting
12:23:14 (10594): wrapper: running acemd3 (--boinc input --device 1)
ERROR: /home/user/conda/conda-bld/acemd3_1570536635323/work/src/mdsim/forcefield.cpp line 174: Cannot index the parameter files with the topology file
12:23:15 (10594): acemd3 exited; CPU time 0.081577
12:23:15 (10594): app exit status: 0x9e
12:23:15 (10594): called boinc_finish(195)

</stderr_txt>
]]>


Also, while my Linux machines get a GPU core load of 90-100%, the Windows ones aren't doing so great (one thread is set aside for each task in the client and swan_sync is on). Sub-90, sometimes around 80 and the worst I've seen is an RTX 2080 at 70% max.
ID: 53466 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
STARBASEn
Avatar

Send message
Joined: 17 Feb 09
Posts: 91
Credit: 1,603,303,394
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 53467 - Posted: 25 Jan 2020, 16:01:24 UTC

Cool, got all 3 Linux NV cards happily crunching away at about 95% gpu usage.
ID: 53467 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ServicEnginIC
Avatar

Send message
Joined: 24 Sep 10
Posts: 588
Credit: 11,396,036,510
RAC: 11,719,261
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 53469 - Posted: 25 Jan 2020, 16:28:16 UTC - in response to Message 53462.  

I'm glad to process Science again! (please, note capital letter for this)
Thank so much to all GPUGrid's Team (please, see above)
ID: 53469 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
John C MacAlister

Send message
Joined: 17 Feb 13
Posts: 181
Credit: 144,871,276
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 53471 - Posted: 25 Jan 2020, 17:21:57 UTC

What is the object of the research?
John
ID: 53471 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Aurum
Avatar

Send message
Joined: 12 Jul 17
Posts: 404
Credit: 17,408,899,587
RAC: 769,520
Level
Trp
Scientific publications
watwatwat
Message 53472 - Posted: 25 Jan 2020, 18:29:14 UTC

Toni, Glad to get the work.
Is there any plan to upgrade your server or internet speed???
ID: 53472 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Aurum
Avatar

Send message
Joined: 12 Jul 17
Posts: 404
Credit: 17,408,899,587
RAC: 769,520
Level
Trp
Scientific publications
watwatwat
Message 53473 - Posted: 25 Jan 2020, 18:31:24 UTC

Toni, Glad to get the work.
Is there any plan to upgrade your server or internet speed???
ID: 53473 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Remanco

Send message
Joined: 4 Mar 13
Posts: 3
Credit: 30,169,077
RAC: 0
Level
Val
Scientific publications
watwatwat
Message 53478 - Posted: 25 Jan 2020, 21:12:51 UTC - in response to Message 53471.  

What is the object of the research?


Yes, can we have a bit more info on what we crunch?

Thanks!

Sylvain


ID: 53478 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Miklos M.

Send message
Joined: 16 Jun 12
Posts: 17
Credit: 292,288,806
RAC: 0
Level
Asn
Scientific publications
watwatwatwat
Message 53480 - Posted: 25 Jan 2020, 22:47:50 UTC

Trying to get some tasks and so far no luck. Am I doing it wrong?

Thanks
ID: 53480 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Keith Myers

Send message
Joined: 13 Dec 17
Posts: 1400
Credit: 8,616,046,190
RAC: 8,556,950
Level
Tyr
Scientific publications
watwatwatwatwat
Message 53481 - Posted: 25 Jan 2020, 22:49:24 UTC - in response to Message 53480.  

Trying to get some tasks and so far no luck. Am I doing it wrong?

Thanks

Do you have acemd3 application selected in your project preferences?
ID: 53481 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Nick Name

Send message
Joined: 3 Sep 13
Posts: 53
Credit: 1,533,531,731
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 53482 - Posted: 25 Jan 2020, 23:13:09 UTC

ID: 53482 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
davidBAM

Send message
Joined: 17 Sep 18
Posts: 11
Credit: 1,857,385,729
RAC: 0
Level
His
Scientific publications
watwatwat
Message 53486 - Posted: 25 Jan 2020, 23:50:02 UTC

Is the policy still to reduce credits on work not uploaded within 24hrs of issue?
ID: 53486 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Killersocke

Send message
Joined: 18 Oct 13
Posts: 53
Credit: 406,647,419
RAC: 0
Level
Gln
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 53488 - Posted: 26 Jan 2020, 0:13:25 UTC - in response to Message 53482.  

Out of work already! LOL

+ 1
ID: 53488 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Retvari Zoltan
Avatar

Send message
Joined: 20 Jan 09
Posts: 2380
Credit: 16,897,957,044
RAC: 193,866
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 53489 - Posted: 26 Jan 2020, 0:47:15 UTC - in response to Message 53482.  

Out of work already! LOL
I think it was just the warm-up. Every batch of Toni queued yesterday consisted only a single step, it's no wonder that they didn't last longer.
ID: 53489 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Retvari Zoltan
Avatar

Send message
Joined: 20 Jan 09
Posts: 2380
Credit: 16,897,957,044
RAC: 193,866
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 53490 - Posted: 26 Jan 2020, 1:31:46 UTC - in response to Message 53486.  

Is the policy still to reduce credits on work not uploaded within 24hrs of issue?
Yes. But it's actually a +50% bonus for less than 24h, or +25% for less than 48h.
ID: 53490 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
davidBAM

Send message
Joined: 17 Sep 18
Posts: 11
Credit: 1,857,385,729
RAC: 0
Level
His
Scientific publications
watwatwat
Message 53491 - Posted: 26 Jan 2020, 1:44:50 UTC - in response to Message 53490.  

Thank you. Great job on the new WU incidentally.

As they are much shorter, could I ask that you please allow download of more than 2 WU per GPU
ID: 53491 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Keith Myers

Send message
Joined: 13 Dec 17
Posts: 1400
Credit: 8,616,046,190
RAC: 8,556,950
Level
Tyr
Scientific publications
watwatwatwatwat
Message 53492 - Posted: 26 Jan 2020, 3:05:17 UTC

I believe the limit is 16 per host. That is what I got on my 3 hosts. After that I received the "you have reached the limit of tasks in progress message"
ID: 53492 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
davidBAM

Send message
Joined: 17 Sep 18
Posts: 11
Credit: 1,857,385,729
RAC: 0
Level
His
Scientific publications
watwatwat
Message 53493 - Posted: 26 Jan 2020, 4:01:18 UTC - in response to Message 53492.  

Thank you. Perhaps I'll see that once WU become freely available :-)
ID: 53493 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Erich56

Send message
Joined: 1 Jan 15
Posts: 1162
Credit: 12,205,098,501
RAC: 9,135,494
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwat
Message 53495 - Posted: 26 Jan 2020, 6:19:53 UTC - in response to Message 53492.  

As they are much shorter, could I ask that you please allow download of more than 2 WU per GPU

I believe the limit is 16 per host. That is what I got on my 3 hosts. After that I received the "you have reached the limit of tasks in progress message"

I guess what was meant in the first above cited posting was to increase the limit of tasks per GPU that can be downloaded at a time.
So far, this figure was (and still seems to be) 2.

When talking about 16 tasks per host (in the second of the above postings), I guess this was the total number of tasks that were downloaded NOT at a time, but within a certain time frame yesterday, provided a given GPU was fast enough.
My various hosts got only up to about 10 tasks each, and that was it. No more downloads since late night.



ID: 53495 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
1 · 2 · 3 · 4 . . . 8 · Next

Message boards : News : Large scale experiment: MDAD

©2025 Universitat Pompeu Fabra