Major SNAFU in Effect

Message boards : Number crunching : Major SNAFU in Effect
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3

AuthorMessage
mmonnin

Send message
Joined: 2 Jul 16
Posts: 338
Credit: 7,987,341,558
RAC: 213
Level
Tyr
Scientific publications
watwatwatwatwat
Message 51889 - Posted: 20 May 2019, 0:07:34 UTC
Last modified: 20 May 2019, 0:09:35 UTC

I'm trying to to think of projects that use it. Going through project folders it looks like DrugDiscovery CPU Goofy, MindModeling and CAS used it. DHEP, Gerasium, Moo, SRBase, Enigma, YoYo and Yafu are active projects that have a wrapper in the exe name. Some Yoyo ECM tasks can use like 8GB but I think thats the data as its limited to certain types. But nothing like LHC Atlas using 10gb for the other projects. VBox apps are huge because its an entire image.

It seems like most GPUGrid crunching is done in Windows as the stats have only gone down from about 600m to 400m per day.
ID: 51889 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Keith Myers
Avatar

Send message
Joined: 13 Dec 17
Posts: 1419
Credit: 9,119,446,190
RAC: 731
Level
Tyr
Scientific publications
watwatwatwatwat
Message 51890 - Posted: 20 May 2019, 2:31:55 UTC

That still shows the Linux hosts responsible for 1/3 of the total credit. And since the percentage of Linux hosts is 37% compared to 54% for Windows hosts, the Linux hosts are showing a greater percentage of higher production hosts compared to Windows hosts.

It would benefit the project to return the Linux hosts to participation.
ID: 51890 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 11 Jul 09
Posts: 1639
Credit: 10,159,968,649
RAC: 351
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 51891 - Posted: 20 May 2019, 11:24:17 UTC - in response to Message 51890.  

It would benefit the project to return the Linux hosts to participation.

Which is why the PM which got Toni's attention had the subject line

Research being delayed - Linux apps broken

:-)
ID: 51891 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile bcavnaugh

Send message
Joined: 8 Nov 13
Posts: 56
Credit: 1,002,640,163
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 52084 - Posted: 14 Jun 2019, 1:02:09 UTC

Been a while, and news?
ID: 52084 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Retvari Zoltan
Avatar

Send message
Joined: 20 Jan 09
Posts: 2380
Credit: 16,897,957,044
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 52426 - Posted: 8 Aug 2019, 20:27:39 UTC

Now the license of the Windows app has expired.
I have the feeling that this project is more important for us than for the GPUGrid team, if there's such an entity at all.
ID: 52426 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile JStateson
Avatar

Send message
Joined: 31 Oct 08
Posts: 186
Credit: 3,578,903,157
RAC: 0
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 52427 - Posted: 8 Aug 2019, 21:52:42 UTC - in response to Message 52426.  

Now the license of the Windows app has expired.
I have the feeling that this project is more important for us than for the GPUGrid team, if there's such an entity at all.


August is the vacation month in Italy. Looking at the "about" I don't see a lot of diversity. Probably took off a week to get their heads out of the quantum clouds and socialize with opposite sex.
ID: 52427 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
w1hue

Send message
Joined: 28 Sep 09
Posts: 21
Credit: 471,394,734
RAC: 56
Level
Gln
Scientific publications
watwatwatwatwatwatwat
Message 52429 - Posted: 9 Aug 2019, 1:06:31 UTC

August is vacation month in Italy . . .

Most likely most take off the whole month . . . not just a week.
ID: 52429 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jim1348

Send message
Joined: 28 Jul 12
Posts: 819
Credit: 1,591,285,971
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 52430 - Posted: 9 Aug 2019, 1:51:07 UTC

They are in Spain, so I always figured they would head to Majorca. No one ever denied it at any rate.
ID: 52430 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile robertmiles

Send message
Joined: 16 Apr 09
Posts: 503
Credit: 769,991,668
RAC: 0
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 52431 - Posted: 9 Aug 2019, 12:03:56 UTC - in response to Message 51792.  

Same here, of course. But I haven't seen anyone from the project around here for a while. Is anyone at home?

It looks to me like the two main researchers are about to get a flood of workunits that failed due to all of the tasks giving an error. If so, they will have to notify the programmer or programmers, and start an effort to fix the problem. If they're able to read and write in English, they'll then have little worthwhile to do other than tell us what happened, and when they expect a fix.
ID: 52431 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
wolfman1360

Send message
Joined: 19 Feb 17
Posts: 5
Credit: 36,563,552
RAC: 0
Level
Val
Scientific publications
wat
Message 52705 - Posted: 23 Sep 2019, 20:54:15 UTC - in response to Message 51786.  

Am I to assume this has been fixed and I can add my Linux machine here? Or are there no WUs for Linux as of yet?
I know I'm crunching okay under Windows...
ID: 52705 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Retvari Zoltan
Avatar

Send message
Joined: 20 Jan 09
Posts: 2380
Credit: 16,897,957,044
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 52706 - Posted: 23 Sep 2019, 21:48:12 UTC - in response to Message 52705.  
Last modified: 23 Sep 2019, 21:49:43 UTC

Am I to assume this has been fixed and I can add my Linux machine here?
It's been fixed, thoguh only the Windows app is released to the production line.
You can add your Linux machine, but it will receive only beta test tasks for a while.

Or are there no WUs for Linux as of yet?
The workunits are common, but the new Linux app will be put into the production line only when the new Windows app is working as it should be.

I know I'm crunching okay under Windows...
Me too.
ID: 52706 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Keith Myers
Avatar

Send message
Joined: 13 Dec 17
Posts: 1419
Credit: 9,119,446,190
RAC: 731
Level
Tyr
Scientific publications
watwatwatwatwat
Message 52708 - Posted: 23 Sep 2019, 23:53:39 UTC - in response to Message 52706.  

Am I to assume this has been fixed and I can add my Linux machine here?
It's been fixed, thoguh only the Windows app is released to the production line.
You can add your Linux machine, but it will receive only beta test tasks for a while.

Or are there no WUs for Linux as of yet?
The workunits are common, but the new Linux app will be put into the production line only when the new Windows app is working as it should be.

I know I'm crunching okay under Windows...
Me too.

I am receiving non-Toni test tasks today for my Linux host. Looks like normal project work.
https://www.gpugrid.net/result.php?resultid=21405079
https://www.gpugrid.net/result.php?resultid=21405557
https://www.gpugrid.net/result.php?resultid=21405187
https://www.gpugrid.net/result.php?resultid=21405090
ID: 52708 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 11 Jul 09
Posts: 1639
Credit: 10,159,968,649
RAC: 351
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 52712 - Posted: 24 Sep 2019, 7:34:51 UTC - in response to Message 52708.  

I am receiving non-Toni test tasks today for my Linux host. Looks like normal project work.
https://www.gpugrid.net/result.php?resultid=21405079
https://www.gpugrid.net/result.php?resultid=21405557
https://www.gpugrid.net/result.php?resultid=21405187
https://www.gpugrid.net/result.php?resultid=21405090

Yes, 'Application version: New version of ACEMD v2.06 (cuda100)' is the new normal.
ID: 52712 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Aurum
Avatar

Send message
Joined: 12 Jul 17
Posts: 404
Credit: 17,408,899,587
RAC: 0
Level
Trp
Scientific publications
watwatwat
Message 52714 - Posted: 24 Sep 2019, 13:58:59 UTC

Being in check-in mode for months has got me so confused. I thought Toni asked not to run acemd3 on Linux as that's not what she needs to test. Or are we now good to go on Linux WUs???
ID: 52714 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jim1348

Send message
Joined: 28 Jul 12
Posts: 819
Credit: 1,591,285,971
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 52715 - Posted: 24 Sep 2019, 14:17:21 UTC - in response to Message 52714.  

I thought Toni asked not to run acemd3 on Linux as that's not what she needs to test.

Yes, that is what he said. I am just surprised that they send them to Linux machines at all. Can't they block them?
ID: 52715 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Retvari Zoltan
Avatar

Send message
Joined: 20 Jan 09
Posts: 2380
Credit: 16,897,957,044
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 52716 - Posted: 24 Sep 2019, 17:44:05 UTC - in response to Message 52712.  

I am receiving non-Toni test tasks today for my Linux host. Looks like normal project work.
Yes, 'Application version: New version of ACEMD v2.06 (cuda100)' is the new normal.
I received such tasks too. These are from the short queue. (Which is epmty now, though).
I think Toni put some workunits from the short queue to the "New version of ACEMD" queue from time to time to serve as a bit longer test.
ID: 52716 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mmonnin

Send message
Joined: 2 Jul 16
Posts: 338
Credit: 7,987,341,558
RAC: 213
Level
Tyr
Scientific publications
watwatwatwatwat
Message 52717 - Posted: 24 Sep 2019, 19:57:42 UTC

I've received only 1 since he's said that. If admins only want Windows hosts to receive the tasks then they could always depreciate the Linux app.
ID: 52717 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
rod4x4

Send message
Joined: 4 Aug 14
Posts: 266
Credit: 2,219,935,054
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 52723 - Posted: 26 Sep 2019, 0:52:23 UTC - in response to Message 52716.  

I received such tasks too. These are from the short queue. (Which is epmty now, though).
I think Toni put some workunits from the short queue to the "New version of ACEMD" queue from time to time to serve as a bit longer test.

Agreed.
My Windows hosts do not process from the short queue, only from the long queue and test queue.
I am receiving ADRIA short work units from the test queue. This would seem to indicate ADRIA is becoming familiar with creating ACEMD3 work units.
We are getting closer to full release of ACEMD3!
ID: 52723 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3

Message boards : Number crunching : Major SNAFU in Effect

©2025 Universitat Pompeu Fabra