ATM: Free Energy Calculations new application

Message boards : Number crunching : ATM: Free Energy Calculations new application
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · Next

AuthorMessage
Ian&Steve C.

Send message
Joined: 21 Feb 20
Posts: 1116
Credit: 40,839,470,595
RAC: 6,423
Level
Trp
Scientific publications
wat
Message 59973 - Posted: 24 Feb 2023, 13:27:38 UTC - in response to Message 59971.  

keep in mind these are bata tasks, and the batch being sent NOW are not necessarily the same as the batch sent last week and wont be the same as whatever is sent sometime in the future, until they get all the bugs worked out. the tasks last week basically ran with no perceived use of the GPU or CPU, so what were they doing? who knows. no official word from the project about these tasks at all. I wasn't willing to let the GPU/CPU be occupied for hours on end with the task spinning it's wheels when they could be doing something more useful.

maybe this current batch has been tweaked from last week and thats why they are working OK, for those that have completed this latest batch, did they have any meaningful use of the GPU or CPU? it also seems this batch was released with a new Windows application (they were Linux only before) for testing.

_______________________

Well, most of us know that Abouh reads every word written on these threads and without much song and dance, makes changes. He is the Only Admin on all the projects who diligently attend. Maybe, quite possibly. No arguments with your tweaking statement.


that's great and all, but abouh is not the researcher working with this application. Abouh deals with the research with the Python RL tasks.

These ATM tasks look to be being run by Raimis.

(the researcher names are in the filenames of the WUs)

ID: 59973 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Ian&Steve C.

Send message
Joined: 21 Feb 20
Posts: 1116
Credit: 40,839,470,595
RAC: 6,423
Level
Trp
Scientific publications
wat
Message 59974 - Posted: 24 Feb 2023, 13:30:55 UTC

https://gpugrid.net/result.php?resultid=33321222

ran for 10+hours, failed due to file size limit after an otherwise successful computation.

:(
ID: 59974 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Erich56

Send message
Joined: 1 Jan 15
Posts: 1166
Credit: 12,260,898,501
RAC: 1
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwat
Message 59975 - Posted: 24 Feb 2023, 14:39:26 UTC - in response to Message 59974.  

... failed due to file size limit
:(

I am just trying to remember with which other application we've had the same problem some time ago - last year or 2 years ago ???
ID: 59975 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Ian&Steve C.

Send message
Joined: 21 Feb 20
Posts: 1116
Credit: 40,839,470,595
RAC: 6,423
Level
Trp
Scientific publications
wat
Message 59976 - Posted: 24 Feb 2023, 14:51:34 UTC - in response to Message 59975.  

... failed due to file size limit
:(

I am just trying to remember with which other application we've had the same problem some time ago - last year or 2 years ago ???


it's happened a few times in the past with acemd3 tasks.

see here from July 2021: https://www.gpugrid.net/forum_thread.php?id=5239#57117
ID: 59976 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Aurum
Avatar

Send message
Joined: 12 Jul 17
Posts: 404
Credit: 17,408,899,587
RAC: 0
Level
Trp
Scientific publications
watwatwat
Message 59977 - Posted: 24 Feb 2023, 18:19:21 UTC

Yea, I got my first ATM checkpoint :-)
Now my list of ATM ULs are stuck.
ID: 59977 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Ian&Steve C.

Send message
Joined: 21 Feb 20
Posts: 1116
Credit: 40,839,470,595
RAC: 6,423
Level
Trp
Scientific publications
wat
Message 59978 - Posted: 24 Feb 2023, 18:43:59 UTC - in response to Message 59977.  

Yea, I got my first ATM checkpoint :-)
Now my list of ATM ULs are stuck.


the uploads are nearly 700MB in size, and likely the same problem from my link that we saw over a year ago. their server can't accept something that big, I don't think they ever figured out how to adjust the settings of their file server and just tried to keep the file sizes below the limit, which they seem to have forgotten about. nothing you do will get them to upload.

I've disabled ATM until they get it together with them.
ID: 59978 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile ServicEnginIC
Avatar

Send message
Joined: 24 Sep 10
Posts: 592
Credit: 11,972,186,510
RAC: 1,447
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 59979 - Posted: 24 Feb 2023, 21:20:56 UTC - in response to Message 59978.  

On past chance, I bet and lost.
Currently, I'm only processing ACEMD tasks, when available. I happened to catch one this morning.
ID: 59979 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Aurum
Avatar

Send message
Joined: 12 Jul 17
Posts: 404
Credit: 17,408,899,587
RAC: 0
Level
Trp
Scientific publications
watwatwat
Message 59980 - Posted: 24 Feb 2023, 23:20:56 UTC

GDF, Should I Abort these 12 completed ATM WUs that won't upload or is there a reasonable chance you'll fix it?
ID: 59980 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
zombie67 [MM]

Send message
Joined: 16 Jul 07
Posts: 209
Credit: 5,496,860,456
RAC: 12,111
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 59981 - Posted: 25 Feb 2023, 1:18:39 UTC

Well, I just achieved my 100 hours, which was my 1st priority. I will abort and reset (if necessary) the completed tasks I have. If/when the project gets its act together, I'll be back.
Reno, NV
Team: SETI.USA
ID: 59981 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
gemini8
Avatar

Send message
Joined: 3 Jul 16
Posts: 31
Credit: 2,248,809,169
RAC: 0
Level
Phe
Scientific publications
watwat
Message 59986 - Posted: 26 Feb 2023, 11:04:13 UTC

For me it's just this:
So 26 Feb 2023 11:57:00 CET | GPUGRID | Started upload of TL9_55-RAIMIS_TEST_ATM-0-1-RND1804_0_0
So 26 Feb 2023 11:57:02 CET | GPUGRID | Backing off 04:12:16 on upload of TL9_55-RAIMIS_TEST_ATM-0-1-RND1804_0_0
So 26 Feb 2023 11:57:19 CET | GPUGRID | Started upload of TL9_55-RAIMIS_TEST_ATM-0-1-RND1804_0_0
So 26 Feb 2023 11:57:22 CET | GPUGRID | Backing off 05:10:06 on upload of TL9_55-RAIMIS_TEST_ATM-0-1-RND1804_0_0

No message about the size, just about backing off.
Hooray!
- - - - - - - - - -
Greetings, Jens
ID: 59986 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
FritzB

Send message
Joined: 7 Apr 15
Posts: 17
Credit: 2,978,057,945
RAC: 73
Level
Phe
Scientific publications
wat
Message 59988 - Posted: 26 Feb 2023, 11:56:45 UTC - in response to Message 59986.  

I just aborted the upload (not the workunit) and then it was reported as valid.

https://www.gpugrid.net/results.php?hostid=604029
ID: 59988 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
gemini8
Avatar

Send message
Joined: 3 Jul 16
Posts: 31
Credit: 2,248,809,169
RAC: 0
Level
Phe
Scientific publications
watwat
Message 59989 - Posted: 26 Feb 2023, 13:55:33 UTC - in response to Message 59988.  

I just aborted the upload (not the workunit) and then it was reported as valid.

https://www.gpugrid.net/results.php?hostid=604029

Indeed, this worked out for me as well.
But is there a result that can be used?
- - - - - - - - - -
Greetings, Jens
ID: 59989 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Ian&Steve C.

Send message
Joined: 21 Feb 20
Posts: 1116
Credit: 40,839,470,595
RAC: 6,423
Level
Trp
Scientific publications
wat
Message 59990 - Posted: 26 Feb 2023, 14:03:51 UTC - in response to Message 59986.  

For me it's just this:
So 26 Feb 2023 11:57:00 CET | GPUGRID | Started upload of TL9_55-RAIMIS_TEST_ATM-0-1-RND1804_0_0
So 26 Feb 2023 11:57:02 CET | GPUGRID | Backing off 04:12:16 on upload of TL9_55-RAIMIS_TEST_ATM-0-1-RND1804_0_0
So 26 Feb 2023 11:57:19 CET | GPUGRID | Started upload of TL9_55-RAIMIS_TEST_ATM-0-1-RND1804_0_0
So 26 Feb 2023 11:57:22 CET | GPUGRID | Backing off 05:10:06 on upload of TL9_55-RAIMIS_TEST_ATM-0-1-RND1804_0_0

No message about the size, just about backing off.
Hooray!


There won’t be any message about why it failed until you enable debugging messages. See the previous link I posted about when this issues happened 1.5 years ago.
ID: 59990 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
kksplace

Send message
Joined: 4 Mar 18
Posts: 53
Credit: 2,815,476,011
RAC: 0
Level
Phe
Scientific publications
wat
Message 59991 - Posted: 26 Feb 2023, 14:12:44 UTC - in response to Message 59988.  

I just aborted the upload (not the workunit) and then it was reported as valid.


Partially successful for me. I attempted with two of these and one ended up as "Upload failed" while the other "Completed and validated".
ID: 59991 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
fzs600

Send message
Joined: 14 Nov 10
Posts: 4
Credit: 1,497,192,557
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 59992 - Posted: 26 Feb 2023, 15:57:47 UTC - in response to Message 59988.  

I just aborted the upload (not the workunit) and then it was reported as valid.

https://www.gpugrid.net/results.php?hostid=604029

Indeed, this worked out for me as well.
ID: 59992 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mikey

Send message
Joined: 2 Jan 09
Posts: 303
Credit: 7,321,800,090
RAC: 330
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 59993 - Posted: 26 Feb 2023, 16:27:45 UTC - in response to Message 59992.  

I just aborted the upload (not the workunit) and then it was reported as valid.

https://www.gpugrid.net/results.php?hostid=604029

Indeed, this worked out for me as well.


It worked on multiple pc's for me too
ID: 59993 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Speedy

Send message
Joined: 19 Aug 07
Posts: 46
Credit: 45,339,082
RAC: 46
Level
Val
Scientific publications
watwatwatwatwatwatwat
Message 60008 - Posted: 4 Mar 2023, 3:36:01 UTC - in response to Message 59755.  

task ran to completion in about an hour. but hit an error and threw it all away because the file size is too big.

upload failure: <file_xfer_error>
  <file_name>T11_4-RAIMIS_TEST_ATM-0-1-RND7054_2_0</file_name>
  <error_code>-131 (file size too big)</error_code>
</file_xfer_error>


what a waste.

No it's not a waste in my opinion because you found something out. You found that "the file size was too big" so it can be corrected so it doesn't happen again hopefully. :-)
ID: 60008 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Erich56

Send message
Joined: 1 Jan 15
Posts: 1166
Credit: 12,260,898,501
RAC: 1
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwat
Message 60010 - Posted: 4 Mar 2023, 6:31:15 UTC

this now is a topic also on this thread:
https://www.gpugrid.net/forum_thread.php?id=5379
which has been opened by the developer Quico
ID: 60010 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Magiceye04

Send message
Joined: 1 Apr 09
Posts: 24
Credit: 67,905,687
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwat
Message 60178 - Posted: 25 Mar 2023, 13:28:56 UTC

How can I get ATM ?
Serverstatus tells me, there are more then hundred WUs ready to send at the moment.
Boinc Manager tells me:
Sa 25 Mär 2023 14:20:07 CET | GPUGRID | No tasks are available for ATM: Free energy calculations of protein-ligand binding

The PC is running with Ubuntu 20LTS, Geforce1070ti and driver 470.16
ID: 60178 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Ian&Steve C.

Send message
Joined: 21 Feb 20
Posts: 1116
Credit: 40,839,470,595
RAC: 6,423
Level
Trp
Scientific publications
wat
Message 60179 - Posted: 25 Mar 2023, 13:38:05 UTC - in response to Message 60178.  

you need to enable beta/test applications in your project preferences
ID: 60179 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · Next

Message boards : Number crunching : ATM: Free Energy Calculations new application

©2025 Universitat Pompeu Fabra