Message boards :
Number crunching :
ATM: Free Energy Calculations new application
Message board moderation
Previous · 1 · 2 · 3 · Next
| Author | Message |
|---|---|
|
Send message Joined: 21 Feb 20 Posts: 1116 Credit: 40,839,470,595 RAC: 6,423 Level ![]() Scientific publications
|
keep in mind these are bata tasks, and the batch being sent NOW are not necessarily the same as the batch sent last week and wont be the same as whatever is sent sometime in the future, until they get all the bugs worked out. the tasks last week basically ran with no perceived use of the GPU or CPU, so what were they doing? who knows. no official word from the project about these tasks at all. I wasn't willing to let the GPU/CPU be occupied for hours on end with the task spinning it's wheels when they could be doing something more useful. that's great and all, but abouh is not the researcher working with this application. Abouh deals with the research with the Python RL tasks. These ATM tasks look to be being run by Raimis. (the researcher names are in the filenames of the WUs)
|
|
Send message Joined: 21 Feb 20 Posts: 1116 Credit: 40,839,470,595 RAC: 6,423 Level ![]() Scientific publications
|
https://gpugrid.net/result.php?resultid=33321222 ran for 10+hours, failed due to file size limit after an otherwise successful computation. :(
|
|
Send message Joined: 1 Jan 15 Posts: 1166 Credit: 12,260,898,501 RAC: 1 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
... failed due to file size limit I am just trying to remember with which other application we've had the same problem some time ago - last year or 2 years ago ??? |
|
Send message Joined: 21 Feb 20 Posts: 1116 Credit: 40,839,470,595 RAC: 6,423 Level ![]() Scientific publications
|
... failed due to file size limit it's happened a few times in the past with acemd3 tasks. see here from July 2021: https://www.gpugrid.net/forum_thread.php?id=5239#57117
|
|
Send message Joined: 12 Jul 17 Posts: 404 Credit: 17,408,899,587 RAC: 0 Level ![]() Scientific publications ![]() ![]()
|
Yea, I got my first ATM checkpoint :-) Now my list of ATM ULs are stuck. |
|
Send message Joined: 21 Feb 20 Posts: 1116 Credit: 40,839,470,595 RAC: 6,423 Level ![]() Scientific publications
|
Yea, I got my first ATM checkpoint :-) the uploads are nearly 700MB in size, and likely the same problem from my link that we saw over a year ago. their server can't accept something that big, I don't think they ever figured out how to adjust the settings of their file server and just tried to keep the file sizes below the limit, which they seem to have forgotten about. nothing you do will get them to upload. I've disabled ATM until they get it together with them.
|
ServicEnginICSend message Joined: 24 Sep 10 Posts: 592 Credit: 11,972,186,510 RAC: 1,447 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
On past chance, I bet and lost. Currently, I'm only processing ACEMD tasks, when available. I happened to catch one this morning. |
|
Send message Joined: 12 Jul 17 Posts: 404 Credit: 17,408,899,587 RAC: 0 Level ![]() Scientific publications ![]() ![]()
|
GDF, Should I Abort these 12 completed ATM WUs that won't upload or is there a reasonable chance you'll fix it? |
|
Send message Joined: 16 Jul 07 Posts: 209 Credit: 5,496,860,456 RAC: 12,111 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Well, I just achieved my 100 hours, which was my 1st priority. I will abort and reset (if necessary) the completed tasks I have. If/when the project gets its act together, I'll be back. Reno, NV Team: SETI.USA |
|
Send message Joined: 3 Jul 16 Posts: 31 Credit: 2,248,809,169 RAC: 0 Level ![]() Scientific publications ![]()
|
For me it's just this: So 26 Feb 2023 11:57:00 CET | GPUGRID | Started upload of TL9_55-RAIMIS_TEST_ATM-0-1-RND1804_0_0 So 26 Feb 2023 11:57:02 CET | GPUGRID | Backing off 04:12:16 on upload of TL9_55-RAIMIS_TEST_ATM-0-1-RND1804_0_0 So 26 Feb 2023 11:57:19 CET | GPUGRID | Started upload of TL9_55-RAIMIS_TEST_ATM-0-1-RND1804_0_0 So 26 Feb 2023 11:57:22 CET | GPUGRID | Backing off 05:10:06 on upload of TL9_55-RAIMIS_TEST_ATM-0-1-RND1804_0_0 No message about the size, just about backing off. Hooray! - - - - - - - - - - Greetings, Jens |
|
Send message Joined: 7 Apr 15 Posts: 17 Credit: 2,978,057,945 RAC: 73 Level ![]() Scientific publications
|
I just aborted the upload (not the workunit) and then it was reported as valid. https://www.gpugrid.net/results.php?hostid=604029 |
|
Send message Joined: 3 Jul 16 Posts: 31 Credit: 2,248,809,169 RAC: 0 Level ![]() Scientific publications ![]()
|
I just aborted the upload (not the workunit) and then it was reported as valid. Indeed, this worked out for me as well. But is there a result that can be used? - - - - - - - - - - Greetings, Jens |
|
Send message Joined: 21 Feb 20 Posts: 1116 Credit: 40,839,470,595 RAC: 6,423 Level ![]() Scientific publications
|
For me it's just this: There won’t be any message about why it failed until you enable debugging messages. See the previous link I posted about when this issues happened 1.5 years ago.
|
|
Send message Joined: 4 Mar 18 Posts: 53 Credit: 2,815,476,011 RAC: 0 Level ![]() Scientific publications
|
I just aborted the upload (not the workunit) and then it was reported as valid. Partially successful for me. I attempted with two of these and one ended up as "Upload failed" while the other "Completed and validated". |
|
Send message Joined: 14 Nov 10 Posts: 4 Credit: 1,497,192,557 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
I just aborted the upload (not the workunit) and then it was reported as valid. Indeed, this worked out for me as well. |
|
Send message Joined: 2 Jan 09 Posts: 303 Credit: 7,321,800,090 RAC: 330 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
I just aborted the upload (not the workunit) and then it was reported as valid. It worked on multiple pc's for me too |
|
Send message Joined: 19 Aug 07 Posts: 46 Credit: 45,339,082 RAC: 46 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]()
|
task ran to completion in about an hour. but hit an error and threw it all away because the file size is too big. No it's not a waste in my opinion because you found something out. You found that "the file size was too big" so it can be corrected so it doesn't happen again hopefully. :-) |
|
Send message Joined: 1 Jan 15 Posts: 1166 Credit: 12,260,898,501 RAC: 1 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
this now is a topic also on this thread: https://www.gpugrid.net/forum_thread.php?id=5379 which has been opened by the developer Quico |
|
Send message Joined: 1 Apr 09 Posts: 24 Credit: 67,905,687 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]()
|
How can I get ATM ? Serverstatus tells me, there are more then hundred WUs ready to send at the moment. Boinc Manager tells me: Sa 25 Mär 2023 14:20:07 CET | GPUGRID | No tasks are available for ATM: Free energy calculations of protein-ligand binding The PC is running with Ubuntu 20LTS, Geforce1070ti and driver 470.16 |
|
Send message Joined: 21 Feb 20 Posts: 1116 Credit: 40,839,470,595 RAC: 6,423 Level ![]() Scientific publications
|
you need to enable beta/test applications in your project preferences
|
©2025 Universitat Pompeu Fabra