Full-atom molecular dynamics for Cell processor 5.03

Message boards : Number crunching : Full-atom molecular dynamics for Cell processor 5.03
Message board moderation

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
Profile Bender10
Avatar

Send message
Joined: 3 Dec 07
Posts: 167
Credit: 8,368,897
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwat
Message 932 - Posted: 3 Mar 2008, 23:36:45 UTC
Last modified: 3 Mar 2008, 23:45:30 UTC

HOLY Crap!!!

That TIM3 wu (#15017), finally finished successfully. 45.6 hours.

Here is the Task data (again) http://www.ps3grid.net/result.php?resultid=19218.

At least it finished....

It shows a message (error?) in the middle of the stderr out file.

<core_client_version>5.10.6</core_client_version>
<![CDATA[
<stderr_txt>
ENC
# number of SPEs used 6
B no 0 ./restart.coor ./restart.vel
B no 0 ./restart.coor ./restart.vel
.....
B no 0 ./restart.coor ./restart.vel
B no 0 ./restart.coor ./restart.vel
FILE_LOCK::unlock(): close failed.: Bad file descriptor
called boinc_finish
ENC
# number of SPEs used 6
B no 0 ./restart.coor ./restart.vel
B no 0 ./restart.coor ./restart.vel
.....
</stderr_txt>

EDIT: It almost looks like it ran twice the normal time due to an error at the end of a normal run??



Consciousness: That annoying time between naps......

Experience is a wonderful thing: it enables you to recognize a mistake every time you repeat it.
ID: 932 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist

Send message
Joined: 14 Mar 07
Posts: 1958
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 934 - Posted: 4 Mar 2008, 9:35:07 UTC - in response to Message 932.  

HOLY Crap!!!

That TIM3 wu (#15017), finally finished successfully. 45.6 hours.

Here is the Task data (again) http://www.ps3grid.net/result.php?resultid=19218.

At least it finished....

It shows a message (error?) in the middle of the stderr out file.

5.10.6

ENC
# number of SPEs used 6
B no 0 ./restart.coor ./restart.vel
B no 0 ./restart.coor ./restart.vel
.....
B no 0 ./restart.coor ./restart.vel
B no 0 ./restart.coor ./restart.vel
FILE_LOCK::unlock(): close failed.: Bad file descriptor
called boinc_finish
ENC
# number of SPEs used 6
B no 0 ./restart.coor ./restart.vel
B no 0 ./restart.coor ./restart.vel
.....


EDIT: It almost looks like it ran twice the normal time due to an error at the end of a normal run??


I will check on this. Thanks for letting it run.
ID: 934 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist

Send message
Joined: 14 Mar 07
Posts: 1958
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 935 - Posted: 4 Mar 2008, 9:42:44 UTC - in response to Message 934.  

HOLY Crap!!!

That TIM3 wu (#15017), finally finished successfully. 45.6 hours.

Here is the Task data (again) http://www.ps3grid.net/result.php?resultid=19218.

At least it finished....

It shows a message (error?) in the middle of the stderr out file.

5.10.6

ENC
# number of SPEs used 6
B no 0 ./restart.coor ./restart.vel
B no 0 ./restart.coor ./restart.vel
.....
B no 0 ./restart.coor ./restart.vel
B no 0 ./restart.coor ./restart.vel
FILE_LOCK::unlock(): close failed.: Bad file descriptor
called boinc_finish
ENC
# number of SPEs used 6
B no 0 ./restart.coor ./restart.vel
B no 0 ./restart.coor ./restart.vel
.....


EDIT: It almost looks like it ran twice the normal time due to an error at the end of a normal run??


I will check on this. Thanks for letting it run.



YES. There is a problem. Restart does not work properly. If you run it continuously it should work, but if you suspend it, it starts from the beginning. I will patch it now.
ID: 935 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile UL1

Send message
Joined: 16 Sep 07
Posts: 56
Credit: 35,013,195
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwat
Message 939 - Posted: 5 Mar 2008, 19:58:56 UTC

WU14776 ran for about 56 hours with zero % progress and 24 hours of remaining time, before I restarted it. But again only the calculation time is running and the other two indicatoirs seem to be stuck. Should I abort this WU? Deadline is nearing...
ID: 939 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist

Send message
Joined: 14 Mar 07
Posts: 1958
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 940 - Posted: 6 Mar 2008, 8:39:24 UTC - in response to Message 939.  

WU14776 ran for about 56 hours with zero % progress and 24 hours of remaining time, before I restarted it. But again only the calculation time is running and the other two indicatoirs seem to be stuck. Should I abort this WU? Deadline is nearing...


Yes please. See the thread on workunits *TIM3*.
g
ID: 940 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2

Message boards : Number crunching : Full-atom molecular dynamics for Cell processor 5.03

©2025 Universitat Pompeu Fabra