Message boards :
Graphics cards (GPUs) :
Error units PAOLA
Message board moderation
| Author | Message |
|---|---|
[AF>WildWildWest] Al TarfSend message Joined: 22 Oct 10 Posts: 6 Credit: 10,043,483 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
For three days, all units go to error (paola), where does the problem? I specify that the units (IBUCH) calculated without problems! Sorry for my English, I write it with google translation. thank you My pc: Athlon II X3, GTX 550ti, ubuntu. |
dskagcommunitySend message Joined: 28 Apr 11 Posts: 463 Credit: 958,266,958 RAC: 31 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Ah im not the only one with this problem ^^ is it only with app 42 like me? DSKAG Austria Research Team: http://www.research.dskag.at
|
[AF>WildWildWest] Al TarfSend message Joined: 22 Oct 10 Posts: 6 Credit: 10,043,483 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Hi, yes, cuda 42 PAOLA. My graphics card is compatible with CUDA 4.2, I do not understand? |
Retvari ZoltanSend message Joined: 20 Jan 09 Posts: 2380 Credit: 16,897,957,044 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
I've checked the error message of these 2HDQ_43_9-PAOLA_2HDQ workunits in their stderr outpuf file, and I've find out that every one of them on every host I've checked resulted in ERROR: Failed to parse input file. So I've came to the conclusion that the source of this error is not on your side. This batch of 2HDQ_43_9-PAOLA_2HDQ workunits are messed up somehow. |
[AF>WildWildWest] Al TarfSend message Joined: 22 Oct 10 Posts: 6 Credit: 10,043,483 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Oh okay, thank you for your answer! |
[AF>WildWildWest] Al TarfSend message Joined: 22 Oct 10 Posts: 6 Credit: 10,043,483 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
it looks like it arranges, I have a unit (PAOLA) that runs without error for 27 minutes. |
Retvari ZoltanSend message Joined: 20 Jan 09 Posts: 2380 Credit: 16,897,957,044 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
it looks like it arranges, I have a unit (PAOLA) that runs without error for 27 minutes. This workunit comes from a different batch: 3EKO_8_10-PAOLA_3EKO_8LIGs |
skgivenSend message Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
There's definitely problems with the PAOLA_2HDQ batch. These tasks all fail on all cards no matter which app is used (3.1 or 4.2). Fortunately they all fail after a few seconds:
<core_client_version>7.0.24</core_client_version> <![CDATA[ <message> process exited with code 98 (0x62, -158) </message> <stderr_txt> ERROR: file mdsim.cpp line 167: Failed to parse input file 20:12:31 (5458): called boinc_finish </stderr_txt> ]]>
FAQ's HOW TO: - Opt out of Beta Tests - Ask for Help |
|
Send message Joined: 9 May 12 Posts: 16 Credit: 8,100 RAC: 0 Level ![]() Scientific publications
|
Hi dear volunteers, There was such a stupid error in the input file :p, I fixed it and now I am going to submit again the system (500 WU) on acemd short (12500 credits for WU), the group is 2HDQbis. thanks for your patience and your computing time :D Paola |
|
Send message Joined: 8 Mar 12 Posts: 411 Credit: 2,083,882,218 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
This is not a parse file error, but this WU has failed on 2 other rigs. One was with 295 so maybe their end, but the other was with a 470 301.42 driver. 3EKO_36_9_step_13_20_7-PAOLA_ADAPT-10-20-RND9221 <core_client_version>7.0.25</core_client_version> <![CDATA[ <message> The system cannot find the path specified. (0x3) - exit code 3 (0x3) </message> <stderr_txt> MDIO: cannot open file "restart.coor" SWAN : FATAL : Cuda driver error 999 in file 'swanlibnv2.cpp' in line 1574. Assertion failed: a, file swanlibnv2.cpp, line 59 Cheers |
dskagcommunitySend message Joined: 28 Apr 11 Posts: 463 Credit: 958,266,958 RAC: 31 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
There are still so much errors that wasting energy on resent failed workunits :( DSKAG Austria Research Team: http://www.research.dskag.at
|
|
Send message Joined: 8 Mar 12 Posts: 411 Credit: 2,083,882,218 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Gotta take the good with the bad. This is the only WU that has crashed for me. Things happen. Cheers. |
Raptures RiotSend message Joined: 30 Apr 11 Posts: 6 Credit: 220,588,795 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Agreed to take the bad with the good. Just had a 1E2I fail after 7 hours. I get occassional driver crashes and permanently frozen displays with the 4.2's. All 4.2's seem 'touch and go' regardless of the author. Any further refinements would be much appreciated. |
skgivenSend message Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Try this post or this post, among others, in the "FAQ - Why does my run fail? Some answers." FAQ's HOW TO: - Opt out of Beta Tests - Ask for Help |
[AF>WildWildWest] Al TarfSend message Joined: 22 Oct 10 Posts: 6 Credit: 10,043,483 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
I have error with new PAOLA units "1H46 RNP" ? |
|
Send message Joined: 19 Aug 07 Posts: 46 Credit: 45,339,082 RAC: 34 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]()
|
I had 1H46_19_9-PAOLA_1H46_RNP-6-100-RND5172_1 fail with <core_client_version>6.10.60</core_client_version> <![CDATA[ <message> The system cannot find the path specified. (0x3) - exit code 3 (0x3) </message> <stderr_txt> MDIO: cannot open file "restart.coor" SWAN : FATAL : Cuda driver error 999 in file 'swanlibnv2.cpp' in line 1574. Assertion failed: a, file swanlibnv2.cpp, line 59 This application has requested the Runtime to terminate it in an unusual way. Please contact the application's support team for more information. </stderr_txt> ]]> Ran for 19.29 minutes |
[AF>EDLS]GuLSend message Joined: 7 Jan 09 Posts: 3 Credit: 160,687,223 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Hello, Congratulations to all the team for your job. All actual units (PAOLA) are exiting with error code 247, see 5747509 for instance. Stderr output All this units are using cuda42. The only cuda31 unit I got was going fine until I've done a mistake, using the gpu at the same time. My card is a GTX260, on a freshly installed fedora 17 system, with NVIDIA driver 304.32 and cuda Toolkit 4.2.9. I have followed the procedure at http://doc.fedora-fr.org/wiki/Cuda. The GTK toolkit is working fine and primegrid also. What are this errors due to ? Thank you for your help |
StoneagemanSend message Joined: 25 May 09 Posts: 224 Credit: 34,057,374,498 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
200 series cards do not work with cuda 4.2 tasks under Linux! |
[AF>EDLS]GuLSend message Joined: 7 Jan 09 Posts: 3 Credit: 160,687,223 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Ok, thanks for the answer. It this case is there a way to have only cuda31 units ? Cheers |
skgivenSend message Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Use an older driver, to prevent getting the CUDA4.2 app: Uninstall the present driver completely. Find something pre-CUDA 4.2 (or whatever the dll's actually are). Probably around 265 to 285 should be good. Install these. Reset project and you should only get the 3.1app and thus 3.1tasks. FAQ's HOW TO: - Opt out of Beta Tests - Ask for Help |
©2025 Universitat Pompeu Fabra