Message boards :
Number crunching :
All tasks failed with Exit status 195 (0xc3) EXIT_CHILD_FAILED
Message board moderation
Previous · 1 · 2
| Author | Message |
|---|---|
|
Send message Joined: 30 Jun 14 Posts: 153 Credit: 129,654,684 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]()
|
Disk space limits can be solved by tweaking BOINC's limits. Ok thanks...fixed |
|
Send message Joined: 30 Jun 14 Posts: 153 Credit: 129,654,684 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]()
|
New problem...I stopped last night with 98% done and about a hour and half to go on the end of the task. I do all the normal shut down procedures, suspend all computing, shut down client, exit program. When I restart this morning the task has gone to hell. Time to finish 159 days and 2% done and time remaining counts UP and not down. CPU time 6d 11:39:36 CPU time since checkpoint 00:14:10 Elapsed time 3d 06:14:26 Estimated time remaining 159d 17:47:48 Fraction done 2.000% Now after several restarts the time remaining goes down, but still 159 days. I had another task that was also close to done, but the server considered it timed out. I guess I missed the deadline. I'll let this task run for a bit longer, but to me it looks all messed up. I don't see anything wrong in stderr or boinc_task_state |
|
Send message Joined: 30 Jun 14 Posts: 153 Credit: 129,654,684 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]()
|
New problem...I stopped last night with 98% done and about a hour and half to go on the end of the task. I do all the normal shut down procedures, suspend all computing, shut down client, exit program. When I restart this morning the task has gone to hell. Time to finish 159 days and 2% done and time remaining counts UP and not down. |
|
Send message Joined: 30 Jun 14 Posts: 153 Credit: 129,654,684 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]()
|
Different question: When looking at Boinc Tasks program and looking at the CPU%, why do I see 197% and 131% CPU usage? Is that just how these tasks work? I thought CPU was for control and guidance only? This almost looks like it is processing as well. |
|
Send message Joined: 13 Dec 17 Posts: 1423 Credit: 9,189,196,190 RAC: 1,326,743 Level ![]() Scientific publications ![]() ![]() ![]() ![]()
|
It is normal for tasks to temporarily revert to 2% completion upon restart. But they quickly jump back to their original completion done percentage at the point they were stopped in just a few minutes. And then continue on till finish. At least that is what they always do on all my Linux hosts. But I have seen similar comments from others running Windows. Probably best not to chance stopping them on Windows. The application does in fact use the cpu. Quite a bit in fact. The task will jump back and forth from running on the cpu to a quick spurt on the gpu and then back to the cpu. The tasks spawn 32 individual python processes on the cpu so you are really using more than 100% of a single cpu core. That is what BoincTasks is detecting and showing. From The reason Reinforcement Learning agents do not currently use the whole potential of the cards is because the interactions between the AI agent and the simulated environment are performed on CPU while the agent "learning" process is the one that uses the GPU intermittently.Message 59980 |
|
Send message Joined: 22 Oct 10 Posts: 42 Credit: 1,758,800,315 RAC: 40,420 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
The failure rate on the GPU tasks has reached the point where I feel it is a waste to even try to explain the processes of the failures: 97 out of 101 tasks have failed on either a GTX 1060 or an RTX 3080 and I aborted the RTX task after it wasted 5 days+ of running time, exceeded the return time limit, and still had double-digit days remaining. The three tasks that succeeded used only about 1800 to 3500 seconds of run time. My patience has expired and I am terminating tasking on Grid for a couple of weeks or so and perhaps the problem can be solved using internal GPUs. Added Comment: Just for the hell of it: I downloaded a new task just now on the GTX 1060 machine and the initial time to compute was shown as 30 DAYS; OH SURE!!!This does not constitute a sound confidence builder. Billy Ewell 1931 (Yes, my year of birth) |
|
Send message Joined: 13 Dec 17 Posts: 1423 Credit: 9,189,196,190 RAC: 1,326,743 Level ![]() Scientific publications ![]() ![]() ![]() ![]()
|
Sorry to hear you go. The estimated time to complete values can be completely ignored at GPUGrid. BOINC does not have the mechanism to compute the time remaining values of the dual cpu-gpu nature of these tasks and cannot estimate the time to complete correctly. On modern gpus of at least Pascal generation, the tasks complete well within the standard 5 day deadlines. Typical compute times of around 20 minutes to 12 hours. Windows needs to be set up correctly however to run these tasks properly. The Windows pagefile size needs to be increased to around 35-50GB for the tasks to run and finish properly. |
|
Send message Joined: 30 Jun 14 Posts: 153 Credit: 129,654,684 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]()
|
Billy, scroll down this thread a bit. There is a post where Keith gives some upper and lower limits to the page file size. This cleared things up for me really fast. I run a 1080 and a 1050 and once I did the page file setting I have never had an error on either card. Run time is about 3 days on these cards, but I am sharing them with Folding At Home, so that might slow things down a bit. |
|
Send message Joined: 13 Dec 17 Posts: 1423 Credit: 9,189,196,190 RAC: 1,326,743 Level ![]() Scientific publications ![]() ![]() ![]() ![]()
|
Thanks for the confirmation Greg that the Python tasks CAN in fact be properly run to completion well within their deadlines AS LONG as Windows is configured correctly. Glad to hear you are successfully processing this new work and contributing to cutting edge science. |
|
Send message Joined: 18 Jul 13 Posts: 79 Credit: 218,028,292 RAC: 180,230 Level ![]() Scientific publications
|
Try to install boinc on rocky linux 8 in vmware workstation player . It is free for home use. |
|
Send message Joined: 30 Jun 14 Posts: 153 Credit: 129,654,684 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]()
|
Thanks for the confirmation Greg that the Python tasks CAN in fact be properly run to completion well within their deadlines AS LONG as Windows is configured correctly. Just chugging along now. Once that swap space issue was taken care of, no problems. This is a Win10 machine with AMD Ryzen. |
God is Love, Jesus proves it. ...Send message Joined: 23 Mar 15 Posts: 1 Credit: 21,695,263 RAC: 0 Level ![]() Scientific publications
|
Adria, please fix the bug in your WUs. error code 195 |
|
Send message Joined: 11 Jul 09 Posts: 1639 Credit: 10,159,968,649 RAC: 2 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
(unknown error) - exit code 195 (0xc3)</message> A GeForce GTX 1660 Ti should be OK: check your drivers. |
|
Send message Joined: 22 Oct 10 Posts: 42 Credit: 1,758,800,315 RAC: 40,420 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Keith: Thanks for the input but I am personally cautious in changing items for fear I will screw up what I cannot fix. Here are the current page filing settings on automatic and I have changed nothing so far. This is as currently specified: Minimum allowed----16 MB Recommended--------4957 MB Currently----------45056 MB As I understand the suggestion is I unclick the automatic setting option and set the Minimum as 35 and the others as ?????. Await your reply: Bill |
|
Send message Joined: 18 Jul 13 Posts: 79 Credit: 218,028,292 RAC: 180,230 Level ![]() Scientific publications
|
Try to set it 51200 |
|
Send message Joined: 13 Dec 17 Posts: 1423 Credit: 9,189,196,190 RAC: 1,326,743 Level ![]() Scientific publications ![]() ![]() ![]() ![]()
|
Keith: Thanks for the input but I am personally cautious in changing items for fear I will screw up what I cannot fix. Those setting pages are enumerated in MB's, not GB's, which it needs to be for Python tasks. So you need to add X1000 to your 35 IOW 35000 MB's |
|
Send message Joined: 22 Oct 10 Posts: 42 Credit: 1,758,800,315 RAC: 40,420 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Keith Myers and Kotenok2000: Once I reset the pagefiles to the recommended values I have processed bunches of tasks without a skip. Thanks for the great advice. BET The bottom number is 35000MB and the top is 51200MB. It would seem practical to me for the admins/techs to incorporate the pagefiles criteria in such a way that all contributors will find it easy to find the instructions and likewise easy to modify their machines. |
|
Send message Joined: 13 Dec 17 Posts: 1423 Credit: 9,189,196,190 RAC: 1,326,743 Level ![]() Scientific publications ![]() ![]() ![]() ![]()
|
I just made a post about the pagefile mod needed for Python task in the FAQ section. Just need a admin to make it sticky. |
|
Send message Joined: 14 Feb 20 Posts: 16 Credit: 27,395,983 RAC: 0 Level ![]() Scientific publications
|
If the GPUGrid project is willing to ask for and accept the in-kind donations of people's GPU time, then GPUGrid has an obligation to do what they can to resolve problematic tasks and code If WUs require mods to the defaults in config files, etc., people should NOT have to hunt around in forum posts to glean a solution. BOINC manager does have a Notices tab, and it is negligent of GPUGrid not to post needed instructions there, or at least a direct link to the specific forum post, for resolution ...in particular when the problem is not an isolated issue to just a few PCs Other projects DO extend the coutesy to communicate via the Notices tab. LLP, PhD, Prof. Engr. |
©2026 Universitat Pompeu Fabra