Tasks returning compute error

Message boards : Number crunching : Tasks returning compute error
Message board moderation

To post messages, you must log in.

AuthorMessage
Michael E.

Send message
Joined: 15 Aug 19
Posts: 7
Credit: 27,732,011
RAC: 41,631
Level
Val
Scientific publications
wat
Message 62552 - Posted: 12 Jan 2026, 23:51:20 UTC
Last modified: 12 Jan 2026, 23:52:48 UTC

I received some work units (tasks) today but most of them failed with a "error while computing" error.

The error occurs usually within 1-2 minutes and one task went 7-8 minutes. Another task is still running (80% done),

Error encountered on two Windows 11 PCs both running BOINC v 8.2.8 with different NVidia GPUs.

The first is an older PC where a task is still running: https://gpugrid.net/gpugrid/show_host_detail.php?hostid=648899 with 3 tasks failed and 1 still processing (type is ATM: Free energy calculations of protein...)

The other PC is a newer PC: https://gpugrid.net/gpugrid/show_host_detail.php?hostid=638123 with 3 tasks failed.
ID: 62552 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Steve Dodd

Send message
Joined: 26 Dec 08
Posts: 19
Credit: 4,622,334,506
RAC: 167,146
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 62553 - Posted: 13 Jan 2026, 0:00:25 UTC

Having same issue.
ID: 62553 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Keith Myers
Avatar

Send message
Joined: 13 Dec 17
Posts: 1423
Credit: 9,186,946,190
RAC: 1,288,374
Level
Tyr
Scientific publications
watwatwatwatwat
Message 62554 - Posted: 13 Jan 2026, 7:17:55 UTC - in response to Message 62553.  

It usually takes the researchers a few small batches of tasks to sort out the proper configuration parameters. Only when the small batches are mostly successful do they give a larger, longer lasting batch.
ID: 62554 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Michael E.

Send message
Joined: 15 Aug 19
Posts: 7
Credit: 27,732,011
RAC: 41,631
Level
Val
Scientific publications
wat
Message 62555 - Posted: 13 Jan 2026, 14:55:18 UTC - in response to Message 62554.  

Thank you Keith! Glad to see progress and glad to help.

The first task did complete successfully. A second task last night failed after multiple hours.
ID: 62555 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
SpekAal

Send message
Joined: 12 Aug 25
Posts: 2
Credit: 55,500,000
RAC: 541,603
Level
Thr
Scientific publications
wat
Message 62556 - Posted: 15 Jan 2026, 9:22:19 UTC

Slot 0 was already occupied by another program, so it was aborted after 7 minutes.
Werkeenheid 31550481

WARNING: The script pyaml.exe is installed in 'C:\ProgramData\BOINC\slots\0\Scripts' which is not on PATH
ID: 62556 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Keith Myers
Avatar

Send message
Joined: 13 Dec 17
Posts: 1423
Credit: 9,186,946,190
RAC: 1,288,374
Level
Tyr
Scientific publications
watwatwatwatwat
Message 62557 - Posted: 16 Jan 2026, 2:27:13 UTC - in response to Message 62556.  

Yes, the tasks are much harder to setup correctly on Windows. OTOH, in Linux it is much easier to get the configuration right because of the way that the OS sets up support files and applications. Still doesn't help you when the researcher plainly forgets to include a needed research in the task package and then attempts to use it when it isn't there. At least most of those fail fast, less than a minute. The irksome ones are the ones that seem to be running correctly but eventually hits a NaN error after many hours. Again, expect to have many errors for the first batch of work that gets sent out.
ID: 62557 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Erich56

Send message
Joined: 1 Jan 15
Posts: 1168
Credit: 12,311,898,501
RAC: 331,341
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwat
Message 62558 - Posted: 16 Jan 2026, 12:00:18 UTC - in response to Message 62557.  
Last modified: 16 Jan 2026, 12:00:35 UTC

...The irksome ones are the ones that seem to be running correctly but eventually hits a NaN error after many hours...
I have had exactly those quite frequently within the past 2 days :-(

https://gpugrid.net/gpugrid/results.php?userid=125700&offset=40&show_names=0&state=0&appid=
ID: 62558 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
SpekAal

Send message
Joined: 12 Aug 25
Posts: 2
Credit: 55,500,000
RAC: 541,603
Level
Thr
Scientific publications
wat
Message 62562 - Posted: 18 Jan 2026, 22:14:16 UTC

It's a shame that all the p38_A15_A13 etc. don't even reach 1%, 17 succeeded, 22 failed, too bad it was always going smoothly.
ID: 62562 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Erich56

Send message
Joined: 1 Jan 15
Posts: 1168
Credit: 12,311,898,501
RAC: 331,341
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwat
Message 62563 - Posted: 19 Jan 2026, 8:10:07 UTC - in response to Message 62562.  

It's a shame that all the p38_A15_A13 etc. don't even reach 1%, 17 succeeded, 22 failed, too bad it was always going smoothly.
as I already wrote in the GPUGRID Discord channel: I am surprised that obviously, before issuing a new batch, none of the tasks are being tested before being sent out.
ID: 62563 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Tasks returning compute error

©2026 Universitat Pompeu Fabra