Advanced search

Message boards : Graphics cards (GPUs) : 75% tasks failed with EXIT_CHILD_FAILED

Author Message
[AF>EDLS]zOU
Send message
Joined: 14 May 15
Posts: 1
Credit: 47,810,937
RAC: 0
Level
Val
Scientific publications
watwatwatwat
Message 54280 - Posted: 9 Apr 2020 | 5:21:52 UTC

Hello,

I'm running GPU grid on a couple of systems and recently a LOT of my tasks are failing.

State: All (1274) · Valid (307) · Invalid (0) · Error (958)

All on this host: http://www.gpugrid.net/show_host_detail.php?hostid=500099

Antivirus reports no specific activity, Windows is up-to-date, drivers are up-to-date, card are not manually overclocked. (other GPU projects work fine)

Toni
Volunteer moderator
Project administrator
Project developer
Project scientist
Send message
Joined: 9 Dec 08
Posts: 988
Credit: 5,068,599
RAC: 0
Level
Ser
Scientific publications
watwatwatwat
Message 54281 - Posted: 9 Apr 2020 | 10:08:31 UTC - in response to Message 54280.

There is nothing obviously wrong. I'm afraid your card may be failing, or factory overclocked (not expert).

Aurum
Avatar
Send message
Joined: 12 Jul 17
Posts: 295
Credit: 10,242,918,856
RAC: 27,145
Level
Trp
Scientific publications
watwatwat
Message 54284 - Posted: 9 Apr 2020 | 19:52:17 UTC

The WU I looked at failed for 2 others as well before it completed.

# Engine failed: Particle coordinate is nan

{nan = not a number}

I don't see anything. Sometimes you just need to reboot. I run Linux and I always set a 16 GB cache. Yours is only 2 MB but maybe Win10 dynamically sets it. Are you running more than one WU per GPU? I recommend only running one WU per GPU. I have an FX-6300 and it's fine. I like to leave a CPU thread open for overhead etc on all my computers.

Post to thread

Message boards : Graphics cards (GPUs) : 75% tasks failed with EXIT_CHILD_FAILED