Message boards :
Number crunching :
GPUGrid causes blue screen.
Message board moderation
| Author | Message |
|---|---|
|
Send message Joined: 8 Dec 12 Posts: 23 Credit: 182,017,044 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Hello. Today GPUGrid has just started to crash my system. Basically the system loads fine, until BOINC begins and the tasks start, then computer rapidly descends into blue screen city with the error caused by file nvlddmkm.sys I went into safe mode and disabled BOINC on start up, and ran all other tasks other than the ones on GPUGrid, they all completed fine. Is it possible the current tasks (or one of them) is a little buggy? I've managed to max out my cards (both) on games, and the cards themselves are responding and working fine, when either or both are maxed out, except if its being used by GPUGrid. I was doing tasks 14147186 & 14146656, which are now aborted (for perhaps obvious reasons) I assume someone in the know can track down those tasks and identify any potential bum ups? I've been running GPUGrid (and BOINC and lots of other projects) for years now, this is the first instance of this. |
|
Send message Joined: 8 Dec 12 Posts: 23 Credit: 182,017,044 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
This is incorrect, those are the two waiting to be done i cancelled, the ones in progress were 14146005 & 14143996 |
|
Send message Joined: 2 Sep 12 Posts: 16 Credit: 609,890,687 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Typically this kind of problem is related to temp or power. These tasks can really push a GPU. When I first started, I had power and temperature problems. It's hard to tell absent any details on your system. If your temp is fine, and you're feed it sufficient power...then you might completely uninstall and reinstall your driver. I had to do that myself after a recent update. -MichaelMac |
|
Send message Joined: 8 Dec 12 Posts: 23 Credit: 182,017,044 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Well as part of my debugging, i got some games to run on both cards, and max it out at 100%, then i loaded two games and forced them onto the two seperate cards, giving both of them an 80 to 100% average load, nothing bad happened. So I assume power wise and card wise, everything is going fine. As for the system: Windows 7 Intel i5-4690 (4x 3.5GHz) 550W PSU 2X GFX Cards, nVidia GeForce GTX650Ti & GT 640 (both 2GB variants) 8GB system RAM As mentioned before, I've been running GPUGrid on this system for a while with no problems prior. The error occurs at run time, as soon as the BOINC manager loads, so the temperatures never really get above 35'C anyway. |
Retvari ZoltanSend message Joined: 20 Jan 09 Posts: 2380 Credit: 16,897,957,044 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Your problem could be caused by insufficient power, so I would check all power connectors (MB+CPU+GPU) and then take one of the GPUs out, and test the system with GPUGrid. I would do a file system check and then I would try these steps. BTW the GPUGrid app uses different parts of the GPU than a game does, so you couldn't validate your system for the GPUGrid app by testing it with games. What is the exact type (and manufacturer) of your PSU? What is its efficiency rating (80+, Bronze, Silver, Gold, Platinum)? |
|
Send message Joined: 8 Dec 12 Posts: 23 Credit: 182,017,044 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
PSU rating is 80$ efficiency all stress tests I have done find no problem, psu provided the power needed, but when gpugrid goes it, all goes to pot. Anyone got any idea what exactly is the problem. I'd love to keep supporting the cause |
|
Send message Joined: 26 Jun 09 Posts: 815 Credit: 1,470,385,294 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Could be several issues. Can be the driver (I never use the latest driver), can be the WU. I see it are SDOER's where you have the problems. So perhaps that WU and your specific set-up don't play nice together. Or it could be another program on the back that causes this after running good for a long time. I had that once and it took months before I found that out. PCAngel was causing me the blue screens as soon BOINC started running. You can try to take one card out and let it run with only one card. Let a WU finish. Then test the other card. If it are SDOER's, wait to get another WU and see what happens. Does it run fine with one card put the second one back in and test again. No joy then revert to another stable driver i.e. 347.88 If all no luck then post back here and we can have a look again. Hope this helps a bit. Greetings from TJ |
|
Send message Joined: 2 Sep 12 Posts: 16 Credit: 609,890,687 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
I notice that you're running version 350.12 of the NVidia driver. I just finished debugging really strange errors using that version of the driver. I couldn't finish a wu, but the errors (when I got them) were unknown. Granted, I was running on Win XP, but it may be related to your problem. Get DDU at http://www.guru3d.com/files-details/display-driver-uninstaller-download.html, this is a "complete" uninstaller. I uninstalled the driver using Windows. I rebooted in Safe mode. I ran DDU and did a uninstall. I then installed version 344.75 of the driver. Everything worked great again after that. -MichaelMac |
|
Send message Joined: 11 Oct 08 Posts: 1127 Credit: 1,901,927,545 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
.. hmm... Did you have a power outage recently? Because there is a bug where ... if GPUGrid tasks are stopped abruptly from a power outage, then when they restart, they TDR infinitely until they BSOD. Maybe you could try restarting with GPU computing suspended, then aborting problematic tasks, then seeing if new ones produce the same problem or not? Or, if you already did this, can you tell us if the problems happened just on those 2 tasks? |
|
Send message Joined: 8 Dec 12 Posts: 23 Credit: 182,017,044 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Hm. That might be it then! My dog knocked the PCs power cable out before this occured. I thought nothing of it at the time. Perhaps that is the cause? It's all working as normal now, on all 3 of my machines. If that is the cause, then nasty little glitch there. Hope it can be resolved. |
|
Send message Joined: 6 Feb 10 Posts: 38 Credit: 274,204,838 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]()
|
I had the same problem. Was running on Windows 7. I installed Windows 8.1 Pro and running so far ok. I think windows needed a refresh. So far so good. |
|
Send message Joined: 6 Feb 10 Posts: 38 Credit: 274,204,838 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]()
|
Nope. Problem is back, but without blue screen. Just freezes and UPS starts beeping because of overload. Connected directly to the wall and same problem, it freezes. Always when the work units are reaching the 80% or above mark. Only happens with GPUGRID work units running. Using driver 350.12 Any ideas? |
|
Send message Joined: 26 Jun 09 Posts: 815 Credit: 1,470,385,294 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Temperature of the room and temperature of the GPU(s)? Greetings from TJ |
|
Send message Joined: 2 Sep 12 Posts: 16 Credit: 609,890,687 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
yeah...I had similar problems with driver 350.12. I went back a revision. I describe in my post, above, what I did. -MichaelMac |
|
Send message Joined: 6 Feb 10 Posts: 38 Credit: 274,204,838 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]()
|
Okay. I am doing that now. Thanks don't think heat is an issue. Using GPUZ and all okay. Remember,it was doing it at 80% and above on the work unit progress. Will check and report back. |
|
Send message Joined: 11 Oct 08 Posts: 1127 Credit: 1,901,927,545 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
I believe the problems the original poster, "Redirect Left", had, were caused by a GPUGrid's inability to properly start tasks after a power interruption. I believe that other problems that are unrelated to a power interruption, should probably be in their own thread, maybe. |
|
Send message Joined: 26 Jun 09 Posts: 815 Credit: 1,470,385,294 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
I believe the problems the original poster, "Redirect Left", had, were caused by a GPUGrid's inability to properly start tasks after a power interruption. If I read the first post then I do not read anything about a power interruption. Greetings from TJ |
|
Send message Joined: 6 Feb 10 Posts: 38 Credit: 274,204,838 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]()
|
I did have some power interruptions or spikes and the computer does shut down even with the UPS. But I don't think it's power related to the house because all electric clocks are still working okay, not blinking. It's like if the computer spiked and then the UPS beeps as if something had happened and then you get the blue screen. That was with windows 7. With Windows 8.1 Pro, I heard the UPS beeping and it had an overload emblem on it and the screen had freezed. I rebooted and started up Boinc again and as soon as the work units started, the freeze and UPS beeping again. So I uninstalled Boinc and changed the driver to the one I mentioned. I am waiting to see when the work unit reaches 80% or more if it does it again. I am using a MSI X87 gamer MB and six gpus with X1 USB powered risers. wasn't having any trouble until yesterday. If I use milkyway@home or seti@home, this does not happen ever, only gpugrid. Will let you know later on what happens. Thanks for the responses!! |
|
Send message Joined: 8 Dec 12 Posts: 23 Credit: 182,017,044 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
I believe the problems the original poster, "Redirect Left", had, were caused by a GPUGrid's inability to properly start tasks after a power interruption This is correct. Reinstalling or downgrading drivers didn't resolve the issue. The only resolution I found was to start in safe mode and disable the bad tasks, as starting normally BOINC + tasks loaded before i had the chance to terminate them and prevent the bluescreen. I'd suggest the problems I have seen after my reply are unrelated. |
|
Send message Joined: 11 Oct 08 Posts: 1127 Credit: 1,901,927,545 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
I believe the problems the original poster, "Redirect Left", had, were caused by a GPUGrid's inability to properly start tasks after a power interruption. You'll have to read more than the first post, I'm afraid. https://www.gpugrid.net/forum_thread.php?id=4082&nowrap=true#41037 |
©2026 Universitat Pompeu Fabra