Message boards :
Number crunching :
Simulation has become unstable
Message board moderation
| Author | Message |
|---|---|
|
Send message Joined: 18 Oct 13 Posts: 53 Credit: 406,647,419 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
369x-SANTI_MAR422cap310-8-32-RND5608_0 80 (0x50) Unknown error number http://www.gpugrid.net/result.php?resultid=7785880 |
|
Send message Joined: 18 Oct 13 Posts: 53 Credit: 406,647,419 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
369x-SANTI_MAR422cap310-8-32-RND5608_1 -97 (0xffffffffffffff9f) Unknown error number http://www.gpugrid.net/result.php?resultid=7788789 |
skgivenSend message Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Your GPU is too hot. Use MSI Afterburner (or similar) to control the fans and keep the GPU cool (ideally below 70C). When the GPU is >80C failures are common. Your temps would not keep rising if you had a fan profile or specific fan setting in place. FAQ's HOW TO: - Opt out of Beta Tests - Ask for Help |
|
Send message Joined: 18 Oct 13 Posts: 53 Credit: 406,647,419 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
thx for your Information. Installed now the Afterburner. Hope it will help :-) |
skgivenSend message Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
You should be able to see the GPU temperature drop as you increase the fan speed. FAQ's HOW TO: - Opt out of Beta Tests - Ask for Help |
|
Send message Joined: 18 Oct 13 Posts: 53 Credit: 406,647,419 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
I had no problems in the past, neither with long term, Seti or Einstein The fans are dust free. Driver version 334.89 |
skgivenSend message Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
|
|
Send message Joined: 16 Mar 11 Posts: 509 Credit: 179,005,236 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
I had no problems in the past, neither That doesn't mean you won't have trouble here, if that's what you're saying. BOINC <<--- credit whores, pedants, alien hunters |
|
Send message Joined: 18 Oct 13 Posts: 53 Credit: 406,647,419 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
The Specifications for the GTX 760: Thermal and Power Specs(in C): 97 C Maximum GPU Temperature http://www.geforce.com/hardware/desktop-gpus/geforce-gtx-760/specifications The normal Temperature under heavy load 80 / 81 C |
|
Send message Joined: 16 Mar 11 Posts: 509 Credit: 179,005,236 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
So what's yer point? You think the tech specs prove that the years of experience skgiven has is wrong? Is that it? Do the tech specs tell you how to crunch GPUgrid? Your GPU is running too hot and don't bother saying you don't have the problem at other projects because those projects don't work the GPU as hard as this project does. BOINC <<--- credit whores, pedants, alien hunters |
|
Send message Joined: 18 Oct 13 Posts: 53 Credit: 406,647,419 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
As English is not my native language, feel free to interpret whatever you want in my words. Anyway, have a nice day. Killersocke -- |
skgivenSend message Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
As English is not my native language, feel free to interpret whatever you want in my words. Did you set a fan curve or static rate? Did this improve things? FAQ's HOW TO: - Opt out of Beta Tests - Ask for Help |
|
Send message Joined: 27 Nov 13 Posts: 4 Credit: 10,253,081 RAC: 0 Level ![]() Scientific publications ![]()
|
I have had 3 tasks with this result - "The simulation has become unstable". http://www.gpugrid.net/result.php?resultid=7790657 http://www.gpugrid.net/result.php?resultid=7788022 http://www.gpugrid.net/result.php?resultid=7776520 I have no problems with my GPU running other BOINC projects, normal usage, or stress tests. The error message implies, to me at least, that the program has found a problem with the simulation and has decided to terminate it. I have no problems with that as it is just as valid a result as completing the simulation. What I do object to is not getting credit for the several hours of work done for each task when there is no apparent fault with my PC. So I've stopped GPUGRID for the moment at least. Bob |
|
Send message Joined: 13 Apr 13 Posts: 61 Credit: 726,605,417 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
BobMALCS, I have come to the conclusion the GPUGrid is that absolute toughest thing I can through at my GPU cards. I would guess that a decision has been made to make the software a little less tolerant that we are normally use to. In a game, a bad colored pixel, a dropped frame, seeing through the wrong layer for example are annoying, but do not crash. As we have seen from Win7, it has another layer added to the video drivers which makes it crash a whole lot less. The bad part is that Win7 just crunches slower than XP and Linux. Of course XP and Linux are stable too. At least with XP, when I pushed the edge too hard on the GPU cards, I would crash the whole thing. With that, I increase my voltages to run as fast as I can, but still keep the card at <=72C with the noise I can tolerate from the fans. Those squirrel cage fans are a little unbearable for me when they are at 85%, so I slow the overclock a bit to what I can bear (especially in summer). When I finally find the sweet balance, for not getting failed work units from GPU Grid...I also get no other GPU failures for any other software/games I use/play. Maybe not the best way for things, but it is a bit of a fun challenge. :) |
|
Send message Joined: 16 Mar 11 Posts: 509 Credit: 179,005,236 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
What I do object to is not getting credit for the several hours of work done for each task when there is no apparent fault with my PC. At some projects errored tasks are valuable because it tells the researchers what does not work. It's not that way at this project. Here errored tasks provide no such info therefore they have no value at all. Probably the only thing wrong with your PC is that it's not configured the way it needs to be configured to run GPUgrid tasks error free. If you configure it properly it will run GPUgrid tasks error free (unless there are defective or failing components in the computer). GPU tasks from other projects don't work the GPU as hard as tasks from this project do. To run tasks from this project you need to tune your configuration carefully and make sure your machine is cooling adequately. I know it's hard to imagine that a GPU that runs other projects OK might not be configured properly for this project but it's true. In the last 30 days I've seen several crunchers with the same complaint as you who have accepted the fact they need to tune carefully and now they're running error free. Those who can't accept it are either crunching elsewhere or returning more errors than they need to. BOINC <<--- credit whores, pedants, alien hunters |
|
Send message Joined: 27 Nov 13 Posts: 4 Credit: 10,253,081 RAC: 0 Level ![]() Scientific publications ![]()
|
Thanks for the comments. However my GT760 is overclocked and runs very nicely within its limits. So rather than spending hours trying to tweak it to run GPUGrid and potentially upset a very stable setup I'll just leave GPUGrid well alone. |
|
Send message Joined: 18 Oct 13 Posts: 53 Credit: 406,647,419 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
How to read here I'm not alone with this problem: http://www.gpugrid.net/forum_thread.php?id=3621#35242 So I've stopped GPUGRID for the moment at least. |
|
Send message Joined: 16 Mar 11 Posts: 509 Credit: 179,005,236 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
angemessen BOINC <<--- credit whores, pedants, alien hunters |
skgivenSend message Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Thanks for the comments. However my GT760 is overclocked and runs very nicely within its limits. So rather than spending hours trying to tweak it to run GPUGrid and potentially upset a very stable setup I'll just leave GPUGrid well alone. Clearly your GT760 does not run very nicely! It takes minutes to adjust GPU frequencies and temperatures. FAQ's HOW TO: - Opt out of Beta Tests - Ask for Help |
|
Send message Joined: 27 Nov 13 Posts: 4 Credit: 10,253,081 RAC: 0 Level ![]() Scientific publications ![]()
|
skgiven. First my setup. Intel Core i5 Quad 2500K (Sandy Bridge) OC to 4.33GHz. GTX760 OC to 1097Mhz and 1.2V I use a small program TThtottle to prevent BOINC tasks driving the CPU temp above 75C and the GPU temp above 70C. Quite often the CPU reaches 75C but it is rare for the GPU to reach 70C. Of all the programs and BOINC projects I run only GPUGRID fails in any way so as far as I am concerned my system runs nicely. From what I have read it seems that the only option I have to get GPUGRID to run is to downclock the GPU. If I did that then it will impact the other BOINC projects adversely. So I will not downclock the GPU. It is, as you say, simple to change the various GPU parameters. However, it takes experience to do it properly. I suspect most people running GPUGRID are not experts nor wish to become one. Also, what you left unspoken, is that it takes many hours of checking, over several GPUGRID tasks, to ensure that the changes have the desired effect and also that other projects and programs still run as well as before. One final point. It is quite possible to write programs, either deliberately or by accident, that drive a component beyond its safe limits or seriously affect other programs. It looks to me that GPUGRID, in a perfectly reasonable desire for efficiency, is reaching or has reached that point. It is now up to the programmer to seriously consider what and how they are coding. In summary, getting GPUGRID tasks to run without error is not quite as simple as you make out. I have no doubt others will, perhaps vehemently, disagree with me. But this is my opinion just as you have yours. |
©2026 Universitat Pompeu Fabra