Simulation has become unstable

Message boards : Number crunching : Simulation has become unstable
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Killersocke

Send message
Joined: 18 Oct 13
Posts: 53
Credit: 406,647,419
RAC: 0
Level
Gln
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 35111 - Posted: 18 Feb 2014, 11:51:23 UTC
Last modified: 18 Feb 2014, 11:54:57 UTC

369x-SANTI_MAR422cap310-8-32-RND5608_0
80 (0x50) Unknown error number

http://www.gpugrid.net/result.php?resultid=7785880
ID: 35111 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Killersocke

Send message
Joined: 18 Oct 13
Posts: 53
Credit: 406,647,419
RAC: 0
Level
Gln
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 35112 - Posted: 18 Feb 2014, 11:56:10 UTC

369x-SANTI_MAR422cap310-8-32-RND5608_1
-97 (0xffffffffffffff9f) Unknown error number

http://www.gpugrid.net/result.php?resultid=7788789
ID: 35112 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 35122 - Posted: 18 Feb 2014, 17:26:10 UTC - in response to Message 35112.  

Your GPU is too hot. Use MSI Afterburner (or similar) to control the fans and keep the GPU cool (ideally below 70C). When the GPU is >80C failures are common. Your temps would not keep rising if you had a fan profile or specific fan setting in place.
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help
ID: 35122 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Killersocke

Send message
Joined: 18 Oct 13
Posts: 53
Credit: 406,647,419
RAC: 0
Level
Gln
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 35125 - Posted: 18 Feb 2014, 18:27:32 UTC - in response to Message 35122.  

thx for your Information.
Installed now the Afterburner.
Hope it will help :-)
ID: 35125 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 35126 - Posted: 18 Feb 2014, 19:27:06 UTC - in response to Message 35125.  

You should be able to see the GPU temperature drop as you increase the fan speed.
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help
ID: 35126 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Killersocke

Send message
Joined: 18 Oct 13
Posts: 53
Credit: 406,647,419
RAC: 0
Level
Gln
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 35130 - Posted: 18 Feb 2014, 21:35:08 UTC - in response to Message 35126.  

I had no problems in the past, neither
with long term, Seti or Einstein
The fans are dust free.
Driver version 334.89
ID: 35130 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 35134 - Posted: 18 Feb 2014, 22:27:54 UTC - in response to Message 35130.  

I think I'm developing a twitch.
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help
ID: 35134 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Dagorath

Send message
Joined: 16 Mar 11
Posts: 509
Credit: 179,005,236
RAC: 0
Level
Ile
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 35138 - Posted: 19 Feb 2014, 0:01:28 UTC - in response to Message 35130.  

I had no problems in the past, neither
with long term, Seti or Einstein
The fans are dust free.
Driver version 334.89


That doesn't mean you won't have trouble here, if that's what you're saying.

BOINC <<--- credit whores, pedants, alien hunters
ID: 35138 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Killersocke

Send message
Joined: 18 Oct 13
Posts: 53
Credit: 406,647,419
RAC: 0
Level
Gln
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 35143 - Posted: 19 Feb 2014, 9:38:49 UTC - in response to Message 35138.  

The Specifications
for the GTX 760:
Thermal and Power Specs(in C):
97 C Maximum GPU Temperature
http://www.geforce.com/hardware/desktop-gpus/geforce-gtx-760/specifications

The normal Temperature under heavy load 80 / 81 C


ID: 35143 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Dagorath

Send message
Joined: 16 Mar 11
Posts: 509
Credit: 179,005,236
RAC: 0
Level
Ile
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 35152 - Posted: 19 Feb 2014, 16:45:30 UTC - in response to Message 35143.  

So what's yer point? You think the tech specs prove that the years of experience skgiven has is wrong? Is that it? Do the tech specs tell you how to crunch GPUgrid?

Your GPU is running too hot and don't bother saying you don't have the problem at other projects because those projects don't work the GPU as hard as this project does.


BOINC <<--- credit whores, pedants, alien hunters
ID: 35152 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Killersocke

Send message
Joined: 18 Oct 13
Posts: 53
Credit: 406,647,419
RAC: 0
Level
Gln
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 35155 - Posted: 19 Feb 2014, 18:03:37 UTC

As English is not my native language, feel free to interpret whatever you want in my words.
Anyway, have a nice day.

Killersocke --
ID: 35155 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 35167 - Posted: 20 Feb 2014, 0:43:42 UTC - in response to Message 35155.  

As English is not my native language, feel free to interpret whatever you want in my words.
Anyway, have a nice day.

Killersocke --

Did you set a fan curve or static rate? Did this improve things?
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help
ID: 35167 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
BobMALCS

Send message
Joined: 27 Nov 13
Posts: 4
Credit: 10,253,081
RAC: 0
Level
Pro
Scientific publications
watwat
Message 35192 - Posted: 21 Feb 2014, 0:16:15 UTC

I have had 3 tasks with this result - "The simulation has become unstable".

http://www.gpugrid.net/result.php?resultid=7790657
http://www.gpugrid.net/result.php?resultid=7788022
http://www.gpugrid.net/result.php?resultid=7776520

I have no problems with my GPU running other BOINC projects, normal usage, or stress tests.

The error message implies, to me at least, that the program has found a problem with the simulation and has decided to terminate it. I have no problems with that as it is just as valid a result as completing the simulation.

What I do object to is not getting credit for the several hours of work done for each task when there is no apparent fault with my PC.

So I've stopped GPUGRID for the moment at least.

Bob
ID: 35192 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jeremy Zimmerman

Send message
Joined: 13 Apr 13
Posts: 61
Credit: 726,605,417
RAC: 0
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwat
Message 35193 - Posted: 21 Feb 2014, 0:48:20 UTC - in response to Message 35192.  

BobMALCS,

I have come to the conclusion the GPUGrid is that absolute toughest thing I can through at my GPU cards. I would guess that a decision has been made to make the software a little less tolerant that we are normally use to. In a game, a bad colored pixel, a dropped frame, seeing through the wrong layer for example are annoying, but do not crash. As we have seen from Win7, it has another layer added to the video drivers which makes it crash a whole lot less. The bad part is that Win7 just crunches slower than XP and Linux. Of course XP and Linux are stable too. At least with XP, when I pushed the edge too hard on the GPU cards, I would crash the whole thing.

With that, I increase my voltages to run as fast as I can, but still keep the card at <=72C with the noise I can tolerate from the fans. Those squirrel cage fans are a little unbearable for me when they are at 85%, so I slow the overclock a bit to what I can bear (especially in summer). When I finally find the sweet balance, for not getting failed work units from GPU Grid...I also get no other GPU failures for any other software/games I use/play.

Maybe not the best way for things, but it is a bit of a fun challenge. :)
ID: 35193 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Dagorath

Send message
Joined: 16 Mar 11
Posts: 509
Credit: 179,005,236
RAC: 0
Level
Ile
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 35196 - Posted: 21 Feb 2014, 2:44:57 UTC - in response to Message 35192.  

What I do object to is not getting credit for the several hours of work done for each task when there is no apparent fault with my PC.


At some projects errored tasks are valuable because it tells the researchers what does not work. It's not that way at this project. Here errored tasks provide no such info therefore they have no value at all.

Probably the only thing wrong with your PC is that it's not configured the way it needs to be configured to run GPUgrid tasks error free. If you configure it properly it will run GPUgrid tasks error free (unless there are defective or failing components in the computer). GPU tasks from other projects don't work the GPU as hard as tasks from this project do. To run tasks from this project you need to tune your configuration carefully and make sure your machine is cooling adequately.

I know it's hard to imagine that a GPU that runs other projects OK might not be configured properly for this project but it's true. In the last 30 days I've seen several crunchers with the same complaint as you who have accepted the fact they need to tune carefully and now they're running error free. Those who can't accept it are either crunching elsewhere or returning more errors than they need to.

BOINC <<--- credit whores, pedants, alien hunters
ID: 35196 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
BobMALCS

Send message
Joined: 27 Nov 13
Posts: 4
Credit: 10,253,081
RAC: 0
Level
Pro
Scientific publications
watwat
Message 35203 - Posted: 21 Feb 2014, 13:16:30 UTC - in response to Message 35196.  

Thanks for the comments. However my GT760 is overclocked and runs very nicely within its limits. So rather than spending hours trying to tweak it to run GPUGrid and potentially upset a very stable setup I'll just leave GPUGrid well alone.
ID: 35203 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Killersocke

Send message
Joined: 18 Oct 13
Posts: 53
Credit: 406,647,419
RAC: 0
Level
Gln
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 35243 - Posted: 23 Feb 2014, 2:38:45 UTC

How to read here I'm not alone with this problem:

http://www.gpugrid.net/forum_thread.php?id=3621#35242

So I've stopped GPUGRID for the moment at least.
ID: 35243 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Dagorath

Send message
Joined: 16 Mar 11
Posts: 509
Credit: 179,005,236
RAC: 0
Level
Ile
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 35245 - Posted: 23 Feb 2014, 3:38:26 UTC - in response to Message 35243.  
Last modified: 23 Feb 2014, 3:58:44 UTC

angemessen
BOINC <<--- credit whores, pedants, alien hunters
ID: 35245 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 35260 - Posted: 23 Feb 2014, 11:16:08 UTC - in response to Message 35203.  

Thanks for the comments. However my GT760 is overclocked and runs very nicely within its limits. So rather than spending hours trying to tweak it to run GPUGrid and potentially upset a very stable setup I'll just leave GPUGrid well alone.

Clearly your GT760 does not run very nicely! It takes minutes to adjust GPU frequencies and temperatures.
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help
ID: 35260 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
BobMALCS

Send message
Joined: 27 Nov 13
Posts: 4
Credit: 10,253,081
RAC: 0
Level
Pro
Scientific publications
watwat
Message 35271 - Posted: 23 Feb 2014, 13:52:58 UTC - in response to Message 35260.  

skgiven.

First my setup.
Intel Core i5 Quad 2500K (Sandy Bridge) OC to 4.33GHz.
GTX760 OC to 1097Mhz and 1.2V

I use a small program TThtottle to prevent BOINC tasks driving the CPU temp above 75C and the GPU temp above 70C. Quite often the CPU reaches 75C but it is rare for the GPU to reach 70C.

Of all the programs and BOINC projects I run only GPUGRID fails in any way so as far as I am concerned my system runs nicely.

From what I have read it seems that the only option I have to get GPUGRID to run is to downclock the GPU. If I did that then it will impact the other BOINC projects adversely. So I will not downclock the GPU.

It is, as you say, simple to change the various GPU parameters. However, it takes experience to do it properly. I suspect most people running GPUGRID are not experts nor wish to become one. Also, what you left unspoken, is that it takes many hours of checking, over several GPUGRID tasks, to ensure that the changes have the desired effect and also that other projects and programs still run as well as before.

One final point. It is quite possible to write programs, either deliberately or by accident, that drive a component beyond its safe limits or seriously affect other programs. It looks to me that GPUGRID, in a perfectly reasonable desire for efficiency, is reaching or has reached that point. It is now up to the programmer to seriously consider what and how they are coding.

In summary, getting GPUGRID tasks to run without error is not quite as simple as you make out.

I have no doubt others will, perhaps vehemently, disagree with me. But this is my opinion just as you have yours.
ID: 35271 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
1 · 2 · Next

Message boards : Number crunching : Simulation has become unstable

©2026 Universitat Pompeu Fabra