SANTI Errors

Message boards : Number crunching : SANTI Errors
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · Next

AuthorMessage
Profile dskagcommunity
Avatar

Send message
Joined: 28 Apr 11
Posts: 463
Credit: 958,266,958
RAC: 31
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 34478 - Posted: 25 Dec 2013, 18:06:37 UTC

Wow they have really harder requirements on that SANTI Batch it seems. I run one successfully with 50mV overvoltage on the 560ti 384core. Until now, all cards run with +25mV and computed successfully with this setting, Santis too. But only this card needs more. But +50mV needs serious cooling.. im nearly at full fanspeed and over 80degress with open case and no extra heating in the flat.
DSKAG Austria Research Team: http://www.research.dskag.at



ID: 34478 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Dagorath

Send message
Joined: 16 Mar 11
Posts: 509
Credit: 179,005,236
RAC: 0
Level
Ile
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 34481 - Posted: 25 Dec 2013, 20:02:54 UTC - in response to Message 34478.  

Over 80C is too hot for me. I like mine 70C max.

BOINC <<--- credit whores, pedants, alien hunters
ID: 34481 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Beyond
Avatar

Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 34516 - Posted: 30 Dec 2013, 16:16:36 UTC - in response to Message 34478.  

Wow they have really harder requirements on that SANTI Batch it seems.

That's what I'm thinking too. Had another one that caused the machine to reboot continuously until I caught it. This time a SANTI_MARwtcap. The only way to stop the cycle is to abort the WU. Lowered the clocks yet again and the next one is running fine. In fact that machine is currently showing 20 valid and the 1 error WU that caused constant bluescreens. NVIDIA GeForce GTX 650 Ti (1024MB) driver: 331.82.
ID: 34516 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile dskagcommunity
Avatar

Send message
Joined: 28 Apr 11
Posts: 463
Credit: 958,266,958
RAC: 31
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 34518 - Posted: 30 Dec 2013, 20:14:05 UTC

Locked again on a workunit, i stopped again with this card on gpugrid and changed it back to einstein :/ will try again in one or two weeks.
DSKAG Austria Research Team: http://www.research.dskag.at



ID: 34518 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
TJ

Send message
Joined: 26 Jun 09
Posts: 815
Credit: 1,470,385,294
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 34566 - Posted: 3 Jan 2014, 0:44:20 UTC

After 17 days a Santi resulted in one 660 to down clock. And only 2 errors in 19 days. But today my rig with to 660's was booted when I found it. After logging in it booted immediately when BOINC started a few times, so I went to Windows in safe mode where I have some more time to abort the task. As I didn't know which one, I aborted both. Its now happily crunching again.
Greetings from TJ
ID: 34566 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
John C MacAlister

Send message
Joined: 17 Feb 13
Posts: 181
Credit: 144,871,276
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 34568 - Posted: 3 Jan 2014, 4:37:58 UTC
Last modified: 3 Jan 2014, 4:38:39 UTC

Too many errors for me......stopping here.
Task
click for details
Show names Work unit
click for details Sent Time reported
or deadline
explain Status Run time
(sec) CPU time
(sec) Credit Application
7614815 5043955 2 Jan 2014 | 20:22:26 UTC 3 Jan 2014 | 3:05:04 UTC Error while computing 23,648.57 20,821.78 --- Long runs (8-12 hours on fastest card) v8.14 (cuda42)
7614651 5043020 2 Jan 2014 | 21:42:39 UTC 3 Jan 2014 | 4:35:29 UTC Aborted by user 20,147.56 20,074.19 --- Long runs (8-12 hours on fastest card) v8.14 (cuda42)
7613690 5043565 2 Jan 2014 | 16:43:59 UTC 2 Jan 2014 | 23:04:48 UTC Completed and validated 22,184.66 18,799.57 20,550.00 Short runs (2-3 hours on fastest card) v8.15 (cuda42)
7613447 5043367 2 Jan 2014 | 16:43:59 UTC 2 Jan 2014 | 20:22:26 UTC Error while computing 12,559.07 12,065.65 --- Short runs (2-3 hours on fastest card) v8.15 (cuda42)
7612704 5042731 2 Jan 2014 | 4:29:25 UTC 2 Jan 2014 | 13:59:13 UTC Completed and validated 33,819.15 17,980.88 20,550.00 Short runs (2-3 hours on fastest card) v8.15 (cuda42)
7612537 5040031 2 Jan 2014 | 4:20:27 UTC 2 Jan 2014 | 4:29:25 UTC Error while computing 274.96 127.56 --- Short runs (2-3 hours on fastest card) v8.15 (cuda42)
ID: 34568 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
TJ

Send message
Joined: 26 Jun 09
Posts: 815
Credit: 1,470,385,294
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 34571 - Posted: 3 Jan 2014, 19:02:10 UTC

Just now I found my rig with two 660's frozen. No Idea when it happened, even ctrl-alt-del didn't work. After booting immediately message that the graphics drives has crashed and recovered, three times in a row and then it booted itself again. After three attempts I got BOINC to stop. I am now installing the latest beta driver but that should not be necessary and it ran for more then a month with the 331.82 driver.

I have not a lot of joy with the 660 since I bought two of them in summer.
I don't like these Santi's.
Greetings from TJ
ID: 34571 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
TJ

Send message
Joined: 26 Jun 09
Posts: 815
Credit: 1,470,385,294
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 34578 - Posted: 4 Jan 2014, 20:20:15 UTC
Last modified: 4 Jan 2014, 20:25:20 UTC

Again I found my rig with two 660's frozen.
After 5 boots I managed to get rid of all the Santi's and switch over to LR only.

It is or an AMD CPU has problems with Santi or GPU's lower tan 7XX have problems with Santi's or a combination of the two has problems with Santi's.
I guess none of the above as there a re many AMD rigs and many more rigs with 660 that does there terrible Santi's. But my RAC is rapidly falling this way.

Edit: another Santi crash, so will become 7 boots eventually. I start to hate these Santi's!

Edit 2: Yes I got two Nathan's on the 660's, so I can go sleep this night and not watch my system continuously.
Greetings from TJ
ID: 34578 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Retvari Zoltan
Avatar

Send message
Joined: 20 Jan 09
Posts: 2380
Credit: 16,897,957,044
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 34579 - Posted: 4 Jan 2014, 20:51:17 UTC - in response to Message 34578.  

TJ, you should lower the GPU frequency of those 660s, or increase the GPU voltage by 12mV.
ID: 34579 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Dagorath

Send message
Joined: 16 Mar 11
Posts: 509
Credit: 179,005,236
RAC: 0
Level
Ile
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 34580 - Posted: 4 Jan 2014, 22:34:01 UTC - in response to Message 34579.  

Boost the voltage and produce more heat or get crunching on Linux. Take a look at my results. I have errors but 99% of those are tasks I aborted because I played with stuff and ended up with too many tasks in my cache or other reasons. I have two 670 and one 660Ti on Linux and they almost never crash SANTI tasks. I'm running the stock clock speeds and if I keep the temps below 70C the clock boost thing kicks in regularly. They hardly ever crash on any task and if they do the OS doesn't hang, BOINC continues running, another GPUgrid task downloads and starts and life carries on.

BOINC <<--- credit whores, pedants, alien hunters
ID: 34580 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jim1348

Send message
Joined: 28 Jul 12
Posts: 819
Credit: 1,591,285,971
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 34581 - Posted: 5 Jan 2014, 1:27:26 UTC - in response to Message 34578.  

Again I found my rig with two 660's frozen.
After 5 boots I managed to get rid of all the Santi's and switch over to LR only.

It is or an AMD CPU has problems with Santi or GPU's lower tan 7XX have problems with Santi's or a combination of the two has problems with Santi's.
I guess none of the above as there a re many AMD rigs and many more rigs with 660 that does there terrible Santi's. But my RAC is rapidly falling this way.

I have had those problems too. The main reason seems to be that the 660s were bumping up against their power limit, causing them to be starved for current on the tough portions of the hardest work units. Increasing the power limit to 110% by using Nvidia Inspector has largely solved the problem for me on the two cards (a Zotac and a Gigabyte) that I now use for GPUGrid, without the need for any other changes:
http://www.gpugrid.net/results.php?hostid=159002&offset=0&show_names=1&state=0&appid=

But they are often overclocked too much at the factory for the work here, and on another of my Zotac 660s I also have had to reduce the clocks a little (GPU clock from 993 MHz to 950 MHz, and memory clock from 3004 to 2804 MHz) and also bump up the core voltage (from 1.162 to 1.175 volts). For some reason on the Zotacs the software control utilites (such as Nvidia Inspector or MSI Afterburner) do not work to change the voltage, and I had to modify the BIOS with Kepler BIOS Tweaker, and then flash it into the video card with nvflash. You can use GPU-Z to first make a copy of your present BIOS that you then modify (keep a copy of your old BIOS as a backup). If you don't want to deal with that, just reduce the clock frequency, first on the GPU clock and then on the memory clock if necessary, until it is stable.

ID: 34581 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
TJ

Send message
Joined: 26 Jun 09
Posts: 815
Credit: 1,470,385,294
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 34582 - Posted: 5 Jan 2014, 15:35:30 UTC

Hi Guys,

Good advise. I lowered the clocks per Zoltans advise and will see what happens.
If not stable I will increase the voltage little.
Problem is that the rig was stable for 17 days continuously, that is wondering me.

I let them run at factory settings, at first months ago. They are both form EVGA. Their maximal fan speed is 74%.
Greetings from TJ
ID: 34582 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
flashawk

Send message
Joined: 18 Jun 12
Posts: 297
Credit: 3,572,627,986
RAC: 0
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 34583 - Posted: 5 Jan 2014, 19:21:26 UTC

You can start a snowball effect with system freezes and BSOD's if you don't run checkdisk after getting those errors, too many orphaned files or wrong time stamps and such just causes more and more problems. If you don't do that, the errors your getting now could be related to it.

Your cracking me up TJ, you think your BSOD's and freezes are because you have an AMD system? I think the drought were having in California is because of the Intel CPU in my laptop (gotta put on my tin foil hat).
ID: 34583 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
TJ

Send message
Joined: 26 Jun 09
Posts: 815
Credit: 1,470,385,294
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 34613 - Posted: 9 Jan 2014, 19:25:24 UTC

No BSOD but again a frozen system. This time by a Noelia on my 660's rig. I did manage to get the clocks down with Pricison X from EVGA, but after a while they boost automatically again. Trying to do it with MSI Afterburner, shows only one card, there is nowhere I click that I can see the settings of my second card.
Well from August these 660's are trouble some to me, so I will buy a second 780Ti as it, even in Windows7, does the same as two 660's is less time to replace the 660's.
Greetings from TJ
ID: 34613 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 34614 - Posted: 9 Jan 2014, 19:48:39 UTC - in response to Message 34613.  
Last modified: 9 Jan 2014, 19:49:30 UTC

In afterburner, click Settings (bottom right corner of the left pane) and then you can change the GPU under the General Tab.
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help
ID: 34614 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Dagorath

Send message
Joined: 16 Mar 11
Posts: 509
Credit: 179,005,236
RAC: 0
Level
Ile
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 34617 - Posted: 10 Jan 2014, 5:02:01 UTC - in response to Message 34433.  

History quiz: Does anyone remember when MS ballyhooed long and loudly that they had solved the "black screen of death"? Remember the solution?


They made the color a user configurable option then told everyone if they get a black screen of death it's their own damn fault.

BOINC <<--- credit whores, pedants, alien hunters
ID: 34617 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
TJ

Send message
Joined: 26 Jun 09
Posts: 815
Credit: 1,470,385,294
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 34618 - Posted: 10 Jan 2014, 15:42:28 UTC - in response to Message 34614.  

In afterburner, click Settings (bottom right corner of the left pane) and then you can change the GPU under the General Tab.

Thanks skgiven, found it and used it!
Greetings from TJ
ID: 34618 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
David Autumns

Send message
Joined: 10 Jul 10
Posts: 1
Credit: 327,425,754
RAC: 0
Level
Asp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 34644 - Posted: 13 Jan 2014, 20:37:03 UTC

Just had to move my 560ti onto the short runs

Even got the Dyson out expecting a GPU full of fluff but no it's the Work Unit's

Just 2 successful long runs since 24th Dec


There's a problem with the current batch. I'll just have to be patient.

Maybe this time next week


Dave
ID: 34644 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Stoneageman
Avatar

Send message
Joined: 25 May 09
Posts: 224
Credit: 34,057,374,498
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 34645 - Posted: 14 Jan 2014, 2:18:28 UTC
Last modified: 14 Jan 2014, 2:26:36 UTC

Your clocks are too high. Try 1644Mhz for the processor & 2004Mhz for the memory.

'GPUgrid stresses the parts other projects can't reach'
ID: 34645 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Betting Slip

Send message
Joined: 5 Jan 09
Posts: 670
Credit: 2,498,095,550
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 34646 - Posted: 14 Jan 2014, 6:55:47 UTC

Probably would be useful if people having a problem with a self overclocked card would return it to stock clocks before reporting problems with WU.
ID: 34646 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · Next

Message boards : Number crunching : SANTI Errors

©2025 Universitat Pompeu Fabra