Long run Santi worse then Noelia

Message boards : Number crunching : Long run Santi worse then Noelia
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
TJ

Send message
Joined: 26 Jun 09
Posts: 815
Credit: 1,470,385,294
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 31723 - Posted: 23 Jul 2013, 17:19:19 UTC

I have 4 LR from Santi erroring in a row, after 32000, 34000 and 40600 seconds.
But there are more errors from Santi LR, most wingman have errors too and some already more then 4. If I compare this with the my error rate from Noelia's then are the Noelia's absolutely not bad. I would like Noelia's WU now, as after two days of continuous crunching during a heatwave none LR have finished normal.
This is no fun at all.
Greetings from TJ
ID: 31723 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Carlesa25
Avatar

Send message
Joined: 13 Nov 10
Posts: 328
Credit: 72,619,453
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 31724 - Posted: 23 Jul 2013, 18:12:56 UTC - in response to Message 31723.  

Hello: I see your GTX770 performance is very low, it took more than twice mine.

The difference from Windows to Linux is not warranted, at most 5% + - looking at his PC also see that the CPU performance is low, possibly penalized as fast GPU.
ID: 31724 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
werdwerdus

Send message
Joined: 15 Apr 10
Posts: 123
Credit: 1,004,473,861
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 31725 - Posted: 23 Jul 2013, 21:54:17 UTC

You are correct, looking at my BoincTasks history log for all my machines combined, there have been quite a few (at least 27) SANTI errored in the past few days. Except all of my errored units are the short SANTI runs. Not just a single machine, it happened on all machines and windows xp as well as windows 7.
XtremeSystems.org - #1 Team in GPUGrid
ID: 31725 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
5pot

Send message
Joined: 8 Mar 12
Posts: 411
Credit: 2,083,882,218
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 31726 - Posted: 24 Jul 2013, 0:34:37 UTC

So it's the short ones that are the problem? Because I'm not experiencing any issues wight the long runs.
ID: 31726 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile dskagcommunity
Avatar

Send message
Joined: 28 Apr 11
Posts: 463
Credit: 958,266,958
RAC: 34
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 31727 - Posted: 24 Jul 2013, 6:12:01 UTC

Dont have issues too on both short and long santis
DSKAG Austria Research Team: http://www.research.dskag.at



ID: 31727 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
TJ

Send message
Joined: 26 Jun 09
Posts: 815
Credit: 1,470,385,294
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 31728 - Posted: 24 Jul 2013, 7:32:32 UTC

My last LR that finished is from 21 July. I see that all LR from Santi and Nathan only use around 25% of memory, while that was 48% a few weeks ago.
Greetings from TJ
ID: 31728 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
nanoprobe

Send message
Joined: 26 Feb 12
Posts: 184
Credit: 222,376,233
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 31733 - Posted: 24 Jul 2013, 15:43:50 UTC - in response to Message 31723.  

I have 4 LR from Santi erroring in a row, after 32000, 34000 and 40600 seconds.
But there are more errors from Santi LR, most wingman have errors too and some already more then 4. If I compare this with the my error rate from Noelia's then are the Noelia's absolutely not bad. I would like Noelia's WU now, as after two days of continuous crunching during a heatwave none LR have finished normal.
This is no fun at all.

Same problem with SANTI_RAP74.
http://www.gpugrid.net/forum_thread.php?id=3426
ID: 31733 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
TJ

Send message
Joined: 26 Jun 09
Posts: 815
Credit: 1,470,385,294
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 31738 - Posted: 25 Jul 2013, 9:55:55 UTC

Looking close at the systems I see that after a Santi SR finished without error or a Santi SR errored out, that the GPU clock is down clocked and I need to reboot to get the clock running at around 1024MHz again. That's no problem if I am with the PC's but at home I cannot boot my rigs at work.
This explains why some of my SR run so long.

As I have more errors than valid results and the heatwave is now official in the Netherlands and continuing I will power some rigs down for the time being.
Greetings from TJ
ID: 31738 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Beyond
Avatar

Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 31739 - Posted: 25 Jul 2013, 15:02:02 UTC

To provide some counterpoint, my machines have completed 18 (and 3 more nearing completion) SANTI long WUs with no failures, no problems at all. This includes GTX 670, GTX 560, GTX 650 Ti and even GTX 460/768mb GPUs. The ones that are showstoppers here are the NOELIA RUN WUs which won't RUN properly on anything but the GTX 670.
ID: 31739 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Retvari Zoltan
Avatar

Send message
Joined: 20 Jan 09
Posts: 2380
Credit: 16,897,957,044
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 31740 - Posted: 25 Jul 2013, 16:47:26 UTC - in response to Message 31739.  

To provide some counterpoint, my machines have completed 18 (and 3 more nearing completion) SANTI long WUs with no failures, no problems at all.

My hosts did it the same way.

This includes GTX 670, GTX 560, GTX 650 Ti and even GTX 460/768mb GPUs.

My hosts have GTX670s and GTX680s at the moment.

The ones that are showstoppers here are the NOELIA RUN WUs which won't RUN properly on anything but the GTX 670.

I've crunched a couple of that NOELIAs on my GTX480 properly.
I'm quite sure, that the GTX 660 Ti is capable of crunching these workunits, but it needs an older driver (v307.90~v314.22) to do that.
ID: 31740 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Beyond
Avatar

Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 31741 - Posted: 25 Jul 2013, 18:15:39 UTC - in response to Message 31740.  
Last modified: 25 Jul 2013, 18:17:01 UTC

The ones that are showstoppers here are the NOELIA RUN WUs which won't RUN properly on anything but the GTX 670.

I've crunched a couple of that NOELIAs on my GTX480 properly.
I'm quite sure, that the GTX 660 Ti is capable of crunching these workunits, but it needs an older driver (v307.90~v314.22) to do that.

Also > 1GB GPU ram to do them at reasonable efficiency. Luckily the SANTI WUs run fine in 768MB.
ID: 31741 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
nanoprobe

Send message
Joined: 26 Feb 12
Posts: 184
Credit: 222,376,233
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 31745 - Posted: 26 Jul 2013, 1:43:42 UTC
Last modified: 26 Jul 2013, 1:44:12 UTC

I've crunched a couple of that NOELIAs on my GTX480 properly.
I'm quite sure, that the GTX 660 Ti is capable of crunching these workunits, but it needs an older driver (v307.90~v314.22) to do that.

Strange how different hardware/OS systems handle things. My 660Ti crunches NOELIIA and NATHAN tasks without any problems with the 320:18 drivers and XP but it chokes on the SANTI_RAP74 tasks. Maybe I should drop down to the 314:22 driver and see if the SANTI_RAP74 tasks play nicer.
ID: 31745 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Beyond
Avatar

Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 31750 - Posted: 26 Jul 2013, 17:11:39 UTC - in response to Message 31745.  

Strange how different hardware/OS systems handle things. My 660Ti crunches NOELIIA and NATHAN tasks without any problems with the 320:18 drivers and XP but it chokes on the SANTI_RAP74 tasks. Maybe I should drop down to the 314:22 driver and see if the SANTI_RAP74 tasks play nicer.

MY GTX 670, GTX 560, GTX 650 Ti and GTX 460/768mb GPUs are now at 22 SANTI_RAP74 WUs completed with no errors. I'm using 310.90 drivers in Win7-64. Tried a newer driver a while back on 1 box and it started erroring some NOELIA WUs, Switched back to 310.90 and the errors went away. The only WUs that are a problem are the NOELIA_RUN, but that's because they need > 1GB memory or they slow to a crawl.
ID: 31750 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
TJ

Send message
Joined: 26 Jun 09
Posts: 815
Credit: 1,470,385,294
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 31794 - Posted: 30 Jul 2013, 5:25:07 UTC

I have again 3 Santi errors in a row on my GTX660 with 314.22 drivers after 60-70% of the WU finished. I have set this host to SR´s in the hope to finish WU´s without error. All the errors are not helping science.
Greetings from TJ
ID: 31794 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Beyond
Avatar

Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 31797 - Posted: 30 Jul 2013, 14:37:45 UTC - in response to Message 31794.  

I have again 3 Santi errors in a row on my GTX660 with 314.22 drivers after 60-70% of the WU finished. I have set this host to SR´s in the hope to finish WU´s without error. All the errors are not helping science.

Have you tried the 310.90 drivers? MY GTX 670, GTX 560, GTX 650 Ti and GTX 460/768mb GPUs are now at 44 SANTI_RAP74 WUs completed with no errors. I'm using 310.90 drivers (like your machine, running Win7-64).

Looking at your GTX 660:

http://www.gpugrid.net/results.php?hostid=153284&offset=0&show_names=1&state=0&appid=

It looks like a lot of NATHANs and NOELIAs are erroring too. If it's not OCed try slowing it down below stock specs or move it to a known good machine. If it still errors, RMA it.
ID: 31797 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
TJ

Send message
Joined: 26 Jun 09
Posts: 815
Credit: 1,470,385,294
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 31798 - Posted: 30 Jul 2013, 15:08:55 UTC - in response to Message 31797.  

I have again 3 Santi errors in a row on my GTX660 with 314.22 drivers after 60-70% of the WU finished. I have set this host to SR´s in the hope to finish WU´s without error. All the errors are not helping science.

Have you tried the 310.90 drivers? MY GTX 670, GTX 560, GTX 650 Ti and GTX 460/768mb GPUs are now at 44 SANTI_RAP74 WUs completed with no errors. I'm using 310.90 drivers (like your machine, running Win7-64).

Looking at your GTX 660:

http://www.gpugrid.net/results.php?hostid=153284&offset=0&show_names=1&state=0&appid=

It looks like a lot of NATHANs and NOELIAs are erroring too. If it's not OCed try slowing it down below stock specs or move it to a known good machine. If it still errors, RMA it.

No I haven't tried the 310.90 driver, will do that.
Not a lot of Noelia's or Nathan's with error, most are Santi's and they error after a long run time.
The rig where it is in now is the best option, its in the coldest room and did rather well a few weeks ago. Could be the heat, but card is at 64°C. Its not OCed and I have it at clock speed as with the specs on the box.
Greetings from TJ
ID: 31798 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile dskagcommunity
Avatar

Send message
Joined: 28 Apr 11
Posts: 463
Credit: 958,266,958
RAC: 34
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 31799 - Posted: 30 Jul 2013, 15:56:13 UTC
Last modified: 30 Jul 2013, 15:59:39 UTC

The specs on the box are not always good for all tasks from GPUGrid. None of my cards would run @ boxedspecs errorfree ^^ Since Fermi&Kepler changes are needed sometimes. 2xx and lower run all at original specs longer time ago ^^
DSKAG Austria Research Team: http://www.research.dskag.at



ID: 31799 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
nanoprobe

Send message
Joined: 26 Feb 12
Posts: 184
Credit: 222,376,233
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 31813 - Posted: 4 Aug 2013, 13:02:56 UTC

Looks like switching to the 310.90 drivers may have solved my error issue with the SANTI_RAP74 WUs. Just completed 1 with no errors. Have 1 now running and 1 more in cache. Fingers crossed.

http://www.gpugrid.net/result.php?resultid=7104934
ID: 31813 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
TJ

Send message
Joined: 26 Jun 09
Posts: 815
Credit: 1,470,385,294
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 31821 - Posted: 4 Aug 2013, 23:24:01 UTC - in response to Message 31813.  

Looks like switching to the 310.90 drivers may have solved my error issue with the SANTI_RAP74 WUs. Just completed 1 with no errors. Have 1 now running and 1 more in cache. Fingers crossed.

http://www.gpugrid.net/result.php?resultid=7104934

I went back to 310.90 drivers to for the 660, but still errors on the SR. One LR Santi did finish same did Nathan and Noelia WU.

My AMD PC with a 770 and driver 320.49 had nil errors yet and did all types of tasks.
It looks to me that the 660 and Santi WU's need a special set up. The PC is now running a Santi_bax1 and I see the Kernel times slightly rising. When a Noelia ran this morning, the kernel times where very flat, almost not visible.
Greetings from TJ
ID: 31821 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
nanoprobe

Send message
Joined: 26 Feb 12
Posts: 184
Credit: 222,376,233
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 31822 - Posted: 4 Aug 2013, 23:52:25 UTC - in response to Message 31821.  
Last modified: 4 Aug 2013, 23:53:25 UTC

Looks like switching to the 310.90 drivers may have solved my error issue with the WUs. Just completed 1 with no errors. Have 1 now running and 1 more in cache. Fingers crossed.

http://www.gpugrid.net/result.php?resultid=7104934

I went back to 310.90 drivers to for the 660, but still errors on the SR. One LR Santi did finish same did Nathan and Noelia WU.

My AMD PC with a 770 and driver 320.49 had nil errors yet and did all types of tasks.
It looks to me that the 660 and Santi WU's need a special set up. The PC is now running a Santi_bax1 and I see the Kernel times slightly rising. When a Noelia ran this morning, the kernel times where very flat, almost not visible.


My card is a 660TI and it finished a second SANTI_RAP74 task without error. One thing I've noticed is that with the 320 driver my GPU load was at 97%. With the 310 driver it's at 95%. Maybe that extra 2% load had something to do with the errors on the 320 driver. Just guessing.
ID: 31822 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
1 · 2 · Next

Message boards : Number crunching : Long run Santi worse then Noelia

©2025 Universitat Pompeu Fabra