gtx295 returning nearly constant errors

Message boards : Graphics cards (GPUs) : gtx295 returning nearly constant errors
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · Next

AuthorMessage
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 10126 - Posted: 24 May 2009, 10:33:29 UTC - in response to Message 10120.  

You're totally right, seeing the 2nd GPU is not your problem.

And CUDA memtest.. well, would have been nice if it detected something. But memory doesn't fail that often over time - either it's broken from the beginning or it's the chip. Since your card worked fine and then started throwing errors, I' rather suppose the 2nd chip got damaged / decayed excessively.

72°C sounds fine.. but still, I'd test with the fan speed set to 100% (and leave the room). If that helps you have a clear case for RMA. If not I'd run the regular 3D stability tests I suggested above. If they fail you also have a clear case for RMA.

MrS
Scanning for our furry friends since Jan 2002
ID: 10126 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile madas91

Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10137 - Posted: 24 May 2009, 19:46:47 UTC - in response to Message 10126.  

As a by note to fixing this maybe it means something who knows but under windows 7 my gtx 295 shows only 1 gpu to boinc no matter how i configure it. 2 monitors plugged in desktops extended and not. Any driver or version all only let boinc see the single gpu.

Just had a fun afternoon anyway playing quickly with W7 :)



ID: 10137 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Paul D. Buck

Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 10146 - Posted: 24 May 2009, 23:11:48 UTC - in response to Message 10137.  

As a by note to fixing this maybe it means something who knows but under windows 7 my gtx 295 shows only 1 gpu to boinc no matter how i configure it. 2 monitors plugged in desktops extended and not. Any driver or version all only let boinc see the single gpu.

Just had a fun afternoon anyway playing quickly with W7 :)

SLI is off?

I don't know how W7 handles SLI but there is an internal SLI mode on the 295 cards IIRC, XP Does not do that ... or at least none of my installs have done that (yet) ... then again, I have not had the courage to try Vista or W7 ...

THough I seem to increasingly hit the upper bound in memory which in XP is 2.something G ...
ID: 10146 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
popandbob

Send message
Joined: 18 Jul 07
Posts: 67
Credit: 43,351,724
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 10150 - Posted: 25 May 2009, 6:05:57 UTC - in response to Message 10120.  

My system doesn't need to be tricked into "seeing 2 gpu's" there are 2 or it couldn't be failing or starting on the second one.


I meant working on 2 GPU's.
What I've seen elsewhere is that the second GPU being the only one to throw errors is the Drivers/CUDA are not happy with the config. This is why I've been suggesting what I have. I'm sorry if I sounded like a broken record.
ID: 10150 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Paul D. Buck

Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 10151 - Posted: 25 May 2009, 6:48:04 UTC - in response to Message 10150.  

Well, I have 3 systems all with 2 or more GPUs running and my error rate is vanishingly small. I mean in almost all cases of error it has turned out to be a bad task or some event where I have caused an issue.

To put it another way, I don't see a problem here ...
ID: 10151 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile madas91

Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10152 - Posted: 25 May 2009, 8:31:16 UTC - in response to Message 10151.  

how can you not see a problem with the 2nd gpu always erroring w/u Paul. Just because your system is not erroring constantly then all systems are ok? i dont understand that statement.I had the same errors as you to start, the occasional w/u here and there. Then bang, all w/u that 2nd gpu does fail everytime.

Look through my tasks and try to find the last completed w/u on gpu 1.


Popandbob. I salute you for your suggestions i dont mean to be grumpy honest. I can only see the one way of changing sli config. There is no bios option for it and nvidia control panel just has the 2 important ones, SLI and Physx.

I'm nearly back to where i started again with no light at end of tunnel yet but am about to try fresh instal of 64 vista and 182.50 drivers first. Just doing the last updates before i instal boinc again.



ID: 10152 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile madas91

Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10159 - Posted: 25 May 2009, 13:29:30 UTC - in response to Message 10152.  
Last modified: 25 May 2009, 13:40:06 UTC

OK i'm 2hours 46 minutes into running 2 tasks on it.
723126
723205
Fingers crossed and they both finish so i can sleep easy.

May sound silly but its always failed by now...........

The difference.........

Turned Aero interface off........

I'm just running on the vista basic theme so all the fancy graphic card utilising bumf is off

Could it be that simple for my problem!
ID: 10159 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Paul D. Buck

Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 10164 - Posted: 25 May 2009, 15:58:59 UTC - in response to Message 10152.  

how can you not see a problem with the 2nd gpu always erroring w/u Paul. Just because your system is not erroring constantly then all systems are ok? i dont understand that statement.I had the same errors as you to start, the occasional w/u here and there. Then bang, all w/u that 2nd gpu does fail everytime.

Here, as in my house ... I guess I was not clear ... obviously...

I did not say that there was not a problem THERE ...

The way I interpreted the earlier posts was that all systems with 2 or more GPUs are seeing errors on the second GPU. I was saying that I don't see that problem, nothing more or less ...

Sudden changes like what you report sound to me more like a failure of the card than anything else. Do you have other cards? I mean, can you swap things around?
ID: 10164 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Liuqyn

Send message
Joined: 16 May 08
Posts: 5
Credit: 68,721,860
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 10185 - Posted: 26 May 2009, 11:43:49 UTC - in response to Message 10146.  

As a by note to fixing this maybe it means something who knows but under windows 7 my gtx 295 shows only 1 gpu to boinc no matter how i configure it. 2 monitors plugged in desktops extended and not. Any driver or version all only let boinc see the single gpu.

Just had a fun afternoon anyway playing quickly with W7 :)

SLI is off?

I don't know how W7 handles SLI but there is an internal SLI mode on the 295 cards IIRC, XP Does not do that ... or at least none of my installs have done that (yet) ... then again, I have not had the courage to try Vista or W7 ...

THough I seem to increasingly hit the upper bound in memory which in XP is 2.something G ...


I tried W7 with 2 GTX285's and could only get boinc to see one card regardless of what settings I made.
Has anyone gotten W7 to use more then one GPU?
ID: 10185 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Edboard
Avatar

Send message
Joined: 24 Sep 08
Posts: 72
Credit: 12,410,275
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 10192 - Posted: 26 May 2009, 14:41:15 UTC - in response to Message 10081.  
Last modified: 26 May 2009, 14:47:00 UTC

I'm sure its not the card.
I see the new host (37017)
currently 4 errors all on the second GPU of the 295.
Nothing yet from the first GPU...
Sounding more like a config issue as there would be errors from the first GPU as well if something was wrong...

Bob


I have a PC with a gtx295 and a 8800GT. The GPUs assignation is:

GPU#0: gtx295
GPU#1: 8800GT
GPU#2: gtx295

I get many erros with the WUs that are processed in GPU#1. I thought it was dued to mixed GPU chips (g92 and gt200). I changed the 8800GT with a gtx280 with this new GPU assignation:

GPU#0: gtx280
GPU#1: gtx295
GPU#2: gtx295

The same, many WUs being processed on GPU#1 give error. Note that GPU#1 is a different graphic card in each case.

I thought it could be the mobo. I installed the gtx295 in another PC (with only one PCIe for graphics). The GPUs assignation was:

GPU#0: gtx295
GPU#1: gtx295

The same: many WUs being processed in GPU#1 give error...

Windows Vista Home Premium 32 bits and Nvidia Driver 182.50 in boht PCs.

Now I have the PC with gtx295 + 8800GT configured so It only process WUs in the gtx295 (GPU#0 and GPU#2) (the GPU#1 -> 8800GT not processing) and I have NO problem (no errors) but I have one Graphic Card wasted...
ID: 10192 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile madas91

Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10202 - Posted: 26 May 2009, 19:42:45 UTC - in response to Message 10192.  

Its in a parcel at the post office atm :) will update as it pans out
ID: 10202 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 10209 - Posted: 26 May 2009, 21:18:43 UTC - in response to Message 10192.  

Edboard,

oh.. what a ****** mess! What does that mean for other people? Take a close look at which GPU errors out. Otherwise.. it can't be a general problem, as some people are running fine. That makes me point to the combination of OS and driver. Would you mind checking one of the 185 series drivers?

MrS
Scanning for our furry friends since Jan 2002
ID: 10209 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile madas91

Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10228 - Posted: 27 May 2009, 8:26:07 UTC - in response to Message 10209.  

My problem started very suddenly and nothing i did changed anything in the slightest, drivers, O/S, whole different pc which has none of the problems this card gave me and i still get the same result.

Before this started the GTX295 was running both w/u with just the occasional fail on either core so either something is wrong enough with the gpu to fail w/u which it does in 2 different systems or who knows. the other system i tried it in is running 2 w/u on its own without a problem until i put this card in then it also fails on GPU 1.
ID: 10228 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Edboard
Avatar

Send message
Joined: 24 Sep 08
Posts: 72
Credit: 12,410,275
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 10230 - Posted: 27 May 2009, 13:14:56 UTC - in response to Message 10209.  
Last modified: 27 May 2009, 13:26:55 UTC

I want to state with facts that many WUs processed in any GPU assigned #1 fails, mainly with the newest units.

I begun to have this kind of problems some time ago with Folding. But NO with GpuGrid/SETI. Now I have this problem with GpuGrid too. SETI works fine.
ID: 10230 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Joe

Send message
Joined: 1 Sep 08
Posts: 37
Credit: 5,864,088
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwat
Message 10365 - Posted: 1 Jun 2009, 19:35:49 UTC - in response to Message 10230.  

It's Gainward card??? I have ONE GTX 295 from Gainward - one GPU works fine GPUGrid and the other crashed after a few seconds. But its ok for short WUs like SaH. And this not a problem from XP, Vista, Linux or the driver/Boinc version or an unused monitorport, it's the card... Perhaps you can chance it to another...
ID: 10365 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile nutcase
Avatar

Send message
Joined: 16 Oct 08
Posts: 7
Credit: 5,348,057
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwat
Message 10369 - Posted: 1 Jun 2009, 21:34:19 UTC

ok, you are getting the same errors as I was.

I tried mine in 2 computers (with different OS's) and errored on both.

I suspected I had a Bad card but I can run benchmarks with no errors.

ID: 10369 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile madas91

Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10377 - Posted: 2 Jun 2009, 14:30:44 UTC - in response to Message 10369.  

I'm just waiting for the argument with customer support when they say they can find nothing wrong. I have a feeling they wont like my answer that its not fit for the purpose i bought it for, namely gpugrid.

Cant be long now till they get back in touch with me, they have had it since Thursday afternoon :(
ID: 10377 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
pharrg

Send message
Joined: 12 Jan 09
Posts: 36
Credit: 1,075,543
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 10380 - Posted: 2 Jun 2009, 15:52:12 UTC
Last modified: 2 Jun 2009, 15:53:13 UTC

Check what kind of slots you have your cards plugged into on the motherboard. Not all PCI-E slots are the same. Some run at x16, some at X8, some at X4, etc. I searched hard when shopping for my motherboard to find one that had 3 full speed x16 slots and that was also capable for triple SLI (though you need SLI disabled when running CUDA like GPUgrid). Then, I made sure to use the x16 slots for my cards. Oh, and the x16 slots have to be spaced correctly too since the newer GTX200 series cards are double thick, or they won't fit. Then, of course, the case has to have room for that setup. Check for motherboard BIOS and chipset driver updates as well. Anyway, just a thought.
ID: 10380 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Gamekiller
Avatar

Send message
Joined: 17 Jan 09
Posts: 11
Credit: 191,851,704
RAC: 0
Level
Ile
Scientific publications
watwatwatwatwatwat
Message 10385 - Posted: 2 Jun 2009, 18:10:18 UTC - in response to Message 10380.  

I'm getting a ton of errors with my gtx 295 as well. =[
I just started gpugrid, and this doesn't make me very happy.
I gave up on using the 2nd gpu and just tried using the first.... and the 1st gpu errors out everything... some within second and some after 20 minutes.
My card is not overclocked, and the temps are in a very good range.

Win7, 185.55, 6.6.31

Annoying to say the least.
ID: 10385 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile madas91

Send message
Joined: 22 Apr 09
Posts: 21
Credit: 8,119,831
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 10466 - Posted: 9 Jun 2009, 19:11:21 UTC - in response to Message 10385.  

The product GX-191-AS on RMA RMA****** has been replaced, this will be sent on a new order with order number *******.

This will be despatched as soon as possible, subject to stock availability.

Well thats that sorted....they tested the card and the 2nd gpu was indeed faulty. It was ok for a bit then the fault showed under load.

Pharg my slots are both PCIe 2.0 16x so that wasn't the problem.

My system ran ok for 2 weeks till the fault showed itself. There where no system/driver changes to blame between working and not working.

Thanks to you all for keeping me sane while i waited for their decision.

:) cant wait to take my emergency EAX800 XT out and slide a 295 back in. It will be a lot quieter believe it or not :)
ID: 10466 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · Next

Message boards : Graphics cards (GPUs) : gtx295 returning nearly constant errors

©2025 Universitat Pompeu Fabra