Continual computing errors

Message boards : Graphics cards (GPUs) : Continual computing errors
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Administrator

Send message
Joined: 25 Jan 09
Posts: 1
Credit: 0
RAC: 0
Level

Scientific publications
wat
Message 5978 - Posted: 25 Jan 2009, 9:45:08 UTC

Hi there.
I have overclocked the shaders on my 9500GT card but I keep getting computing errors. I have now reset my card back to default values but the errors are still occuring. To date I think its up to around 20. Anyone have any ideas how I might fix this. I have tried a reset of the project and a detach but nothing works yet.
Thanks.

Eric
ID: 5978 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Paul D. Buck

Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 5980 - Posted: 25 Jan 2009, 10:40:32 UTC - in response to Message 5978.  

Hi there.
I have overclocked the shaders on my 9500GT card but I keep getting computing errors. I have now reset my card back to default values but the errors are still occuring. To date I think its up to around 20. Anyone have any ideas how I might fix this. I have tried a reset of the project and a detach but nothing works yet.
Thanks.

Eric


Check the fan and the air path around the card to make sure that it can be cooled. Use one of the monitroing tools to get the card temps ...

ONe other thing to try is to run a few SaH tasks to see if they complete (though you will have to wait for them to validate and be paired up with a wingman) ...

These are the first things that come to my mind ... weak as it is ...
ID: 5980 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Eric

Send message
Joined: 17 Nov 08
Posts: 13
Credit: 15,272,287
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 5994 - Posted: 25 Jan 2009, 15:14:39 UTC - in response to Message 5980.  

Thanks for the info. I found the fans and air vents were ok. I then did a complete re install of my operating system as it was unstable. Re did all the drivers and re did Bionc. All is fine now.
Once again thanks for repling.
Eric
ID: 5994 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Paul D. Buck

Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 6010 - Posted: 25 Jan 2009, 22:11:29 UTC - in response to Message 5994.  

Thanks for the info. I found the fans and air vents were ok. I then did a complete re install of my operating system as it was unstable. Re did all the drivers and re did Bionc. All is fine now.
Once again thanks for repling.
Eric


Eric,

It is what we are here for ... I help you ... others are trying to help me with my issues ... all to the betterment of the universe ... :)

One of the lessons I learned when I was writing documentation for BOINC is none of us knows it all ... or can do it all ... there is always more to learn ...

Just because I can't get Linux beat into submission at the moment does not make me an idiot, nor does your difficulties ... I am just glad that you found the problem. *MY* experience with windows is that I would have to do a clean install about every 6 months to keep the systems running stably. When I only run BOINC and don't use the system for much of anything else it seems to last longer ... YMMV ...
ID: 6010 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile (_KoDAk_)
Avatar

Send message
Joined: 18 Oct 08
Posts: 43
Credit: 6,924,807
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 6303 - Posted: 1 Feb 2009, 22:03:30 UTC

http://www.gpugrid.net/result.php?resultid=269806
ID: 6303 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile (_KoDAk_)
Avatar

Send message
Joined: 18 Oct 08
Posts: 43
Credit: 6,924,807
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 8200 - Posted: 5 Apr 2009, 7:12:04 UTC

Incorrect function. (0x1) - exit code 1 (0x1)

http://www.gpugrid.net/result.php?resultid=489157
ID: 8200 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 8248 - Posted: 6 Apr 2009, 16:40:13 UTC

Kodak, what are you trying to say? Do you have "Continual computing errors", as the thread title implies? I can only see one error for the host you linked to. And I see his 9600GSO is overclocked quite a bit, so an error every nwo and then might well be within expectations.

MrS
Scanning for our furry friends since Jan 2002
ID: 8248 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile (_KoDAk_)
Avatar

Send message
Joined: 18 Oct 08
Posts: 43
Credit: 6,924,807
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 8766 - Posted: 23 Apr 2009, 8:17:08 UTC

IN LAST 2 DAY MANY errors
in new WU old WU is ok
and after
23.04.2009 11:11:44 GPUGRID Message from server: No work sent
23.04.2009 11:11:44 GPUGRID Message from server: (reached daily quota of 4 results)
23.04.2009 11:11:44 GPUGRID Message from server: (Project has no jobs available)
(((

ID: 8766 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Michael Goetz
Avatar

Send message
Joined: 2 Mar 09
Posts: 124
Credit: 124,873,744
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwat
Message 8767 - Posted: 23 Apr 2009, 8:45:19 UTC - in response to Message 8248.  

Kodak, what are you trying to say? Do you have "Continual computing errors", as the thread title implies? I can only see one error for the host you linked to. And I see his 9600GSO is overclocked quite a bit, so an error every nwo and then might well be within expectations.

MrS


I think he's talking about his other computer, which is throwing a ton of errors. Here's one: http://www.gpugrid.net/result.php?resultid=569330


Want to find one of the largest known primes? Try PrimeGrid. Or help cure disease at WCG.

ID: 8767 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 8798 - Posted: 23 Apr 2009, 19:32:48 UTC

Kodak, your 9600GSO is overclocked by ~350 MHz. If you run such a high OC and at some point it starts to fail the first thing you should try is to lower the OC and see if it helps.

MrS
Scanning for our furry friends since Jan 2002
ID: 8798 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile (_KoDAk_)
Avatar

Send message
Joined: 18 Oct 08
Posts: 43
Credit: 6,924,807
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 8816 - Posted: 24 Apr 2009, 4:42:43 UTC
Last modified: 24 Apr 2009, 5:38:37 UTC

i about http://www.gpugrid.net/show_host_detail.php?hostid=31714
in it only one overclocked by shaders from 13xx to 1734
second is asus top shaders =1674 OC to 1734
today low OC to 1674 same errors(((
cards not hot ~ 56-60 C
--info
1s WU run's ok ( no more WU) -> update
+3 WU
start 2nd run->ok BUT
start's 3rd wu run-> fail OND start's 4s wu run-> fail !!!
and return run 2nd wu and ok
Whay start's 3 and 4 (deadline is almost same!!!) ??????

and ?
what better use 2 GPU in one PC or 1gpu+pc +1gpu+pc ?????
ID: 8816 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Michael Goetz
Avatar

Send message
Joined: 2 Mar 09
Posts: 124
Credit: 124,873,744
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwat
Message 8825 - Posted: 24 Apr 2009, 11:18:32 UTC - in response to Message 8816.  

what better use 2 GPU in one PC or 1gpu+pc +1gpu+pc ?????


That's hard to say, but right now, I'd go with 1+1 and 1+1. GPU computing is relatively new to BOINC, and the BOINC scheduling software is far from perfect. It seems to have issues with propper scheduling when there's more than one GPU (especially when there's different GPUs in the same computer).

Once the scheduling issues are eventually resolved, things might change. But for right now, I'd put each GPU in a separate computer. It's also easier on the power supplies and the cooling (summer is coming, after all.)
ID: 8825 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile (_KoDAk_)
Avatar

Send message
Joined: 18 Oct 08
Posts: 43
Credit: 6,924,807
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 8884 - Posted: 25 Apr 2009, 8:45:26 UTC

1s WU run's ok ( no more WU) -> update
+3 WU
start 2nd run->ok BUT
start's 3rd wu run-> fail OND start's 4s wu run-> fail !!!
and return run 2nd wu and ok
Whay start's 3 and 4 (deadline is almost same!!!) ??????

IT is only 2xGPU (
ID: 8884 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 8913 - Posted: 25 Apr 2009, 13:57:57 UTC

You started at a reported shader clock of ~1730 MHz, then you went to ~1715 MHz and still get errors and now you're running 1693 and 1700 MHz and still get errors.

Do you know that the clock speed on current nVidia GPUs is nto continous (i.e. 1 MHz steps), but discrete with much larger steps? For the shader the step size is about 54 MHz (can't remember the exact value) and changes smaller than this likely don't change anything. Most tools (also the GPU-Grid task output) only report the requested clock speed, but not the real one.

So back off to 1600 MHz shader or so and see if it helps. Also don't forget the chip and memory clocks.. if they're also overclocked you should reduce them as well. It could also be a too tight OC on the CPU and/or memory.

MrS
Scanning for our furry friends since Jan 2002
ID: 8913 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile (_KoDAk_)
Avatar

Send message
Joined: 18 Oct 08
Posts: 43
Credit: 6,924,807
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwat
Message 8958 - Posted: 26 Apr 2009, 17:09:18 UTC

OC only shader \ chip and memory - is default

will be fine work
9800GTX+ VS 250GTS ? (it is only diff name , shader - 128 )
ID: 8958 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
showa

Send message
Joined: 2 Mar 09
Posts: 28
Credit: 4,975,808
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 8972 - Posted: 27 Apr 2009, 6:27:01 UTC

Hi. I have an overclocked 9800GTX which was running very smoothly. Until 3/4 days ago, when I behgan to obtain a long list of "computation error". Nothing has changed when that problem has begun (for example, I haven't installed any new driver).
This is a typical error:
<core_client_version>6.6.20</core_client_version>
<![CDATA[
<message>
- exit code 1073741845 (0x40000015)
</message>
<stderr_txt>
Failed to set low-cpu sync mode
# Using CUDA device 0
# Device 0: "Device Emulation (CPU)"
# Clock rate: 1350000 kilohertz
# Total amount of global memory: -1 bytes
# Number of multiprocessors: 16
# Number of cores: 128
Cuda error in file '..\cuda/cutil.h' in line 968 : initialization error.
Memory usage: host: bytes device: bytes
Assertion failed: 0, file ..\cuda/cutil.h, line 968

This application has requested the Runtime to terminate it in an unusual way.
Please contact the application's support team for more information.

</stderr_txt>
]]>

Can you help me?
Thank you in advance.
ID: 8972 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ignasi

Send message
Joined: 10 Apr 08
Posts: 254
Credit: 16,836,000
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 8975 - Posted: 27 Apr 2009, 7:19:41 UTC - in response to Message 8972.  

You are running in device emulation, no card is being used:

# Using CUDA device 0
# Device 0: "Device Emulation (CPU)"

Your last success result 566035 shows right config:

# Using CUDA device 0
# Device 0: "GeForce 9800 GTX/9800 GTX+"

Re-install?

i
ID: 8975 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
showa

Send message
Joined: 2 Mar 09
Posts: 28
Credit: 4,975,808
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 8976 - Posted: 27 Apr 2009, 7:46:34 UTC - in response to Message 8975.  

I have to reinstall the video drivers, or BOINC? Or both?
Thank for your answer.
ID: 8976 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 9023 - Posted: 27 Apr 2009, 21:31:03 UTC - in response to Message 8958.  

will be fine work
9800GTX+ VS 250GTS ? (it is only diff name , shader - 128 )


What are you trying to say?

9800GTX+ and GTS250 are the same speed, but the GTS250 can have a lower power consumption.

MrS
Scanning for our furry friends since Jan 2002
ID: 9023 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ignasi

Send message
Joined: 10 Apr 08
Posts: 254
Credit: 16,836,000
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 9027 - Posted: 27 Apr 2009, 21:42:24 UTC - in response to Message 8976.  

I have to reinstall the video drivers, or BOINC? Or both?
Thank for your answer.


I'd try BOINC first, try with version 6.5.0 though...
Although folks here in the forum may give you better idea on what client version is "safer".

i
ID: 9027 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
1 · 2 · Next

Message boards : Graphics cards (GPUs) : Continual computing errors

©2025 Universitat Pompeu Fabra