Message boards :
Graphics cards (GPUs) :
Continual computing errors
Message board moderation
| Author | Message |
|---|---|
|
Send message Joined: 25 Jan 09 Posts: 1 Credit: 0 RAC: 0 Level ![]() Scientific publications
|
Hi there. I have overclocked the shaders on my 9500GT card but I keep getting computing errors. I have now reset my card back to default values but the errors are still occuring. To date I think its up to around 20. Anyone have any ideas how I might fix this. I have tried a reset of the project and a detach but nothing works yet. Thanks. Eric |
Paul D. BuckSend message Joined: 9 Jun 08 Posts: 1050 Credit: 37,321,185 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Hi there. Check the fan and the air path around the card to make sure that it can be cooled. Use one of the monitroing tools to get the card temps ... ONe other thing to try is to run a few SaH tasks to see if they complete (though you will have to wait for them to validate and be paired up with a wingman) ... These are the first things that come to my mind ... weak as it is ... |
|
Send message Joined: 17 Nov 08 Posts: 13 Credit: 15,272,287 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Thanks for the info. I found the fans and air vents were ok. I then did a complete re install of my operating system as it was unstable. Re did all the drivers and re did Bionc. All is fine now. Once again thanks for repling. Eric |
Paul D. BuckSend message Joined: 9 Jun 08 Posts: 1050 Credit: 37,321,185 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Thanks for the info. I found the fans and air vents were ok. I then did a complete re install of my operating system as it was unstable. Re did all the drivers and re did Bionc. All is fine now. Eric, It is what we are here for ... I help you ... others are trying to help me with my issues ... all to the betterment of the universe ... :) One of the lessons I learned when I was writing documentation for BOINC is none of us knows it all ... or can do it all ... there is always more to learn ... Just because I can't get Linux beat into submission at the moment does not make me an idiot, nor does your difficulties ... I am just glad that you found the problem. *MY* experience with windows is that I would have to do a clean install about every 6 months to keep the systems running stably. When I only run BOINC and don't use the system for much of anything else it seems to last longer ... YMMV ... |
(_KoDAk_)Send message Joined: 18 Oct 08 Posts: 43 Credit: 6,924,807 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
http://www.gpugrid.net/result.php?resultid=269806 |
(_KoDAk_)Send message Joined: 18 Oct 08 Posts: 43 Credit: 6,924,807 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Incorrect function. (0x1) - exit code 1 (0x1) http://www.gpugrid.net/result.php?resultid=489157 |
|
Send message Joined: 17 Aug 08 Posts: 2705 Credit: 1,311,122,549 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Kodak, what are you trying to say? Do you have "Continual computing errors", as the thread title implies? I can only see one error for the host you linked to. And I see his 9600GSO is overclocked quite a bit, so an error every nwo and then might well be within expectations. MrS Scanning for our furry friends since Jan 2002 |
(_KoDAk_)Send message Joined: 18 Oct 08 Posts: 43 Credit: 6,924,807 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
IN LAST 2 DAY MANY errors in new WU old WU is ok and after 23.04.2009 11:11:44 GPUGRID Message from server: No work sent 23.04.2009 11:11:44 GPUGRID Message from server: (reached daily quota of 4 results) 23.04.2009 11:11:44 GPUGRID Message from server: (Project has no jobs available) ((( |
Michael GoetzSend message Joined: 2 Mar 09 Posts: 124 Credit: 124,873,744 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Kodak, what are you trying to say? Do you have "Continual computing errors", as the thread title implies? I can only see one error for the host you linked to. And I see his 9600GSO is overclocked quite a bit, so an error every nwo and then might well be within expectations. I think he's talking about his other computer, which is throwing a ton of errors. Here's one: http://www.gpugrid.net/result.php?resultid=569330 Want to find one of the largest known primes? Try PrimeGrid. Or help cure disease at WCG.
|
|
Send message Joined: 17 Aug 08 Posts: 2705 Credit: 1,311,122,549 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Kodak, your 9600GSO is overclocked by ~350 MHz. If you run such a high OC and at some point it starts to fail the first thing you should try is to lower the OC and see if it helps. MrS Scanning for our furry friends since Jan 2002 |
(_KoDAk_)Send message Joined: 18 Oct 08 Posts: 43 Credit: 6,924,807 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
i about http://www.gpugrid.net/show_host_detail.php?hostid=31714 in it only one overclocked by shaders from 13xx to 1734 second is asus top shaders =1674 OC to 1734 today low OC to 1674 same errors((( cards not hot ~ 56-60 C --info 1s WU run's ok ( no more WU) -> update +3 WU start 2nd run->ok BUT start's 3rd wu run-> fail OND start's 4s wu run-> fail !!! and return run 2nd wu and ok Whay start's 3 and 4 (deadline is almost same!!!) ?????? and ? what better use 2 GPU in one PC or 1gpu+pc +1gpu+pc ????? |
Michael GoetzSend message Joined: 2 Mar 09 Posts: 124 Credit: 124,873,744 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
what better use 2 GPU in one PC or 1gpu+pc +1gpu+pc ????? That's hard to say, but right now, I'd go with 1+1 and 1+1. GPU computing is relatively new to BOINC, and the BOINC scheduling software is far from perfect. It seems to have issues with propper scheduling when there's more than one GPU (especially when there's different GPUs in the same computer). Once the scheduling issues are eventually resolved, things might change. But for right now, I'd put each GPU in a separate computer. It's also easier on the power supplies and the cooling (summer is coming, after all.) |
(_KoDAk_)Send message Joined: 18 Oct 08 Posts: 43 Credit: 6,924,807 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
1s WU run's ok ( no more WU) -> update +3 WU start 2nd run->ok BUT start's 3rd wu run-> fail OND start's 4s wu run-> fail !!! and return run 2nd wu and ok Whay start's 3 and 4 (deadline is almost same!!!) ?????? IT is only 2xGPU ( |
|
Send message Joined: 17 Aug 08 Posts: 2705 Credit: 1,311,122,549 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
You started at a reported shader clock of ~1730 MHz, then you went to ~1715 MHz and still get errors and now you're running 1693 and 1700 MHz and still get errors. Do you know that the clock speed on current nVidia GPUs is nto continous (i.e. 1 MHz steps), but discrete with much larger steps? For the shader the step size is about 54 MHz (can't remember the exact value) and changes smaller than this likely don't change anything. Most tools (also the GPU-Grid task output) only report the requested clock speed, but not the real one. So back off to 1600 MHz shader or so and see if it helps. Also don't forget the chip and memory clocks.. if they're also overclocked you should reduce them as well. It could also be a too tight OC on the CPU and/or memory. MrS Scanning for our furry friends since Jan 2002 |
(_KoDAk_)Send message Joined: 18 Oct 08 Posts: 43 Credit: 6,924,807 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
OC only shader \ chip and memory - is default will be fine work 9800GTX+ VS 250GTS ? (it is only diff name , shader - 128 ) |
|
Send message Joined: 2 Mar 09 Posts: 28 Credit: 4,975,808 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Hi. I have an overclocked 9800GTX which was running very smoothly. Until 3/4 days ago, when I behgan to obtain a long list of "computation error". Nothing has changed when that problem has begun (for example, I haven't installed any new driver). This is a typical error: <core_client_version>6.6.20</core_client_version> <![CDATA[ <message> - exit code 1073741845 (0x40000015) </message> <stderr_txt> Failed to set low-cpu sync mode # Using CUDA device 0 # Device 0: "Device Emulation (CPU)" # Clock rate: 1350000 kilohertz # Total amount of global memory: -1 bytes # Number of multiprocessors: 16 # Number of cores: 128 Cuda error in file '..\cuda/cutil.h' in line 968 : initialization error. Memory usage: host: bytes device: bytes Assertion failed: 0, file ..\cuda/cutil.h, line 968 This application has requested the Runtime to terminate it in an unusual way. Please contact the application's support team for more information. </stderr_txt> ]]> Can you help me? Thank you in advance.
|
|
Send message Joined: 10 Apr 08 Posts: 254 Credit: 16,836,000 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
You are running in device emulation, no card is being used: # Using CUDA device 0 # Device 0: "Device Emulation (CPU)" Your last success result 566035 shows right config: # Using CUDA device 0 # Device 0: "GeForce 9800 GTX/9800 GTX+" Re-install? i |
|
Send message Joined: 2 Mar 09 Posts: 28 Credit: 4,975,808 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
I have to reinstall the video drivers, or BOINC? Or both? Thank for your answer.
|
|
Send message Joined: 17 Aug 08 Posts: 2705 Credit: 1,311,122,549 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
will be fine work What are you trying to say? 9800GTX+ and GTS250 are the same speed, but the GTS250 can have a lower power consumption. MrS Scanning for our furry friends since Jan 2002 |
|
Send message Joined: 10 Apr 08 Posts: 254 Credit: 16,836,000 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
I have to reinstall the video drivers, or BOINC? Or both? I'd try BOINC first, try with version 6.5.0 though... Although folks here in the forum may give you better idea on what client version is "safer". i |
©2025 Universitat Pompeu Fabra