6 Errors Today [Problems with "KASHIF_HIVPR" and "IBUCH_KID"-WUs]

Message boards : Graphics cards (GPUs) : 6 Errors Today [Problems with "KASHIF_HIVPR" and "IBUCH_KID"-WUs]
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · Next

AuthorMessage
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 9553 - Posted: 9 May 2009, 15:20:44 UTC

@Zydor: I don't see "signal 11", neither in my nor in your latest results.

@Paul: that's number 3 of these tasks which have failed on a G200 card. But the circumstances were slightly unusual.. not sure if it means anything.

@all: ouch, 2 more errors for me:

- "30-KASHIF_HIVPR_dim_ba3-4-100-RND0655_0" - seems "normal"
- "p2690000-IBUCH_pYIpYVkp01_0705-2-10-RND1281_1" - not normal

The second task registered only 3s cpu time, so it may have happened while the driver was still restarting.

MrS
Scanning for our furry friends since Jan 2002
ID: 9553 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Bymark
Avatar

Send message
Joined: 23 Feb 09
Posts: 30
Credit: 5,897,921
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwat
Message 9559 - Posted: 9 May 2009, 16:27:50 UTC - in response to Message 9525.  
Last modified: 9 May 2009, 16:50:40 UTC

I have a big problem with my new asus 260:

hostid=35303

I downgraded all drivers, and now waiting to get more task.
"reached daily quota of 4 results" heh ;),
Any suggestion? Seti gpus working fine.......


The ones crashing on that machine are not the suspect WUs that they have now stopped issuing, those crashing on that machine usually run fine. He also has a 260 which is outside the problems, its the lower cards that did have issues in the past. Something else lurketh. No idea what personally, over to the Gurus for that.

Regards
Zy


Now i have exactly the same drivers boinc etc. as my fine working ati 260. Still waiting for new wu's, seti is working fine, same power 550w all should be identical, maybe a hardware problem but then I don't understand why seti gpus working without failure. Runnig one seti Gpu:

Seti acount for same computer

Hardware monitor
-----------------------------------------------------

AMD Athlon 64 X2 5600+ hardware monitor

Temperature sensor 0 33°C (91°F) [0x149] (Core #0)
Temperature sensor 1 38°C (99°F) [0x15A] (Core #1)

Dump hardware monitor

Hardware monitor
-----------------------------------------------------

GeForce GTX 260 hardware monitor

Temperature sensor 0 71°C (159°F) [0x47] (GPU Core)
"Silakka"
Hello from Turku > Åbo.
ID: 9559 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 9560 - Posted: 9 May 2009, 18:08:31 UTC - in response to Message 9559.  

Well, you also got >6 errors a day, but your problem is totally unrelated to what is being discussed int his thread. Might help to ask in a separate thread, if you need further assistence. Do 3D Mark and/or Furmark run on your card? Seti stresses the hardware less than GPU-Grid.

MrS
Scanning for our furry friends since Jan 2002
ID: 9560 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
[AF] Profanateur
Avatar

Send message
Joined: 25 Oct 08
Posts: 42
Credit: 42,812,268
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 9562 - Posted: 9 May 2009, 19:03:31 UTC - in response to Message 9560.  

And for my pbs ? with driver other than 182.5.
ID: 9562 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Aardvark
Avatar

Send message
Joined: 27 Nov 08
Posts: 28
Credit: 82,362,324
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 9567 - Posted: 10 May 2009, 0:34:22 UTC

Success on 52-KASHIF_HIVPR_mon_ba3-7-100-RND3244_0. 64 bit Vista, 9800 GX2 (Not O/C), client 6.6.20 & 182.85 driver.
ID: 9567 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 9578 - Posted: 10 May 2009, 8:17:23 UTC

Aardvark, so far the "KASHIF_HIVPR_mon" have also been fine for my machine. Thanks for the info.. seems like these are indeed not the trouble makers.

Profanateur, if I remember correctly you have a separate thread regarding your problem elsewhere. And since on your machine all WUs error you are facing a different problems than what is discussed here. I think I wrote some suggestions in that other thread.. well, I hope. At least I wanted to write something ;)
What do you mean by pbs?

MrS
Scanning for our furry friends since Jan 2002
ID: 9578 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
[AF] Profanateur
Avatar

Send message
Joined: 25 Oct 08
Posts: 42
Credit: 42,812,268
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 9581 - Posted: 10 May 2009, 8:49:47 UTC

pbs =problems=failure.
sorry but I'm french.
ID: 9581 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
boincwoman

Send message
Joined: 9 May 09
Posts: 1
Credit: 2,096,817
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwatwatwatwatwat
Message 9585 - Posted: 10 May 2009, 11:31:08 UTC

I'm new here.

Have errors with this:

75-IBUCH_HIVPR_mon_ba8-4-100-RND5234 id: 451357

100-KASHIF_HIVPR_n1_for_ba4-4-100-RND3172 id: 448737

Shuttle XPC
Vista Enterprise 64 bit 2 Gb ram
AMD Opteron 2.4 GHz model 180
Geeforce 9400GT 1 Gb ram newly bought
Boinc 6.6.20

ComputerID: 35365

The Boincwoman
ID: 9585 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
refla

Send message
Joined: 12 Feb 09
Posts: 9
Credit: 385,357
RAC: 0
Level

Scientific publications
watwatwatwat
Message 9586 - Posted: 10 May 2009, 12:12:50 UTC - in response to Message 9530.  
Last modified: 10 May 2009, 12:15:06 UTC

xp/32 + 9600GT@181.20 + BOINC6.4.5 cannot survive!
ID: 9586 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 9588 - Posted: 10 May 2009, 12:25:02 UTC - in response to Message 9586.  

Refla, not sure what you mean. You only have successful WUs and others which are listed as "aborted by user". Sure, they can't survive if you abort them ;)

Boincwoman, your machine has not completed any WUs so far. So i'm not sure if we can attribute your failure of the "IBUCH_HIVPR" to the error discussed here. If your card is passively cooled it may be overheating (check with GPU-Z and report temperatures). Otherwise your setup should be fine.
However, the card is very slow: it has 16 shaders ("stream processors"), whereas at least 50 are officially recommended (FAQ). You'll have problems to meat the GPU-Grid deadlines and you may want to take a look at seti for your GPU.

MrS
Scanning for our furry friends since Jan 2002
ID: 9588 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
[AF] Profanateur
Avatar

Send message
Joined: 25 Oct 08
Posts: 42
Credit: 42,812,268
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 9605 - Posted: 10 May 2009, 20:18:45 UTC

Errors todays :
10/05/2009 10:53:19 GPUGRID Output file p1760000-IBUCH_pYIpYVkp01_0705-4-10-RND5135_0_1 for task p1760000-IBUCH_pYIpYVkp01_0705-4-10-RND5135_0 absent
10/05/2009 16:56:28 GPUGRID Output file p2750000-IBUCH_pYIpYVkp01_0705-4-10-RND5064_1_1 for task p2750000-IBUCH_pYIpYVkp01_0705-4-10-RND5064_1 absent
ID: 9605 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
refla

Send message
Joined: 12 Feb 09
Posts: 9
Credit: 385,357
RAC: 0
Level

Scientific publications
watwatwatwat
Message 9608 - Posted: 10 May 2009, 21:11:16 UTC - in response to Message 9588.  

ETA:

I aborted them because WUs' progress has not advanced in a long time(at least more than 1 hour). The situation has not changed even I rebooted my computer.

After 2 WUs, I deem if the last number in the task name more than zero, it should be a bad WU.

Details in http://www.gpugrid.net/forum_thread.php?id=1041

My English is not good enough, I hope you can understand what I mean.

:)
ID: 9608 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 9613 - Posted: 10 May 2009, 21:48:37 UTC

Profanateur,

your problem is not related to what is being discussed here. Very many of your WUs error, this is different from the "KASHIF_HIVPR" and "IBUCH_KID" issue. You actually completed some, so your software should be fine.

However, you are running a very new driver and two overclocked cards, which are very different. All of these or their combination could lead to problems. I suggest you start a new thread (instead of posting a little in different threads), write down your current config (software versions, clocks, GPU temperatures) and then change some parameters, document the changes and see if it helps. By that I mean

- run only 1 of the cards to see if one is broken
- reduce all clocks to standard values
- run other stability tests
- try well-tested drivers like 182.50 or 182.08
- maybe more

If you do that we (or you yourself ;) should be able to get you going.

Regards,
MrS
Scanning for our furry friends since Jan 2002
ID: 9613 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 9615 - Posted: 10 May 2009, 21:57:31 UTC - in response to Message 9608.  

refla,

that's strange. You're running 6.4.5, so you shouldn't be affected by the slow-6.6.20-bug. Also most of your canceled WUs may belong to the critical "KASHIF_HIVPR" and "IBUCH_KID" series, but some were also "IBUCH_pYIpYVkp01", which have not been reported to fail massively.
Furthermore your WUs are crunched just fine on G200-based cards, whereas no G9x returned any of them. Sorry, don't know what this means..

MrS
Scanning for our furry friends since Jan 2002
ID: 9615 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
refla

Send message
Joined: 12 Feb 09
Posts: 9
Credit: 385,357
RAC: 0
Level

Scientific publications
watwatwatwat
Message 9627 - Posted: 11 May 2009, 3:55:09 UTC - in response to Message 9615.  

ETA,

please tell me how to avoid/recover the case that WU's progress freezes.

You can see not only me who met this case. Before I abandoned them, other GPUGriders have done the same operation.
ID: 9627 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
MarkJ
Volunteer moderator
Volunteer tester

Send message
Joined: 24 Dec 08
Posts: 738
Credit: 200,909,904
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 9630 - Posted: 11 May 2009, 10:12:05 UTC - in response to Message 9627.  

ETA,

please tell me how to avoid/recover the case that WU's progress freezes.

You can see not only me who met this case. Before I abandoned them, other GPUGriders have done the same operation.


@refla:

I would suggest you switch to BOINC 6.6.23.

Your driver version is not shown, but as ETA has said above I would suggest 182.50 drivers as they seem to be reliable.
BOINC blog
ID: 9630 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
palmss

Send message
Joined: 28 Aug 08
Posts: 7
Credit: 60,897,550
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 9631 - Posted: 11 May 2009, 10:41:28 UTC

Hi
I have another error(Kernel [nb_k] failed in file 'nb.cu' in line 202 : unknown error) on a new type of WU http://www.gpugrid.net/result.php?resultid=645509
ID: 9631 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
MarkJ
Volunteer moderator
Volunteer tester

Send message
Joined: 24 Dec 08
Posts: 738
Credit: 200,909,904
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 9632 - Posted: 11 May 2009, 11:08:39 UTC - in response to Message 9631.  

Hi
I have another error(Kernel [nb_k] failed in file 'nb.cu' in line 202 : unknown error) on a new type of WU http://www.gpugrid.net/result.php?resultid=645509


What driver version are you using?
BOINC blog
ID: 9632 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mike047

Send message
Joined: 21 Dec 08
Posts: 47
Credit: 7,330,049
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwat
Message 9633 - Posted: 11 May 2009, 11:23:09 UTC
Last modified: 11 May 2009, 11:23:33 UTC

Have the "EVIL" work units been disabled or deleted?

I have stopped work on 8[250's and below] of my cards. The two 260s are doing OK.
mike
ID: 9633 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Zydor

Send message
Joined: 8 Feb 09
Posts: 252
Credit: 1,309,451
RAC: 0
Level
Ala
Scientific publications
watwatwatwat
Message 9635 - Posted: 11 May 2009, 12:25:39 UTC - in response to Message 9633.  
Last modified: 11 May 2009, 12:27:45 UTC

Yes they stopped issuing the suspect ones on Saturday, its not all KASHIF's that are suspect, there are several types of KASHIF WUs, it was only one particular type of KASHIF WU that was giving grief.

See http://www.gpugrid.net/forum_thread.php?id=1034&nowrap=true#9506

Regards
Zy
ID: 9635 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · Next

Message boards : Graphics cards (GPUs) : 6 Errors Today [Problems with "KASHIF_HIVPR" and "IBUCH_KID"-WUs]

©2025 Universitat Pompeu Fabra