New NATHAN_KID WUs on long

Message boards : News : New NATHAN_KID WUs on long
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · Next

AuthorMessage
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 30714 - Posted: 7 Jun 2013, 18:41:17 UTC

Klepel, as you say you've been running these Nathan KIDc22 WUs just fine before in 59ks. Check if the GPU clock is still up to where it should be. After driver resets it sometimes stays too low without throwing an error at GPU-Grid. In this case a reboot would help.

MrS
Scanning for our furry friends since Jan 2002
ID: 30714 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Vagelis Giannadakis

Send message
Joined: 5 May 13
Posts: 187
Credit: 349,254,454
RAC: 0
Level
Asp
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 30721 - Posted: 7 Jun 2013, 19:40:22 UTC - in response to Message 30708.  

The computer it failed on was a titan (which cannot run these WU's):
    Coprocessors NVIDIA GeForce GTX TITAN (4095MB) driver: 314.22


Good catch! It seems I was too quick to blame the WU.

ID: 30721 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 30723 - Posted: 7 Jun 2013, 20:44:13 UTC - in response to Message 30711.  

Oh, and I just noticed the up-load file does have a size of 107.88 MB (so more or less the double as before)


Sounds too long. Maybe wrong compression.
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help
ID: 30723 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
klepel

Send message
Joined: 23 Dec 09
Posts: 189
Credit: 4,792,731,008
RAC: 124,733
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 30727 - Posted: 8 Jun 2013, 0:02:01 UTC

The strange WU is up-loaded: http://www.gpugrid.net/result.php?resultid=6930954

All seems normal. Only that it took nearly twice as long: 101,479.54 s
Stderr output: 57186.919 s

The next WU crashed, because of a electricity cut.

I will inform about the next WU after it's finished.
ID: 30727 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
tomba

Send message
Joined: 21 Feb 09
Posts: 497
Credit: 700,690,702
RAC: 0
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 30732 - Posted: 8 Jun 2013, 14:39:36 UTC

I've completed 18 NATHAN_KIDc22s, nine of the SODcharge variety for 134000 credits each, and nine of the RND variety for 167500 credits each EXCEPT...

The RND that completed this morning, well within 24 hours, gave 111700 credits:

http://www.gpugrid.net/result.php?resultid=6933145

That was a surprise!
ID: 30732 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 30734 - Posted: 8 Jun 2013, 15:23:17 UTC - in response to Message 30732.  
Last modified: 8 Jun 2013, 15:23:32 UTC

Your 2nd wingman handed the WU in after it got sent to you, but before you could send your in. In this case the bonus credit doesn't trigger for both of you, sadly an old problem.

MrS
Scanning for our furry friends since Jan 2002
ID: 30734 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
tomba

Send message
Joined: 21 Feb 09
Posts: 497
Credit: 700,690,702
RAC: 0
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 30735 - Posted: 8 Jun 2013, 15:42:25 UTC - in response to Message 30734.  

Your 2nd wingman handed the WU in after it got sent to you, but before you could send your in. In this case the bonus credit doesn't trigger for both of you, sadly an old problem. MrS

Ah! I did not notice I had a wingman. I had thought GPUGrid's quorum was one. Why did it get sent to me before the wingman's try had run out of time?
ID: 30735 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 30736 - Posted: 8 Jun 2013, 16:46:25 UTC - in response to Message 30735.  

It's a known issue with the credit system at GPUGrid. Basically it happens only when a WU is resent and after being resent is returned by the first recipient and validates. Subsequent to the validation bonuses are not awarded. It's a fairly rare event but apparently too difficult to fix.

No Bonus for finishing within 24 hours
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help
ID: 30736 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 30745 - Posted: 9 Jun 2013, 12:33:06 UTC - in response to Message 30735.  

The WU was sent to the wingman first, but he didn't return it within a few days. At some point GPU-Grid needs the results to generate the next WU in this string / time iteration. So the WU was sent to you (not sure if this was before the deadline for the wingman or not), but the other guy evetually handed the result in. This is why GPU-Grid is not suitable for low-end hardware and why the deadline is rather short.

MrS
Scanning for our furry friends since Jan 2002
ID: 30745 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile nate

Send message
Joined: 6 Jun 11
Posts: 124
Credit: 2,928,865
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwat
Message 30773 - Posted: 11 Jun 2013, 14:40:51 UTC

FYI, there are two new batches of simulations that are among the same research project I just sent out yesterday and today. Names are NATHAN_KIDc22_full and NATHAN_KIDc22_noPhos.
ID: 30773 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
flashawk

Send message
Joined: 18 Jun 12
Posts: 297
Credit: 3,572,627,986
RAC: 0
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 30775 - Posted: 11 Jun 2013, 15:03:20 UTC

Boy, this place is turning in to you're own private Idaho, nobody else wants to run work units? Not that there's anything wrong with yours.
ID: 30775 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Stefan
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 5 Mar 13
Posts: 348
Credit: 0
RAC: 0
Level

Scientific publications
wat
Message 30777 - Posted: 11 Jun 2013, 17:13:03 UTC - in response to Message 30775.  

It's the circumstances. We are missing people to conferences, paternity and also summer is coming. So these weeks might end up being a bit unstable, but there is definitely more stuff to simulate! (We just need to get the people back :D)
ID: 30777 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
tomba

Send message
Joined: 21 Feb 09
Posts: 497
Credit: 700,690,702
RAC: 0
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 30779 - Posted: 11 Jun 2013, 17:57:30 UTC

After 23 successful Nathan_KIDc22s, I just got an error on my first noPhos, after 8h29m:

http://www.gpugrid.net/result.php?resultid=6944425

ID: 30779 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
nanoprobe

Send message
Joined: 26 Feb 12
Posts: 184
Credit: 222,376,233
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 30784 - Posted: 11 Jun 2013, 23:06:06 UTC

Just finished my first noPhos. Looks like no problemo.

http://www.gpugrid.net/result.php?resultid=6944390
ID: 30784 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile dskagcommunity
Avatar

Send message
Joined: 28 Apr 11
Posts: 462
Credit: 949,416,958
RAC: 67,995
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 30786 - Posted: 12 Jun 2013, 10:20:03 UTC

Yes 5 phos done, no problems.
DSKAG Austria Research Team: http://www.research.dskag.at



ID: 30786 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
FoldingNator

Send message
Joined: 1 Dec 12
Posts: 24
Credit: 60,122,950
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwat
Message 30794 - Posted: 12 Jun 2013, 16:29:25 UTC
Last modified: 12 Jun 2013, 16:31:26 UTC

I was working on two Nathan Phos WU's. After 16 hours and 99,617% done freezes my computer.

@#$%^&*@#$%^&@#$%^&#$%^& :'( not funny!!! :@

Why not? because it isn't the first time, no it is the third week have those problems. Only with GPUGRID for what I know... and it is very annoying.
From anger I've after restart all GPUGRID tasks aborted in BOINC, that is what you see at the task/client status now... -_-'


Last WU: http://www.gpugrid.net/workunit.php?wuid=4514859
Second last WU: http://www.gpugrid.net/workunit.php?wuid=4514763[/url]
ID: 30794 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
flashawk

Send message
Joined: 18 Jun 12
Posts: 297
Credit: 3,572,627,986
RAC: 0
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 30795 - Posted: 12 Jun 2013, 17:16:41 UTC - in response to Message 30794.  

Why do you guys hide you're computers? I've never been able to figure that out, I haven't had any lockups in a while on any of my 4 rigs, couldn't say what the problem is with yours. I disabled my onboard USB 3.0 and all my issue's went away.
ID: 30795 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 30797 - Posted: 12 Jun 2013, 18:36:57 UTC - in response to Message 30795.  

I disabled my onboard USB 3.0 and all my issue's went away.

Do you have an Eltron chip? They had serious driver issues last year, leading to unstable transfer rates, devices dropping and unstable computers. Drivers from this spring are fine, though (using it myself.. guess why I know about the problems).

MrS
Scanning for our furry friends since Jan 2002
ID: 30797 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 30798 - Posted: 12 Jun 2013, 18:38:26 UTC - in response to Message 30794.  

I was working on two Nathan Phos WU's. After 16 hours and 99,617% done freezes my computer.

What do you mean by that, did you run 2 WUs concurrently? If so be aware that you're alpha testing that functionality. It's not suggested or approved by the project in any way, so do it at your own risk.

MrS
Scanning for our furry friends since Jan 2002
ID: 30798 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 30802 - Posted: 12 Jun 2013, 19:15:04 UTC - in response to Message 30798.  
Last modified: 12 Jun 2013, 20:07:03 UTC

Going by this post he has two GPU's.

Also,
Coprocessors [2] NVIDIA GeForce GTX 570 (2559MB) driver: 314.22

Outcome Computation error
Client state Aborted by user
Exit status 203 (0xcb) EXIT_ABORTED_VIA_GUI

(a process exit code used by the core client/app). More commonly found at CPU projects such as ABC, Asteroids, Physics@home, Constellation, Quake-Catcher...

I92R9-NATHAN_KIDc22_noPhos-1-10-RND9632_0 4514859 11 Jun 2013 | 23:11:29 UTC 12 Jun 2013 | 16:20:04 UTC Aborted by user 56,038.62 10,735.35 --- Long runs (8-12 hours on fastest card) v6.18 (cuda42)
I93R9-NATHAN_KIDc22_noPhos-1-10-RND7759_0 4514763 11 Jun 2013 | 23:11:29 UTC 12 Jun 2013 | 16:20:04 UTC Aborted by user 55,998.47 10,639.65 --- Long runs (8-12 hours on fastest card) v6.18 (cuda42)

http://www.gpugrid.net/show_host_detail.php?hostid=152224

The NATHAN_KIDc22_noPhos WU's take ~12h on a GTX660. Some finish under 12h on GTX570's on Linux but others are ~13.6h on W7.
Either 2 GPU's running slow (most likely) or one GPU running 2 WU's fast until they crash (less likely). Quite a few failures on that system though,
http://www.gpugrid.net/results.php?hostid=152224&offset=0&show_names=1&state=0&appid=
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help
ID: 30802 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · Next

Message boards : News : New NATHAN_KID WUs on long

©2025 Universitat Pompeu Fabra