Advanced search

Message boards : Number crunching : WU's aborted by project

Author Message
Profile [VENETO] sabayonino
Send message
Joined: 4 Apr 10
Posts: 50
Credit: 645,641,596
RAC: 5,071
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 33718 - Posted: 2 Nov 2013 | 11:55:04 UTC
Last modified: 2 Nov 2013 | 11:56:43 UTC

Hi .

I've many WUs witho

<core_client_version>7.2.0</core_client_version>
<![CDATA[
<message>
aborted by project - no longer usable
</message>
<stderr_txt>

</stderr_txt>
]]>


many of these after few hours computation

any idea ?

TJ
Send message
Joined: 26 Jun 09
Posts: 815
Credit: 1,470,385,294
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 33723 - Posted: 2 Nov 2013 | 15:53:16 UTC - in response to Message 33718.

If you have your "max additional work buffer" set high, you will get a lot of tasks. If however others get the same tasks, as you not have been reported after a day or two, as return it sooner, than sometimes they are cancelled by the server. This could be one reason for what you a re seeing, but absolutely not the only reason.
____________
Greetings from TJ

Profile [VENETO] sabayonino
Send message
Joined: 4 Apr 10
Posts: 50
Credit: 645,641,596
RAC: 5,071
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 33725 - Posted: 2 Nov 2013 | 16:12:19 UTC - in response to Message 33723.
Last modified: 2 Nov 2013 | 16:12:47 UTC

If you have your "max additional work buffer" set high, you will get a lot of tasks. If however others get the same tasks, as you not have been reported after a day or two, as return it sooner, than sometimes they are cancelled by the server. This could be one reason for what you a re seeing, but absolutely not the only reason.


Hi . tnx for the answre.

max work buffer is set to 0.5 for all machines.

deadline for the all WUs was 6th November ...

a job was deleted but "wingman" still cranching the same WU

(now I deselect the Long-Run apps ...)

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 33726 - Posted: 2 Nov 2013 | 16:14:28 UTC

You're really getting quite a few of them on all your hosts. Several per host, to be more specific.. which compares to 0 on my machine. There's a certain tendency with most of them being "SANTI_baxbim2", but there's also a Noelia short run and a "NOELIA_INS1P" long run.

It's got nothing to do with your cache setting, it happens even a few hours after you recieve a WU.

The weirdest thing is that the WUs get resent just normally, so the message "aborted by project, don't need anymore" seems to be wrong. Unless something unexpected happened from the project side I could only vaguely speculate that your BOINC 7.2.0 might screw something up. The version number suggest the first public beta release of a the new 7.2. branch, which could have introduced new bugs along new functionality.

MrS
____________
Scanning for our furry friends since Jan 2002

Profile skgiven
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 33761 - Posted: 3 Nov 2013 | 21:59:07 UTC - in response to Message 33726.

Seems to be some confusion as to why it errored, crashed or got aborted. Maybe a bad description by Boinc?

Name I324-SANTI_baxbim2-16-32-RND4772_0
Workunit 4892140
Created 1 Nov 2013 | 16:13:36 UTC
Sent 1 Nov 2013 | 19:42:09 UTC
Received 1 Nov 2013 | 21:53:33 UTC
Server state Over
Outcome Computation error
Client state Aborted by user
Exit status 202 (0xca) EXIT_ABORTED_BY_PROJECT
Computer ID 161414
Report deadline 6 Nov 2013 | 19:42:09 UTC
Run time 7,707.59
CPU time 6,221.09
Validate state Invalid
Credit 0.00
Application version Long runs (8-12 hours on fastest card) v8.03 (cuda55)
Stderr output

<core_client_version>7.2.0</core_client_version>
<![CDATA[
<message>
aborted by project - no longer usable
</message>
<stderr_txt>
SWAN : FATAL : Cuda driver error 715 in file 'swanlibnv2.cpp' in line 1803.

</stderr_txt>
]]>

The same WU completed on a different Linux system, using a newer 7.2.xx client.

Operating System Linux
3.10-3-amd64
BOINC version 7.2.22

Maybe worth updating the client (finish or suspend GPUGrid work first though).
How many systems do you have and are you detaching and reattaching?
____________
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help

Profile [VENETO] sabayonino
Send message
Joined: 4 Apr 10
Posts: 50
Credit: 645,641,596
RAC: 5,071
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 33774 - Posted: 4 Nov 2013 | 17:22:19 UTC
Last modified: 4 Nov 2013 | 17:23:30 UTC

"Aborted by user"

I killed some Wus for my own reasons...

but last WUs seem work fine

"Aborted by project" was from 1th to 2th November ....

Post to thread

Message boards : Number crunching : WU's aborted by project

//