All Gerard WUs erroring

Message boards : Number crunching : All Gerard WUs erroring
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · Next

AuthorMessage
Stephen Farrell

Send message
Joined: 3 Nov 14
Posts: 10
Credit: 57,322,675
RAC: 0
Level
Thr
Scientific publications
wat
Message 42573 - Posted: 8 Jan 2016, 11:26:53 UTC

Hi,

I was wondering if others are still having this problem as the issue still persists on both my Linux boxes.

ID: 42573 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
captainjack

Send message
Joined: 9 May 13
Posts: 171
Credit: 4,610,046,466
RAC: 193,605
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 42574 - Posted: 8 Jan 2016, 13:31:18 UTC

Yep, the GPUGRID tasks are still not processing on my Linux boxes.

But my backup project is getting quite a bit of work done.
ID: 42574 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Stephen Farrell

Send message
Joined: 3 Nov 14
Posts: 10
Credit: 57,322,675
RAC: 0
Level
Thr
Scientific publications
wat
Message 42576 - Posted: 8 Jan 2016, 16:09:52 UTC

Okay, thanks for the update. I guess I'll just add a backup project myself until the issue is resolved.

Cheers.

ID: 42576 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Trotador

Send message
Joined: 25 Mar 12
Posts: 103
Credit: 14,948,929,771
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 42577 - Posted: 8 Jan 2016, 19:22:01 UTC

Same here, no joy for Linux hosts, five days in a row, we don't seem to be anything worthy for the project.
ID: 42577 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
nanoprobe

Send message
Joined: 26 Feb 12
Posts: 184
Credit: 222,376,233
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 42583 - Posted: 9 Jan 2016, 14:30:56 UTC
Last modified: 9 Jan 2016, 14:31:37 UTC

The tasks were doing OK on my XP box. I moved the cards to a Win7 box and now they all error out in 2 seconds. Looks like moving the cards was a mistake but I can't move them back ATM.
ID: 42583 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
nanoprobe

Send message
Joined: 26 Feb 12
Posts: 184
Credit: 222,376,233
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 42585 - Posted: 9 Jan 2016, 20:32:11 UTC
Last modified: 9 Jan 2016, 20:38:43 UTC

I managed to download 1 task that didn't error out in 2 seconds. *fingers crossed*

Still having issues getting tasks to download all the files needed to run. From event log.

4680 GPUGRID 1/9/2016 3:25:00 PM Temporarily failed download of e17s19_e13s27p1f405-GERARD_CXCL12_CHALC2_MON1-0-pdb_file: transient HTTP error

After 5 attempts and 30inutes the last file did finally download.
EDIT: Second task downloaded and is running. Stay tuned.
ID: 42585 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile microchip
Avatar

Send message
Joined: 4 Sep 11
Posts: 110
Credit: 326,102,587
RAC: 0
Level
Asp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 42587 - Posted: 10 Jan 2016, 15:12:24 UTC

Same here on Linux. All WUs error out, even after a reset of the project.

Team Belgium
ID: 42587 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Stephen Farrell

Send message
Joined: 3 Nov 14
Posts: 10
Credit: 57,322,675
RAC: 0
Level
Thr
Scientific publications
wat
Message 42591 - Posted: 12 Jan 2016, 11:13:57 UTC - in response to Message 42585.  

Hi nanoprobe,

did you successfully complete the work unit that started?

ID: 42591 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
bormolino

Send message
Joined: 16 May 13
Posts: 41
Credit: 145,731,947
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwat
Message 42597 - Posted: 13 Jan 2016, 13:31:53 UTC

Still not running under linux ...
ID: 42597 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
nanoprobe

Send message
Joined: 26 Feb 12
Posts: 184
Credit: 222,376,233
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 42598 - Posted: 13 Jan 2016, 19:38:08 UTC - in response to Message 42591.  

Hi nanoprobe,

did you successfully complete the work unit that started?



Yes. It had previously errored out on a Linux machine with 0 runtime and a Windows machine after about 60 minutes of run time. I have also received 6 more since that one that have completed and currently have 2 more in progress. For me all the version 8.4.1 tasks error out. Version 8.4.7 tasks seem to run fine with only an occasional error and unfortunately they run for hours before they go south.
ID: 42598 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Trotador

Send message
Joined: 25 Mar 12
Posts: 103
Credit: 14,948,929,771
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 42600 - Posted: 13 Jan 2016, 20:29:36 UTC

One day more without Linux crunching and without status info...who cares?
ID: 42600 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
nanoprobe

Send message
Joined: 26 Feb 12
Posts: 184
Credit: 222,376,233
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 42602 - Posted: 14 Jan 2016, 1:36:14 UTC - in response to Message 42600.  

One day more without Linux crunching and without status info...who cares?


Someone didn't get their nap today.
ID: 42602 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Bikermatt

Send message
Joined: 8 Apr 10
Posts: 37
Credit: 4,431,457,619
RAC: 36,378
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 42615 - Posted: 15 Jan 2016, 2:01:56 UTC - in response to Message 42602.  

One day more without Linux crunching and without status info...who cares?


Someone didn't get their nap today.


No, don't be a jerk. This has been a known problem with a known cause for a week now and no one has bothered to fix it.

For many years there was a significant performance boost when crunching with Linux at this project. The developers actually recommended that you crunch with Linux. Many of us have dedicated Linux hosts to this project due to that fact. Now my Linux hosts are having to crunch mathematics crap and look for pulsars to keep my house warm.

Could someone please fix this?
ID: 42615 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
nanoprobe

Send message
Joined: 26 Feb 12
Posts: 184
Credit: 222,376,233
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 42616 - Posted: 15 Jan 2016, 4:32:33 UTC - in response to Message 42615.  

One day more without Linux crunching and without status info...who cares?


Someone didn't get their nap today.


No, don't be a jerk. This has been a known problem with a known cause for a week now and no one has bothered to fix it.

For many years there was a significant performance boost when crunching with Linux at this project. The developers actually recommended that you crunch with Linux. Many of us have dedicated Linux hosts to this project due to that fact. Now my Linux hosts are having to crunch mathematics crap and look for pulsars to keep my house warm.

Could someone please fix this?

No nap and lost your sense of humor? Go look in a mirror and take a chill pill man. This ain't life or death and GPUGrid doesn't revolve around you.
ID: 42616 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
nanoprobe

Send message
Joined: 26 Feb 12
Posts: 184
Credit: 222,376,233
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 42617 - Posted: 15 Jan 2016, 4:33:04 UTC - in response to Message 42615.  
Last modified: 15 Jan 2016, 4:35:18 UTC

That was weird. Triple post.?????
ID: 42617 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
nanoprobe

Send message
Joined: 26 Feb 12
Posts: 184
Credit: 222,376,233
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 42618 - Posted: 15 Jan 2016, 4:33:21 UTC - in response to Message 42615.  
Last modified: 15 Jan 2016, 4:36:53 UTC

Can't explain the triple post.
ID: 42618 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Trotador

Send message
Joined: 25 Mar 12
Posts: 103
Credit: 14,948,929,771
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 42619 - Posted: 15 Jan 2016, 5:19:04 UTC - in response to Message 42618.  

Can't explain the triple post.


You missed your nap? :)
ID: 42619 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Gerard

Send message
Joined: 26 Mar 14
Posts: 101
Credit: 0
RAC: 0
Level

Scientific publications
wat
Message 42620 - Posted: 15 Jan 2016, 10:35:21 UTC - in response to Message 42615.  

Guys! Matt is trying to fix it, see https://www.gpugrid.net/forum_thread.php?id=4235 . Apparently the solution must not be trivial. Please be patient!
ID: 42620 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Bedrich Hajek

Send message
Joined: 28 Mar 09
Posts: 490
Credit: 11,739,145,728
RAC: 95,752
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 42621 - Posted: 15 Jan 2016, 11:45:24 UTC
Last modified: 15 Jan 2016, 11:53:58 UTC

Now I am getting this same "linux" error on my both my windows machines.


https://www.gpugrid.net/hosts_user.php?userid=19626


Also, when I downloaded a new unit, and I suspended a good unit to test the new unit. The new unit would crash, and when I resumed the previously good unit, it also crashed.
ID: 42621 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
nanoprobe

Send message
Joined: 26 Feb 12
Posts: 184
Credit: 222,376,233
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 42622 - Posted: 15 Jan 2016, 12:24:05 UTC - in response to Message 42619.  
Last modified: 15 Jan 2016, 12:35:01 UTC

Can't explain the triple post.


You missed your nap? :)

Or I fell asleep at the keyboard. ;-)

FWIW most of the tasks I'm getting are resends that have failed at least once on a Linux host. So far they have all run to completion on my host. Win7, Xeon E5 2683, twin GTX 970. Along with GPUGrid tasks I'm also running a full load of CPU tasks minus 2 threads each for the cards if that means anything.
ID: 42622 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · Next

Message boards : Number crunching : All Gerard WUs erroring

©2026 Universitat Pompeu Fabra