Everyone is getting computation errors

Message boards : Number crunching : Everyone is getting computation errors
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
flashawk

Send message
Joined: 18 Jun 12
Posts: 297
Credit: 3,572,627,986
RAC: 0
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 49862 - Posted: 15 Jul 2018, 1:08:24 UTC

I suspended GPUGrid
ID: 49862 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
kksplace

Send message
Joined: 4 Mar 18
Posts: 53
Credit: 2,815,476,011
RAC: 0
Level
Phe
Scientific publications
wat
Message 49863 - Posted: 15 Jul 2018, 1:16:15 UTC

Same. Errors on last 5 WUs within 2 seconds.
ID: 49863 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
flashawk

Send message
Joined: 18 Jun 12
Posts: 297
Credit: 3,572,627,986
RAC: 0
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 49864 - Posted: 15 Jul 2018, 1:27:18 UTC

Ya, I was scrambling around downclocking my cards and turning up the voltage having a litter of kittens wondering how all 4 of my 1180's all went bad at the same time.

It's not a good feeling, these things aren't cheap. I'm glad it's the work units rather than my cards.
ID: 49864 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
tullio

Send message
Joined: 8 May 18
Posts: 190
Credit: 104,426,808
RAC: 0
Level
Cys
Scientific publications
wat
Message 49865 - Posted: 15 Jul 2018, 1:27:58 UTC

I get no errors on my Linux systems.
Tullio
ID: 49865 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
flashawk

Send message
Joined: 18 Jun 12
Posts: 297
Credit: 3,572,627,986
RAC: 0
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 49866 - Posted: 15 Jul 2018, 1:35:24 UTC - in response to Message 49865.  

Maybe it's a Windows only thing, both Pablo and Adria WU's are getting errors in Windows not Linux.
ID: 49866 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
tullio

Send message
Joined: 8 May 18
Posts: 190
Credit: 104,426,808
RAC: 0
Level
Cys
Scientific publications
wat
Message 49868 - Posted: 15 Jul 2018, 1:57:39 UTC

On my Windows 10 PC always updated and with nVidia drivers I also get errors so I am running GPUGRID only on the two Linux boxes.
Tullio
ID: 49868 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Erich56

Send message
Joined: 1 Jan 15
Posts: 1166
Credit: 12,260,898,501
RAC: 1
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwat
Message 49870 - Posted: 15 Jul 2018, 6:10:39 UTC

same thing here: all newly downloaded tasks (regardless whether PABLO or ADRIA) error out after a few seconds:

(unknown error) - exit code -44 (0xffffffd4)

did no one at GPUGRID notice this problem?
ID: 49870 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Retvari Zoltan
Avatar

Send message
Joined: 20 Jan 09
Posts: 2380
Credit: 16,897,957,044
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 49872 - Posted: 15 Jul 2018, 8:59:11 UTC

I think this time the Windows / CUDA8.0 client got its license expired, as the Windows XP / CUDA6.5 and the Linux / CUDA8.0 client is working fine.
Too bad that my Windows XP hosts are offline for the summer.
Many workunits will be lost, due to most of the hosts are Windows 10 and 7.
It think the Windows / CUDA8.0 client should be deprecated immediately.
ID: 49872 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [PUGLIA] kidkidkid3
Avatar

Send message
Joined: 23 Feb 11
Posts: 101
Credit: 1,589,743,957
RAC: 439
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 49873 - Posted: 15 Jul 2018, 9:25:16 UTC - in response to Message 49872.  

After purchase of a new license, all of us need a reset of daily quota to crunch WU, is't correct ?
Thanks
K.
Dreams do not always come true. But not because they are too big or impossible. Why did we stop believing.
(Martin Luther King)
ID: 49873 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Betting Slip

Send message
Joined: 5 Jan 09
Posts: 670
Credit: 2,498,095,550
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 49874 - Posted: 15 Jul 2018, 9:39:40 UTC - in response to Message 49873.  

No need to reset it for the sake of just one day.


Radio Caroline, the world's most famous offshore pirate radio station.
Great music since April 1964. Support Radio Caroline Team -
Radio Caroline
ID: 49874 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Retvari Zoltan
Avatar

Send message
Joined: 20 Jan 09
Posts: 2380
Credit: 16,897,957,044
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 49875 - Posted: 15 Jul 2018, 10:57:19 UTC

The error rate on the Server status page of three workunit batches are in the red range (above 75%) now
ID: 49875 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
flashawk

Send message
Joined: 18 Jun 12
Posts: 297
Credit: 3,572,627,986
RAC: 0
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 49884 - Posted: 15 Jul 2018, 18:55:30 UTC

No word from the staff yet when it's safe to start crunching? The Linux guys should have enough work to go the next couple of days. Doesn't anyone monitor the servers and software over the weekend?

I asked this question twice before and got no answer, what happened to the moderators?
ID: 49884 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
kain

Send message
Joined: 3 Sep 14
Posts: 152
Credit: 918,557,369
RAC: 28
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 49885 - Posted: 15 Jul 2018, 19:32:28 UTC

It is a small (but still very productive) team, and this is weekend, and today was a world cup final. Let them live ;)
ID: 49885 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
MartinKanne

Send message
Joined: 27 Dec 16
Posts: 6
Credit: 53,210,225
RAC: 0
Level
Thr
Scientific publications
watwatwatwat
Message 49906 - Posted: 16 Jul 2018, 20:07:16 UTC

I do not suspend GPUGruid, because it only takes 3 to 8 Seconds.
But I would be glad, GPUGrid would be able to fix the Problem the next 48 Hours.
ID: 49906 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
MartinKanne

Send message
Joined: 27 Dec 16
Posts: 6
Credit: 53,210,225
RAC: 0
Level
Thr
Scientific publications
watwatwatwat
Message 49907 - Posted: 16 Jul 2018, 20:09:37 UTC

I do not suspend GPUGruid, because it only takes 3 to 8 Seconds.
But I would be glad, GPUGrid would be able to fix the Problem the next 48 Hours.
ID: 49907 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jim1348

Send message
Joined: 28 Jul 12
Posts: 819
Credit: 1,591,285,971
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 49908 - Posted: 16 Jul 2018, 21:15:10 UTC

I was running an ADRIA_FOLDT1015 on my GTX 1060 (Ubuntu 16.04) when it crashed. Not only that, but it took out the QC work units running on the CPU also. I will lay off the GPU for a while; it is too warm anyway.
ID: 49908 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Bedrich Hajek

Send message
Joined: 28 Mar 09
Posts: 490
Credit: 11,731,645,728
RAC: 69
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 49910 - Posted: 16 Jul 2018, 22:05:07 UTC

ID: 49910 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Retvari Zoltan
Avatar

Send message
Joined: 20 Jan 09
Posts: 2380
Credit: 16,897,957,044
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 49913 - Posted: 16 Jul 2018, 22:33:05 UTC - in response to Message 49910.  

Now, I am getting the same error on cuda 6.5 / windows xp.
Yep, me too. Too bad... At least my electricity bill will be the lowest in years...
ID: 49913 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
flashawk

Send message
Joined: 18 Jun 12
Posts: 297
Credit: 3,572,627,986
RAC: 0
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 49915 - Posted: 17 Jul 2018, 1:01:19 UTC

I can't believe they haven't fixed this yet, over 4300 work units now and growing. It's obvious the Linux machines can't keep up, this is starting to get strange.
ID: 49915 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Erich56

Send message
Joined: 1 Jan 15
Posts: 1166
Credit: 12,260,898,501
RAC: 1
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwat
Message 49916 - Posted: 17 Jul 2018, 4:50:07 UTC - in response to Message 49913.  

Now, I am getting the same error on cuda 6.5 / windows xp.
Yep, me too. Too bad... At least my electricity bill will be the lowest in years...

same here :-(

is GPUGRID falling apart?
ID: 49916 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
1 · 2 · Next

Message boards : Number crunching : Everyone is getting computation errors

©2025 Universitat Pompeu Fabra