Message boards :
News :
Server up again
Message board moderation
Author | Message |
---|---|
![]() Send message Joined: 14 Mar 07 Posts: 1958 Credit: 629,356 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() |
The server is up again and all should get back to normal very quickly. gdf |
Send message Joined: 5 Jan 09 Posts: 670 Credit: 2,498,095,550 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Server down again Radio Caroline, the world's most famous offshore pirate radio station. Great music since April 1964. Support Radio Caroline Team - Radio Caroline |
Send message Joined: 11 Jul 09 Posts: 1639 Credit: 10,159,968,649 RAC: 295,172 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I'm getting GPUGRID 31/01/2011 16:53:28 [error] Error reported by file upload server: can't open file on all file uploads - apparently after all data has successfully transferred. (task 3631188) |
![]() ![]() Send message Joined: 18 Sep 08 Posts: 36 Credit: 100,352,867 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Same here. No dn/up loads. Have 13 uploads stuck in the pipe. It will take a bit to get these sorted. :) ![]() ![]() ![]() |
![]() Send message Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Yeah, looks like the gpugrid_file_deleter program failed on the server. Might be to do with changes in task size, a bad batch or just a random service failure event, ie I know nothing. For the next few hours I'm turning my systems off and doing some dusting ;) For those that can micromanage you might want to suspend the pending uploads and keep an eye on the server status page Some crunchers might want to hook up to MilkyWay (or other GPU project), keep a low cache (0.01 days) and wait it out. |
Send message Joined: 11 Jul 09 Posts: 1639 Credit: 10,159,968,649 RAC: 295,172 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Would a failed 'deleter' daemon really cause this error? Does their data storage really fill up that quickly? Feels more like a NAS mounting error to me. [Edit - especially as there are - reportedly - only a few files waiting to be deleted] |
![]() Send message Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Possibly but for all I know they have stopped the service. Yes data storage is a repeating issue. It could well be a NAS issue. Good point because if it is a NAS issue its probably down to the technicians to fix, not the research team, and that means it could be down until tomorrow morning. |
![]() ![]() Send message Joined: 25 May 09 Posts: 224 Credit: 34,057,374,498 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Bags me first in queue when it gets sorted, lol |
Send message Joined: 16 Nov 10 Posts: 22 Credit: 24,712,746 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() |
Possibly but for all I know they have stopped the service. Even if the upload/download server is displayed as running, I cannot download or upload anything. Here is the messages I get: 1/31/2011 10:21:58 PM GPUGRID Started upload of p18-IBUCH_4_mutEGFR_110124-6-20-RND8641_1_0 1/31/2011 10:21:58 PM GPUGRID Started upload of p18-IBUCH_4_mutEGFR_110124-6-20-RND8641_1_1 1/31/2011 10:22:05 PM GPUGRID [error] Error reported by file upload server: can't open file 1/31/2011 10:22:05 PM GPUGRID Temporarily failed upload of p18-IBUCH_4_mutEGFR_110124-6-20-RND8641_1_0: transient upload error 1/31/2011 10:22:05 PM GPUGRID Backing off 56 min 8 sec on upload of p18-IBUCH_4_mutEGFR_110124-6-20-RND8641_1_0 1/31/2011 10:22:05 PM GPUGRID Started upload of p18-IBUCH_4_mutEGFR_110124-6-20-RND8641_1_2 1/31/2011 10:22:15 PM GPUGRID [error] Error reported by file upload server: can't open file 1/31/2011 10:22:15 PM GPUGRID Temporarily failed upload of p18-IBUCH_4_mutEGFR_110124-6-20-RND8641_1_1: transient upload error 1/31/2011 10:22:15 PM GPUGRID Backing off 45 min 37 sec on upload of p18-IBUCH_4_mutEGFR_110124-6-20-RND8641_1_1 1/31/2011 10:22:15 PM GPUGRID Started upload of p18-IBUCH_4_mutEGFR_110124-6-20-RND8641_1_3 1/31/2011 10:22:21 PM GPUGRID [error] Error reported by file upload server: can't open file 1/31/2011 10:22:21 PM GPUGRID [error] Error reported by file upload server: can't open file 1/31/2011 10:22:21 PM GPUGRID Temporarily failed upload of p18-IBUCH_4_mutEGFR_110124-6-20-RND8641_1_2: transient upload error 1/31/2011 10:22:21 PM GPUGRID Backing off 36 min 21 sec on upload of p18-IBUCH_4_mutEGFR_110124-6-20-RND8641_1_2 1/31/2011 10:22:21 PM GPUGRID Temporarily failed upload of p18-IBUCH_4_mutEGFR_110124-6-20-RND8641_1_3: transient upload error 1/31/2011 10:22:21 PM GPUGRID Backing off 3 min 26 sec on upload of p18-IBUCH_4_mutEGFR_110124-6-20-RND8641_1_3 1/31/2011 10:23:22 PM GPUGRID Started upload of p18-IBUCH_4_mutEGFR_110124-6-20-RND8641_1_7 1/31/2011 10:23:22 PM GPUGRID Started upload of p40-IBUCH_1_wtEGFR_110121-7-20-RND6307_0_0 1/31/2011 10:23:23 PM GPUGRID [error] Error reported by file upload server: can't open file 1/31/2011 10:23:23 PM GPUGRID Temporarily failed upload of p18-IBUCH_4_mutEGFR_110124-6-20-RND8641_1_7: transient upload error 1/31/2011 10:23:23 PM GPUGRID Backing off 1 min 0 sec on upload of p18-IBUCH_4_mutEGFR_110124-6-20-RND8641_1_7 1/31/2011 10:23:28 PM GPUGRID [error] Error reported by file upload server: can't open file 1/31/2011 10:23:28 PM GPUGRID Temporarily failed upload of p40-IBUCH_1_wtEGFR_110121-7-20-RND6307_0_0: transient upload error 1/31/2011 10:23:28 PM GPUGRID Backing off 1 min 0 sec on upload of p40-IBUCH_1_wtEGFR_110121-7-20-RND6307_0_0 12 Teraflops going to sleep..... |
![]() Send message Joined: 14 Mar 07 Posts: 1958 Credit: 629,356 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() |
Fixed. gdf |
![]() Send message Joined: 23 Apr 09 Posts: 3968 Credit: 1,995,359,260 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Thanks, and just to confirm I have downloaded new tasks and uploaded finished work. Most tasks should report back automatically in a reasonable time, but if you don't have work do a manual update. |
![]() ![]() Send message Joined: 20 Jan 09 Posts: 2380 Credit: 16,897,957,044 RAC: 1 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I did a manual update, all 4 tasks were uploaded successfully two of them were successfully reported. I received two new WUs. But the two other WU cannot be reported, and I'm still receiving: 2011.01.31. 23:34:38 GPUGRID Message from server: Server can't open database |
![]() ![]() Send message Joined: 20 Jan 09 Posts: 2380 Credit: 16,897,957,044 RAC: 1 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
It's ok now. The remaining two tasks were successfully reported, and I received two more WUs. |
![]() ![]() Send message Joined: 20 Jan 09 Posts: 2380 Credit: 16,897,957,044 RAC: 1 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
But the error message persists: 2011.01.31. 23:45:49 GPUGRID Message from server: Server can't open database I don't understand this error message, everything seems to be working fine. |
©2025 Universitat Pompeu Fabra