Message boards :
Server and website :
Welcome back Gpugrid
Message board moderation
| Author | Message |
|---|---|
|
Send message Joined: 5 Jan 09 Posts: 670 Credit: 2,498,095,550 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Nice to see you reconnected. |
|
Send message Joined: 21 Mar 16 Posts: 513 Credit: 4,673,458,277 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
My house is nice and chilly now, -22C outside |
|
Send message Joined: 1 Jan 15 Posts: 1171 Credit: 12,662,148,501 RAC: 1,014,572 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
I am asking just out of curiosity: what was the reason for this lenghty outage? |
Logan CarrSend message Joined: 12 Aug 15 Posts: 240 Credit: 64,069,811 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]()
|
I checked the boinc project website last night and it said that Boinc was down. This is probably the reason gpugrid was down. Also, I don't know if anybody's said this but my upload was pending saying "project backoff" something like that. Hope this is helpful -Logan |
|
Send message Joined: 16 Dec 10 Posts: 4 Credit: 19,812,500 RAC: 0 Level ![]() Scientific publications ![]()
|
I can't seem to find any reason for the down time nor an apology for it. Maybe my GPU cycles are better spent on a project that monitors their systems over a weekend and has better up-time. Since they don't seem to look after their own systems, what would they care about my hard worked data? Or at least point me at the post that shows you care about us. |
BeyondSend message Joined: 23 Nov 08 Posts: 1112 Credit: 6,162,416,256 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
I checked the boinc project website last night and it said that Boinc was down. This is probably the reason gpugrid was down. Also, I don't know if anybody's said this but my upload was pending saying "project backoff" something like that. Hope this is helpful -Logan Logan, the BOINC site being down has nothing to do with GPUGrid. Apparently the GPUGrid server crashed during the weekend and nobody noticed. It sure did create a crazy backlog of WUs trying to upload. :-( |
Logan CarrSend message Joined: 12 Aug 15 Posts: 240 Credit: 64,069,811 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]()
|
I checked the boinc project website last night and it said that Boinc was down. This is probably the reason gpugrid was down. Also, I don't know if anybody's said this but my upload was pending saying "project backoff" something like that. Hope this is helpful -Logan Ah alright. Thanks for letting me know! The timing must have been just right then, haha. My assumption is rather that the scientists might have taken the weekend off and maybe that's why it wasn't noticed right away. We all need to step away from our jobs sometimes, so maybe that's what they did. Either way, the website is back up and that's all that counts, right? Let's all try to think positively about these situations. Also if this website was down for much longer, check gpugrid's twitter account. They post useful stuff there. I have no intention of lecturing if it appears that way. I'm just trying to make positive vibes :) Cruncher/Learner in progress. |
|
Send message Joined: 28 Mar 09 Posts: 490 Credit: 11,850,145,728 RAC: 301,281 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
I am asking just out of curiosity: what was the reason for this lenghty outage? I would like to know as well. And somehow, I received 5 ghost units during this outage! |
|
Send message Joined: 1 Jan 15 Posts: 1171 Credit: 12,662,148,501 RAC: 1,014,572 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
My assumption is rather that the scientists might have taken the weekend off and maybe that's why it wasn't noticed right away. No problem if the scientists themselves had taken the weekend off, they have produced plenty of WUs during last week anyway. However, I was a little surprised that there was not even one IT person at least in any kind of standby and would have noticed already on Saturday evening that there was a problem that got even worse by Sunday morning. |
|
Send message Joined: 15 Oct 11 Posts: 17 Credit: 81,085,378 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
My assumption is rather that the scientists might have taken the weekend off and maybe that's why it wasn't noticed right away. Was wondering the same...nobody checking on the server(s) for (approx. 2 days....) Did not get my 24 hr bonus because of this.. I know small potatoes..... :) |
|
Send message Joined: 5 Mar 13 Posts: 348 Credit: 0 RAC: 0 Level ![]() Scientific publications ![]() |
There was a server crash. Sometimes it can take us a day to notice if we are not currently actively monitoring everything. Sorry for any inconvenience caused by it. Maybe best send us a mail if it happens again. |
|
Send message Joined: 22 Nov 09 Posts: 114 Credit: 589,114,683 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Thanks for the update. Unfortunately, the WU that I had gotten before the crash and had finished without error uploaded and was not credited. That has happened before, but not that often, and these were extenuating circumstances, so I am not all that concerned. |
|
Send message Joined: 9 May 13 Posts: 171 Credit: 4,739,796,466 RAC: 334,273 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Stephan said,
Where do we send the email when the GPUGRID site is unavailable? |
|
Send message Joined: 21 Mar 16 Posts: 513 Credit: 4,673,458,277 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
I had two BNBS2 WUs run for 100k+ seconds and had a validation error, can anyone explain this? |
Retvari ZoltanSend message Joined: 20 Jan 09 Posts: 2380 Credit: 16,897,957,044 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
I had two BNBS2 WUs run for 100k+ seconds and had a validation error, can anyone explain this? Perhaps your host had a power outage, and these GPUGrid tasks restarted from 0%. In such cases it is practical to abort the workunits manually, as there's no point in spending time and electricity crunching them. Here's two excerpts from the stderr.txt of your failed tasks: 1st: # GPU 2 : 73C # GPU [GeForce GTX 970] Platform [Windows] Rev [3212] VERSION [65] # SWAN Device 0 : # Name : GeForce GTX 970 2nd: # GPU 0 : 73C # GPU [GeForce GTX 690] Platform [Windows] Rev [3212] VERSION [65] # SWAN Device 1 : # Name : GeForce GTX 690Note that there's no line explaining the reason to the exit from the application between the 1st and the 2nd line, which is usually the sign of a dirty shutdown. |
|
Send message Joined: 21 Mar 16 Posts: 513 Credit: 4,673,458,277 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
How did you get that information zoltan? I've been curious to see some of your WUs |
Retvari ZoltanSend message Joined: 20 Jan 09 Posts: 2380 Credit: 16,897,957,044 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
How did you get that information zoltan? I've been curious to see some of your WUs Every host computer have a list of workunits. If you click on the ID (or name in other view, it's the first column of the tasklist) of a finished task, you can see detailed information of the given task, and the second part is the "stderr output" which is generated by the task while it is running. |
©2026 Universitat Pompeu Fabra