No work? Fixed a bug in the scheduler

Message boards : Graphics cards (GPUs) : No work? Fixed a bug in the scheduler
Message board moderation

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4736 - Posted: 22 Dec 2008, 17:24:06 UTC

@Mike: sorry, don't know much about how to handle this in Linux. Could it have anything to do with your distro being rather *old*?

@Pat: the different WU sizes are not causing these problems. And when you close and restart BOINC manager, do you also shut down BOINC? Or just the manager? Seems like you're on of the few who can still reproduce the issue on a regular basis, so you may have the chance to find the cause ;)

MrS
Scanning for our furry friends since Jan 2002
ID: 4736 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
frankhagen

Send message
Joined: 18 Sep 08
Posts: 65
Credit: 3,037,414
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwat
Message 4752 - Posted: 22 Dec 2008, 19:36:21 UTC - in response to Message 4734.  

Frank, have you tried the 180.84 driver yet? It seems to have solved all my 64bit issues. It's only been a day, but no problems since :)
Fish


180.84 has been running fine for some days - in the middle of PG-challenge boinc suddenly wasn't able to get fresh work from GPU.
ID: 4752 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mike047

Send message
Joined: 21 Dec 08
Posts: 47
Credit: 7,330,049
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwat
Message 4756 - Posted: 22 Dec 2008, 20:17:54 UTC - in response to Message 4736.  
Last modified: 22 Dec 2008, 20:22:57 UTC

@Mike: sorry, don't know much about how to handle this in Linux. Could it have anything to do with your distro being rather *old*?

@Pat: the different WU sizes are not causing these problems. And when you close and restart BOINC manager, do you also shut down BOINC? Or just the manager? Seems like you're on of the few who can still reproduce the issue on a regular basis, so you may have the chance to find the cause ;)

MrS


6.04 is a typing mistake by me it is actually;

Ubuntu 8.04LTS isn't outdated, there is another release 8.10. Both are reliable os's.

There is some other issue, because yesterday it worked for a couple of hours.

I guess I will put this project on the back burner for awhile. Good science but difficult [for me] to set up and run with out babysitting it.

mike
ID: 4756 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4759 - Posted: 22 Dec 2008, 21:55:50 UTC

You're right, 8.04 is surely not causing this problem.

MrS
Scanning for our furry friends since Jan 2002
ID: 4759 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Nognlite

Send message
Joined: 9 Nov 08
Posts: 69
Credit: 25,106,923
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwat
Message 4762 - Posted: 22 Dec 2008, 22:06:30 UTC - in response to Message 4736.  
Last modified: 22 Dec 2008, 22:08:57 UTC

MrS:

Seems that both rigs are responding properly (knock on wood) but will keep an eye on it.

When I shut BOINCmgr down I also selected "Stop running science applications when exiting the Manager".

Pat
ID: 4762 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist

Send message
Joined: 14 Mar 07
Posts: 1958
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 4770 - Posted: 22 Dec 2008, 23:24:43 UTC - in response to Message 4756.  

Mike,
do you have installed boinc for Linux x86_64 or 32 bit?

gdf
ID: 4770 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mike047

Send message
Joined: 21 Dec 08
Posts: 47
Credit: 7,330,049
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwat
Message 4782 - Posted: 23 Dec 2008, 9:34:40 UTC - in response to Message 4770.  
Last modified: 23 Dec 2008, 9:42:12 UTC

64 bit with ia32-libs

mike

edit; I just tried again and got 2 work units and it is working one. I did not change anything.....let's see how long it will work.
ID: 4782 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Paul D. Buck

Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 4810 - Posted: 23 Dec 2008, 21:19:41 UTC - in response to Message 4782.  

edit; I just tried again and got 2 work units and it is working one. I did not change anything.....let's see how long it will work.

I just got up a little bit ago and my machine had completed one task and I reported it in and got another. Just like it is supposed to work ... :)

Now, was it a one time miracle or will it repeat?

About half way through the next task and I have two in queue ... so, theory says I should be keeping about that many locally ...

Fingers crossed ...
ID: 4810 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mike047

Send message
Joined: 21 Dec 08
Posts: 47
Credit: 7,330,049
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwat
Message 4820 - Posted: 24 Dec 2008, 8:52:17 UTC

My two units will run about 8 hours and then go to "waiting on memory". I re boot and then it will pick one back up and crunch????? When it goes to waiting on memory, it does a constant write to hard drive. I will try 180.06 drivers later today or after it finishes these units and see if it will behave:)

mike
ID: 4820 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile koschi
Avatar

Send message
Joined: 14 Aug 08
Posts: 127
Credit: 913,858,161
RAC: 13
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 4824 - Posted: 24 Dec 2008, 12:31:24 UTC

Sounds like the memory leak in the linux app. See also:
http://www.gpugrid.net/forum_thread.php?id=571
ID: 4824 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Donnie

Send message
Joined: 13 Nov 08
Posts: 11
Credit: 11,185,470
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwat
Message 4829 - Posted: 24 Dec 2008, 17:49:42 UTC - in response to Message 4692.  

It's back!!!

12/24/2008 11:47:25 AM|GPUGRID|Sending scheduler request: Requested by user. Requesting 387007 seconds of work, reporting 0 completed tasks
12/24/2008 11:47:30 AM|GPUGRID|Scheduler request completed: got 0 new tasks
12/24/2008 11:47:30 AM|GPUGRID|Message from server: No work sent
12/24/2008 11:47:30 AM|GPUGRID|Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.
12/24/2008 11:47:30 AM|GPUGRID|Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.
ID: 4829 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist

Send message
Joined: 14 Mar 07
Posts: 1958
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 4831 - Posted: 24 Dec 2008, 19:20:50 UTC - in response to Message 4829.  

Simply server was out of work.

More workunits now and many more in the next few days.

gdf
ID: 4831 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Donnie

Send message
Joined: 13 Nov 08
Posts: 11
Credit: 11,185,470
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwat
Message 4834 - Posted: 24 Dec 2008, 23:40:28 UTC - in response to Message 4831.  

My bad!!! Thanks GDF!!! I guess the next time I cry wolf, I'll check the server first. Thanks again to you and all of your staff (if any) for all of your hard & dedicated work to correct these problems and listening to all of us complain.
ID: 4834 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Paul D. Buck

Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 4836 - Posted: 25 Dec 2008, 0:03:07 UTC

GDF,

A little cheer maybe. The one machine I am running at the moment seemed to have auto-magically obtained another task on the 24th at about 1400 and my last 2.56 task is nearing completion. So ... encouraging news and I am getting tempted to fire up the other machine and let it rip! :)

But, it is looking like, at least for me, that I am in the "normal" group now (about the only thing normal about me).
ID: 4836 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2

Message boards : Graphics cards (GPUs) : No work? Fixed a bug in the scheduler

©2025 Universitat Pompeu Fabra