Top hosts exceed 30,000+ RAC

Message boards : Graphics cards (GPUs) : Top hosts exceed 30,000+ RAC
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
J.D.

Send message
Joined: 2 Jan 09
Posts: 40
Credit: 16,762,688
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwat
Message 6046 - Posted: 27 Jan 2009, 2:45:25 UTC
Last modified: 27 Jan 2009, 2:47:15 UTC

I just noticed that now two machines exceed the 30,000 mark of recent average credit. (Top Hosts)

Anyone care to speculate when the first machine will exceed 40K and 50K of RAC?
:-)
ID: 6046 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Matthew Lei
Avatar

Send message
Joined: 4 Dec 08
Posts: 7
Credit: 2,718,779
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwat
Message 6047 - Posted: 27 Jan 2009, 4:48:58 UTC

Care to share the specs of your host?
ID: 6047 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
J.D.

Send message
Joined: 2 Jan 09
Posts: 40
Credit: 16,762,688
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwat
Message 6051 - Posted: 27 Jan 2009, 6:33:58 UTC - in response to Message 6047.  
Last modified: 27 Jan 2009, 6:35:26 UTC

It's a 64-bit Linux system with a total of 4 GT200 class CUDA devices, made possible due to the 2-in-1 GeForce GTX 295. The Phenom 9550 CPU cores are not so impressive as those of Core i7, but they always seem able to keep the GPUs satisfied. Actively running more than two CUDA devices required an upgrade from a 750 Watt to a 1000 Watt power supply, now a Zalman ZM1000-HP.

Meanwhile... anyone yet running an eight GPU quad GTX 295 rig? ;-)
ID: 6051 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
rapt0r

Send message
Joined: 4 Sep 08
Posts: 16
Credit: 9,366,617
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 6053 - Posted: 27 Jan 2009, 9:55:41 UTC - in response to Message 6051.  
Last modified: 27 Jan 2009, 9:57:29 UTC

Consider the Price of this AMD Phenom the delivered power is although impressive. Now you can upgrade to Phenom II and show me a Intel-System with such a plattform-compatibility.

Question: Do you have 2 CPU-Socket's?
ID: 6053 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 6071 - Posted: 27 Jan 2009, 23:46:09 UTC - in response to Message 6053.  

Sorry, but what are you talking about? This is about the GPUs.. you just need a board with 2 PCIe slots, 2 GTX 295 and preferrably 4 CPU cores, though on Linux less may do for 4 GPUs.

MrS
Scanning for our furry friends since Jan 2002
ID: 6071 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Paul D. Buck

Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 6073 - Posted: 28 Jan 2009, 0:07:53 UTC - in response to Message 6071.  

Sorry, but what are you talking about? This is about the GPUs.. you just need a board with 2 PCIe slots, 2 GTX 295 and preferrably 4 CPU cores, though on Linux less may do for 4 GPUs.

MrS


With 6.62 you might not even need that much on windows ... I am seeing less than 1% average CPU with that application. At that CPU rate you could even run all 4 GPU cores on a single CPU system (if we ignore bus bandwidth and CPU I/O bandwidth issues) ...
ID: 6073 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile UL1

Send message
Joined: 16 Sep 07
Posts: 56
Credit: 35,013,195
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwat
Message 6100 - Posted: 28 Jan 2009, 11:32:41 UTC - in response to Message 6046.  

J.D. wrote:
Anyone care to speculate when the first machine will exceed 40K and 50K of RAC? :-)

" If " the rig will crunch without producing computation errors and doesn't freeze I'd expect the 40K to be reached around the weekend... ;) (Knock on wood)
ID: 6100 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 6139 - Posted: 28 Jan 2009, 21:37:20 UTC - in response to Message 6100.  

Good luck mate! That would be almost half the output of my entire team.. :D

@Paul: yes, with 6.62 fewer cores may be perfectly fine. If they're not I wouldn't look for a problem with bandwidth (because that's in the realm of nanoseconds) but rather the 1 ms scheduler interval. If a single CPU core is busy serving GPU 1 and right now GPU 2, 3 and 4 also need an *whatever*, then they'll have to wait until serving GPU 1 is done and the scheduler grants them a time slice. Thus I choose the careful term "preferrably" ;)

MrS
Scanning for our furry friends since Jan 2002
ID: 6139 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Paul D. Buck

Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 6142 - Posted: 29 Jan 2009, 0:08:27 UTC - in response to Message 6139.  

Good luck mate! That would be almost half the output of my entire team.. :D

@Paul: yes, with 6.62 fewer cores may be perfectly fine. If they're not I wouldn't look for a problem with bandwidth (because that's in the realm of nanoseconds) but rather the 1 ms scheduler interval. If a single CPU core is busy serving GPU 1 and right now GPU 2, 3 and 4 also need an *whatever*, then they'll have to wait until serving GPU 1 is done and the scheduler grants them a time slice. Thus I choose the careful term "preferrably" ;)

MrS


Um, well that is what I would class as CPU I/O bandwidth, because the CPU has only the one channel to service the interrupts ... a rose by any other name ...

But even a multi-core system still has potential issues with bandwidth for the same reason unless the MB has distinct and separate channels for each GPU to be serviced. Then we can get into the same issue with the dual, and soon to come quad, core systems where there is one I/O channel for each card and the two/four GPU cores are contending for service at the same time.

This is an issue that has dogged PCs for, like, forever ... though the CPUs we use have more power than the CPUs of mainframes of yore the I/O is simply not really there ... they are getting there, slowly ... but, some of those old systems were masters at I/O ...

In any case, my opinion we are in violent agreement ...
ID: 6142 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
J.D.

Send message
Joined: 2 Jan 09
Posts: 40
Credit: 16,762,688
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwat
Message 6213 - Posted: 30 Jan 2009, 12:53:27 UTC - in response to Message 6100.  

" If " the rig will crunch without producing computation errors and doesn't freeze I'd expect the 40K to be reached around the weekend... ;) (Knock on wood)


40K!
Even sooner than the weekend. :-)
ID: 6213 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile UL1

Send message
Joined: 16 Sep 07
Posts: 56
Credit: 35,013,195
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwat
Message 6214 - Posted: 30 Jan 2009, 15:24:10 UTC - in response to Message 6213.  

I was pleasantly surprised too...especially because the rig had freezes and produced computation errors...
My next estimate would be 50 K on next wednesday... ;)
ID: 6214 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 6257 - Posted: 31 Jan 2009, 16:56:01 UTC - in response to Message 6142.  

Um, well that is what I would class as CPU I/O bandwidth, because the CPU has only the one channel to service the interrupts ... a rose by any other name ...


I'm still not convinced.

Is there an interrupt at all? I don't know about the new method, but as far as I understand the polling is not an interrupt, it's just a normal task switch, which the scheduler would have done anyway.

The way I see it: a single core executes only one thread at a time. Thus when multiple GPUs need work all except one are blocked.. no matter how much I/O bandwidth you give that cpu, it couldn't execute the other threads at the same time. If you have multiple CPUs (be it physical ones, more cores or logical ones via multithreading) then each of them can process one thread at the same time and, with otherwise perfect software, lags / breaks could be avoided. What I need is the ability to execute several threads at once, not I/O bandwidth.

So.. I'm not sure if we're talking about the same thing ;)

MrS
Scanning for our furry friends since Jan 2002
ID: 6257 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Edboard
Avatar

Send message
Joined: 24 Sep 08
Posts: 72
Credit: 12,410,275
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 6307 - Posted: 2 Feb 2009, 14:41:36 UTC

Four GPUs and 4 CPU cores means one gpugrid WU/core and so, no WU in cache. I have a 2 cores CPU and a gtx295 (2 GPUs) and I can not get that the Boinc scheduler feeds them without my personal intervention (which, e.g., is impossible if I'm sleeping) (Boinc 6.6.3)
ID: 6307 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Phoneman1

Send message
Joined: 25 Nov 08
Posts: 51
Credit: 980,186
RAC: 0
Level
Gly
Scientific publications
watwat
Message 6311 - Posted: 2 Feb 2009, 15:38:25 UTC - in response to Message 6307.  

Four GPUs and 4 CPU cores means one gpugrid WU/core and so, no WU in cache. I have a 2 cores CPU and a gtx295 (2 GPUs) and I can not get that the Boinc scheduler feeds them without my personal intervention (which, e.g., is impossible if I'm sleeping) (Boinc 6.6.3)


As mentioned in another thread recently 6.6.3 has a problem with uninitialized variables. Sooner or later, it won't get GPU work reliably.

Boinc version 6.5.0 seems to cause the least trouble; get it from here.

Phoneman1
ID: 6311 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile UL1

Send message
Joined: 16 Sep 07
Posts: 56
Credit: 35,013,195
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwat
Message 6435 - Posted: 5 Feb 2009, 14:09:54 UTC

Couldn't keep my promise to reach 50K just in time (yesterday)...
...and am wondering if anyone else had some "ghost WUs"...?

Explanation: during the last two days I had WUs, that could only be seen by the BOINC-manager, but not in the web sites task list. So I had eight WUs to crunch whilst in the task list there could only be seen five or six as "in progress". Could have been acceptable...if these WUs would have been listed after they had finished and were submitted...but they seem to have vanished in the Lost-WU-Nirvana...an unnecessary loss of time and credits...kind of annoying...
ID: 6435 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Alain Maes

Send message
Joined: 8 Sep 08
Posts: 63
Credit: 1,696,957,181
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 6436 - Posted: 5 Feb 2009, 14:46:06 UTC - in response to Message 6435.  

Yes, I thought also something like that was happening.

Further investigation learned me that these WU were on the web page task list, but then on page two or even three. So just try "next" on top of the web page to see your next 20 WUs and so on. That is were you will find them.

Kind regards

Alain
ID: 6436 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile UL1

Send message
Joined: 16 Sep 07
Posts: 56
Credit: 35,013,195
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwat
Message 6439 - Posted: 5 Feb 2009, 15:40:26 UTC

When I saw that there were less than the usual eight tasks "in progress" I checked the previous task-sides, but with no success: I couldn't find any new ones. Also, after submitting these "ghosts" neither the 'avg. cr' nor the 'tot. cr' for this host did change...
ID: 6439 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Phoneman1

Send message
Joined: 25 Nov 08
Posts: 51
Credit: 980,186
RAC: 0
Level
Gly
Scientific publications
watwat
Message 6440 - Posted: 5 Feb 2009, 16:26:06 UTC - in response to Message 6439.  
Last modified: 5 Feb 2009, 16:26:40 UTC

Ul1, your list of computers shows 4 x i7s but 3 have not contacted the server this month. Those 3 also have a number of work units marked no reply. I wonder if the missing work units are to be found on these i7s??

Did you change your email details on this project or make some other change?? If so it might be worth merging those computers with the same name within this project.

Phoneman1
ID: 6440 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile UL1

Send message
Joined: 16 Sep 07
Posts: 56
Credit: 35,013,195
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwat
Message 6449 - Posted: 5 Feb 2009, 19:27:38 UTC - in response to Message 6440.  

As you mentioned: these rigs haven't done anything for the project this month...but the days I was dealing with the 'ghosts' were late monday and the whole tuesday...

And no: I didn't change anything...and these rigs will be back here as soon as they have cleaned their cache over at SETI...
ID: 6449 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
J.D.

Send message
Joined: 2 Jan 09
Posts: 40
Credit: 16,762,688
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwat
Message 6738 - Posted: 17 Feb 2009, 23:53:05 UTC - in response to Message 6435.  

Couldn't keep my promise to reach 50K just in time (yesterday)...


Woo! 50K!
Here too!
Okay, so my machine took 12 days longer, but still. :-)

Meanwhile, the stats haven't yet shown a machine over 60K...
ID: 6738 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
1 · 2 · Next

Message boards : Graphics cards (GPUs) : Top hosts exceed 30,000+ RAC

©2025 Universitat Pompeu Fabra