New CUDA4.2 applications are out for Kepler GPUs

Message boards : News : New CUDA4.2 applications are out for Kepler GPUs
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · Next

AuthorMessage
Profile ritterm
Avatar

Send message
Joined: 31 Jul 09
Posts: 88
Credit: 244,413,897
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 25941 - Posted: 27 Jun 2012, 20:07:32 UTC

Do the 4.2 tasks need/make use of as much of a CPU as the 3.1 tasks? I've been watching a 4.2 task run on my GTX 570 and noticed that the CPU utilization by the core I've set aside for GPU tasks is much lower than before. It looks like it uses significantly less RAM, too.
ID: 25941 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Simba123

Send message
Joined: 5 Dec 11
Posts: 147
Credit: 69,970,684
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 25946 - Posted: 27 Jun 2012, 23:10:56 UTC

@ Wiyosaya: my 460 which is clocked at 880 normally gets poala tasks in around 20 hours. It's a bit of a golden card though, they normally don't clock that high.

if the current trend of around a 50% reduction in runtime holds true, It should have no problem getting Poala tasks in on time.

@ ritterm: funny, I've noticed that while, according to my Boinc manager it requires around the same amount of CPU time (.42 cores) when I run, I actually have to free up 2 cores to get the time-to-completion counter to move. If I don't it just sits at the same time until about 50% is completed and then starts counting down > increases the runtime by about 25% over only having 1 core free.

I have 8 cores/threads and used to run 7 WCG threads and 1 GPUGrid, now have to run 6 WCG threads and 2 GpuGrid. The load on the CPU drops accordingly, so it seems it might be a scheduling issue with the faster GPU app fighting for CPU cycles and needs the lower load to run at optimum speed.


My question: What is driving this HUGE increase in speed [decrease in runtime]. Is it solely due to CUDA 4.2, or have the programmers achieved an amazing increase in the efficiency of the 4.2 app?

ID: 25946 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Betting Slip

Send message
Joined: 5 Jan 09
Posts: 670
Credit: 2,498,095,550
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 25960 - Posted: 28 Jun 2012, 11:23:41 UTC

Nathans jobs run well as long as I don't use PC for anything else. They are crippling. Remote computer not so good but have no access to it as yet to make adjustments.

NATHAN get your jobs sorted out nobody else's cause the computer to become useless.
Radio Caroline, the world's most famous offshore pirate radio station.
Great music since April 1964. Support Radio Caroline Team -
Radio Caroline
ID: 25960 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Simba123

Send message
Joined: 5 Dec 11
Posts: 147
Credit: 69,970,684
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 25963 - Posted: 28 Jun 2012, 12:01:09 UTC - in response to Message 25960.  

Nathans jobs run well as long as I don't use PC for anything else. They are crippling. Remote computer not so good but have no access to it as yet to make adjustments.

NATHAN get your jobs sorted out nobody else's cause the computer to become useless.



Strange, Nathan tasks don't have that effect on my PC. Even the new 4.2 tasks that run 95-99% GPU utilization.

even when I'm running 7 threads of WCG and 1 thread for GPUGrid with a Nathan task I have no problems.
A slight lag when changing programs/screens etc, but nothing else. I can sort that slight issue out by freeing up another CPU core.

What's your CPU load like?
maybe you need to look at that....
ID: 25963 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
5pot

Send message
Joined: 8 Mar 12
Posts: 411
Credit: 2,083,882,218
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 25964 - Posted: 28 Jun 2012, 13:31:50 UTC

I have no slowdown with mine either. All 4.2 tasks on W7 are using a minimum of 95 GPU Aw well.
ID: 25964 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Betting Slip

Send message
Joined: 5 Jan 09
Posts: 670
Credit: 2,498,095,550
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 25970 - Posted: 28 Jun 2012, 14:02:42 UTC - in response to Message 25963.  
Last modified: 28 Jun 2012, 14:03:58 UTC

Graphics card is only GTX460

Runs everything fine apart from Nathans so he must be doing something different.

or it could be my PCIE 1.1 slot
Radio Caroline, the world's most famous offshore pirate radio station.
Great music since April 1964. Support Radio Caroline Team -
Radio Caroline
ID: 25970 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Simba123

Send message
Joined: 5 Dec 11
Posts: 147
Credit: 69,970,684
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 26020 - Posted: 29 Jun 2012, 22:23:12 UTC - in response to Message 25970.  

Graphics card is only GTX460

Runs everything fine apart from Nathans so he must be doing something different.

or it could be my PCIE 1.1 slot



I run a 460 as well as a 560, no difference in functionality of the computer when either or both of them are running NATE tasks.

Why do you have it in a 1.1 slot??????
Is this card also your primary graphics card?
It could be that you are running out of PCIE bandwidth.

ID: 26020 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Betting Slip

Send message
Joined: 5 Jan 09
Posts: 670
Credit: 2,498,095,550
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 26027 - Posted: 30 Jun 2012, 8:15:30 UTC - in response to Message 26020.  

It's a 16 X slot just MB is a few years old now and I don't have the option of PCIE2
Radio Caroline, the world's most famous offshore pirate radio station.
Great music since April 1964. Support Radio Caroline Team -
Radio Caroline
ID: 26027 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 26029 - Posted: 30 Jun 2012, 9:53:14 UTC - in response to Message 26027.  
Last modified: 30 Jun 2012, 10:00:19 UTC

Might be due to the high amount of memory required to run these tasks and W7; I'm seeing ~990MB in use. I have a GTX 470 (1279MB), so I have some headroom. However W7 eats some GPU memory leaving you short of 1024MB. Possibly too short. That said Boinc reports 1023MB (maybe another rounding error in the driver), and might not be true anyway (W7 is probably using more, ~60 to 90MB I think).
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help
ID: 26029 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Paul Raney

Send message
Joined: 26 Dec 10
Posts: 115
Credit: 416,576,946
RAC: 0
Level
Gln
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 26031 - Posted: 30 Jun 2012, 10:47:17 UTC - in response to Message 26029.  

I just check the GPUGrid computing preferences and noticed we still only have 3 queues, long, short and beta. It would be good to find a way to get 4.2 tasks exclusively to machines with the correct mix of hardware and drivers.

All of my cards are now optimized for 4.2 work units so when 3.1 work units hit my systems, they run a bit slower than in the past.

If the problem is a bug on the server, do we have an estimated time to repair?

So far it looks like we don't have a solution to the mixed work unit issue. Aborting 3.1 work units usually works but recently I aborted 2 of the 3.1 work units and received 2 more 3.1 work units.
Thx - Paul

Note: Please don't use driver version 295 or 296! Recommended versions are 266 - 285.
ID: 26031 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Simba123

Send message
Joined: 5 Dec 11
Posts: 147
Credit: 69,970,684
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 26043 - Posted: 30 Jun 2012, 14:20:17 UTC - in response to Message 26031.  

I just check the GPUGrid computing preferences and noticed we still only have 3 queues, long, short and beta. It would be good to find a way to get 4.2 tasks exclusively to machines with the correct mix of hardware and drivers.

All of my cards are now optimized for 4.2 work units so when 3.1 work units hit my systems, they run a bit slower than in the past.

If the problem is a bug on the server, do we have an estimated time to repair?

So far it looks like we don't have a solution to the mixed work unit issue. Aborting 3.1 work units usually works but recently I aborted 2 of the 3.1 work units and received 2 more 3.1 work units.



I imagine that most/all new tasks will be coded in 4.2.

Just have to run the 3.1 hoppers dry. (I hope this is the case anyway. Seems fairly pointless to code in 3.1 now that 4.2 is here and ~40% more efficient)
ID: 26043 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 26050 - Posted: 30 Jun 2012, 20:55:24 UTC - in response to Message 26043.  
Last modified: 30 Jun 2012, 20:56:04 UTC

There might be a lot of non-4.2 capable drivers in use.

While I doubt there would be any issues, I suppose it's good science to compare cuda3.1 with cuda4.2 runs.

If a solution has to come in the form of a bespoke server patch, who knows how long it will take?

For now we could use the 3.1 to 4.2 workaround
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help
ID: 26050 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile robertmiles

Send message
Joined: 16 Apr 09
Posts: 503
Credit: 769,991,668
RAC: 0
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 26053 - Posted: 30 Jun 2012, 23:33:36 UTC - in response to Message 25867.  

i got a unit on a gtx260 (win7 x64 latest boinc and drivers), but i had to abort it as it was making my computer completely unusable with 99% usage.

Is there any way to make it less aggressive? like 90-95% usage or am I doing something wrong?


99% usage of what? If it's a CPU core, try telling BOINC to leave one CPU core free for programs other than BOINC workunits.

If it's the GPU, I haven't found a usable method yet.
ID: 26053 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Tom Philippart

Send message
Joined: 12 Feb 09
Posts: 57
Credit: 23,376,686
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 26079 - Posted: 1 Jul 2012, 18:59:25 UTC - in response to Message 26053.  

i got a unit on a gtx260 (win7 x64 latest boinc and drivers), but i had to abort it as it was making my computer completely unusable with 99% usage.

Is there any way to make it less aggressive? like 90-95% usage or am I doing something wrong?


99% usage of what? If it's a CPU core, try telling BOINC to leave one CPU core free for programs other than BOINC workunits.

If it's the GPU, I haven't found a usable method yet.


it's the gpu usage
ID: 26079 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile SMTB1963
Avatar

Send message
Joined: 27 Jun 10
Posts: 38
Credit: 524,420,921
RAC: 0
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 26080 - Posted: 1 Jul 2012, 20:37:55 UTC - in response to Message 25867.  

i got a unit on a gtx260 (win7 x64 latest boinc and drivers), but i had to abort it as it was making my computer completely unusable with 99% usage.

Is there any way to make it less aggressive? like 90-95% usage or am I doing something wrong?


Same thing's happening on the GTX275 in my wife's machine. I've simply unchecked "Use GPU while computer is in use" until I get around to upgrading the card.
ID: 26080 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
oscark

Send message
Joined: 3 Nov 11
Posts: 4
Credit: 460,884,503
RAC: 0
Level
Gln
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 26089 - Posted: 2 Jul 2012, 11:27:35 UTC

Can I receive only cuda 4.2 ?
ID: 26089 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 26095 - Posted: 2 Jul 2012, 11:58:42 UTC - in response to Message 26089.  
Last modified: 2 Jul 2012, 22:31:19 UTC

Not unless you use the 3.1 to 4.2 workaround
- Actually you will still get 3.1 tasks but they will run just as fast as 4.2, so it's a fix.
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help
ID: 26095 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
oscark

Send message
Joined: 3 Nov 11
Posts: 4
Credit: 460,884,503
RAC: 0
Level
Gln
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 26108 - Posted: 2 Jul 2012, 20:46:06 UTC - in response to Message 26095.  

thanks
ID: 26108 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
JLConawayII

Send message
Joined: 31 May 10
Posts: 48
Credit: 28,893,779
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 26118 - Posted: 3 Jul 2012, 4:35:57 UTC

The 4.2 units actually run slower on my old GTX 260, and additionally the newer drivers are still causing threadsafe exit downclocks with other projects. I guess it may finally be time to get some new hardware and relegate the old card to running einstein full time.
ID: 26118 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 26128 - Posted: 3 Jul 2012, 10:33:35 UTC - in response to Message 26118.  

For now, go back to an older driver (285).
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help
ID: 26128 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · Next

Message boards : News : New CUDA4.2 applications are out for Kepler GPUs

©2025 Universitat Pompeu Fabra