Cuda 4.2 WUs seem much faster than the old 3.1

Message boards : Graphics cards (GPUs) : Cuda 4.2 WUs seem much faster than the old 3.1
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · Next

AuthorMessage
Paul Raney

Send message
Joined: 26 Dec 10
Posts: 115
Credit: 416,576,946
RAC: 0
Level
Gln
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 25801 - Posted: 21 Jun 2012, 11:45:41 UTC - in response to Message 25780.  

Should we migrate our Fermi cards to Cuda 4.2 compliant drivers? What drivers are required? Do we have a new recommendation on the driver version to be used?

The 301.42 drivers work well on one of my systems. Is it recommended that I upgrade all of my systems?

Thx - Paul

Note: Please don't use driver version 295 or 296! Recommended versions are 266 - 285.
ID: 25801 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile MJH

Send message
Joined: 12 Nov 07
Posts: 696
Credit: 27,266,655
RAC: 0
Level
Val
Scientific publications
watwat
Message 25802 - Posted: 21 Jun 2012, 12:57:20 UTC - in response to Message 25801.  

Should we migrate our Fermi cards to Cuda 4.2 compliant drivers? What drivers are required? Do we have a new recommendation on the driver version to be used?


Yes, if you can. The 4.2 app is fast. The latest drivers (301 series) are fine.

Matt


ID: 25802 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
klepel

Send message
Joined: 23 Dec 09
Posts: 189
Credit: 4,801,881,008
RAC: 50,765
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 25803 - Posted: 21 Jun 2012, 13:24:19 UTC

No, I had problems with my GTX570 SC card. More from the office.
ID: 25803 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 25808 - Posted: 21 Jun 2012, 19:34:04 UTC - in response to Message 25804.  

Does the card feature a substantial factory-OC and is failing some tasks with 4.2? It has been observed that the new app is somewhat less tolerant to overclocking, or demands higherstability - however you want to look at it.

MrS
Scanning for our furry friends since Jan 2002
ID: 25808 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
5pot

Send message
Joined: 8 Mar 12
Posts: 411
Credit: 2,083,882,218
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 25812 - Posted: 21 Jun 2012, 21:51:43 UTC

MJH:

No doubt it's fast. First time I've ever seen that short queue get depleted so quickly ;)

Seems like the hopper needs more added in quite often now...


As a side note, any idea one when we can finally start taking a crack at those long queue tasks. Seems like, from what I read in the forums, that everything seems to being a-ok!!

Cheers

PS, Yours seem to have the greatest speed difference gain. Congrats!!
ID: 25812 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
klepel

Send message
Joined: 23 Dec 09
Posts: 189
Credit: 4,801,881,008
RAC: 50,765
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 25813 - Posted: 21 Jun 2012, 23:17:02 UTC - in response to Message 25803.  

No, I had problems with my GTX570 SC card. More from the office.


Sorry I have not written during the day.

I transferred my GTX 570 SC to another computer to make room for my GTX670 SC in my most potent PC. At the same time I did up-date drivers on both computers to 301.42. All short work units Cuda 42 work well on both systems. However when I have attached the computer with the GTX 570 SC card to the long work queue, this long work unites (Cuda 31) failed. Means the computer shuts down randomly. I was able to restart computer as well as continuo the work unites, but the computer shuts down again after half an hour or so.

The GTX 570 SC card worked very well short and long work units with Driver 275.33 on the computer, which has actually the GTX 670.

The GTX 570 SC card works actually well with Primegrid Genefer (cuda32_13) and PPS (Sieve) (cuda23), as well with Einstein (BRP4cuda32nv301), so it is not the Cuda version.

The video cards are factory overclocked. I have only modified the fan speed, so the temperature of the cards is below 70º C.
ID: 25813 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
5pot

Send message
Joined: 8 Mar 12
Posts: 411
Credit: 2,083,882,218
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 25814 - Posted: 22 Jun 2012, 0:29:38 UTC

I only see two workunits which have failed with energies have become nan. Maybe increase the voltage slightly and see if this helps stability? Or of energy usage is a concern you could try underclocking it as well.
ID: 25814 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Paul Raney

Send message
Joined: 26 Dec 10
Posts: 115
Credit: 416,576,946
RAC: 0
Level
Gln
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 25817 - Posted: 22 Jun 2012, 3:31:18 UTC - in response to Message 25815.  

I recently had some issues with work units causing the computer to hang or freeze. The only way to clear the condition was a reset of the computer (or power cycle). It happened a couple of times to one of my systems that never has issues. I did not change drivers or any of the OC settings on the system. I assumed it was a couple of bad work units.

Try the 570 SC with the new drivers again. I bet you will find it was bad work units and nothing to do with the driver.

Best of Luck!
Thx - Paul

Note: Please don't use driver version 295 or 296! Recommended versions are 266 - 285.
ID: 25817 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile dskagcommunity
Avatar

Send message
Joined: 28 Apr 11
Posts: 463
Credit: 979,266,958
RAC: 76,910
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 25819 - Posted: 22 Jun 2012, 11:00:52 UTC

Paul i had/have that too with some nathans rps ( perhaps it was nathan too in your case??) at the time, some units freeze the machine in a special way. Desktop with no icons but windows is running. Often @ the beginning of a new computation while he uploads the finished wu. Good to hear they are the wus perhaps but itcomputes to finishing line after reboot. Strange. Changed fan to manual to bring it from 90 to 80 degrees. Will see how it works over weekend bacause this is litle annoing on a unaatended machine.
DSKAG Austria: http://www.dskag.at

ID: 25819 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
MarkJ
Volunteer moderator
Volunteer tester

Send message
Joined: 24 Dec 08
Posts: 738
Credit: 200,909,904
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 25820 - Posted: 22 Jun 2012, 13:10:52 UTC - in response to Message 25817.  
Last modified: 22 Jun 2012, 13:20:06 UTC

I recently had some issues with work units causing the computer to hang or freeze. The only way to clear the condition was a reset of the computer (or power cycle). It happened a couple of times to one of my systems that never has issues. I did not change drivers or any of the OC settings on the system. I assumed it was a couple of bad work units.

Try the 570 SC with the new drivers again. I bet you will find it was bad work units and nothing to do with the driver.

Best of Luck!


Probably haven't made the apps threadsafe which is why you need to reboot the machine. I know Richard asked the same question back in March but never got a reply. See this message

Edit:
Found the original message with code snippets here

Maybe GDF or one of the other developers can confirm Acemd 6.16 (cuda42) does use the above logic (ie its threadsafe)
BOINC blog
ID: 25820 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Paul Raney

Send message
Joined: 26 Dec 10
Posts: 115
Credit: 416,576,946
RAC: 0
Level
Gln
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 25821 - Posted: 22 Jun 2012, 13:11:46 UTC - in response to Message 25819.  

Wow - 90 is hot! I flashed my EVGA GTX cards with the newest BIOS and it unlocks the fans to 100%. I have them set at 90% at 70C and my 580s are down to 72C - 74C

This summer is going to be a challenge to keep these systems cool.

Dry Ice could be a solution!

BTW - is dry ice safe for the computers?
Thx - Paul

Note: Please don't use driver version 295 or 296! Recommended versions are 266 - 285.
ID: 25821 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
klepel

Send message
Joined: 23 Dec 09
Posts: 189
Credit: 4,801,881,008
RAC: 50,765
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 25823 - Posted: 22 Jun 2012, 15:38:08 UTC - in response to Message 25814.  

I only see two workunits which have failed with energies have become nan. Maybe increase the voltage slightly and see if this helps stability? Or of energy usage is a concern you could try underclocking it as well.

As I have said, the problems are only with the cuda 31 app, not with cuda 42. In the night the computer failed with two cuda 31 apps. The strange thing: When I restarted the computer again, blue screen with warning message and the GPUGRID project was detached afterwards, so I attached anew and got one new work unit with cuda 42. Hope this works out just fine. The other two are just lost, and won´t be finished ever.

Side note: I am running malariacontrol.net at all computers at the moment and this writes heavily on hard disks, so this might also cause the observed problems. Also my co-workers experience problems with this project, whereas with climateprediction,net - my normal CPU load - I normally do not have problems.
ID: 25823 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 25826 - Posted: 22 Jun 2012, 15:55:09 UTC - in response to Message 25823.  
Last modified: 22 Jun 2012, 16:02:09 UTC

The CPU tasks may be the source of some of your problems.

On your 6core all the failures were 'Aborted by user'.
On your GTX670 system, one nan failure might have been as a result of the CPU projects, the other failure happened because you started running the task on a GTX570 and then continued to run it on a GTX670.
The 9800GT error was the old cufft issue, common with CC1.1 cards.

How many CPU cores are you using to crunch CPU tasks? I know you have a 6core, an 8core and an 8thread system.
Generally it's more stable to use one less CPU core/thread than the total. This also tends to expedite the GPU tasks and even the other CPU tasks slightly, so you gain stability and GPU task performance at a very limited CPU expense.
It's better to run a mix of CPU task types, for lots of reasons.
It's advisable to increase you write to disk times on anything more than a dual core. I use 600 to 900sec.
It's also beneficial to use a second drive for the Boinc data directory, where possible.
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help
ID: 25826 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
klepel

Send message
Joined: 23 Dec 09
Posts: 189
Credit: 4,801,881,008
RAC: 50,765
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 25828 - Posted: 22 Jun 2012, 16:25:01 UTC - in response to Message 25826.  

Yes it might be that the CPU task cause the problems. Unfortunately my computer knowledge is very limited, so what goes behind Boinc Manager, will have to be explained step by step. So I will be able to try it on my own, which I would be willing to do.

The GTX670 on the 8-core does not worry me that much, because it is true, I changed the GPU, and the other might happen ones in a while.

The 9800GT will not crunch GPUGRID anymore as intend to use only GTX570 and GTX670 for this project, lower end cards I will use for Primegrid, as these do not work very well with GPUGRID.

Which really worries me is the GTX570 on the 6core computer.

All CPU cores are crunching CPU projects, I have seen that you disable one core for the GPU with Svan_sync (I think), which I am willing to do but I do not see how. However with climateprediction.net (my main focus) I never had any issues with using GPUs, as well as write to disk times.

That said I normally running climteprediction.net on all computers, malariacontrol.net is only my “back-up” project, if the first runs out of work.

So if you are willing to guide me, you are welcome!

8core, has solid state disk and normal hard disk – but if I remember correctly I am writing boinc to solid state disk as I assumed this will improve speed. 6core has a small solid state disk for the programs as all data is loaded to an external disk.
ID: 25828 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 25829 - Posted: 22 Jun 2012, 16:29:46 UTC - in response to Message 25828.  
Last modified: 22 Jun 2012, 16:36:30 UTC

When you reattached to the project you lost any GPU tasks that were running. These must just appear as 'Aborted by user'.
Did you reattach twice?

I suggest you configure Boinc to use one less CPU core on your 6core system, and write to disk less frequently:
Boinc Manager (Advanced View),
Tools Tab,
Computing Preferences,
Processor Usage Tab,
On multiprocessor systems use at most 99% of the processors,
Disk and Memory Usage Tab,
Tasks Checkpoint to Disk at most every 900 seconds,
OK.
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help
ID: 25829 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
klepel

Send message
Joined: 23 Dec 09
Posts: 189
Credit: 4,801,881,008
RAC: 50,765
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 25832 - Posted: 22 Jun 2012, 17:08:36 UTC - in response to Message 25829.  

When you reattached to the project you lost any GPU tasks that were running. These must just appear as 'Aborted by user'.
Did you reattach twice?


No, I reattached once.
It is: Task 5522912 and Task 5522851. Both are still marked in Progress on GPUGRID Homepage. But are not mentioned any more in Boinc Manager.
ID: 25832 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
KING100N

Send message
Joined: 27 Mar 09
Posts: 2
Credit: 113,212,081
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 25844 - Posted: 24 Jun 2012, 8:28:19 UTC

Should i have something besides boinc 7.0.25 and 301.42 drivers to get cuda 4.2 tasks
ID: 25844 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Snow Crash

Send message
Joined: 4 Apr 09
Posts: 450
Credit: 539,316,349
RAC: 0
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 25845 - Posted: 24 Jun 2012, 9:35:00 UTC - in response to Message 25844.  

Currently the CUDA 4.2 app is only being used for the Standard WUs and it looks like you are only processing the Longs. Hopefully GDF will release it next week? for the Longs also.
Thanks - Steve
ID: 25845 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile dskagcommunity
Avatar

Send message
Joined: 28 Apr 11
Posts: 463
Credit: 979,266,958
RAC: 76,910
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 25846 - Posted: 24 Jun 2012, 16:29:33 UTC - in response to Message 25844.  
Last modified: 24 Jun 2012, 16:30:12 UTC

Should i have something besides boinc 7.0.25 and 301.42 drivers to get cuda 4.2 tasks


I have the same problem ^^ Hope somebody tells us if we need to reattach or something ^^ (an nooo i tried short/standart units and it worked on a fermi card, only not on the 285gtx)

I must wait until a WU is finished AND im in the room where the machine is, then i will try a reattach, could need a few days :)
DSKAG Austria: http://www.dskag.at

ID: 25846 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [PUGLIA] kidkidkid3
Avatar

Send message
Joined: 23 Feb 11
Posts: 103
Credit: 1,628,751,957
RAC: 678,699
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 25849 - Posted: 25 Jun 2012, 10:27:15 UTC

Hi all,
my pc is running first longqueue WU with Cuda 4.2 (a Paola RND).
The expected time on GTS450 (with GF116) under windows xp will be 17 hours !
Here the result of Gpu-z
url]http://gpuz.techpowerup.com/12/06/25/3aa.png[/url]
Good work, great !
K.
Dreams do not always come true. But not because they are too big or impossible. Why did we stop believing.
(Martin Luther King)
ID: 25849 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · Next

Message boards : Graphics cards (GPUs) : Cuda 4.2 WUs seem much faster than the old 3.1

©2026 Universitat Pompeu Fabra