Nvidia 190.38, Cuda 2.3

Message boards : Graphics cards (GPUs) : Nvidia 190.38, Cuda 2.3
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · Next

AuthorMessage
zpm
Avatar

Send message
Joined: 2 Mar 09
Posts: 159
Credit: 13,639,818
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 11519 - Posted: 30 Jul 2009, 19:54:51 UTC - in response to Message 11514.  
Last modified: 30 Jul 2009, 19:56:28 UTC

some of us do but that has be deduced to problems with the driver itself and posted on nvidia...

were talking about app errors wit the app and boinc...

and some these are related to the fact that gpugrid is still built upon 2.1 cuda which is in the upgrade department right now.
ID: 11519 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
OldChap

Send message
Joined: 5 Jul 09
Posts: 14
Credit: 28,617,272
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwat
Message 11525 - Posted: 30 Jul 2009, 22:18:49 UTC - in response to Message 11519.  

Thanks for that info..would you then be interested in the fact that rolling back to 182.08 then 185.85,186's and 190's they all display the same behaviour
Forcing Riva and locking 3D has no impact nor does 6.6.36 to 6.6.37.(this on vista 64 and 2* gtx260 (192's) Cuda 2.1.

Perhaps the most annoying thing for me though is that errors can occur from just a few seconds out to almost 100% complete.

I think I may just try with xp again.

Mike
ID: 11525 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mark Henderson

Send message
Joined: 21 Dec 08
Posts: 51
Credit: 26,320,167
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 11527 - Posted: 31 Jul 2009, 0:00:52 UTC - in response to Message 11504.  

It worked on mine until it downloaded a new task and it errored out on the download " Cuda.dll Unexpected size " or something like that.

Hello Admin,
Would it be possible to do as Seti and have it to where the 2 DLL files for Cuda 2.3 could be " dropped " into the GPU folder and replace the 2.2 files?
This is possible on the Seti optimized apps. and works great.
For the people that 190.. drivers are running problem free this could give a small added boost.
I tried it one time and it would not work because the "file size " was incorrect and it errored out.
If the software did not look at the file size it would maybe work.
It may also help the developers get a jump on any issues with 2.3 also.

Just a question.
Thanks

That sounded like an interesting question, so I'm giving it a try on two hosts.

Preliminary observation - makes no difference at all, but both tasks are still below 10% - it'll be tomorrow morning (UTC) before I get a definitive answer. I'm running this on my two Q6600/9800GT hosts (XP SP3), if anyone wants to watch.

I can only presume that GPUGrid relies much less on Fourier Transforms than SETI - it was fft that made the difference at SETI.

ID: 11527 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile koschi
Avatar

Send message
Joined: 14 Aug 08
Posts: 127
Credit: 919,858,161
RAC: 154,051
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 11529 - Posted: 31 Jul 2009, 5:18:27 UTC - in response to Message 11527.  

http://boincfaq.mundayweb.com/index.php?language=1&view=91

Please check if you can convince BOINC with this option to ignore the incorrect size and proceed with 2.3 libs:

<dont_check_file_sizes>0|1</dont_check_file_sizes>
ID: 11529 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 11 Jul 09
Posts: 1639
Credit: 10,159,968,649
RAC: 2
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 11531 - Posted: 31 Jul 2009, 7:25:38 UTC

First task with 2.3 runtime DLLs (1041263) errored out after 14 hours with "Cuda error: Kernel [frc_sum_kernel_dihed] failed in file 'force.cu' in line 313 : unspecified launch failure."

Task on second machine, and replacement for this one, still running: but proceed with caution.
ID: 11531 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
MarkJ
Volunteer moderator
Volunteer tester

Send message
Joined: 24 Dec 08
Posts: 738
Credit: 200,909,904
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 11540 - Posted: 31 Jul 2009, 11:50:06 UTC - in response to Message 11527.  

It worked on mine until it downloaded a new task and it errored out on the download " Cuda.dll Unexpected size " or something like that.

Hello Admin,
Would it be possible to do as Seti and have it to where the 2 DLL files for Cuda 2.3 could be " dropped " into the GPU folder and replace the 2.2 files?
This is possible on the Seti optimized apps. and works great.
For the people that 190.. drivers are running problem free this could give a small added boost.
I tried it one time and it would not work because the "file size " was incorrect and it errored out.
If the software did not look at the file size it would maybe work.
It may also help the developers get a jump on any issues with 2.3 also.

Just a question.
Thanks

That sounded like an interesting question, so I'm giving it a try on two hosts.

Preliminary observation - makes no difference at all, but both tasks are still below 10% - it'll be tomorrow morning (UTC) before I get a definitive answer. I'm running this on my two Q6600/9800GT hosts (XP SP3), if anyone wants to watch.

I can only presume that GPUGrid relies much less on Fourier Transforms than SETI - it was fft that made the difference at SETI.



You'd need to use an app_info file. That way you'd bypass BOINC checking the signatures on the files.
BOINC blog
ID: 11540 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mark Henderson

Send message
Joined: 21 Dec 08
Posts: 51
Credit: 26,320,167
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 11543 - Posted: 31 Jul 2009, 12:22:16 UTC

Would just adding this line into the CC_Config file work?

<dont_check_file_sizes>0|1</dont_check_file_sizes>
ID: 11543 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 11 Jul 09
Posts: 1639
Credit: 10,159,968,649
RAC: 2
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 11544 - Posted: 31 Jul 2009, 13:08:06 UTC - in response to Message 11540.  

I can only presume that GPUGrid relies much less on Fourier Transforms than SETI - it was fft that made the difference at SETI.

You'd need to use an app_info file. That way you'd bypass BOINC checking the signatures on the files.

Strangely, the CUDA RT DLL is signed, but the FFT isn't. Seriously, does anyone know whether the mathematics here requires Fourier Transforms? If not, it's not worth chasing this.

And yes, I am using an app_info.xml
ID: 11544 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 11 Jul 09
Posts: 1639
Credit: 10,159,968,649
RAC: 2
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 11550 - Posted: 31 Jul 2009, 14:51:48 UTC

1040687 completed and validated using the 2.3 DLLs. Took slightly longer than previous tasks on that box, but got slightly more credit - must have been an tougher job. Hourly rate exactly the same. Conclusion? It's not worth the hassle. Reverting to stock so I can get the new compiles as they come out.
ID: 11550 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist

Send message
Joined: 14 Mar 07
Posts: 1958
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 11552 - Posted: 31 Jul 2009, 16:33:37 UTC - in response to Message 11550.  

We had already optimized the FFT our selves, so no much speedup will be seen using 2.3.
By the way, we have updated the application to cuda 2.2.

gdf
ID: 11552 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
zpm
Avatar

Send message
Joined: 2 Mar 09
Posts: 159
Credit: 13,639,818
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 11554 - Posted: 31 Jul 2009, 17:17:30 UTC - in response to Message 11552.  
Last modified: 31 Jul 2009, 17:17:49 UTC

We had already optimized the FFT our selves, so no much speedup will be seen using 2.3.
By the way, we have updated the application to cuda 2.2.

gdf


thanks for updating, although last night i grabbed 40 seti cuda wu b/c i knew this change would happen today, and i just wanted to see what feedback was coming out from here... on the 6.67 app.
ID: 11554 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 11 Jul 09
Posts: 1639
Credit: 10,159,968,649
RAC: 2
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 11555 - Posted: 31 Jul 2009, 17:47:56 UTC - in response to Message 11552.  

We had already optimized the FFT our selves, so no much speedup will be seen using 2.3.

Ah well, experimental proof is always good to go alongside the theoretical conjecture.... :-)

By the way, we have updated the application to cuda 2.2.

gdf

And started distributing the 2.2 DLLs. Nice workround for the 'same name, different version' problem that BOINC handles so badly.

Sorry, I tried to unravel my app_info.xml without doing a full project reset - and failed. I've released the WU back into the wild for someone else to do.

And ended up with a reset project anyway. DCF 1.000000, <rsc_fpops_est>250000000000000 works out to 01:08:53 on this host, where it should be about 16 hours.

I could edit client_stats again, or you could - pretty please - put a more realistic estimate into the workunit generator?
ID: 11555 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist

Send message
Joined: 14 Mar 07
Posts: 1958
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 11557 - Posted: 31 Jul 2009, 23:02:12 UTC - in response to Message 11555.  

We will start to increase by a factor 2. Then we need to wait a little.

gdf
ID: 11557 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
STE\/E

Send message
Joined: 18 Sep 08
Posts: 368
Credit: 4,174,624,885
RAC: 0
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 11559 - Posted: 1 Aug 2009, 9:38:17 UTC
Last modified: 1 Aug 2009, 9:40:28 UTC

I tried the Grid Wu's again last night when I seen the Application went to CUDA 2.2 but I'm still getting error's on at least 2 Box's with the 190.38 Drivers & 6.6.36 Client, same 2 Box's that have given me the most trouble since Updating to the 190.38 Drivers that didn't have any trouble before Upgrading.
ID: 11559 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 11 Jul 09
Posts: 1639
Credit: 10,159,968,649
RAC: 2
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 11560 - Posted: 1 Aug 2009, 9:45:20 UTC - in response to Message 11559.  
Last modified: 1 Aug 2009, 9:45:48 UTC

I tried the Grid Wu's again last night when I seen the Application went to CUDA 2.2 but I'm still getting error's on at least 2 Box's with the 190.38 Drivers & 6.6.36 Client, same 2 Box's that have given me the most trouble since Updating to the 190.38 Drivers that didn't have any trouble before Upgrading.

So which two boxes are they, and what graphics cards do they have installed?

Consensus seems to be settling down on 'G92-based cards run the new drivers OK, g200-based cards have problems'.
ID: 11560 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
STE\/E

Send message
Joined: 18 Sep 08
Posts: 368
Credit: 4,174,624,885
RAC: 0
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 11562 - Posted: 1 Aug 2009, 11:42:30 UTC - in response to Message 11560.  
Last modified: 1 Aug 2009, 11:54:15 UTC

I tried the Grid Wu's again last night when I seen the Application went to CUDA 2.2 but I'm still getting error's on at least 2 Box's with the 190.38 Drivers & 6.6.36 Client, same 2 Box's that have given me the most trouble since Updating to the 190.38 Drivers that didn't have any trouble before Upgrading.

So which two boxes are they, and what graphics cards do they have installed?

Consensus seems to be settling down on 'G92-based cards run the new drivers OK, g200-based cards have problems'.


Right now it's Host ID 17909 & Host ID 38967 ... On the 17909 Host both 260 Cores were running @ 300/Core & 400/Memory when I checked it, On Host 38967 1 of the 260 Cores was running @ 300/Core & 400/Memory while the other Core was running 650/Core & 1100/Memory.

Both Box's had been running the Collatz Projects Wu's for the last 3-4 Days with no Error's or Re-setting of the Cores to Lower Speeds. I did a complete Uninstall of the NVIDIA Drivers again on both Box's, then Re-installed them to each 260 Card thru the Device Manager on both Box's, did the Riva Tuner Fix thingy on both Box's gave them both a Big Hug & Kiss to let them know they were still Loved ... hahaha ... and sent them on their way running the Grid Wu's again to see if that fixes them.

If they still spit out the errors then it's back to Collatz with them, so far the rest of my Pharm has behaved it's self and hasn't had any errors since last night.

What is the difference in the 'G92-based cards & the g200-based card ??? I just thought they all were 200 Series Cards. Funny thing is I have 4 Box's with 2 BFG GTX 260's each. 2 of the Box's seem to run the Wu's Okay but then the other 2 BFG GTX 260 Box's just spit out errors. All 8 Cards are the BFG 216 Shader OCX Version Type Cards ... Weird

NOTE: One thing I have noticed is that if one of the Cores drops or resets it's self to a lower running speed but the other one doesn't your still just going to get error's off both Cores. So the Speed drop must have an effect on both Cores for some reason or other.

I did a RMA on a BFG GTX 260 Card the same as the ones I have now just last week for the same reason of the Speed Drop. Since I got it's replacement back it's run okay, could just be something with the BFG Cards but I don't think so with all the other people reporting problems too.
ID: 11562 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist

Send message
Joined: 14 Mar 07
Posts: 1958
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 11563 - Posted: 1 Aug 2009, 12:01:03 UTC - in response to Message 11562.  

it does not seem that you have successfully downgraded to 185.xx:
http://www.gpugrid.net/show_host_detail.php?hostid=17909

Your are having problems with the CUDA FFT library, which also seem to suggest that you are using the wrong driver. Please check.

Do you have the toolkit installed, if yes uninstall it. I am not sure if Windows would choose system installed libs rather than the one we provide.

gdf
ID: 11563 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
STE\/E

Send message
Joined: 18 Sep 08
Posts: 368
Credit: 4,174,624,885
RAC: 0
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 11564 - Posted: 1 Aug 2009, 12:14:10 UTC - in response to Message 11563.  

it does not seem that you have successfully downgraded to 185.xx:
http://www.gpugrid.net/show_host_detail.php?hostid=17909

Your are having problems with the CUDA FFT library, which also seem to suggest that you are using the wrong driver. Please check.

Do you have the toolkit installed, if yes uninstall it. I am not sure if Windows would choose system installed libs rather than the one we provide.

gdf


Hmmmmmm, but I tried that already (Downgrade to 185.18 I believe) & it din't work, I still got the errors, that and all my other Box's are running the 190.38 Drivers. I'll wait and see if they run okay now with the uninstall & reinstall I did with them an hour or so ago. If they still get the errors I'll try the 185.xx Drivers again.

No, I don't have the Toolkit installed on any of my Box's, did at first when I joined the Project but then un-installed them when I realized I didn't need them.
ID: 11564 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
STE\/E

Send message
Joined: 18 Sep 08
Posts: 368
Credit: 4,174,624,885
RAC: 0
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 11565 - Posted: 1 Aug 2009, 12:33:36 UTC
Last modified: 1 Aug 2009, 12:57:46 UTC

GDF, do you mean Downgrade to 185.xx Drivers or 186.xx Drivers, I think I tried the 186.18 Drivers but they didn't stop the Errors or the Cards from Dropping speed after just a few minutes of running sometimes, other times it took an hour or so before they dropped their speed.

PS: Crap, looks like something is wrong with one of the Box's already, 1 of the Wu's isn't Progressing according to BOINCView, will have to go out to the Garage & see what's up with it. Think I'll just go ahead and put the 185.xx Drivers in it & try them.
ID: 11565 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
STE\/E

Send message
Joined: 18 Sep 08
Posts: 368
Credit: 4,174,624,885
RAC: 0
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 11566 - Posted: 1 Aug 2009, 13:19:49 UTC
Last modified: 1 Aug 2009, 14:13:31 UTC

Looks like it did a Benchmark Test & upon Resuming the Wu's one of them erred out ... Host ID 17909 One Core & 300/Core & 400/Memory & One Core @ 650/Core & 1100/Memory ... Going to the 185.85 Drivers & see if they work or not ...

8/1/2009 8:14:24 AM Running CPU benchmarks
009 8:14:24 AM Suspending computation - running CPU benchmarks
8/1/2009 8:14:55 AM Benchmark results:
8/1/2009 8:14:55 AM Number of CPUs: 4
8/1/2009 8:14:55 AM 2275 floating point MIPS (Whetstone) per CPU
8/1/2009 8:14:55 AM 6944 integer MIPS (Dhrystone) per CPU
8/1/2009 8:14:56 AM Resuming computation
8/1/2009 8:29:00 AM AQUA@home Sending scheduler request: To fetch work.
8/1/2009 8:29:00 AM AQUA@home Requesting new tasks for CPU
8/1/2009 8:29:05 AM AQUA@home Scheduler request completed: got 0 new tasks
8/1/2009 8:29:05 AM AQUA@home Message from server: No work sent
8/1/2009 8:29:05 AM AQUA@home Message from server: (reached limit of 2 CPU tasks)
8/1/2009 9:10:05 AM GPUGRID Computation for task 93-KASHIF_HIVPR_twomons_ba2-14-100-RND4610_0 finished
8/1/2009 9:10:05 AM GPUGRID Output file 93-KASHIF_HIVPR_twomons_ba2-14-100-RND4610_0_1 for task 93-KASHIF_HIVPR_twomons_ba2-14-100-RND4610_0 absent
8/1/2009 9:10:05 AM GPUGRID Output file 93-KASHIF_HIVPR_twomons_ba2-14-100-RND4610_0_2 for task 93-KASHIF_HIVPR_twomons_ba2-14-100-RND4610_0 absent
8/1/2009 9:10:05 AM GPUGRID Output file 93-KASHIF_HIVPR_twomons_ba2-14-100-RND4610_0_3 for task 93-KASHIF_HIVPR_twomons_ba2-14-100-RND4610_0 absent
8/1/2009 9:10:06 AM Collatz Conjecture Restarting task collatz_1248745476_67361_1 using collatz version 110
8/1/2009 9:10:08 AM GPUGRID Started upload of 93-KASHIF_HIVPR_twomons_ba2-14-100-RND4610_0_0
8/1/2009 9:10:08 AM GPUGRID Started upload of 93-KASHIF_HIVPR_twomons_ba2-14-100-RND4610_0_4
8/1/2009 9:10:12 AM GPUGRID Finished upload of 93-KASHIF_HIVPR_twomons_ba2-14-100-RND4610_0_0
8/1/2009 9:11:37 AM GPUGRID Finished upload of 93-KASHIF_HIVPR_twomons_ba2-14-100-RND4610_0_4

PS: Have the 185'85 Drivers installed now, did find a couple of Odd Ball nv.dll's in the system 32 Directory, or at least i thought they were Odd Ball ones because the referred to an older Driver Version so I canned them before installing the 185.85 Drivers.

I also Aborted the running Grid Wu because I haven't had a Wu finish yet that was already running when one erred out, the remaining Wu would always error out eventually too so I figured 2 Fresh ones would be better. Am running both of them now with the 185.85 Drivers & 6.6.36 Client.
ID: 11566 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · Next

Message boards : Graphics cards (GPUs) : Nvidia 190.38, Cuda 2.3

©2026 Universitat Pompeu Fabra