New CUDA65 beta app

Message boards : News : New CUDA65 beta app
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · Next

AuthorMessage
biodoc

Send message
Joined: 26 Aug 08
Posts: 183
Credit: 10,085,929,375
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 38181 - Posted: 29 Sep 2014, 22:21:54 UTC

All WU's completed & validated thus far on my GTX980 with beta app versions 8.44, 8.45 and 8.46. I'm running windows 8.1 and nvidia drivers v. 344.16.

http://www.gpugrid.net/results.php?hostid=142719
ID: 38181 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Bedrich Hajek

Send message
Joined: 28 Mar 09
Posts: 490
Credit: 11,731,645,728
RAC: 47,738
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 38182 - Posted: 29 Sep 2014, 23:05:55 UTC

All the beta units are finishing valid. Though, the output files are rather large, 44 Megabytes.

52-MJHARVEY_TEST4000-0-10-RND4601_3
Workunit 10123299
Created 29 Sep 2014 | 12:46:43 UTC
Sent 29 Sep 2014 | 19:41:27 UTC
Received 29 Sep 2014 | 22:41:52 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x0)
Computer ID 127986
Report deadline 4 Oct 2014 | 19:41:27 UTC
Run time 6,363.92
CPU time 6,033.14
Validate state Valid
Credit 1,500.00
Application version ACEMD beta version v8.46 (cuda65)
Stderr output
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
# GPU [GeForce GTX 690] Platform [Windows] Rev [3212] VERSION [65]
# SWAN Device 1 :
# Name : GeForce GTX 690
# ECC : Disabled
# Global mem : 2048MB
# Capability : 3.0
# PCI ID : 0000:04:00.0
# Device clock : 1019MHz
# Memory clock : 3004MHz
# Memory width : 256bit
# Driver version : r343_98 : 34411
# GPU 0 : 63C
# GPU 1 : 73C
# GPU 2 : 74C
# GPU 3 : 74C
# GPU 0 : 64C
# GPU 0 : 65C
# GPU 0 : 66C
# GPU 0 : 67C
# GPU 0 : 68C
# GPU 0 : 69C
# GPU 0 : 70C
# GPU 0 : 71C
# Time per step (avg over 2500000 steps): 2.549 ms
# Approximate elapsed time for entire WU: 6371.977 s
# PERFORMANCE: 23558 Natoms 2.549 ns/day 0.000 ms/step 0.000 us/step/atom
18:24:40 (5228): called boinc_finish

</stderr_txt>
]]>


ID: 38182 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
eXaPower

Send message
Joined: 25 Sep 13
Posts: 293
Credit: 1,897,601,978
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 38183 - Posted: 29 Sep 2014, 23:25:59 UTC

What's the meaning of ns/day performance? Number is same as time (ms) per step.

23558 Natoms 4.726 ns/day-GTX650Ti
23558 Natoms 2.549 ns/day-GTX690
23558 Natoms 1.633 ns/day-GTX980
ID: 38183 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
TJ

Send message
Joined: 26 Jun 09
Posts: 815
Credit: 1,470,385,294
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 38184 - Posted: 30 Sep 2014, 8:31:12 UTC - in response to Message 38180.  

It completed OK on the GTX 650 Ti, but seems to be causing problems on some higher-end cards. But their versions of ACEMD probably have more changes than the one I got (8.46).
http://www.gpugrid.net/workunit.php?wuid=10123336

I will be trying my GTX 660 Ti next on the same machine to see what happens.

I think the errors on the higher-end cards where caused by to old drivers Jim. I had a lot errors on my 780Ti's yesterday, but when I updated to the latest driver, they run smooth as usual again.
The beta did okay on my 660, so your 660Ti will do great as well.
Greetings from TJ
ID: 38184 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jim1348

Send message
Joined: 28 Jul 12
Posts: 819
Credit: 1,591,285,971
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 38186 - Posted: 30 Sep 2014, 10:28:53 UTC - in response to Message 38184.  

TJ,

Thanks, that is probably it. My GTX 660 Ti did finish fine; I will be trying a couple of GTX 750 Ti's now just for fun.
ID: 38186 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Matt
Avatar

Send message
Joined: 11 Jan 13
Posts: 216
Credit: 846,538,252
RAC: 0
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 38208 - Posted: 1 Oct 2014, 1:46:11 UTC
Last modified: 1 Oct 2014, 1:49:43 UTC

Just enabled Test Apps for my GTX 680 and GTX 780Ti cards. I'll check back in a while to see how they're doing.

Edit: I saw that TJ recommended updating to the latest drivers. Is this the latest Beta or WHQL driver? I'm currently running 344.11. Thanks.
ID: 38208 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jim1348

Send message
Joined: 28 Jul 12
Posts: 819
Credit: 1,591,285,971
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 38210 - Posted: 1 Oct 2014, 2:03:50 UTC - in response to Message 38208.  

Edit: I saw that TJ recommended updating to the latest drivers. Is this the latest Beta or WHQL driver? I'm currently running 344.11. Thanks.

344.11 works fine on my GTX 650 Ti and 660 Ti on the test apps. I am running it on my GTX 750 Ti also, but haven't picked up the new apps yet
ID: 38210 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Matt
Avatar

Send message
Joined: 11 Jan 13
Posts: 216
Credit: 846,538,252
RAC: 0
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 38213 - Posted: 1 Oct 2014, 2:59:31 UTC

Thanks, Jim1348. I'll stick with 344.11 for now, then.
ID: 38213 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jacob Klein

Send message
Joined: 11 Oct 08
Posts: 1127
Credit: 1,901,927,545
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 38214 - Posted: 1 Oct 2014, 3:45:25 UTC - in response to Message 38213.  

Thanks, Jim1348. I'll stick with 344.11 for now, then.


What other options are there? :)
ID: 38214 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
TJ

Send message
Joined: 26 Jun 09
Posts: 815
Credit: 1,470,385,294
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 38216 - Posted: 1 Oct 2014, 7:23:02 UTC - in response to Message 38208.  

Just enabled Test Apps for my GTX 680 and GTX 780Ti cards. I'll check back in a while to see how they're doing.

Edit: I saw that TJ recommended updating to the latest drivers. Is this the latest Beta or WHQL driver? I'm currently running 344.11. Thanks.

Hello Matt, yes I am running 344.11 the latest WHQL driver. But to be clear it was recommended by Matt from the project.
The older driver I was using, was a bit faster on Win7 as the WDDM was introduced with Vista and can not be switched of, but that is besides the scope of this thread.
Greetings from TJ
ID: 38216 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile MJH

Send message
Joined: 12 Nov 07
Posts: 696
Credit: 27,266,655
RAC: 0
Level
Val
Scientific publications
watwat
Message 38217 - Posted: 1 Oct 2014, 8:46:10 UTC - in response to Message 38216.  
Last modified: 1 Oct 2014, 8:47:07 UTC

If I've got things right, the 65 apps shouldn't be sent any driver older than 343.00. The exception to that will be the Linux app, when that finally exists. That will give the WU out to any client that reports CUDA 6.5 capability, as only our patched client reports the driver version.

Matt
ID: 38217 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
TJ

Send message
Joined: 26 Jun 09
Posts: 815
Credit: 1,470,385,294
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 38219 - Posted: 1 Oct 2014, 8:58:34 UTC - in response to Message 38217.  
Last modified: 1 Oct 2014, 9:00:13 UTC

If I've got things right, the 65 apps shouldn't be sent any driver older than 343.00. The exception to that will be the Linux app, when that finally exists. That will give the WU out to any client that reports CUDA 6.5 capability, as only our patched client reports the driver version.

Matt

Well Matt with driver 331 on my 780Ti's win7 where a bit faster but then I got cuda65 tasks and errored out. With your advice I updated the driver and no more errors (yesterday one, but that was another reason).
But if you have made changes yesterday or today, then you are probably right.
Greetings from TJ
ID: 38219 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Retvari Zoltan
Avatar

Send message
Joined: 20 Jan 09
Posts: 2380
Credit: 16,897,957,044
RAC: 1
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 38220 - Posted: 1 Oct 2014, 8:58:46 UTC

I think it's safe to promote the CUDA6.5 application to the long queue.
ID: 38220 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile MJH

Send message
Joined: 12 Nov 07
Posts: 696
Credit: 27,266,655
RAC: 0
Level
Val
Scientific publications
watwat
Message 38221 - Posted: 1 Oct 2014, 8:59:36 UTC - in response to Message 38220.  

Not just yet...
ID: 38221 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
biodoc

Send message
Joined: 26 Aug 08
Posts: 183
Credit: 10,085,929,375
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 38222 - Posted: 1 Oct 2014, 9:28:39 UTC - in response to Message 38217.  

If I've got things right, the 65 apps shouldn't be sent any driver older than 343.00. The exception to that will be the Linux app, when that finally exists. That will give the WU out to any client that reports CUDA 6.5 capability, as only our patched client reports the driver version.

Matt


boinc 7.4.22 (development version) now reports the driver version:

Starting BOINC client version 7.4.22 for x86_64-pc-linux-gnu
CUDA: NVIDIA GPU 0: GeForce GTX 780 Ti (driver version 343.22, CUDA version 6.5, compute capability 3.5, 3072MB, 2814MB available, 5345 GFLOPS peak)

Shows up here too:

http://www.gpugrid.net/show_host_detail.php?hostid=183991
ID: 38222 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jacob Klein

Send message
Joined: 11 Oct 08
Posts: 1127
Credit: 1,901,927,545
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 38224 - Posted: 1 Oct 2014, 10:18:09 UTC
Last modified: 1 Oct 2014, 10:19:40 UTC

MJH:

I've been processing Beta tasks, and although nearly all are successful for me on the 8.46 app, I did have a failure last night. This is on a completely-stable Windows 8.1 Update 1 x64 machine, on one of my GTX 660 Ti GPUs, using 344.11 driver.

Any ideas?

http://www.gpugrid.net/result.php?resultid=13154266

Name 79-MJHARVEY_TEST4001-2-10-RND8149_0
Workunit 10126844
Created 30 Sep 2014 | 18:55:59 UTC
Sent 1 Oct 2014 | 3:25:13 UTC
Received 1 Oct 2014 | 4:35:24 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -97 (0xffffffffffffff9f) Unknown error number
Computer ID 153764
Report deadline 6 Oct 2014 | 3:25:13 UTC
Run time 1,431.05
CPU time 384.63
Validate state Invalid
Credit 0.00
Application version ACEMD beta version v8.46 (cuda65)
Stderr output

<core_client_version>7.4.22</core_client_version>
<![CDATA[
<message>
(unknown error) - exit code -97 (0xffffff9f)
</message>
<stderr_txt>
# GPU [GeForce GTX 660 Ti] Platform [Windows] Rev [3212] VERSION [65]
# SWAN Device 2 :
# Name : GeForce GTX 660 Ti
# ECC : Disabled
# Global mem : 3072MB
# Capability : 3.0
# PCI ID : 0000:08:00.0
# Device clock : 1045MHz
# Memory clock : 3004MHz
# Memory width : 192bit
# Driver version : r343_98 : 34411
# GPU 0 : 69C
# GPU 1 : 64C
# GPU 2 : 69C
# GPU 1 : 65C
# GPU 1 : 66C
# GPU 1 : 67C
# GPU 0 : 70C
# The simulation has become unstable. Terminating to avoid lock-up (1)
# Attempting restart (step 5505000)

# GPU [GeForce GTX 660 Ti] Platform [Windows] Rev [3212] VERSION [65]
# SWAN Device 2 :
# Name : GeForce GTX 660 Ti
# ECC : Disabled
# Global mem : 3072MB
# Capability : 3.0
# PCI ID : 0000:08:00.0
# Device clock : 1045MHz
# Memory clock : 3004MHz
# Memory width : 192bit
# Driver version : r343_98 : 34411
# The simulation has become unstable. Terminating to avoid lock-up (1)

</stderr_txt>
]]>
ID: 38224 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
eXaPower

Send message
Joined: 25 Sep 13
Posts: 293
Credit: 1,897,601,978
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 38225 - Posted: 1 Oct 2014, 11:05:22 UTC
Last modified: 1 Oct 2014, 11:06:43 UTC

Question: would a 4.2CUDA long task running on one card slow down CUDA 6.5 short or Beta tasks running on other or vise versa? I just noticed a CUDA 4.2 Noelia Long task running, with 6.5 Beta and Short. Runtime for Long task is more than normal. It takes ~40Hr to complete, but at ~40Hr the 4.2 task is 80%.
ID: 38225 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile MJH

Send message
Joined: 12 Nov 07
Posts: 696
Credit: 27,266,655
RAC: 0
Level
Val
Scientific publications
watwat
Message 38226 - Posted: 1 Oct 2014, 14:55:20 UTC - in response to Message 38225.  

Maybe, if the processes are competing for CPU.

Matt
ID: 38226 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jacob Klein

Send message
Joined: 11 Oct 08
Posts: 1127
Credit: 1,901,927,545
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 38227 - Posted: 1 Oct 2014, 15:09:47 UTC
Last modified: 1 Oct 2014, 15:10:05 UTC

Any idea why my task failed, 3 posts up?
ID: 38227 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
eXaPower

Send message
Joined: 25 Sep 13
Posts: 293
Credit: 1,897,601,978
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 38228 - Posted: 1 Oct 2014, 16:12:36 UTC - in response to Message 38227.  
Last modified: 1 Oct 2014, 16:13:52 UTC

Any idea why my task failed, 3 posts up?


Have you checked event viewer to locate any occurrences at the time task failed? Any kernel failures ? Or database instances? If you have automatic windows updates or auto Maintenance enabled- this can trigger random failures for other processes. (or sometimes fault any heavy usage process) Also, a security "audit" can trigger background task (GPUGRID) failures.
ID: 38228 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · Next

Message boards : News : New CUDA65 beta app

©2025 Universitat Pompeu Fabra