New nvidia beta application

Message boards : Graphics cards (GPUs) : New nvidia beta application
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 . . . 11 · Next

AuthorMessage
Profile Beyond
Avatar

Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 14798 - Posted: 29 Jan 2010, 17:15:10 UTC - in response to Message 14790.  

Does that mean the Windows version now works well?
ID: 14798 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 14800 - Posted: 29 Jan 2010, 17:26:56 UTC - in response to Message 14798.  

What are these workunits called?
ID: 14800 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist

Send message
Joined: 14 Mar 07
Posts: 1958
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 14801 - Posted: 29 Jan 2010, 17:29:10 UTC - in response to Message 14800.  

We actually uploaded only the Windows one now.
It works well, 50 more WUs uploaded.

gdf
ID: 14801 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Toni
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 9 Dec 08
Posts: 1006
Credit: 5,068,599
RAC: 0
Level
Ser
Scientific publications
watwatwatwat
Message 14803 - Posted: 29 Jan 2010, 17:44:04 UTC - in response to Message 14801.  

They are called something like L*-TEST.
ID: 14803 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 14805 - Posted: 29 Jan 2010, 19:11:22 UTC - in response to Message 14803.  
Last modified: 29 Jan 2010, 19:17:40 UTC

Sorry guys. I let my ION try to pick up tasks and it downloaded and spat out 13 tests. Its now set to not pick up any more tasks. Not sure why I could not pick up any tests on my GTS 250 or GTX 260, perhaps just timing.

My GT 240 (G215 core) picked up one task and it worked a bit better. Lasted 5min 34sec.

1792896, 1127796, 29 Jan 2010 17:35:16 UTC, 29 Jan 2010 18:57:03 UTC Completed and validated 334.26 331.80 36.03 48.64

PS the Test Application is called,

ACEMD beta version v6.05 (cuda)

Details:
Name B40-TONI_TEST2901-1-5-RND7016_0
Workunit 1127796
Created 29 Jan 2010 16:28:36 UTC
Sent 29 Jan 2010 17:35:16 UTC
Received 29 Jan 2010 18:57:03 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x0)
Computer ID 55914
Report deadline 3 Feb 2010 17:35:16 UTC
Run time 334.2612
CPU time 331.7985
stderr out

<core_client_version>6.10.18</core_client_version>
<![CDATA[
<stderr_txt>
# There is 1 device supporting CUDA
# Device 0: "GeForce GT 240"
# Clock rate: 1.46 GHz
# Total amount of global memory: 1073741824 bytes
# Number of multiprocessors: 12
# Number of cores: 96
MDIO ERROR: cannot open file "restart.coor"
called boinc_finish

</stderr_txt>
]]>

Validate state Valid
Claimed credit 36.0299189814815
Granted credit 48.640390625
application version ACEMD beta version v6.05 (cuda)


The Ions were all swanMalloc failed.

One of the failed Ion efforts:
Name Lhp2-TONI_TEST2901-0-10-RND3354_0
Workunit 1127831
Created 29 Jan 2010 16:53:30 UTC
Sent 29 Jan 2010 18:56:55 UTC
Received 29 Jan 2010 18:58:22 UTC
Server state Over
Outcome Client error
Client state Compute error
Exit status -40 (0xffffffffffffffd8)
Computer ID 55951
Report deadline 3 Feb 2010 18:56:55 UTC
Run time 9.484798
CPU time 8.704856
stderr out

<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
- exit code -40 (0xffffffd8)
</message>
<stderr_txt>
# There is 1 device supporting CUDA
# Device 0: "ION"
# Clock rate: 1.25 GHz
# Total amount of global memory: 268435456 bytes
# Number of multiprocessors: 2
# Number of cores: 16
SWAN: FATAL : swanMalloc failed


</stderr_txt>
]]>

Validate state Invalid
Claimed credit 0.0147670414732421
Granted credit 0
application version ACEMD beta version v6.05 (cuda)
ID: 14805 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Siegfried Niklas
Avatar

Send message
Joined: 23 Feb 09
Posts: 39
Credit: 144,654,294
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 14807 - Posted: 29 Jan 2010, 21:12:02 UTC
Last modified: 29 Jan 2010, 21:40:44 UTC

L15-TONI_TEST2901-0-10-RND2988 (state: In progress)

i7-860 / GTX295 / Vista 64



Elapsed Time (wall clock time: 3:43:58)

CPU Time: 3:42:32 !

CPU Eff.: 99,365% !



EDIT: completed

L15-TONI_TEST2901-0-10-RND2988_0
Workunit 1127719
Created 29 Jan 2010 16:04:07 UTC
Sent 29 Jan 2010 16:08:46 UTC
Received 29 Jan 2010 21:32:36 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x0)
Computer ID 53295
Report deadline 3 Feb 2010 16:08:46 UTC
Run time 16040.183401
CPU time 15938.39
stderr out

<core_client_version>6.10.17</core_client_version>
<![CDATA[
<stderr_txt>
# There are 2 devices supporting CUDA
# Device 0: "GeForce GTX 295"
# Clock rate: 1.51 GHz
# Total amount of global memory: 939524096 bytes
# Number of multiprocessors: 30
# Number of cores: 240
# Device 1: "GeForce GTX 295"
# Clock rate: 1.51 GHz
# Total amount of global memory: 939524096 bytes
# Number of multiprocessors: 30
# Number of cores: 240
MDIO ERROR: cannot open file "restart.coor"
# Time per step: 25.659 ms
# Approximate elapsed time for entire WU: 16036.845 s
called boinc_finish

</stderr_txt>
]]>

Validate state Valid
Claimed credit 4503.73958333333
Granted credit 6080.0484375
application version ACEMD beta version v6.05 (cuda)
ID: 14807 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 14810 - Posted: 29 Jan 2010, 22:46:08 UTC - in response to Message 14807.  

4 more tasks completed and validated:

1. 1793522 B32-TONI_TEST2901-1-5-RND4868_0 203.16 199.92 36.03 48.64
2. 1793556 B31-TONI_TEST2901-1-5-RND1267_0 326.62 320.47 36.03 48.64
3. 1793228 B39-TONI_TEST2901-1-5-RND7704_0 311.53 301.89 36.03 48.64
4. 1792896 B40-TONI_TEST2901-1-5-RND7016_0 334.26 331.80 36.03 48.64

Systems:
1. GTX 260 sp216 (55nm) driver 19038 VistaU64bit Phenom II 940 4GB
2&3. GeForce 8800 GTS 512 Driver 19038 W7Pro 64bit Q6600 4GB
4. GTS 250 (Factory OC) driver 19562 W7Pro 64bit Q9400(OC) 4GB


One Beta task still Running after 43min on Q9400 system,

L6-TONI_TEST2901-0-10-RND2222
ID: 14810 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Stoneageman
Avatar

Send message
Joined: 25 May 09
Posts: 224
Credit: 34,057,374,498
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 14811 - Posted: 29 Jan 2010, 23:12:43 UTC - in response to Message 14807.  

gtx260 OC 696/1500/999 Boinc 6:10:25 XP64

All four Bxx - TONI units completed successfully after only two minutes.
One Lxx - TONI unit has just finished normally after only 4h:18m. Excellent performance increase.
Temperatures look normal, however the cpu utilisation of 100% is going to be an issue for me and others running cpu projects alongside GPUgrid.



ID: 14811 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist

Send message
Joined: 14 Mar 07
Posts: 1958
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 14812 - Posted: 29 Jan 2010, 23:16:09 UTC - in response to Message 14811.  

the cpu utilization should have remained the same. We will look into it.
gdf
ID: 14812 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Aardvark
Avatar

Send message
Joined: 27 Nov 08
Posts: 28
Credit: 82,362,324
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 14816 - Posted: 30 Jan 2010, 1:33:28 UTC - in response to Message 14812.  
Last modified: 30 Jan 2010, 2:06:33 UTC

I have two of these WU's running on my Windows 7 64 bit Pro', i7 920 with two GTX260 (216 core 55nm) GPU's. After they were about 66% completed I noticed that they were each using 100% of one of the eight multi-threaded cores. I closed down Boinc manager and switched off multithreading. On restarting Boinc manager, each WU was now running 100% of one of the four,non multithreaded cores. The elapsed timer on Boinc manager started again at the point at which it had previously stopped (3.5 hours). The time to completion looks to be about the same as when previously using hyperthreading (approximately 5 hours). I will forward details when completed.
ID: 14816 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Aardvark
Avatar

Send message
Joined: 27 Nov 08
Posts: 28
Credit: 82,362,324
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 14817 - Posted: 30 Jan 2010, 1:52:50 UTC - in response to Message 14816.  
Last modified: 30 Jan 2010, 2:08:17 UTC

The result of one of the WU's which completed after approximately five hours and seven minutes. The other is similar. Temperatures normal.

Name L18-TONI_TEST2901-0-10-RND0675_0
Workunit 1127722
Created 29 Jan 2010 16:04:17 UTC
Sent 29 Jan 2010 20:14:35 UTC
Received 30 Jan 2010 1:47:32 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x0)
Computer ID 55059
Report deadline 3 Feb 2010 20:14:35 UTC
Run time 18451.701774
CPU time 18065.93
stderr out <core_client_version>6.10.25</core_client_version>
<![CDATA[
<stderr_txt>
# There are 2 devices supporting CUDA
# Device 0: "GeForce GTX 260"
# Clock rate: 1.47 GHz
# Total amount of global memory: 939524096 bytes
# Number of multiprocessors: 27
# Number of cores: 216
# Device 1: "GeForce GTX 260"
# Clock rate: 1.47 GHz
# Total amount of global memory: 939524096 bytes
# Number of multiprocessors: 27
# Number of cores: 216
MDIO ERROR: cannot open file "restart.coor"
# There are 2 devices supporting CUDA
# Device 0: "GeForce GTX 260"
# Clock rate: 1.47 GHz
# Total amount of global memory: 939524096 bytes
# Number of multiprocessors: 27
# Number of cores: 216
# Device 1: "GeForce GTX 260"
# Clock rate: 1.47 GHz
# Total amount of global memory: 939524096 bytes
# Number of multiprocessors: 27
# Number of cores: 216
called boinc_finish

</stderr_txt>
]]>


Validate state Valid
Claimed credit 4503.73958333333
Granted credit 6080.0484375
application version ACEMD beta version v6.05 (cuda)

--------------------------------------------------------------------------------
ID: 14817 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
MarkJ
Volunteer moderator
Volunteer tester

Send message
Joined: 24 Dec 08
Posts: 738
Credit: 200,909,904
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 14820 - Posted: 30 Jan 2010, 7:01:02 UTC

I picked up 2 so far. Links are here and here

They only ran for a couple of minutes but finished successfully. They were run on a GTX295.
BOINC blog
ID: 14820 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 14822 - Posted: 30 Jan 2010, 9:34:54 UTC - in response to Message 14820.  

L6-TONI_TEST2901-0-10-RND2222_2

http://www.gpugrid.net/result.php?resultid=1793679
Completed and validated 37,082.82 35,765.96 4,503.74 6,080.05

The result mirrors Stoneageman's result in that CPU was used almost 100%. Note I am crunching on WCG at the same time (not sure if that is being picked up rather than actual GPUGrid CPU time used)!
That said, great improvement. The GPU is GTS250 and CC1.1 and tasks normally take from 53000s to 60000s. So completion time dropped from 100% to less than 70%. If you take the normal 60000s the WU was 60% faster.

I have one more long Beta task running on a GT 240 CC1.1 - should finish late tonight.
ID: 14822 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Snow Crash

Send message
Joined: 4 Apr 09
Posts: 450
Credit: 539,316,349
RAC: 0
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 14825 - Posted: 30 Jan 2010, 11:31:10 UTC - in response to Message 14822.  

GTX285, XP32/ i7-920 HT ON @4.0 GHz.
100% cpu utilization and I did see the estimated 60% runtime improvement.

GTX295 Vista 64 Ultimate/ i7-920 HT OFF @ 4.4 GHz.

On my 295 when I did not suspend other projects to make a full core available for each GPUGrid beta (meaning they had to share) they ran 7.5 hours which is not substantively better than normal, certainly not in the 60% range. I did also get a couple of the real shorties and their CPU utilization was also close to 100%.
It looks like on the i7 there may be an issue with how the processor is handling the instruction set if HT is OFF. Maybe internally the CPU is turning HT on??? Observing task manager if I made sure 2 cores were free then each of the two WUs would use 25% but when I ran 4 CPU WUs from other projects the GPUGrid betas were using 11-13% which is just what a CPU WU looks like when I have HT on. The 7.5 hour runtimes I mentioned above were processed with HT off.


Thanks - Steve
ID: 14825 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Krunchin-Keith [USA]
Avatar

Send message
Joined: 17 May 07
Posts: 512
Credit: 111,288,061
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 14826 - Posted: 30 Jan 2010, 12:25:18 UTC

Kaboom, in the middle of the night.

Name L38-TONI_TEST2901-0-10-RND1295_0
Workunit 1127742
Created 29 Jan 2010 16:05:13 UTC
Sent 29 Jan 2010 16:13:31 UTC
Received 30 Jan 2010 12:20:21 UTC
Server state Over
Outcome Client error
Client state Compute error
Exit status 3 (0x3)
Computer ID 6133
Report deadline 3 Feb 2010 16:13:31 UTC
Run time 10312.796875
CPU time 5985.359
stderr out <core_client_version>6.10.17</core_client_version>
<![CDATA[
<message>
The system cannot find the path specified. (0x3) - exit code 3 (0x3)
</message>
<stderr_txt>
# There is 1 device supporting CUDA
# Device 0: "GeForce 8800 GT"
# Clock rate: 1.62 GHz
# Total amount of global memory: 536543232 bytes
# Number of multiprocessors: 14
# Number of cores: 112
MDIO ERROR: cannot open file "restart.coor"
SWAN : FATAL : Failure executing kernel sync [frc_sum_kernel] [700]
Assertion failed: 0, file ../swan/swanlib_nv.cpp, line 203

This application has requested the Runtime to terminate it in an unusual way.
Please contact the application's support team for more information.

</stderr_txt>
]]>


Validate state Invalid
Claimed credit 4503.73958333333
Granted credit 0
application version ACEMD beta version v6.05 (cuda)
ID: 14826 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Siegfried Niklas
Avatar

Send message
Joined: 23 Feb 09
Posts: 39
Credit: 144,654,294
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 14827 - Posted: 30 Jan 2010, 13:26:51 UTC
Last modified: 30 Jan 2010, 13:40:40 UTC

9800 GT, 607MHz/1517MHz/900MHz (512MB) driver: 19062 / QX9650 @3.66GHz / Vista64

B28-TONI_TEST2901-0-5-RND4647_0
Run time 357.2244
CPU time 348.1474
Validate state Valid

B47-TONI_TEST2901-0-5-RND7649_1
Run time 357.2556
CPU time 349.2238
Validate state Valid

B42-TONI_TEST2901-0-5-RND0199_2
Run time 356.179199
CPU time 348.943
Validate state Valid


GTX 295, 701MHz/1509MHz/1086MHz (896MB) driver: 19062 / i7-860 HT @3.8 GHz/ Vista64

L14-TONI_TEST2901-0-10-RND7593_0
Run time 15980.329535
CPU time 15870.54
Validate state Valid

L15-TONI_TEST2901-0-10-RND2988_0
Run time 16040.183401
CPU time 15938.39
Validate state Valid
ID: 14827 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 14828 - Posted: 30 Jan 2010, 13:33:25 UTC - in response to Message 14826.  

L45-TONI_TEST2901-0-10-RND5880_0 is 43% complete after 5h, so should complete in around 6h (though the estimate time to finish is 12h). On a 2.2GHz opteron Quad with a GT240. On that system typical task turnaround is about 17 or 18h. So it is preforming about 60% faster on that CC1.2 card.
ID: 14828 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Beyond
Avatar

Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 14832 - Posted: 30 Jan 2010, 15:23:31 UTC - in response to Message 14811.  

gtx260 OC 696/1500/999 Boinc 6:10:25 XP64

the cpu utilisation of 100% is going to be an issue for me and others running cpu projects alongside GPUgrid.

8800GT, BOINC v6.10.29, Win7-64, AMD X2

Running with 2 instances of Wieferich@home, beta runs a bit slower than old app, machine is barely responsive, 1 instance of Wieferich stalls. If I free up one complete core (close 1 instance of Wieferich) beta runs faster and machine becomes responsive. With the old app everything ran fine with an instance of Wieferich on each core. The 100% CPU core utilization of the beta is a problem...
ID: 14832 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Siegfried Niklas
Avatar

Send message
Joined: 23 Feb 09
Posts: 39
Credit: 144,654,294
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 14833 - Posted: 30 Jan 2010, 15:46:31 UTC

GTX 295, 701MHz/1509MHz/1086MHz (896MB) driver: 19062 / i7-860 HT @3.8 GHz/ Vista64

L13-TONI_TEST2901-1-10-RND4450_0

Aborted by myself - Run time 2726: 100% CPU-load, no GPU-utilization shown by GPU-Z, no progress.

(Dump by BOINC Windows Runtime Debugger)
ID: 14833 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Zydor

Send message
Joined: 8 Feb 09
Posts: 252
Credit: 1,309,451
RAC: 0
Level
Ala
Scientific publications
watwatwatwat
Message 14834 - Posted: 30 Jan 2010, 15:48:13 UTC
Last modified: 30 Jan 2010, 15:55:24 UTC

9800GTX Stock no o/c Phenom2 o/c 3.2Ghz 8Gb RAM Vista64

Also running Aqua and Freehal concurrently.

L36_TONI_TEST2901-1-10-RND9113_0

Saw the Beta running, so came back to give it a go, I had been blown away by too many errors on the old apps. Grabbed two about 20 mins ago, early days but the speed is impressive.

CPU utilisation is out of wack as noted by others, its using a complete core on the Phenom2 Quad (Task Manager reports cpu utilisation as "23". That needs addressing as its a showstopper because its draining from cpu based Projects (in my case Aqua - has no effect on Freehal).

It managed to muscle out Aqua from one core - quite a feat as the Aqua app is somewhat territorial grabbing all cores usually kicking off others :) Temps are ok.

If the cpu utilisation is resolved its looking like I'll be back again, which is a relief - I have serious GPUGRID withdrawal symptoms :)

Regards
Zy
ID: 14834 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 . . . 11 · Next

Message boards : Graphics cards (GPUs) : New nvidia beta application

©2026 Universitat Pompeu Fabra