New Beta app for AMD GPUs

Message boards : News : New Beta app for AMD GPUs
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · 11 · 12 . . . 14 · Next

AuthorMessage
Richard Haselgrove

Send message
Joined: 11 Jul 09
Posts: 1639
Credit: 10,159,968,649
RAC: 351
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39995 - Posted: 1 Feb 2015, 23:54:21 UTC - in response to Message 39989.  

There is the website (or local) preference "On multiprocessors, use at most xxx% of the processors", but this controls how many CPU cores BOINC can use as a whole, across all projects

I was under the impression that that's per project. (Otherwise, ¿how does that setting work when a client's attached to multiple projects?). In fact, I'm pretty sure changing that setting that will do the job, as I'm seeing jobs go out with odd numbers of cores assigned.

Matt

Well, for a start it's in the global preferences section: http://www.gpugrid.net/prefs.php?subset=global

and that section is headed (I copy from the GPUGrid website)

"These apply to all BOINC projects in which you participate.
On computers participating in multiple projects, the most recently modified preferences will be used."

Since the projects don't actually communicate directly between each other, this works by each user's host computer(s) acting as a message relay between all the projects that computer is attached to.

Each scheduler contact (both request and reply) contains the user's current global preference settings. That includes:

Current values (for all four venues)
Timestamp of last change
Project where the most recent update is believed to have been made

If any RPC contains different values from those held by the receiver, and if the timestamp is newer, then the receiving computer/project updates its internal records.

Provided there is a continuous chain connecting every project, and every computer, on the user's account, eventually all computers and all projects end up with an identical set of preference values for that global set. It only breaks down if a user has computers 1,2,3 attached to projects A,B,C - and computers 4,5,6 attached to projects D,E,F with no overlap. Then, there's no migration route between the first cluster and the second cluster.
ID: 39995 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jacob Klein

Send message
Joined: 11 Oct 08
Posts: 1127
Credit: 1,901,927,545
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 39996 - Posted: 1 Feb 2015, 23:58:30 UTC
Last modified: 2 Feb 2015, 0:04:44 UTC


Sorry MJH, but you are wrong on how that setting works.


So how does the client resolve conflicting settings from different projects?


The computer has resources, and the client keeps the resources occupied.

Unless a task is in jeopardy of missing deadline, the client feeds coprocessor resources first, and then feeds the rest of the CPUs. If a coprocessor job has been included in the "run-list", the client schedules [ncpus + 1] work of work. Otherwise, if only CPU jobs, it schedules [ncpus] worth of work. If a multithreading job enters the run-list, we are allowed to overcommit the CPU.

As an example, let's say we have a quad-core hyperthreaded machine, such that Windows sees 8 virtual CPUs. Let's say I have 2 ASIC coprocessor devices, 2 AMD GPU coprocessors that are the exact same model, and 3 NVIDIA GPU coprocessors that are all different models. Let's say we're attached to 30 different projects.

So, a possible schedule for this might be:
ASIC + 0.05 CPU
ASIC + 0.05 CPU
AMD + 0.25 CPU
AMD + 0.25 CPU
NVIDIA + 0.3 CPU
NVIDIA + 0.4 CPU
NVIDIA + 0.5 CPU
7 single-threaded CPU tasks
... all of differing projects. Note: The coprocessor CPU portions add up to: 1.8, and ncpus is 8, and so BOINC schedules 7 CPU tasks, such that the sum of CPU is 8.8, which is less than 9 [ncpus + 1].

Let's say we have the "Use at most x% CPUs" setting set to 75%, such that BOINC knows we only want to schedule for 6 virtual CPUs. Now a schedule might be:
ASIC + 0.05 CPU
ASIC + 0.05 CPU
AMD + 0.7 CPU
AMD + 0.7 CPU
NVIDIA + 0.3 CPU
NVIDIA + 0.4 CPU
NVIDIA + 0.5 CPU
1 6-threaded MT CPU task
... all of differing projects. In this example, for the coprocessor jobs, the sum of CPU portions is 2.7. Even though 2.7 + 6 = 8.7 is larger than [ncpus + 1 = 7], we are allowed to overcommit the CPU to get that MT task to process.

Hope this helps explain the scheduler a bit better. TLDR; The "Use at most X% CPUs" just tells BOINC how many CPUs it should consider available for it to allocate across the list of projects and tasks.

Your app here is the first I've ever seen that is both MT and GPU Coprocessor. I expect you to have some scheduling problems with it :)

One final note: The server only knows the make/model of the highest-powered GPU, for each GPU type (NVIDIA or AMD or Intel), I believe. You probably already knew this. Not sure if that will affect your task assignment logic.
ID: 39996 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
poppageek
Avatar

Send message
Joined: 4 Jul 09
Posts: 76
Credit: 114,610,402
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 40007 - Posted: 2 Feb 2015, 21:08:12 UTC

If you allow multi-gpu setups to get work we can limit GPUGRID.net to one GPU via the <exclude_gpu> tag in cc_config.xml.


ID: 40007 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
hangya2

Send message
Joined: 15 Apr 12
Posts: 1
Credit: 121,031
RAC: 0
Level

Scientific publications
wat
Message 40015 - Posted: 3 Feb 2015, 5:57:31 UTC
Last modified: 3 Feb 2015, 6:00:13 UTC

about the AMDTEST wu:s
-on linux they did not download at all; same driver version as on w7; boinc asked only for cpu wu:s, even if it detected the amd card in the logs
-on w7 they finish without error, but do not follow the amount of dedicated cpu:s, they run on 4 cores regardless of local specification
ID: 40015 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
TJ

Send message
Joined: 26 Jun 09
Posts: 815
Credit: 1,470,385,294
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 40016 - Posted: 3 Feb 2015, 6:54:16 UTC - in response to Message 40015.  

about the AMDTEST wu:s
-on linux they did not download at all; same driver version as on w7; boinc asked only for cpu wu:s, even if it detected the amd card in the logs
-on w7 they finish without error, but do not follow the amount of dedicated cpu:s, they run on 4 cores regardless of local specification


If you read through this thread you can read that Matt says, it is Windows x64 only. Hope this helps.

Greetings from TJ
ID: 40016 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile tito

Send message
Joined: 21 May 09
Posts: 22
Credit: 2,002,780,169
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 40018 - Posted: 3 Feb 2015, 14:15:17 UTC - in response to Message 40016.  

No more WUs?
ID: 40018 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile MJH

Send message
Joined: 12 Nov 07
Posts: 696
Credit: 27,266,655
RAC: 0
Level
Val
Scientific publications
watwat
Message 40019 - Posted: 3 Feb 2015, 18:42:24 UTC - in response to Message 40018.  

No more WUs?


Filled up now.
ID: 40019 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile nenym

Send message
Joined: 31 Mar 09
Posts: 137
Credit: 1,429,587,071
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 40020 - Posted: 3 Feb 2015, 19:18:29 UTC - in response to Message 40019.  
Last modified: 3 Feb 2015, 19:29:03 UTC

The fix credit 1500 per WU set. or?
Some tasks errored out after only-CPU stage. This and that.
part of stderr
clCreateProgramWithSource
starting mdrun 'Protein in water'
200000 steps,    400.0 ps.
Inner product between old and new vector <= 0.0!
constraint #0 atoms 1730 and 1731
Wrote pdb files with previous and current coordinates

WARNING: Listed nonbonded interaction between particles 1713 and 1731
at distance 2.985 which is larger than the table limit 1.995 nm.

This is likely either a 1,4 interaction, or a listed interaction inside
a smaller molecule you are decoupling during a free energy calculation.
Since interactions at distances beyond the table cannot be computed,
they are skipped until they are inside the table limit again. You will
only see this message once, even if it occurs for several interactions.

IMPORTANT: This should not happen in a stable simulation, so there is
probably something wrong with your system. Only change the table-extension
distance in the mdp file if you are really sure that is the reason.


Inner product between old and new vector <= 0.0!
constraint #0 atoms 73 and 74
Wrote pdb files with previous and current coordinates


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x000000013F573B37 read attempt to address 0x123DE8A0

Engaging BOINC Windows Runtime Debugger...
ID: 40020 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
poppageek
Avatar

Send message
Joined: 4 Jul 09
Posts: 76
Credit: 114,610,402
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 40022 - Posted: 3 Feb 2015, 20:41:38 UTC - in response to Message 40007.  

If you allow multi-gpu setups to get work we can limit GPUGRID.net to one GPU via the <exclude_gpu> tag in cc_config.xml.



Is this a possibility?

Would groveling help? :D





lol

Cheers!
ID: 40022 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile MJH

Send message
Joined: 12 Nov 07
Posts: 696
Credit: 27,266,655
RAC: 0
Level
Val
Scientific publications
watwat
Message 40023 - Posted: 3 Feb 2015, 21:31:56 UTC - in response to Message 40022.  

Not just yet. It will require a change in the application too

Matt
ID: 40023 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile tito

Send message
Joined: 21 May 09
Posts: 22
Credit: 2,002,780,169
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 40032 - Posted: 4 Feb 2015, 18:18:28 UTC

How about credits? How are they calculated per WU? For now it's 1500 per WU but on XEON 1230V2 + 7950@1100 it gives ~ 160000 daily? Sounds little disappointing cause it takes 8 threads of CPU. Do You plan any adjustment in future or is it final?
ID: 40032 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile MJH

Send message
Joined: 12 Nov 07
Posts: 696
Credit: 27,266,655
RAC: 0
Level
Val
Scientific publications
watwat
Message 40033 - Posted: 4 Feb 2015, 18:46:24 UTC - in response to Message 40032.  
Last modified: 4 Feb 2015, 18:47:31 UTC

How about credits? How are they calculated per WU? For now it's 1500 per WU but on XEON 1230V2 + 7950@1100 it gives ~ 160000 daily? Sounds little disappointing cause it takes 8 threads of CPU. Do You plan any adjustment in future or is it final?


A fixed amount of credits are now allocated per WU - like acemdlong , it's 1000 + 50% bonus for prompt return.

The actual amount of credits are still a placeholder value, while we are in beta.
ID: 40033 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Werkstatt

Send message
Joined: 23 May 09
Posts: 121
Credit: 400,300,664
RAC: 16
Level
Gln
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 40034 - Posted: 4 Feb 2015, 19:40:11 UTC
Last modified: 4 Feb 2015, 19:41:01 UTC

At the moment this is a beta app which does not make good use of the gpu and it does not make much differece how many cpu's are used.
Unfortunately the project does not allow to download and run 2 or even mor wu's. So I looked for a way to increase the gpu usage to values from 93% to 100%.
https://dl.dropboxusercontent.com/u/50246791/gpugrid%20setup%201.JPG
Works on three pc's, all wu finish and validate. But may be different with a new version.

Alexander
ID: 40034 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile caffeineyellow5
Avatar

Send message
Joined: 30 Jul 14
Posts: 225
Credit: 2,658,976,345
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwat
Message 40035 - Posted: 4 Feb 2015, 21:58:26 UTC - in response to Message 40034.  

I just did a new build that has onboard AMD Radeon(TM) R7 Graphics. The driver version is Catalyst 14.12. I am not getting any work for this task on it.

MSI Afterburner is seeing 5 different lines/instances for this GPU. I also put 2 GTX780 cards in this build. I have the device management (for the profile called home on the GPUGRID preferences page of the website) set to use 100% of everything and to run everything except the CPU specific items. (As follows)
--------------------------------------------------------
Resource share 100
Use CPU yes
Use NVIDIA GPU yes
Run test applications? yes
Is it OK for GPUGRID and your team (if any) to email you? yes
Should GPUGRID show your computers on its web site? yes
Default computer location home
Maximum CPU % for graphics 100
Run only the selected applications
ACEMD short runs (2-3 hours on fastest card): yes
ACEMD beta: yes
ACEMD long runs (8-12 hours on fastest GPU): yes
Docking CPU only app: no
Molecular Dynamics on CPU: no
Molecular Dynamics on AMD GPUs: yes
If no work for selected applications is available, accept work from other applications? no
Use Graphics Processing Unit (GPU) if available yes
Use Central Processing Unit (CPU) yes
---------------------------------------------------------

Regardless of these settings and the fact the there are WUs showing on the Server Status page, this system is not getting any of these units. It did get one unit as I was building, before I added the GTX cards and it had an error while computing. This was my fault for adding software while still adding hardware. I was adding cards as they finished WUs on their originally installed computer before transferring to this new one. The new system is http://www.gpugrid.net/results.php?hostid=197994 This ADM GPU is showing as AMD Spectre (762MB) on the PC reference page here http://www.gpugrid.net/show_host_detail.php?hostid=197994

Do I have a setting or misconfiguration or do you need more info to determine? Or is this GPU just m=not able to do the work? Thx
1 Corinthians 9:16 "For though I preach the gospel, I have nothing to glory of: for necessity is laid upon me; yea, woe is unto me, if I preach not the gospel!"
Ephesians 6:18-20, please ;-)
http://tbc-pa.org
ID: 40035 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile MJH

Send message
Joined: 12 Nov 07
Posts: 696
Credit: 27,266,655
RAC: 0
Level
Val
Scientific publications
watwat
Message 40036 - Posted: 4 Feb 2015, 22:23:04 UTC - in response to Message 40035.  

Tasks aren't assigned to machines with both Nvidia and AMD cards.

Matt
ID: 40036 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile caffeineyellow5
Avatar

Send message
Joined: 30 Jul 14
Posts: 225
Credit: 2,658,976,345
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwat
Message 40037 - Posted: 4 Feb 2015, 22:32:09 UTC - in response to Message 40036.  

Tasks aren't assigned to machines with both Nvidia and AMD cards.

Matt
OK, Thanks. I went AMD over Intel on the board so that maybe I could double up the tasks being done with AMD and NVidia. Hopefully I can get use out of the onboard GPU somewhere.
ID: 40037 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile MJH

Send message
Joined: 12 Nov 07
Posts: 696
Credit: 27,266,655
RAC: 0
Level
Val
Scientific publications
watwat
Message 40038 - Posted: 4 Feb 2015, 22:53:09 UTC

916 is up.
Should report progress and run on hosts with multiple gpus
ID: 40038 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
poppageek
Avatar

Send message
Joined: 4 Jul 09
Posts: 76
Credit: 114,610,402
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 40041 - Posted: 5 Feb 2015, 1:29:41 UTC

That explains why my 2x AMD GPU rig got 6 tasks. Says they are 9.15 (mt) though.


Never underestimate the power of proper groveling. ;-)


Cheers!
ID: 40041 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
poppageek
Avatar

Send message
Joined: 4 Jul 09
Posts: 76
Credit: 114,610,402
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 40042 - Posted: 5 Feb 2015, 1:59:38 UTC
Last modified: 5 Feb 2015, 2:00:26 UTC

Vista 6 core 2x 7870 GPUs machine got 9.15 (mt). Using <exclude_gpu> it was on GPU 0. Seldom touched the GPU CPU usage was 0-100 on all cores bouncing around. While Time Remaining stoped after about 10 minutes WU continued for, well it is at 97% and has run 44:55.

Windows 8.1 6 core 2 x 7870 9.15 (mt) did hit the gpus for 30% for a few minutes then to 0% for a few. CPU was all cores at about 20-30% and computation error after 119-122 seconds.
ID: 40042 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
poppageek
Avatar

Send message
Joined: 4 Jul 09
Posts: 76
Credit: 114,610,402
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 40043 - Posted: 5 Feb 2015, 3:57:07 UTC - in response to Message 40042.  

Vista 6 core 2x 7870 GPUs machine got 9.15 (mt). Using <exclude_gpu> it was on GPU 0. Seldom touched the GPU CPU usage was 0-100 on all cores bouncing around. While Time Remaining stoped after about 10 minutes WU continued for, well it is at 97% and has run 44:55.



After running for 2:46:41 and 14 minutes of that at 100% complete I aborted.

ID: 40043 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · 11 · 12 . . . 14 · Next

Message boards : News : New Beta app for AMD GPUs

©2025 Universitat Pompeu Fabra