Message boards :
Number crunching :
Immensive credits difference: ADRIA vs. PABLO tasks
Message board moderation
| Author | Message |
|---|---|
Michael H.W. WeberSend message Joined: 9 Feb 16 Posts: 78 Credit: 656,229,684 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Same GPUs, VERY similar run times, almost 4-fold less credits for ADRIAs tasks. Why is that? ADRIAs tasks: 15997518 12330750 16 Feb 2017 | 9:13:03 UTC 17 Feb 2017 | 12:03:51 UTC Fertig und Bestätigt 48,100.37 22,760.09 146,375.00 Long runs (8-12 hours on fastest card) v8.48 (cuda65) PABLOs tasks: 15993065 12326872 14 Feb 2017 | 8:43:45 UTC 14 Feb 2017 | 23:43:29 UTC Fertig und Bestätigt 53,238.56 20,422.74 421,800.00 Long runs (8-12 hours on fastest card) v8.48 (cuda65) Michael. President of Rechenkraft.net - Germany's first and largest distributed computing organization. |
Retvari ZoltanSend message Joined: 20 Jan 09 Posts: 2380 Credit: 16,897,957,044 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Your first ADRIA task missed the 24h bonus (+50%), so it has only +25% bonus. PABLO's tasks have 49.000~54.000 atoms and 15.625.000 steps ADRIA's tasks have 104.159 atoms and 3.750.000 steps Credit ratio: ~2.4 times Runtime ratio: ~1.09 times (~1.38 times on my hosts using SWAN_SYNC) ADRIA's tasks do more calculations on the CPU, so they do less calculations on the GPU. However, the disparity in the credit/time ratio should be addressed. The method used for estimating the credits before a batch is issued should be corrected. |
Michael H.W. WeberSend message Joined: 9 Feb 16 Posts: 78 Credit: 656,229,684 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
While I do very well understand your point with the 24 hrs deadline, the examples below demonstrate that despite the deadline issue something has to be fixed with the credits system. So, this time the 24 hrs deadline is met in each case: GTX 770 system - all tasks match 24 hrs deadline but differ unproportionally in credits: PABLO: 16008824 12340825 20 Feb 2017 | 19:40:19 UTC 21 Feb 2017 | 8:30:18 UTC Fertig und Bestätigt 46,014.89 6,223.11 252,750.00 Long runs (8-12 hours on fastest card) v8.48 (cuda65) ADRIA: 15993427 12327182 14 Feb 2017 | 13:10:23 UTC 15 Feb 2017 | 3:39:39 UTC Fertig und Bestätigt 52,101.35 17,701.32 175,650.00 Long runs (8-12 hours on fastest card) v8.48 (cuda65) In the above case, the task with less run time receives significantly more credits. GTX 970 system - all tasks match 24 hrs deadline but differ unproportionally in credits: PABLO: 15991483 12325510 13 Feb 2017 | 12:15:47 UTC 14 Feb 2017 | 8:36:56 UTC Fertig und Bestätigt 52,702.50 18,334.29 421,800.00 Long runs (8-12 hours on fastest card) v8.48 (cuda65) ADRIA: 15997517 12330749 16 Feb 2017 | 9:13:03 UTC 16 Feb 2017 | 23:12:08 UTC Fertig und Bestätigt 48,463.93 23,111.61 175,650.00 Long runs (8-12 hours on fastest card) v8.48 (cuda65) So, although exactly the same system was used for computation and in each case shown above the 24 hrs deadline was met, the credits still differ significantly at comparable run times (= electricity consumption). Your argument above regarding # of steps vs. # of atoms is also somehow less conceivable to me (but might require more in depth understanding): In ADRIAs case we have less steps but significantly more atoms while in PABLOS case it is vice versa. In toto, a given GPU runs comparably long and consumes comparably much electricity. Shouldn't these virtual credits somehow reflect this? When can we expect a change? Michael. President of Rechenkraft.net - Germany's first and largest distributed computing organization. |
Retvari ZoltanSend message Joined: 20 Jan 09 Posts: 2380 Credit: 16,897,957,044 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Your argument above regarding # of steps vs. # of atoms is also somehow less conceivable to me (but might require more in depth understanding): In ADRIAs case we have less steps but significantly more atoms while in PABLOS case it is vice versa.This is not an argument, simply the facts. Maybe I wasn't clear on this, but I agree with you. To further understand the behavior of the GPUGrid app, take a look at the "CPU Time" column: PABLO: 16008824 12340825 20 Feb 2017 | 19:40:19 UTC 21 Feb 2017 | 8:30:18 UTC Fertig und Bestätigt 46,014.89 6,223.11 252,750.00 ADRIA: 15993427 12327182 14 Feb 2017 | 13:10:23 UTC 15 Feb 2017 | 3:39:39 UTC Fertig und Bestätigt 52,101.35 17,701.32 175,650.00 The ADRIA task has 2.84 times CPU time than the PABLO task. In toto, a given GPU runs comparably long and consumes comparably much electricity.The actual power consumption of the GPU is different while crunching workunits from different batches, regardless of the GPU usage percentage you can read by 3rd party tools. In this instance the GPU's power consumption is less while crunching an ADRIA_MI_FAAH_TRP445TYR_INH than crunchhing a PABLO_adaptive_goal_KIX_CMYB. Shouldn't these virtual credits somehow reflect this?They should, but these credits are assigned "manually" for each batch, based on an estimation. This estimation is inadequate. When can we expect a change?I don't know. The good thing is that this affects everyone the same. The bad thing is that this disparity encourage some volunteers to selectively abort the less "profitable" workunits - but as these abortions counts as failures, therefore you can quickly reduce the daily quota of this host to zero by doing so. To moderate this large difference in the credit over time ratio you can use the SWAN_SYNC environmental setting, which will make every workunit a bit faster, hopefully the slower ones speed up more. |
|
Send message Joined: 5 Mar 13 Posts: 348 Credit: 0 RAC: 0 Level ![]() Scientific publications ![]() |
Looking into it. Pablo probably has to adjust his |
Michael H.W. WeberSend message Joined: 9 Feb 16 Posts: 78 Credit: 656,229,684 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Well, I have now a power meter attached to my computer showing me that there is indeed less electricity consumption with ADRIAs task but, again, it is not proportional to the credits difference. But OK, let's see what happens. I wished GPUGRID would have more tasks available. The way it currently is handled, unfortunately, it is quite an unreliable project which leaves my machines unnecessarily idling for significant durations. Also, the quota of 2 tasks per machine is, under these idling conditions, a bad idea - at least for the small tasks. Michael. President of Rechenkraft.net - Germany's first and largest distributed computing organization. |
Retvari ZoltanSend message Joined: 20 Jan 09 Posts: 2380 Credit: 16,897,957,044 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Well, I have now a power meter attached to my computer showing me that there is indeed less electricity consumption with ADRIAs task but, again, it is not proportional to the credits difference.True. It should be corrected. But from the scientific point of view credits are unimportant. To put it in another way: I rather help a worthy project for less credits, than a worthless for more credits. To put it in another perspective: the scientists / students rather spend their time on finding new (worthy) simulations, than on correcting the credit system. I wished GPUGRID would have more tasks available. The way it currently is handled, unfortunately, it is quite an unreliable project which leaves my machines unnecessarily idling for significant durations.This comes up every time when there's a shortage, but remember that we volunteered to help this project, not the project volunteered to keep our computers busy. Also, it is advised to have backup project(s) (=resource share set to zero) to avoid idling. Also, the quota of 2 tasks per machine is, under these idling conditions, a bad idea - at least for the small tasks.There's an ongoing argument about this, as others have quite the opposite point of view: 1 Work Unit Per GPU |
Logan CarrSend message Joined: 12 Aug 15 Posts: 240 Credit: 64,069,811 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]()
|
Zoltan, How does one go about making a back-up project that can stop working when a gpugrid project becomes available? I'd love to do this, but only know how to do one project at a time, so my computer has just been sitting turned off in a corner. Cruncher/Learner in progress. |
|
Send message Joined: 30 Sep 15 Posts: 1 Credit: 161,775,361 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]()
|
A backup project is any project with the resource share set to 0. Just attach another project, and set the share on said project's website to 0. Once that's done, your client will request a single unit at a time while all non-backup projects have no work, and you're out of units to run. Although do keep in mind that other projects have CPU tasks too, and you'll need to uncheck "use CPU" if you wish to keep running GPU only. |
Logan CarrSend message Joined: 12 Aug 15 Posts: 240 Credit: 64,069,811 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]()
|
A backup project is any project with the resource share set to 0. Just attach another project, and set the share on said project's website to 0. Once that's done, your client will request a single unit at a time while all non-backup projects have no work, and you're out of units to run. Thank you Cruncher/Learner in progress. |
|
Send message Joined: 1 Jan 15 Posts: 1166 Credit: 12,260,898,501 RAC: 1 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Once more, there is a big mis-relation between credits for certain tasks: e3s5_e1s1p0f218-ADRIA_FOLDA3D75_crystal_ss_contacts_50_a3D_3-0-1-RND1216_0 runtime: 14.317 secs; 137.850 credit points e3s6_e2s2p0f249-ADRIA_FOLDPG01_crystal_ss_contacts_50_proteinG_2-0-1-RND8994_0 runtime: 26.299 secs; 86.100 credit points e73s7_e65s33p0f13-PABLO_P05067_0_IDP-0-1-RND9622_1 runtime: 22.978 secs; 252.750 credit points |
|
Send message Joined: 23 Dec 09 Posts: 189 Credit: 4,798,881,008 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Just like to ask project-team: Why these "ADRIA_FOLDA3D50_crystal_ss_contacts_20" do not run as short tasks? They take less than 2 hours on a GTX1070. This sseems perfect to me to fit under the definition of short runs. Is it just negligence of ADRIA or is there a deeper reason? I am asking as I think it would be helpful to stick to the definitions, so that participants with lesser cards get there fair share of the work units. Especially, when there are so few short WUs available. |
©2025 Universitat Pompeu Fabra