Message boards :
Number crunching :
2 GPUs - but only 1 ATMML downloads - why?
Message board moderation
| Author | Message |
|---|---|
|
Send message Joined: 1 Jan 15 Posts: 1166 Credit: 12,260,898,501 RAC: 1 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
In one of my boxes which I bought 3 years ago, I've been crunching with 2 RTX3070. For the past months, mainly the ATMML tasks from GPUGRID, 1 task on each GPU. Since last week though, only 1 task runs at a time, no second one is downloaded. Only when the running task gets finished, the next one downloads. This behaviour is totally in contrast to how it's been all the time before. To make sure that the second GPU is not defective, I tried a few other GPU projects - they all downloaded 2 (or more) tasks and crunched 2 tasks in parallel (1 task each GPU, as usual). Any idea what problem I'm having with GPUGRID? |
ServicEnginICSend message Joined: 24 Sep 10 Posts: 592 Credit: 11,972,186,510 RAC: 1,447 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
First, you can check that you have set fraction_done_exact for ATMML app at that host. This is the way I have it configured at app_config.xml file: <app_config> Once edited, you have to go to "Options" - "Read config files" at BOINC Manager to make it to take effect. This affects the way the estimated completion time is calculated for the running ATMML tasks. I'm running Linux. At Windows, I think that the route for this file is "C:\Program Data\Boinc\Projects\www.gpugrid.net\app_config.xml" If it does not exist, you can create it by Notepad. |
|
Send message Joined: 9 May 24 Posts: 8 Credit: 4,621,433,524 RAC: 6,526 Level ![]() Scientific publications
|
I think the answer might be simpler. At the moment, there's simply not enough tasks for Windows for the number of hosts. The number of unsent (ATMML) tasks is 0 for most of the time. New tasks trickle one at a time rather than appear in larger batches. So, it's up to the luck of your computer whether it hits the moment when a task is available for download. And they disappear very quickly, within seconds really. The situation has improved over the weekend, but during the week my host barely managed to get one task per day. |
|
Send message Joined: 1 Jan 15 Posts: 1166 Credit: 12,260,898,501 RAC: 1 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
I think the answer might be simpler. At the moment, there's simply not enough tasks for Windows for the number of hosts. The number of unsent (ATMML) tasks is 0 for most of the time. New tasks trickle one at a time rather than appear in larger batches. So, it's up to the luck of your computer whether it hits the moment when a task is available for download. And they disappear very quickly, within seconds really. I am aware that the number of unsent tasks has been zero for about 2 weeks. But as long as the number of "tasks in process" is about 500 or higher (currently 700+), it's always been the case that a new task could be downloaded within a few hours. That's the experience I have made since I had joined GPUGRID. Also, I can see that once a task was finished and uploaded either on this host or on one of the other 3 ones, it normally doesn't take longer than 1-2 hours until a new one comes in. So I am sure the problem is a different one, unfurtunately :-( |
|
Send message Joined: 1 Jan 15 Posts: 1166 Credit: 12,260,898,501 RAC: 1 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
ServicEnginIC wrote: First, you can check that you have set fraction_done_exact for ATMML app at that host. thanks for the hint. I added an app_config.xml according to your suggestion. However, this did not solve the problem :-( |
|
Send message Joined: 1 Jan 15 Posts: 1166 Credit: 12,260,898,501 RAC: 1 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
ServicEnginIC wrote: shortly after I had added the above mentioned app_config.xml, the one running task got finished and uploaded, but no other task was downloaded for hours :-( So I decided to remove the app_config.xml (maybe it does not work for Windows), and surprise: within a few minutes, not only 1 new task was downloaded, but 2 - one for each GPU. So, the problem seems to be solved - although it's still not clear how it got solved. |
ServicEnginICSend message Joined: 24 Sep 10 Posts: 592 Credit: 11,972,186,510 RAC: 1,447 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
The effect of app_config.xml will persist until you re-read config files, or restart BOINC/computer... |
©2025 Universitat Pompeu Fabra