Message boards :
News :
Experimental Python tasks (beta)
Message board moderation
Author | Message |
---|---|
Send message Joined: 9 Dec 08 Posts: 1006 Credit: 5,068,599 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() |
I'm creating some experimental tasks for the Python app (made Beta). They are Linux and CUDA specific and serve in preparation for future batches. They may use a relatively large amount of disk space (order of 1-10 GB) which persists between runs, and is cleared if you reset the project. |
Send message Joined: 4 Aug 14 Posts: 266 Credit: 2,219,935,054 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I'm creating some experimental tasks for the Python app (made Beta). They are Linux and CUDA specific and serve in preparation for future batches. Preference Ticked, ready and waiting... EDIT: Received some already https://www.gpugrid.net/result.php?resultid=29466771 https://www.gpugrid.net/result.php?resultid=29466770 Conda Warnings reported. Will you push out update with app (or safe to ignore)? Also Warnings about path not found: WARNING conda.core.envs_manager:register_env(50): Unable to register environment. Path not writable or missing. environment location: /var/lib/boinc-client/projects/www.gpugrid.net/miniconda registry file: /root/.conda/environments.txt Registry file location ( /root/ ) will not be accessible to boinc user unless conda is already installed on the host (by root user) and conda file is world readable Otherwise the task status is Completed and Validated |
Send message Joined: 9 Dec 08 Posts: 1006 Credit: 5,068,599 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() |
Looks harmless, thanks for reporting. It's because the "boinc" user doesn't have a HOME directory I think. |
Send message Joined: 4 Aug 14 Posts: 266 Credit: 2,219,935,054 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Looks harmless, thanks for reporting. It's because the "boinc" user doesn't have a HOME directory I think. Agreed Perhaps adding "./envs" switch to the end of the command: /var/lib/boinc-client/projects/www.gpugrid.net/miniconda/bin/conda install May help with setting up the environment. This switch should add environment file to current directory from which command is executed. |
Send message Joined: 13 Dec 17 Posts: 1387 Credit: 8,177,442,190 RAC: 6,674,176 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() |
I got one of these tasks which confused me as I have not set "accept beta applications" in my project preferences. Failed after 1200 seconds. Any idea why I got this task even when I have not accepted the app through beta settings? https://www.gpugrid.net/result.php?resultid=30508976 |
Send message Joined: 21 Feb 20 Posts: 1099 Credit: 40,331,687,595 RAC: 101,874 Level ![]() Scientific publications ![]() |
What is the difference between these test Python apps and the standard one? Is it just that this application is coded in Python? what language are the default apps coded in? ![]() |
Send message Joined: 13 Dec 17 Posts: 1387 Credit: 8,177,442,190 RAC: 6,674,176 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() |
Both apps are wrappered. One is the stock acemd3 and I assume is written in some form of C. The new Anaconda Python task is a conda application. And Python. I think Toni is going to have to explain what and how these new tasks and application work. Very strange behavior. I think the conda and python parts run first and communicate with the project doing some intermediary calculation/configuration/formatting or something. Lots of upstream network activity and nothing going on in the client transfers screen. I saw the tasks get to 100% progress and no time remaining and then stall out. No upload of the finished task. Looked away from the machine and looked again and now both tasks have reset their progress and now have 3 hours to run. I first saw conda show up in the process list and now that has disappeared to be replaced with a acemd3 and python process for each task. Must be doing something other than insta-failing that the previous tries. |
Send message Joined: 22 Oct 20 Posts: 4 Credit: 34,434,982 RAC: 0 Level ![]() Scientific publications ![]() |
CondaHTTPError: HTTP 000 CONNECTION FAILED for url <https://conda.anaconda.org/conda-forge/linux-64/_libgcc_mutex-0.1-conda_forge.tar.bz2> I am receiving this error in STDerr Output for Experimental Python tasks on all my hosts. This is probably due to the fact all my PCs are behind a proxy. Can you please set the Python tasks to use the Proxy defined in the Boinc Client? Work Units here: https://www.gpugrid.net/result.php?resultid=31672354 https://www.gpugrid.net/result.php?resultid=31668427 https://www.gpugrid.net/result.php?resultid=31665961 |
Send message Joined: 13 Dec 17 Posts: 1387 Credit: 8,177,442,190 RAC: 6,674,176 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() |
Boy, mixing both regular acemd3 and the python anaconda tasks sure F*s up the APR for both tasks. The insanely low APR for the Python tasks is forcing all GPUGrid tasks into High Priority. The regular acemd3 tasks are getting 3-6 day estimated completions. |
Send message Joined: 21 Feb 20 Posts: 1099 Credit: 40,331,687,595 RAC: 101,874 Level ![]() Scientific publications ![]() |
Boy, mixing both regular acemd3 and the python anaconda tasks sure F*s up the APR for both tasks. The insanely low APR for the Python tasks is forcing all GPUGrid tasks into High Priority. I'm seeing that too lol. but it doesnt seem to be causing too much trouble for me since I don't run more than one GPU project concurrently. Only have Prime and backup. copying my message from another thread with my observations about these tasks for Toni to see if he doesnt check the other threads: Looks like I have 11 successful tasks, and 2 failures. ![]() |
Send message Joined: 11 Jul 09 Posts: 1639 Credit: 9,948,917,649 RAC: 8,720,931 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Boy, mixing both regular acemd3 and the python anaconda tasks sure F*s up the APR for both tasks. The insanely low APR for the Python tasks is forcing all GPUGrid tasks into High Priority. Actually, that won't be the cause. The APRs are kept separately for each application, and once you have an 'active' APR (11 or more 'completions' - validated tasks for that app), they should keep out of each others way. What will F* things up is that this project still allows DCF to run free - and that's a single value which is applied to both task types. |
Send message Joined: 13 Dec 17 Posts: 1387 Credit: 8,177,442,190 RAC: 6,674,176 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() |
Yeah, after I wrote that I realized I meant the DCF is what is messing up the runtime estimations. I wonder if the regular acemd3 tasks will ever get their normal DCF's back to normal. I haven't run ANY of my other gpu project tasks since these anaconda python tasks have shown up. I will eventually when the other projects deadlines approach of course. |
Send message Joined: 21 Feb 20 Posts: 1099 Credit: 40,331,687,595 RAC: 101,874 Level ![]() Scientific publications ![]() |
what's DCF? ![]() |
Send message Joined: 13 Dec 17 Posts: 1387 Credit: 8,177,442,190 RAC: 6,674,176 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() |
what's DCF? Task Duration Correction Factor. The older BOINC server versions use it like Einstein. It messes up gpu tasks of different apps there too. |
Send message Joined: 11 Jul 09 Posts: 1639 Credit: 9,948,917,649 RAC: 8,720,931 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
You can't talk about 'their DCFs' - there is only one (there could have been more than one, but that's the way David chose to play it) You can see it in BOINC Manager, on the Projects|properties dialog. If it gets really, really high (above 90), it'll inch downwards at 1% per task. Below 90, it'll speed up to 10% par task. The standard advice used to be "two weeks to stabilise", but with modern machines (multi-core, multi-GPU, and faster), the tasks fly by, and it should be quicker. |
Send message Joined: 13 Dec 17 Posts: 1387 Credit: 8,177,442,190 RAC: 6,674,176 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() |
What is also messed up is the size of the Anaconda Python task estimated computation size shown in the task properties. The ones I crunched were only set for 3,000 GFLOPS. The regular acemd3 tasks are set for 5,000,000 GFLOPS. This also probably influenced the wildly inaccurate DCF's for the new python tasks. |
Send message Joined: 13 Dec 17 Posts: 1387 Credit: 8,177,442,190 RAC: 6,674,176 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() |
You can't talk about 'their DCFs' - there is only one (there could have been more than one, but that's the way David chose to play it) This daily driver has GPUGrid DCF Project properties currently at 85 and change. |
Send message Joined: 21 Feb 20 Posts: 1099 Credit: 40,331,687,595 RAC: 101,874 Level ![]() Scientific publications ![]() |
What is also messed up is the size of the Anaconda Python task estimated computation size shown in the task properties. can confirm. could this be why the credit reward is so high too? I wonder what the flop estimate was on this one from Kevvy: https://www.gpugrid.net/result.php?resultid=31679003 he got wrecked on this one, over 5hrs on a 2080ti, and got a mere 20 credits lol. ![]() |
Send message Joined: 26 Aug 08 Posts: 183 Credit: 10,085,929,375 RAC: 4 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I've got one running now on an RTX 2070S and the only real issue is low GPU utilization (60-70%). The current task is using ~2 GB of VRAM and ~3 GB of system RAM. I have one thread free on a ryzen 3900X to support the GPU and that thread is running at 100%. This computer has complete 3 of the new python tasks successfully. Linux Mint 20; Driver Version: 440.95.01; CUDA Version: 10.2 |
Send message Joined: 21 Feb 20 Posts: 1099 Credit: 40,331,687,595 RAC: 101,874 Level ![]() Scientific publications ![]() |
I've got one running now on an RTX 2070S and the only real issue is low GPU utilization (60-70%). The current task is using ~2 GB of VRAM and ~3 GB of system RAM. I have one thread free on a ryzen 3900X to support the GPU and that thread is running at 100%. This computer has complete 3 of the new python tasks successfully. what kind of BOINC install do you have? does it run as a service? or a standalone install that runs from an executable? what is the clock speed of your 3900X and memory speed as well? try letting there be 2 spare free threads (so you have one doing nothing) to avoid maxing out the CPU to 100% utilization on all threads. this is known to slow down GPU work. this might increase your GPU utilization a bit. ![]() |
©2025 Universitat Pompeu Fabra