Message boards :
News :
PYSCFbeta: Quantum chemistry calculations on GPU
Message board moderation
Previous · 1 . . . 10 · 11 · 12 · 13 · 14 · Next
| Author | Message |
|---|---|
|
Send message Joined: 21 Feb 20 Posts: 1116 Credit: 40,839,470,595 RAC: 6,423 Level ![]() Scientific publications
|
schedule requests from your host are not specific about what it's asking for. it just asks for work for "Nvidia" and the scheduler on the project side decides what you need and what to send based on your preferences. the way the scheduler is setup right now, you wont be sent both types of work when both are available, only ATM. you will need to move the GPUs to different hosts and setup the project preferences to be different for each of them. or run two clients on one host with one gpu attached to each, or just stay with ATM on both cards.
|
|
Send message Joined: 15 Jul 20 Posts: 95 Credit: 2,550,803,412 RAC: 248 Level ![]() Scientific publications
|
ok merci |
ServicEnginICSend message Joined: 24 Sep 10 Posts: 592 Credit: 11,972,186,510 RAC: 1,447 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
QChem seems to not be classified in the scheduler as "test" or beta. despite being treated as such by the staff and the app name literally has the word beta in it. if you disable test tasks, and enable only QChem, you will get them still. Giving a bit more assortment to current GPUGRID apps spectrum, I happened to be watching Server status page when a limited number (about 215) of "ATM: Free energy calculations of protein-ligand binding" tasks grew up. To be distinguished from previously existing ATMbeta branch. I managed to configure a venue at GPUGRID preferences page to catch one of them before unsent tasks vanished. Task: tnks2_m5f_m5l_1_RE-QUICO_ATM_GAFF2_1fs-0-5-RND3367_1 To achieve this, I disabled getting test apps, and enabled only (somehow paradoxical ;-) "ATM (beta)" app. That task is currently running at my GTX 1660 Ti GPU, at an estimated rate of 9,72% per hour. And quickly returning to PYSCFbeta (QChem) topic: tasks for this app grew up today to a noticeable amount of 80K+ ready to send ones. After peaking, QChem unsent tasks are now decreasing again. |
|
Send message Joined: 15 Jul 20 Posts: 95 Credit: 2,550,803,412 RAC: 248 Level ![]() Scientific publications
|
Bonjour y a t il des unités de calcul pour windows disponible? Hello Are there computing units for windows available? |
ServicEnginICSend message Joined: 24 Sep 10 Posts: 592 Credit: 11,972,186,510 RAC: 1,447 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Yes, ATM and ATMbeta apps have both Windows and Linux versions currently available. Edit. Regarding Quantum chemistry, there is no still any Windows version |
|
Send message Joined: 1 Jan 15 Posts: 1166 Credit: 12,260,898,501 RAC: 1 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Regarding Quantum chemistry, there is no still any Windows version :-( :-( :-( |
|
Send message Joined: 28 Mar 09 Posts: 490 Credit: 11,731,645,728 RAC: 69 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
|
|
Send message Joined: 13 Jul 09 Posts: 64 Credit: 2,922,790,120 RAC: 98 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]()
|
https://imgur.com/evCBB73 GPUGRID error rate across 2x 3070 8GB, 2x 3080 10GB & 1 4070 Super 12GB (early part is with 3x 3070 8GB one of which was replaced by 4070S 2/20). Skip |
|
Send message Joined: 13 Jul 09 Posts: 64 Credit: 2,922,790,120 RAC: 98 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]()
|
|
|
Send message Joined: 21 Feb 20 Posts: 1116 Credit: 40,839,470,595 RAC: 6,423 Level ![]() Scientific publications
|
to be expected with 8-10GB cards. might get better context if you split the graphs up by card type. so you can see the relative error rate vs different VRAM sizes. I'm guessing most errors come from the 8GB cards.
|
|
Send message Joined: 27 May 21 Posts: 54 Credit: 1,004,151,720 RAC: 0 Level ![]() Scientific publications
|
On my GTX1080ti 11GB, I've only got about 1% error rate due to memory. But watching 'nvidia-smi dmon' there are a lot of close shaves, where I'm only a couple of MB's below the limit... So from a 10GB card, I'd already expect a non-trivial error rate. |
|
Send message Joined: 13 Jul 09 Posts: 64 Credit: 2,922,790,120 RAC: 98 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]()
|
to be expected with 8-10GB cards. They do: 8GB – last 2 checks of 2 cards 44.07 10GB – last 2 checks of 2 cards 30.80 12GB – last 2 checks of 1 card 7.62 But I need to look at the last day or two as rates have been going up. - da shu @ HeliOS, "A child's exposure to technology should never be predicated on an ability to afford it." |
|
Send message Joined: 13 Jul 09 Posts: 64 Credit: 2,922,790,120 RAC: 98 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]()
|
Anyone have insight into this error: <stderr_txt> 09:06:00 (130033): wrapper (7.7.26016): starting [x86_64-pc-linux-gnu__cuda1121.zip] End-of-central-directory signature not found. Either this file is not a zipfile, or it constitutes one disk of a multi-part archive. In the latter case the central directory and zipfile comment will be found on the last disk(s) of this archive. unzip: cannot find zipfile directory in one of x86_64-pc-linux-gnu__cuda1121.zip or x86_64-pc-linux-gnu__cuda1121.zip.zip, and cannot find x86_64-pc-linux-gnu__cuda1121.zip.ZIP, period. boinc_unzip() error: 9 It looks like every WU since the afternoon of the 7th (Zulu) is getting this but only on my single 12GB 4070S Skip |
|
Send message Joined: 13 Dec 17 Posts: 1419 Credit: 9,119,446,190 RAC: 891 Level ![]() Scientific publications ![]() ![]() ![]() ![]()
|
Download error causing the zip file to be corrupted because it is missing the end of file signature. I was getting that on a Google Drive zip archive a couple of days ago. Switching browsers let me download the archive correctly so it would unpack. |
|
Send message Joined: 13 Jul 09 Posts: 64 Credit: 2,922,790,120 RAC: 98 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]()
|
Download error causing the zip file to be corrupted because it is missing the end of file signature. Well after 100+ of these errors I finally got 3 good ones out of that box after a reboot for a different reason. Thanx, Skip |
|
Send message Joined: 15 Jul 20 Posts: 95 Credit: 2,550,803,412 RAC: 248 Level ![]() Scientific publications
|
Bonjour y a t il des unités de calcul pour windows disponible? Hello Are there computing units for windows available? |
|
Send message Joined: 27 Aug 21 Posts: 38 Credit: 7,254,068,306 RAC: 0 Level ![]() Scientific publications
|
There are not for this project (at this time). |
|
Send message Joined: 13 Jul 09 Posts: 64 Credit: 2,922,790,120 RAC: 98 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]()
|
Error rates skyrocketed on me for this app... even on the 10GB cards (12GB card will be back on Thursday). This started late on April 7th. Error rate now over 50% so I will have to NNW till I can figure it out. Skip - da shu @ HeliOS, "A child's exposure to technology should never be predicated on an ability to afford it." |
|
Send message Joined: 21 Feb 20 Posts: 1116 Credit: 40,839,470,595 RAC: 6,423 Level ![]() Scientific publications
|
Error rates skyrocketed on me for this app... even on the 10GB cards (12GB card will be back on Thursday). This started late on April 7th. It's not you. its the new v4 tasks require more VRAM. I asked about this on their discord. I asked: it seems the newer "v4" tasks on average require a bit more VRAM than the previous v3 tasks. I'm seeing a higher error percentage on 12GB cards. Steve replied: yes this make sense unfortunately. In the previous round of "inputs_v3**" it was calculating things incorrectly for any molecule containing Iodine. This is heaviest element in our dataset. The computational cost of this QM method scales with the size of the elements (it depends on the number of electrons). We are resending the incorrect calculations for Iodine containing molecules in this round of "v4" work units. Therefore the v4 set is a subset of the previous v3 WUs containing heavier elements, hence there are more OOM errors.
|
|
Send message Joined: 13 Jul 09 Posts: 64 Credit: 2,922,790,120 RAC: 98 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]()
|
Thank you. U probably just saved me hours of wasted time. Error % AVG ALL: 29.1 AVG – last 3: 59.0 8GB – last 2 72.76 10GB – last 2 66.52 12GB – last 2 3.55 (card out for a week) |
©2025 Universitat Pompeu Fabra