ACEMD 4

Message boards : News : ACEMD 4
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · 4 . . . 7 · Next

AuthorMessage
Raimondas

Send message
Joined: 26 Mar 18
Posts: 7
Credit: 0
RAC: 0
Level

Scientific publications
wat
Message 58395 - Posted: 1 Mar 2022, 12:37:14 UTC

Hello everybody,

You probably have noticed the new ACEMD 4 app and have a few questions.

    What is the difference between ACEMD 3 and ACEMD 4?
    A key new feature is the integration of machine learning into molecular simulations. Specifically, we are implementing a new method called NNP/MM.

    What is NNP/MM?
    NNP/MM is a hybrid simulation method combining neural network potentials (NNP) and molecular mechanics (MM). NNP can model the molecular interactions more accurately than the conventional force fields in MM, but it still is not as fast as MM. Thus, only the important part of a molecular system is simulated with NNP, while the rest part is using MM. You can read more in a pre-print of the NNP/MM article: https://arxiv.org/abs/2201.08110

    How much more accurate is NNP?
    You can read a pre-print of the TorchMD-NET article: https://arxiv.org/abs/2202.02541

    What are software/hardware requirements for ACEMD 4?
    Pretty much the same as for ACEMD 3. Only the significant change is the size of the software stack. ACEMD 3 and all its dependencies need just 1 GB, while for ACEMD 4 that has increased to 3 GB, notably due to PyTorch (https://pytorch.org). Also, at the moment, there are just the Linux version for CUDA >=11.2 available.

    When will ACEMD 3 be replaced by ACEMD 4?
    Within a few months, we will release ACEMD 4 officially and ACEMD 3 will be deprecated. For a moment, the apps will coexist and you will receive WUs for both of them.

    What will happen next?
    We have already sent several WUs to test the deployment of ACEMD 4 and will continue this week. Let us know, if you notice some irregularities. Next week, we are aiming to start sending production WUs.


Happy computing,

Raimondas

ID: 58395 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Aurum
Avatar

Send message
Joined: 12 Jul 17
Posts: 404
Credit: 17,408,899,587
RAC: 2
Level
Trp
Scientific publications
watwatwat
Message 58396 - Posted: 1 Mar 2022, 15:37:27 UTC

Thanks for the explanations! I'll be reading those preprints soon.

Do we need to check the Run Test Applications preference?

There's no acemd4 option in preferences?

Why are there 2 Use CPU boxes?
ID: 58396 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Raimondas

Send message
Joined: 26 Mar 18
Posts: 7
Credit: 0
RAC: 0
Level

Scientific publications
wat
Message 58397 - Posted: 1 Mar 2022, 17:25:27 UTC - in response to Message 58396.  

Thanks! I have added the ACEMD 4 option.

Why are there 2 Use CPU boxes?

Where do you see these boxes?
ID: 58397 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Aurum
Avatar

Send message
Joined: 12 Jul 17
Posts: 404
Credit: 17,408,899,587
RAC: 2
Level
Trp
Scientific publications
watwatwat
Message 58398 - Posted: 1 Mar 2022, 18:59:08 UTC - in response to Message 58397.  

Thanks! I have added the ACEMD 4 option.

Why are there 2 Use CPU boxes?

Where do you see these boxes?


Click on Edit Preferences and there's one at the top and one at the bottom.
ID: 58398 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Keith Myers
Avatar

Send message
Joined: 13 Dec 17
Posts: 1416
Credit: 9,119,446,190
RAC: 614,515
Level
Tyr
Scientific publications
watwatwatwatwat
Message 58399 - Posted: 1 Mar 2022, 22:30:50 UTC - in response to Message 58398.  

Thanks! I have added the ACEMD 4 option.

Why are there 2 Use CPU boxes?

Where do you see these boxes?


Click on Edit Preferences and there's one at the top and one at the bottom.

Not seeing this on my account.
ID: 58399 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jim1348

Send message
Joined: 28 Jul 12
Posts: 819
Credit: 1,591,285,971
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 58400 - Posted: 1 Mar 2022, 22:41:02 UTC - in response to Message 58399.  

Resource share

Use CPU Enforced by version 6.10+ no
Use ATI GPU Enforced by version 6.10+ no
Use NVIDIA GPU Enforced by version 6.10+ yes

Run test applications? yes
Is it OK for GPUGRID and your team (if any) to email you? yes
Should GPUGRID show your computers on its web site? yes

Default computer location ---

Maximum CPU % for graphics 0 ... 100 20

Run only the selected applications ACEMD 3: no
ACEMD 4: no
Quantum Chemistry (CPU): no
Quantum Chemistry (CPU, beta): no
Python Runtime (CPU, beta): no
Python Runtime (GPU, beta): yes

If no work for selected applications is available, accept work from other applications? yes

Use Graphics Processing Unit (GPU) if available yes
Use Central Processing Unit (CPU) yes


This is what my default location looks like.
The CPU is at the top and bottom.
ID: 58400 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Keith Myers
Avatar

Send message
Joined: 13 Dec 17
Posts: 1416
Credit: 9,119,446,190
RAC: 614,515
Level
Tyr
Scientific publications
watwatwatwatwat
Message 58401 - Posted: 1 Mar 2022, 23:15:18 UTC - in response to Message 58400.  

Resource share

Use CPU Enforced by version 6.10+ no
Use ATI GPU Enforced by version 6.10+ no
Use NVIDIA GPU Enforced by version 6.10+ yes

Run test applications? yes
Is it OK for GPUGRID and your team (if any) to email you? yes
Should GPUGRID show your computers on its web site? yes

Default computer location ---

Maximum CPU % for graphics 0 ... 100 20

Run only the selected applications ACEMD 3: no
ACEMD 4: no
Quantum Chemistry (CPU): no
Quantum Chemistry (CPU, beta): no
Python Runtime (CPU, beta): no
Python Runtime (GPU, beta): yes

If no work for selected applications is available, accept work from other applications? yes

Use Graphics Processing Unit (GPU) if available yes
Use Central Processing Unit (CPU) yes


This is what my default location looks like.
The CPU is at the top and bottom.


OK, after refreshing the browser, I am seeing use cpu in both places also.
ID: 58401 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Keith Myers
Avatar

Send message
Joined: 13 Dec 17
Posts: 1416
Credit: 9,119,446,190
RAC: 614,515
Level
Tyr
Scientific publications
watwatwatwatwat
Message 58402 - Posted: 1 Mar 2022, 23:16:24 UTC - in response to Message 58397.  

Thanks! I have added the ACEMD 4 option.

Why are there 2 Use CPU boxes?

Where do you see these boxes?

In Project Preferences
ID: 58402 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Keith Myers
Avatar

Send message
Joined: 13 Dec 17
Posts: 1416
Credit: 9,119,446,190
RAC: 614,515
Level
Tyr
Scientific publications
watwatwatwatwat
Message 58403 - Posted: 1 Mar 2022, 23:17:13 UTC

Next week, we are aiming to start sending production WUs.


Can you say at what quantities?
ID: 58403 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 11 Jul 09
Posts: 1639
Credit: 10,159,968,649
RAC: 295,172
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 58404 - Posted: 2 Mar 2022, 13:24:19 UTC
Last modified: 2 Mar 2022, 14:00:02 UTC

Just tried to run one of the new v1.01 tasks. Failed with this error message:

13:16:51 (224808): wrapper: running bin/conda-unpack ()
/usr/bin/env: ‘python’: No such file or directory
13:16:52 (224808): bin/conda-unpack exited; CPU time 0.000612
13:16:52 (224808): app exit status: 0x7f

Another is downloading on my second Linux machine as I type: I'll try to watch it running.

Edit - second task failed the same way. Trying to work out what those unprintable (?punctuation?) characters represent.

Edit2 - no, all mine are failing, and I can't work out those character codes (not a format I recognise). &#152/153 feels like a start-stop pair (open quote/close quote?), but I can't get further than that.
ID: 58404 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Raimondas

Send message
Joined: 26 Mar 18
Posts: 7
Credit: 0
RAC: 0
Level

Scientific publications
wat
Message 58405 - Posted: 2 Mar 2022, 14:41:56 UTC - in response to Message 58403.  

About 400 WUs
ID: 58405 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Raimondas

Send message
Joined: 26 Mar 18
Posts: 7
Credit: 0
RAC: 0
Level

Scientific publications
wat
Message 58406 - Posted: 2 Mar 2022, 14:43:19 UTC - in response to Message 58404.  

Just tried to run one of the new v1.01 tasks. Failed with this error message:

13:16:51 (224808): wrapper: running bin/conda-unpack ()
/usr/bin/env: ‘python’: No such file or directory
13:16:52 (224808): bin/conda-unpack exited; CPU time 0.000612
13:16:52 (224808): app exit status: 0x7f

Another is downloading on my second Linux machine as I type: I'll try to watch it running.

Edit - second task failed the same way. Trying to work out what those unprintable (?punctuation?) characters represent.

Edit2 - no, all mine are failing, and I can't work out those character codes (not a format I recognise). &#152/153 feels like a start-stop pair (open quote/close quote?), but I can't get further than that.


Don't worry, I'll get rid of that conda-unpack.
ID: 58406 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 11 Jul 09
Posts: 1639
Credit: 10,159,968,649
RAC: 295,172
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 58407 - Posted: 2 Mar 2022, 15:31:06 UTC - in response to Message 58406.  

Don't worry, I'll get rid of that conda-unpack.

No probs. I've just failed another one, so I'll set 'No new Tasks' until you're ready. Can you let us know when you're set for the next test, please?
ID: 58407 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Aurum
Avatar

Send message
Joined: 12 Jul 17
Posts: 404
Credit: 17,408,899,587
RAC: 2
Level
Trp
Scientific publications
watwatwat
Message 58408 - Posted: 2 Mar 2022, 16:25:24 UTC
Last modified: 2 Mar 2022, 16:29:25 UTC

One out of nine have passed so far. Here's the latest failure, I can't help with what code 195 means, just sharing.
Edit: Dr Google found it: "ERR_NO_APP_VERSION -195 - BOINC couldn't find the application's version number."
https://boinc.mundayweb.com/wiki/index.php?title=Error_code_-191_to_-200_explained
Stderr output
<core_client_version>7.16.6</core_client_version>
<![CDATA[
<message>
process exited with code 195 (0xc3, -61)</message>
<stderr_txt>
04:49:47 (986503): wrapper (7.7.26016): starting
04:49:47 (986503): wrapper (7.7.26016): starting
04:49:47 (986503): wrapper: running /bin/tar (xf x86_64-pc-linux-gnu__cuda1121.tar.bz2)
04:56:02 (986503): /bin/tar exited; CPU time 369.002826
04:56:02 (986503): wrapper: running bin/conda-unpack ()
/usr/bin/env: &#226;&#128;&#152;python&#226;&#128;&#153;: No such file or directory
04:56:03 (986503): bin/conda-unpack exited; CPU time 0.001065
04:56:03 (986503): app exit status: 0x7f
04:56:03 (986503): called boinc_finish(195)
</stderr_txt>
]]>
ID: 58408 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 11 Jul 09
Posts: 1639
Credit: 10,159,968,649
RAC: 295,172
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 58409 - Posted: 2 Mar 2022, 17:08:44 UTC - in response to Message 58408.  

Your computers are hidden, so I can't see the details of the task which succeeded. Was it the most recent (implying that Raimondas has fixed conda-unpack already), or an old v100 left over from the previous (working) run?
ID: 58409 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Erich56

Send message
Joined: 1 Jan 15
Posts: 1166
Credit: 12,260,898,501
RAC: 869
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwat
Message 58410 - Posted: 2 Mar 2022, 19:04:06 UTC - in response to Message 58405.  

Raimondas wrote:
About 400 WUs

Linux only, or Windows also?
ID: 58410 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Keith Myers
Avatar

Send message
Joined: 13 Dec 17
Posts: 1416
Credit: 9,119,446,190
RAC: 614,515
Level
Tyr
Scientific publications
watwatwatwatwat
Message 58413 - Posted: 3 Mar 2022, 4:29:19 UTC - in response to Message 58410.  

He said earlier it would be a Linux only application. For now.
ID: 58413 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 11 Jul 09
Posts: 1639
Credit: 10,159,968,649
RAC: 295,172
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 58414 - Posted: 3 Mar 2022, 13:53:12 UTC

Raimondas has issued a new download - 2.86 gigabytes this time. It's downloading (for now) at around 7.5 megabytes/sec, which is great. But I fear that speeds may drop if too many people try to download it at once (as some people reported yesterday).

Other BOINC projects have overcome this problem by redirecting these big downloads for new releases to a caching online server service: because everyone is downloading the identical file, caching is viable and valuable.

The new file has now completed downloading, and is running as v1.02 - seemingly successfully. It started making accurate progress reports with 50%, and then progressed in 5% steps. It completed without errors in 6 minutes 44 sec (GTX 1660). Just waiting for a slot in the DDOS protection to report.

Task 32747705
ID: 58414 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jim1348

Send message
Joined: 28 Jul 12
Posts: 819
Credit: 1,591,285,971
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 58415 - Posted: 3 Mar 2022, 15:30:45 UTC - in response to Message 58414.  

Mine will take almost 11 hours to download. It might finish in time on a GTX 1070.
ID: 58415 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Aurum
Avatar

Send message
Joined: 12 Jul 17
Posts: 404
Credit: 17,408,899,587
RAC: 2
Level
Trp
Scientific publications
watwatwat
Message 58416 - Posted: 3 Mar 2022, 15:37:49 UTC
Last modified: 3 Mar 2022, 16:28:59 UTC

T13_7-RAIMIS_TEST-0-2-RND0280_1 did a bad thing. It took over from an acemd3 WU that was 88% done. It should wait its turn.
I'm checking to see if it'll Suspend On Checkpoint.

Edit: It got to 80% at 9:06 minutes and suspended. So it has at least one checkpoint.
ID: 58416 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
1 · 2 · 3 · 4 . . . 7 · Next

Message boards : News : ACEMD 4

©2025 Universitat Pompeu Fabra