Task 38582891

Name PFKFB3_A31_A38_r0_3-QUICO_ATM_GAFF2_LOMAP_PFKFB3-2-5-RND6765_0
Workunit 31547125
Created 13 Dec 2025, 14:16:15 UTC
Sent 13 Dec 2025, 14:16:21 UTC
Report deadline 18 Dec 2025, 14:16:21 UTC
Received 13 Dec 2025, 15:10:16 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 195 (0x000000C3) EXIT_CHILD_FAILED
Computer ID 646402
Run time 33 min 22 sec
CPU time 31 min 21 sec
Validate state Invalid
Credit 0.00
Device peak FLOPS 4,732.61 GFLOPS
Application version ATM: Free energy calculations of protein-ligand binding v1.20 (cuda1121)
x86_64-pc-linux-gnu
Peak working set size 719.93 MB
Peak swap size 14.95 GB
Peak disk usage 3.11 GB

Stderr output

<core_client_version>8.2.8</core_client_version>
<![CDATA[
<message>
process exited with code 195 (0xc3, -61)</message>
<stderr_txt>
2025-12-13 15:31:18 (430463): wrapper (8.1.26018): starting
2025-12-13 15:31:57 (430463): wrapper: running bin/python (bin/conda-unpack)
2025-12-13 15:31:57 (430463): wrapper: created child process 430477
2025-12-13 15:31:59 (430463): bin/python exited; CPU time 0.163356
2025-12-13 15:31:59 (430463): wrapper: running bin/tar (xjvf input.tar.bz2)
2025-12-13 15:31:59 (430463): wrapper: created child process 430478
2025-12-13 15:32:02 (430463): bin/tar exited; CPU time 2.239350
2025-12-13 15:32:02 (430463): wrapper: running bin/bash (run.sh)
2025-12-13 15:32:02 (430463): wrapper: created child process 430485
+ echo 'Setup environment'
+ source bin/activate
++ _conda_pack_activate
++ local _CONDA_SHELL_FLAVOR
++ '[' -n x ']'
++ _CONDA_SHELL_FLAVOR=bash
++ local script_dir
++ case "$_CONDA_SHELL_FLAVOR" in
+++ dirname bin/activate
++ script_dir=bin
+++ cd bin
+++ pwd
++ local full_path_script_dir=/var/lib/boinc/slots/0/bin
+++ dirname /var/lib/boinc/slots/0/bin
++ local full_path_env=/var/lib/boinc/slots/0
+++ basename /var/lib/boinc/slots/0
++ local env_name=0
++ '[' -n '' ']'
++ export CONDA_PREFIX=/var/lib/boinc/slots/0
++ CONDA_PREFIX=/var/lib/boinc/slots/0
++ export _CONDA_PACK_OLD_PS1=
++ _CONDA_PACK_OLD_PS1=
++ PATH=/var/lib/boinc/slots/0/bin:/usr/local/bin:/usr/local/sbin:/usr/bin:/usr/sbin:/bin:/sbin:.
++ PS1='(0) '
++ case "$_CONDA_SHELL_FLAVOR" in
++ hash -r
++ local _script_dir=/var/lib/boinc/slots/0/etc/conda/activate.d
++ '[' -d /var/lib/boinc/slots/0/etc/conda/activate.d ']'
+ export PATH=/var/lib/boinc/slots/0:/var/lib/boinc/slots/0/bin:/usr/local/bin:/usr/local/sbin:/usr/bin:/usr/sbin:/bin:/sbin:.
+ PATH=/var/lib/boinc/slots/0:/var/lib/boinc/slots/0/bin:/usr/local/bin:/usr/local/sbin:/usr/bin:/usr/sbin:/bin:/sbin:.
+ echo 'Create a temporary directory'
+ export TMP=/var/lib/boinc/slots/0/tmp
+ TMP=/var/lib/boinc/slots/0/tmp
+ mkdir -p /var/lib/boinc/slots/0/tmp
+ echo 'Configure AToM'
+ echo localhost,0:0,1,CUDA,,/var/lib/boinc/slots/0/tmp
+ FILE=starting_sample
+ '[' -f starting_sample ']'
+ echo 'starting_sample does not exist.'
+ echo 'Install ATM and deps'
+ python -m pip list
+ python -m pip install pyaml -v
+ python -m pip install --no-dependencies atom-openmm -v
+ python -m pip list
+ echo 'Extract restart'
+ tar xjvf restart.tar.bz2
+ echo 'Run AToM'
+ CONFIG_FILE=QB_A31_A38_input.yaml
+ python bin/rbfe_production QB_A31_A38_input.yaml
Warning: importing 'simtk.openmm' is deprecated.  Import 'openmm' instead.
Traceback (most recent call last):
  File "/var/lib/boinc/slots/0/bin/rbfe_production", line 8, in <module>
    sys.exit(rbfe_production())
             ^^^^^^^^^^^^^^^^^
  File "/var/lib/boinc/slots/0/lib/python3.11/site-packages/atom_openmm/rbfe_production.py", line 33, in rbfe_production
    rx.scheduleJobs()
  File "/var/lib/boinc/slots/0/lib/python3.11/site-packages/atom_openmm/async_re.py", line 321, in scheduleJobs
    self.transport.ProcessJobQueue(min_time,cycle_time)
  File "/var/lib/boinc/slots/0/lib/python3.11/site-packages/atom_openmm/local_openmm_transport.py", line 186, in ProcessJobQueue
    self.isDone(repl,0)
  File "/var/lib/boinc/slots/0/lib/python3.11/site-packages/atom_openmm/local_openmm_transport.py", line 287, in isDone
    retcode = self._update_replica(job)
              ^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/var/lib/boinc/slots/0/lib/python3.11/site-packages/atom_openmm/local_openmm_transport.py", line 210, in _update_replica
    (pos,vel) = job['openmm_worker'].get_posvel()
                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/var/lib/boinc/slots/0/lib/python3.11/site-packages/atom_openmm/ommworker.py", line 467, in get_posvel
    state = self.context.getState(getPositions=True, getVelocities=True)
            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/var/lib/boinc/slots/0/lib/python3.11/site-packages/openmm/openmm.py", line 15506, in getState
    state = _openmm.Context_getState(self, types, enforcePeriodicBox, groups_mask)
            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
openmm.OpenMMException: Error downloading array posq: CUDA_ERROR_LAUNCH_TIMEOUT (702)
terminate called after throwing an instance of 'OpenMM::OpenMMException'
  what():  Error deleting array bondParams: CUDA_ERROR_LAUNCH_TIMEOUT (702)
run.sh: line 34: 430537 Aborted                 (core dumped) python bin/rbfe_production $CONFIG_FILE
2025-12-13 16:04:38 (430463): bin/bash exited; CPU time 940.790001
2025-12-13 16:04:38 (430463): app exit status: 0x86
2025-12-13 16:04:38 (430463): called boinc_finish(195)

</stderr_txt>
]]>


©2025 Universitat Pompeu Fabra