Task 33800345

Name inputs_ace_pch_ms_gc_filt_af05_index_24192_to_24195-SFARR_PYSCF_ace_pch_ms_gc_filt_af05_v2-0-1-RND5826_3
Workunit 27678213
Created 2 Feb 2024, 11:15:59 UTC
Sent 2 Feb 2024, 11:20:08 UTC
Report deadline 7 Feb 2024, 11:20:08 UTC
Received 2 Feb 2024, 12:05:00 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 195 (0x000000C3) EXIT_CHILD_FAILED
Computer ID 132158
Run time 38 min 8 sec
CPU time 37 min 48 sec
Validate state Invalid
Credit 0.00
Device peak FLOPS 17,076.54 GFLOPS
Application version Quantum chemistry calculations on GPU v1.04 (cuda1121)
x86_64-pc-linux-gnu
Peak working set size 0.00 MB
Peak swap size 0.00 MB
Peak disk usage 0.00 MB

Stderr output

<core_client_version>7.23.0</core_client_version>
<![CDATA[
<message>
process exited with code 195 (0xc3, -61)</message>
<stderr_txt>
11:24:51 (88110): wrapper (7.7.26016): starting
11:25:14 (88110): wrapper (7.7.26016): starting
11:25:14 (88110): wrapper: running bin/python (bin/conda-unpack)
11:25:15 (88110): bin/python exited; CPU time 0.126043
11:25:15 (88110): wrapper: running bin/tar (xjvf input.tar.bz2)
11:25:16 (88110): bin/tar exited; CPU time 0.001933
11:25:16 (88110): wrapper: running bin/bash (run.sh)
+ echo 'Setup environment'
+ source bin/activate
++ _conda_pack_activate
++ local _CONDA_SHELL_FLAVOR
++ '[' -n x ']'
++ _CONDA_SHELL_FLAVOR=bash
++ local script_dir
++ case "$_CONDA_SHELL_FLAVOR" in
+++ dirname bin/activate
++ script_dir=bin
+++ cd bin
+++ pwd
++ local full_path_script_dir=/hdd/boinc-client/slots/0/bin
+++ dirname /hdd/boinc-client/slots/0/bin
++ local full_path_env=/hdd/boinc-client/slots/0
+++ basename /hdd/boinc-client/slots/0
++ local env_name=0
++ '[' -n '' ']'
++ export CONDA_PREFIX=/hdd/boinc-client/slots/0
++ CONDA_PREFIX=/hdd/boinc-client/slots/0
++ export _CONDA_PACK_OLD_PS1=
++ _CONDA_PACK_OLD_PS1=
++ PATH=/hdd/boinc-client/slots/0/bin:/usr/local/bin:/usr/local/sbin:/usr/bin:/usr/sbin:/bin:/sbin:.
++ PS1='(0) '
++ case "$_CONDA_SHELL_FLAVOR" in
++ hash -r
++ local _script_dir=/hdd/boinc-client/slots/0/etc/conda/activate.d
++ '[' -d /hdd/boinc-client/slots/0/etc/conda/activate.d ']'
+ export PATH=/hdd/boinc-client/slots/0:/hdd/boinc-client/slots/0/bin:/usr/local/bin:/usr/local/sbin:/usr/bin:/usr/sbin:/bin:/sbin:.
+ PATH=/hdd/boinc-client/slots/0:/hdd/boinc-client/slots/0/bin:/usr/local/bin:/usr/local/sbin:/usr/bin:/usr/sbin:/bin:/sbin:.
+ echo 'Create a temporary directory'
+ export TMP=/hdd/boinc-client/slots/0/tmp
+ TMP=/hdd/boinc-client/slots/0/tmp
+ mkdir -p /hdd/boinc-client/slots/0/tmp
+ export OMP_NUM_THREADS=1
+ OMP_NUM_THREADS=1
+ export CUDA_VISIBLE_DEVICES=1
+ CUDA_VISIBLE_DEVICES=1
+ export CUPY_CUDA_LIB_PATH=/hdd/boinc-client/slots/0/cupy
+ CUPY_CUDA_LIB_PATH=/hdd/boinc-client/slots/0/cupy
+ echo 'Running PySCF'
+ python compute_dft.py
/hdd/boinc-client/slots/0/lib/python3.11/site-packages/gpu4pyscf/lib/cutensor.py:174: UserWarning: using cupy as the tensor contraction engine.
  warnings.warn(f'using {contract_engine} as the tensor contraction engine.')
/hdd/boinc-client/slots/0/lib/python3.11/site-packages/pyscf/dft/libxc.py:771: UserWarning: Since PySCF-2.3, B3LYP (and B3P86) are changed to the VWN-RPA variant, corresponding to the original definition by Stephens et al. (issue 1480) and the same as the B3LYP functional in Gaussian. To restore the VWN5 definition, you can put the setting "B3LYP_WITH_VWN5 = True" in pyscf_conf.py
  warnings.warn('Since PySCF-2.3, B3LYP (and B3P86) are changed to the VWN-RPA variant, '
/hdd/boinc-client/slots/0/lib/python3.11/site-packages/pyscf/gto/mole.py:1280: UserWarning: Function mol.dumps drops attribute charge because it is not JSON-serializable
  warnings.warn(msg)
Traceback (most recent call last):
  File "/hdd/boinc-client/slots/0/lib/python3.11/site-packages/pyscf/lib/misc.py", line 1094, in __exit__
    handler.result()
  File "/hdd/boinc-client/slots/0/lib/python3.11/concurrent/futures/_base.py", line 456, in result
    return self.__get_result()
           ^^^^^^^^^^^^^^^^^^^
  File "/hdd/boinc-client/slots/0/lib/python3.11/concurrent/futures/_base.py", line 401, in __get_result
    raise self._exception
  File "/hdd/boinc-client/slots/0/lib/python3.11/concurrent/futures/thread.py", line 58, in run
    result = self.fn(*self.args, **self.kwargs)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/hdd/boinc-client/slots/0/lib/python3.11/site-packages/gpu4pyscf/df/df_jk.py", line 52, in build_df
    rsh_df.build(omega=omega)
  File "/hdd/boinc-client/slots/0/lib/python3.11/site-packages/gpu4pyscf/df/df.py", line 102, in build
    self._cderi = cholesky_eri_gpu(intopt, mol, auxmol, self.cd_low, omega=omega)
                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/hdd/boinc-client/slots/0/lib/python3.11/site-packages/gpu4pyscf/df/df.py", line 256, in cholesky_eri_gpu
    if lj>1: ints_slices = cart2sph(ints_slices, axis=1, ang=lj)
                           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/hdd/boinc-client/slots/0/lib/python3.11/site-packages/gpu4pyscf/lib/cupy_helper.py", line 333, in cart2sph
    t_sph = contract('min,ip->mpn', t_cart, c2s, out=out)
            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/hdd/boinc-client/slots/0/lib/python3.11/site-packages/gpu4pyscf/lib/cutensor.py", line 177, in contract
    return cupy.asarray(einsum(pattern, a, b), order='C')
                        ^^^^^^^^^^^^^^^^^^^^^
  File "/hdd/boinc-client/slots/0/lib/python3.11/site-packages/cupy/linalg/_einsum.py", line 676, in einsum
    arr_out, sub_out = reduced_binary_einsum(
                       ^^^^^^^^^^^^^^^^^^^^^^
  File "/hdd/boinc-client/slots/0/lib/python3.11/site-packages/cupy/linalg/_einsum.py", line 421, in reduced_binary_einsum
    arr_out = cupy.matmul(tmp0, tmp1).reshape(shapes_out)
              ^^^^^^^^^^^^^^^^^^^^^^^
  File "/hdd/boinc-client/slots/0/lib/python3.11/site-packages/cupy/_core/_gufuncs.py", line 669, in __call__
    outs = cupy.empty(out_shape, dtype=ret_dtype, order=filter_order)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/hdd/boinc-client/slots/0/lib/python3.11/site-packages/cupy/_creation/basic.py", line 22, in empty
    return cupy.ndarray(shape, dtype, order=order)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "cupy/_core/core.pyx", line 132, in cupy._core.core.ndarray.__new__
  File "cupy/_core/core.pyx", line 220, in cupy._core.core._ndarray_base._init
  File "cupy/cuda/memory.pyx", line 740, in cupy.cuda.memory.alloc
  File "cupy/cuda/memory.pyx", line 1426, in cupy.cuda.memory.MemoryPool.malloc
  File "cupy/cuda/memory.pyx", line 1447, in cupy.cuda.memory.MemoryPool.malloc
  File "cupy/cuda/memory.pyx", line 1118, in cupy.cuda.memory.SingleDeviceMemoryPool.malloc
  File "cupy/cuda/memory.pyx", line 1139, in cupy.cuda.memory.SingleDeviceMemoryPool._malloc
  File "cupy/cuda/memory.pyx", line 1346, in cupy.cuda.memory.SingleDeviceMemoryPool._try_malloc
  File "cupy/cuda/memory.pyx", line 1358, in cupy.cuda.memory.SingleDeviceMemoryPool._try_malloc
cupy.cuda.memory.OutOfMemoryError: Out of memory allocating 580,048,896 bytes (allocated so far: 5,241,304,064 bytes, limit set to: 5,602,502,246 bytes).

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/hdd/boinc-client/slots/0/compute_dft.py", line 125, in <module>
    e,f,dip,q = compute_gpu(mol)
                ^^^^^^^^^^^^^^^^
  File "/hdd/boinc-client/slots/0/compute_dft.py", line 32, in compute_gpu
    e_dft = mf.kernel()  # compute total energy
            ^^^^^^^^^^^
  File "<string>", line 2, in kernel
  File "/hdd/boinc-client/slots/0/lib/python3.11/site-packages/gpu4pyscf/scf/hf.py", line 586, in scf
    _kernel(self, self.conv_tol, self.conv_tol_grad,
  File "/hdd/boinc-client/slots/0/lib/python3.11/site-packages/gpu4pyscf/scf/hf.py", line 393, in _kernel
    mf.init_workflow(dm0=dm)
  File "/hdd/boinc-client/slots/0/lib/python3.11/site-packages/gpu4pyscf/df/df_jk.py", line 56, in init_workflow
    with lib.call_in_background(build_df) as build:
  File "/hdd/boinc-client/slots/0/lib/python3.11/site-packages/pyscf/lib/misc.py", line 1096, in __exit__
    raise ThreadRuntimeError('Error on thread %s:\n%s' % (self, e))
pyscf.lib.misc.ThreadRuntimeError: Error on thread <pyscf.lib.misc.call_in_background object at 0x7f48b2235810>:
Out of memory allocating 580,048,896 bytes (allocated so far: 5,241,304,064 bytes, limit set to: 5,602,502,246 bytes).
12:02:57 (88110): bin/bash exited; CPU time 2021.913823
12:02:57 (88110): app exit status: 0x1
12:02:57 (88110): called boinc_finish(195)

</stderr_txt>
]]>


©2025 Universitat Pompeu Fabra