Task 38575381

Name wu_ec069b1f-GIANNI_GPROTO7-0-1-RND5952_0
Workunit 31541218
Created 22 Sep 2025, 22:33:38 UTC
Sent 22 Sep 2025, 22:34:46 UTC
Report deadline 27 Sep 2025, 22:34:46 UTC
Received 23 Sep 2025, 6:14:41 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 195 (0x000000C3) EXIT_CHILD_FAILED
Computer ID 643932
Run time 35 sec
CPU time 20 sec
Validate state Invalid
Credit 0.00
Device peak FLOPS 109,735.62 GFLOPS
Application version LLM: LLMs for chemistry v1.00 (cuda124L)
x86_64-pc-linux-gnu
Peak working set size 599.61 MB
Peak swap size 6.31 GB
Peak disk usage 8.23 GB

Stderr output

<core_client_version>8.3.0</core_client_version>
<![CDATA[
<message>
process exited with code 195 (0xc3, -61)</message>
<stderr_txt>
2025-09-23 02:45:14 (646889): wrapper (8.1.26018): starting
2025-09-23 02:45:43 (646889): wrapper: running bin/python (bin/conda-unpack)
2025-09-23 02:45:43 (646889): wrapper: created child process 647047
2025-09-23 02:45:44 (646889): bin/python exited; CPU time 0.327976
2025-09-23 02:45:44 (646889): wrapper: running bin/tar (xjvf input.tar.bz2)
2025-09-23 02:45:44 (646889): wrapper: created child process 647059
2025-09-23 02:45:58 (646889): bin/tar exited; CPU time 0.013344
2025-09-23 02:45:58 (646889): wrapper: running bin/bash (run.sh)
2025-09-23 02:45:58 (646889): wrapper: created child process 647138
+ echo 'Setup environment'
+ source bin/activate
++ _conda_pack_activate
++ local _CONDA_SHELL_FLAVOR
++ '[' -n x ']'
++ _CONDA_SHELL_FLAVOR=bash
++ local script_dir
++ case "$_CONDA_SHELL_FLAVOR" in
+++ dirname bin/activate
++ script_dir=bin
+++ cd bin
+++ pwd
++ local full_path_script_dir=/var/lib/boinc/slots/3/bin
+++ dirname /var/lib/boinc/slots/3/bin
++ local full_path_env=/var/lib/boinc/slots/3
+++ basename /var/lib/boinc/slots/3
++ local env_name=3
++ '[' -n '' ']'
++ export CONDA_PREFIX=/var/lib/boinc/slots/3
++ CONDA_PREFIX=/var/lib/boinc/slots/3
++ export _CONDA_PACK_OLD_PS1=
++ _CONDA_PACK_OLD_PS1=
++ PATH=/var/lib/boinc/slots/3/bin:/usr/local/bin:/usr/local/sbin:/usr/bin:/usr/sbin:/bin:/sbin:.
++ PS1='(3) '
++ case "$_CONDA_SHELL_FLAVOR" in
++ hash -r
++ local _script_dir=/var/lib/boinc/slots/3/etc/conda/activate.d
++ '[' -d /var/lib/boinc/slots/3/etc/conda/activate.d ']'
+ export PATH=/var/lib/boinc/slots/3:/var/lib/boinc/slots/3/bin:/usr/local/bin:/usr/local/sbin:/usr/bin:/usr/sbin:/bin:/sbin:.
+ PATH=/var/lib/boinc/slots/3:/var/lib/boinc/slots/3/bin:/usr/local/bin:/usr/local/sbin:/usr/bin:/usr/sbin:/bin:/sbin:.
+ echo 'Create a temporary directory'
+ export TMP=/var/lib/boinc/slots/3/tmp
+ TMP=/var/lib/boinc/slots/3/tmp
+ mkdir -p /var/lib/boinc/slots/3/tmp
+ which python
+ pip install main_generation-0.1.0-py3-none-any.whl -v --no-deps
+ export CUDA_VISIBLE_DEVICES=1
+ CUDA_VISIBLE_DEVICES=1
+ export HF_HOME=../.cache
+ HF_HOME=../.cache
+ export VLLM_ASSETS_CACHE=../.cache
+ VLLM_ASSETS_CACHE=../.cache
+ export VLLM_CACHE_ROOT=../.cache
+ VLLM_CACHE_ROOT=../.cache
+ echo RUNNING
+ pythonbinary=/var/lib/boinc/slots/3/lib/python3.12/site-packages/aiengine/main_generation.pyc
+ python /var/lib/boinc/slots/3/lib/python3.12/site-packages/aiengine/main_generation.pyc --conf conf.yaml

Generating train split: 0 examples [00:00, ? examples/s]
Generating train split: 2500 examples [00:00, 758628.27 examples/s]
/var/lib/boinc/slots/3/lib/python3.12/site-packages/torch/cuda/__init__.py:235: UserWarning: 
NVIDIA RTX PRO 6000 Blackwell Max-Q Workstation Edition with CUDA capability sm_120 is not compatible with the current PyTorch installation.
The current PyTorch install supports CUDA capabilities sm_50 sm_60 sm_70 sm_75 sm_80 sm_86 sm_90.
If you want to use the NVIDIA RTX PRO 6000 Blackwell Max-Q Workstation Edition GPU with PyTorch, please check the instructions at https://pytorch.org/get-started/locally/

  warnings.warn(
run.sh: line 26: 647162 Killed                  python ${pythonbinary} --conf conf.yaml
2025-09-23 02:46:29 (646889): bin/bash exited; CPU time 29.303397
2025-09-23 02:46:29 (646889): app exit status: 0x89
2025-09-23 02:46:29 (646889): called boinc_finish(195)

</stderr_txt>
]]>


©2025 Universitat Pompeu Fabra