Task 38575356

Name wu_a4e219a5-GIANNI_GPROTO7-0-1-RND2248_0
Workunit 31541193
Created 22 Sep 2025, 22:28:18 UTC
Sent 22 Sep 2025, 22:48:51 UTC
Report deadline 27 Sep 2025, 22:48:51 UTC
Received 22 Sep 2025, 23:34:17 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 195 (0x000000C3) EXIT_CHILD_FAILED
Computer ID 604231
Run time 13 min 7 sec
CPU time 1 min 5 sec
Validate state Invalid
Credit 0.00
Device peak FLOPS 82,585.56 GFLOPS
Application version LLM: LLMs for chemistry v1.01 (cuda124L)
windows_x86_64
Peak working set size 1.64 GB
Peak swap size 13.36 GB
Peak disk usage 11.51 GB

Stderr output

<core_client_version>8.2.4</core_client_version>
<![CDATA[
<message>
The operating system cannot run (null).
 (0xc3) - exit code 195 (0xc3)</message>
<stderr_txt>
18:25:29 (214560): wrapper (7.9.26016): starting
18:25:29 (214560): wrapper: running Library/usr/bin/tar.exe (xjvf input.tar.bz2)
tasks.json
run.bat
conf.yaml
main_generation-0.1.0-py3-none-any.whl
run.sh
18:25:30 (214560): Library/usr/bin/tar.exe exited; CPU time 0.015625
18:25:30 (214560): wrapper: running C:/Windows/system32/cmd.exe (/c call Scripts\activate.bat && Scripts\conda-unpack.exe && run.bat)

Generating train split: 0 examples [00:00, ? examples/s]
Generating train split: 2500 examples [00:00, 231560.63 examples/s]
[W922 18:27:28.000000000 socket.cpp:759] [c10d] The client socket has failed to connect to [EMD-FL9]:58078 (system error: 10049 - The requested address is not valid in its context.).
[rank0]: Traceback (most recent call last):
[rank0]:   File "wheel_contents/aiengine/main_generation.py", line 87, in <module>
[rank0]:   File "wheel_contents/aiengine/model.py", line 36, in __init__
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\vllm\utils.py", line 1096, in inner
[rank0]:     return fn(*args, **kwargs)
[rank0]:            ^^^^^^^^^^^^^^^^^^^
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\vllm\entrypoints\llm.py", line 243, in __init__
[rank0]:     self.llm_engine = LLMEngine.from_engine_args(
[rank0]:                       ^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\vllm\engine\llm_engine.py", line 521, in from_engine_args
[rank0]:     return engine_cls.from_vllm_config(
[rank0]:            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\vllm\engine\llm_engine.py", line 497, in from_vllm_config
[rank0]:     return cls(
[rank0]:            ^^^^
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\vllm\engine\llm_engine.py", line 281, in __init__
[rank0]:     self.model_executor = executor_class(vllm_config=vllm_config, )
[rank0]:                           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\vllm\executor\executor_base.py", line 52, in __init__
[rank0]:     self._init_executor()
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\vllm\executor\uniproc_executor.py", line 47, in _init_executor
[rank0]:     self.collective_rpc("load_model")
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\vllm\executor\uniproc_executor.py", line 56, in collective_rpc
[rank0]:     answer = run_method(self.driver_worker, method, args, kwargs)
[rank0]:              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\vllm\utils.py", line 2359, in run_method
[rank0]:     return func(*args, **kwargs)
[rank0]:            ^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\vllm\worker\worker.py", line 184, in load_model
[rank0]:     self.model_runner.load_model()
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\vllm\worker\model_runner.py", line 1113, in load_model
[rank0]:     self.model = get_model(vllm_config=self.vllm_config)
[rank0]:                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\vllm\model_executor\model_loader\__init__.py", line 14, in get_model
[rank0]:     return loader.load_model(vllm_config=vllm_config)
[rank0]:            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\vllm\model_executor\model_loader\loader.py", line 1280, in load_model
[rank0]:     self._load_weights(model_config, model)
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\vllm\model_executor\model_loader\loader.py", line 1183, in _load_weights
[rank0]:     self._get_quantized_weights_iterator(model_config.model,
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\vllm\model_executor\model_loader\loader.py", line 891, in _get_quantized_weights_iterator
[rank0]:     hf_weights_files, use_safetensors = self._prepare_weights(
[rank0]:                                         ^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\vllm\model_executor\model_loader\loader.py", line 820, in _prepare_weights
[rank0]:     hf_folder, hf_weights_files, matched_pattern = self._get_weight_files(
[rank0]:                                                    ^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\vllm\model_executor\model_loader\loader.py", line 801, in _get_weight_files
[rank0]:     hf_folder = download_weights_from_hf(
[rank0]:                 ^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\vllm\model_executor\model_loader\weight_utils.py", line 270, in download_weights_from_hf
[rank0]:     hf_folder = snapshot_download(
[rank0]:                 ^^^^^^^^^^^^^^^^^^
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\huggingface_hub\utils\_validators.py", line 114, in _inner_fn
[rank0]:     return fn(*args, **kwargs)
[rank0]:            ^^^^^^^^^^^^^^^^^^^
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\huggingface_hub\_snapshot_download.py", line 296, in snapshot_download
[rank0]:     thread_map(
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\tqdm\contrib\concurrent.py", line 69, in thread_map
[rank0]:     return _executor_map(ThreadPoolExecutor, fn, *iterables, **tqdm_kwargs)
[rank0]:            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\tqdm\contrib\concurrent.py", line 51, in _executor_map
[rank0]:     return list(tqdm_class(ex.map(fn, *iterables, chunksize=chunksize), **kwargs))
[rank0]:            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\tqdm\std.py", line 1169, in __iter__
[rank0]:     for obj in iterable:
[rank0]:                ^^^^^^^^
[rank0]:   File "M:\BOINC\slots\1\Lib\concurrent\futures\_base.py", line 619, in result_iterator
[rank0]:     yield _result_or_cancel(fs.pop())
[rank0]:           ^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "M:\BOINC\slots\1\Lib\concurrent\futures\_base.py", line 317, in _result_or_cancel
[rank0]:     return fut.result(timeout)
[rank0]:            ^^^^^^^^^^^^^^^^^^^
[rank0]:   File "M:\BOINC\slots\1\Lib\concurrent\futures\_base.py", line 456, in result
[rank0]:     return self.__get_result()
[rank0]:            ^^^^^^^^^^^^^^^^^^^
[rank0]:   File "M:\BOINC\slots\1\Lib\concurrent\futures\_base.py", line 401, in __get_result
[rank0]:     raise self._exception
[rank0]:   File "M:\BOINC\slots\1\Lib\concurrent\futures\thread.py", line 59, in run
[rank0]:     result = self.fn(*self.args, **self.kwargs)
[rank0]:              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\huggingface_hub\_snapshot_download.py", line 270, in _inner_hf_hub_download
[rank0]:     return hf_hub_download(
[rank0]:            ^^^^^^^^^^^^^^^^
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\huggingface_hub\utils\_validators.py", line 114, in _inner_fn
[rank0]:     return fn(*args, **kwargs)
[rank0]:            ^^^^^^^^^^^^^^^^^^^
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\huggingface_hub\file_download.py", line 961, in hf_hub_download
[rank0]:     return _hf_hub_download_to_cache_dir(
[rank0]:            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\huggingface_hub\file_download.py", line 1112, in _hf_hub_download_to_cache_dir
[rank0]:     _download_to_tmp_and_move(
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\huggingface_hub\file_download.py", line 1661, in _download_to_tmp_and_move
[rank0]:     xet_get(
[rank0]:   File "M:\BOINC\slots\1\Lib\site-packages\huggingface_hub\file_download.py", line 580, in xet_get
[rank0]:     download_files(
[rank0]: RuntimeError: Data processing error: CAS service error : Error : single flight error: Real call failed: CasObjectError(InternalIOError(Custom { kind: Other, error: reqwest::Error { kind: Decode, source: hyper::Error(Body, Custom { kind: UnexpectedEof, error: IncompleteBody }) } }))
18:32:39 (214560): C:/Windows/system32/cmd.exe exited; CPU time 65.000000
18:32:39 (214560): app exit status: 0x16
18:32:39 (214560): called boinc_finish(195)
0 bytes in 0 Free Blocks.
176 bytes in 6 Normal Blocks.
1144 bytes in 1 CRT Blocks.
0 bytes in 0 Ignore Blocks.
0 bytes in 0 Client Blocks.
Largest number used: 0 bytes.
Total allocations: 6613833 bytes.
Dumping objects ->
{1601490} normal block at 0x00000200BFBC61C0, 48 bytes long.
 Data: <PATH=M:\BOINC\sl> 50 41 54 48 3D 4D 3A 5C 42 4F 49 4E 43 5C 73 6C 
{1601479} normal block at 0x00000200C1876970, 32 bytes long.
 Data: <HOME=M:\BOINC\sl> 48 4F 4D 45 3D 4D 3A 5C 42 4F 49 4E 43 5C 73 6C 
{1601468} normal block at 0x00000200C18774B0, 32 bytes long.
 Data: <TMP=M:\BOINC\slo> 54 4D 50 3D 4D 3A 5C 42 4F 49 4E 43 5C 73 6C 6F 
{1601457} normal block at 0x00000200C1876730, 32 bytes long.
 Data: <TEMP=M:\BOINC\sl> 54 45 4D 50 3D 4D 3A 5C 42 4F 49 4E 43 5C 73 6C 
{1601446} normal block at 0x00000200C18761F0, 32 bytes long.
 Data: <TMPDIR=M:\BOINC\> 54 4D 50 44 49 52 3D 4D 3A 5C 42 4F 49 4E 43 5C 
{1601415} normal block at 0x00000200BFBC6150, 48 bytes long.
 Data: <PATH=M:\BOINC\sl> 50 41 54 48 3D 4D 3A 5C 42 4F 49 4E 43 5C 73 6C 
..\api\boinc_api.cpp(309) : {1601402} normal block at 0x00000200BFBBE960, 8 bytes long.
 Data: <  &#181;&#191;    > 00 00 B5 BF 00 02 00 00 
{1599918} normal block at 0x00000200BFBBE870, 8 bytes long.
 Data: <Pm&#193;&#191;    > 50 6D C1 BF 00 02 00 00 
..\zip\boinc_zip.cpp(122) : {306} normal block at 0x00000200BFBB2AC0, 260 bytes long.
 Data: <                > 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 
{291} normal block at 0x00000200BFBB0100, 80 bytes long.
 Data: </c call Scripts\> 2F 63 20 63 61 6C 6C 20 53 63 72 69 70 74 73 5C 
{290} normal block at 0x00000200BFBC7520, 16 bytes long.
 Data: <&#168;y&#188;&#191;            > A8 79 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 
{289} normal block at 0x00000200BFBC6FD0, 16 bytes long.
 Data: < y&#188;&#191;            > 80 79 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 
{288} normal block at 0x00000200BFBC7160, 16 bytes long.
 Data: <Xy&#188;&#191;            > 58 79 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 
{287} normal block at 0x00000200BFBC6B70, 16 bytes long.
 Data: <0y&#188;&#191;            > 30 79 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 
{286} normal block at 0x00000200BFBC6850, 16 bytes long.
 Data: < y&#188;&#191;            > 08 79 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 
{285} normal block at 0x00000200BFBC6F30, 16 bytes long.
 Data: <&#224;x&#188;&#191;            > E0 78 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 
{284} normal block at 0x00000200BFBC6000, 48 bytes long.
 Data: <ComSpec=C:\Windo> 43 6F 6D 53 70 65 63 3D 43 3A 5C 57 69 6E 64 6F 
{283} normal block at 0x00000200BFBC6DA0, 16 bytes long.
 Data: < '&#187;&#191;            > 08 27 BB BF 00 02 00 00 00 00 00 00 00 00 00 00 
{282} normal block at 0x00000200BFBC0E20, 32 bytes long.
 Data: <SystemRoot=C:\Wi> 53 79 73 74 65 6D 52 6F 6F 74 3D 43 3A 5C 57 69 
{281} normal block at 0x00000200BFBC7070, 16 bytes long.
 Data: <&#224;&&#187;&#191;            > E0 26 BB BF 00 02 00 00 00 00 00 00 00 00 00 00 
{279} normal block at 0x00000200BFBC74D0, 16 bytes long.
 Data: <&#184;&&#187;&#191;            > B8 26 BB BF 00 02 00 00 00 00 00 00 00 00 00 00 
{278} normal block at 0x00000200BFBC7110, 16 bytes long.
 Data: < &&#187;&#191;            > 90 26 BB BF 00 02 00 00 00 00 00 00 00 00 00 00 
{277} normal block at 0x00000200BFBC72A0, 16 bytes long.
 Data: <h&&#187;&#191;            > 68 26 BB BF 00 02 00 00 00 00 00 00 00 00 00 00 
{276} normal block at 0x00000200BFBC75C0, 16 bytes long.
 Data: <@&&#187;&#191;            > 40 26 BB BF 00 02 00 00 00 00 00 00 00 00 00 00 
{275} normal block at 0x00000200BFBC7480, 16 bytes long.
 Data: < &&#187;&#191;            > 18 26 BB BF 00 02 00 00 00 00 00 00 00 00 00 00 
{274} normal block at 0x00000200BFBC0DC0, 32 bytes long.
 Data: <CUDA_DEVICE=0 PU> 43 55 44 41 5F 44 45 56 49 43 45 3D 30 00 50 55 
{273} normal block at 0x00000200BFBC7020, 16 bytes long.
 Data: <&#240;%&#187;&#191;            > F0 25 BB BF 00 02 00 00 00 00 00 00 00 00 00 00 
{272} normal block at 0x00000200BFBB25F0, 320 bytes long.
 Data: < p&#188;&#191;    &#192; &#188;&#191;    > 20 70 BC BF 00 02 00 00 C0 0D BC BF 00 02 00 00 
{271} normal block at 0x00000200BFBC7610, 16 bytes long.
 Data: <&#192;x&#188;&#191;            > C0 78 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 
{270} normal block at 0x00000200BFBC6A30, 16 bytes long.
 Data: < x&#188;&#191;            > 98 78 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 
{269} normal block at 0x00000200BFBC0640, 32 bytes long.
 Data: <C:/Windows/syste> 43 3A 2F 57 69 6E 64 6F 77 73 2F 73 79 73 74 65 
{268} normal block at 0x00000200BFBC6800, 16 bytes long.
 Data: <px&#188;&#191;            > 70 78 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 
{267} normal block at 0x00000200BFBC1240, 32 bytes long.
 Data: <xjvf input.tar.b> 78 6A 76 66 20 69 6E 70 75 74 2E 74 61 72 2E 62 
{266} normal block at 0x00000200BFBC71B0, 16 bytes long.
 Data: <&#184;w&#188;&#191;            > B8 77 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 
{265} normal block at 0x00000200BFBC6D00, 16 bytes long.
 Data: < w&#188;&#191;            > 90 77 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 
{264} normal block at 0x00000200BFBC7430, 16 bytes long.
 Data: <hw&#188;&#191;            > 68 77 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 
{263} normal block at 0x00000200BFBC72F0, 16 bytes long.
 Data: <@w&#188;&#191;            > 40 77 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 
{262} normal block at 0x00000200BFBC70C0, 16 bytes long.
 Data: < w&#188;&#191;            > 18 77 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 
{261} normal block at 0x00000200BFBC6A80, 16 bytes long.
 Data: <&#240;v&#188;&#191;            > F0 76 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 
{259} normal block at 0x00000200BFBC6F80, 16 bytes long.
 Data: < c&#188;&#191;            > 80 63 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 
{258} normal block at 0x00000200BFBC6380, 40 bytes long.
 Data: < o&#188;&#191;    Pa&#188;&#191;    > 80 6F BC BF 00 02 00 00 50 61 BC BF 00 02 00 00 
{257} normal block at 0x00000200BFBC6760, 16 bytes long.
 Data: <&#208;v&#188;&#191;            > D0 76 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 
{256} normal block at 0x00000200BFBC7250, 16 bytes long.
 Data: <&#168;v&#188;&#191;            > A8 76 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 
{255} normal block at 0x00000200BFBC0940, 32 bytes long.
 Data: <Library/usr/bin/> 4C 69 62 72 61 72 79 2F 75 73 72 2F 62 69 6E 2F 
{254} normal block at 0x00000200BFBC6710, 16 bytes long.
 Data: < v&#188;&#191;            > 80 76 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 
{253} normal block at 0x00000200BFBC7680, 992 bytes long.
 Data: < g&#188;&#191;    @	&#188;&#191;    > 10 67 BC BF 00 02 00 00 40 09 BC BF 00 02 00 00 
{97} normal block at 0x00000200BFBC0D00, 32 bytes long.
 Data: <windows_x86_64__> 77 69 6E 64 6F 77 73 5F 78 38 36 5F 36 34 5F 5F 
{96} normal block at 0x00000200BFBBEAA0, 16 bytes long.
 Data: <&#224;`&#188;&#191;            > E0 60 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 
{95} normal block at 0x00000200BFBC60E0, 40 bytes long.
 Data: <&#160;&#234;&#187;&#191;      &#188;&#191;    > A0 EA BB BF 00 02 00 00 00 0D BC BF 00 02 00 00 
{74} normal block at 0x00000200BFBBECD0, 16 bytes long.
 Data: < &#234;&#226; &#247;           > 80 EA E2 06 F7 7F 00 00 00 00 00 00 00 00 00 00 
{73} normal block at 0x00000200BFBBEDC0, 16 bytes long.
 Data: <@&#233;&#226; &#247;           > 40 E9 E2 06 F7 7F 00 00 00 00 00 00 00 00 00 00 
{72} normal block at 0x00000200BFBBE640, 16 bytes long.
 Data: <&#248;W&#223; &#247;           > F8 57 DF 06 F7 7F 00 00 00 00 00 00 00 00 00 00 
{71} normal block at 0x00000200BFBBEBE0, 16 bytes long.
 Data: <&#216;W&#223; &#247;           > D8 57 DF 06 F7 7F 00 00 00 00 00 00 00 00 00 00 
{70} normal block at 0x00000200BFBBE410, 16 bytes long.
 Data: <P &#223; &#247;           > 50 04 DF 06 F7 7F 00 00 00 00 00 00 00 00 00 00 
{69} normal block at 0x00000200BFBBEEB0, 16 bytes long.
 Data: <0 &#223; &#247;           > 30 04 DF 06 F7 7F 00 00 00 00 00 00 00 00 00 00 
{68} normal block at 0x00000200BFBBE140, 16 bytes long.
 Data: <&#224; &#223; &#247;           > E0 02 DF 06 F7 7F 00 00 00 00 00 00 00 00 00 00 
{67} normal block at 0x00000200BFBBE2D0, 16 bytes long.
 Data: <  &#223; &#247;           > 10 04 DF 06 F7 7F 00 00 00 00 00 00 00 00 00 00 
{66} normal block at 0x00000200BFBBEF00, 16 bytes long.
 Data: <p &#223; &#247;           > 70 04 DF 06 F7 7F 00 00 00 00 00 00 00 00 00 00 
{65} normal block at 0x00000200BFBBE820, 16 bytes long.
 Data: < &#192;&#221; &#247;           > 18 C0 DD 06 F7 7F 00 00 00 00 00 00 00 00 00 00 
Object dump complete.

</stderr_txt>
]]>


©2025 Universitat Pompeu Fabra