Task 38579313

Name wu_b376cb6f-GIANNI_GPROTO7-0-1-RND1056_0
Workunit 31544699
Created 30 Sep 2025, 3:34:22 UTC
Sent 30 Sep 2025, 3:34:23 UTC
Report deadline 5 Oct 2025, 3:34:23 UTC
Received 30 Sep 2025, 7:08:03 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 195 (0x000000C3) EXIT_CHILD_FAILED
Computer ID 644390
Run time 6 min 36 sec
CPU time 2 min 55 sec
Validate state Invalid
Credit 0.00
Device peak FLOPS 82,576.36 GFLOPS
Application version LLM: LLMs for chemistry v1.01 (cuda124L)
windows_x86_64
Peak working set size 3.03 GB
Peak swap size 20.11 GB
Peak disk usage 6.35 GB

Stderr output

<core_client_version>8.2.4</core_client_version>
<![CDATA[
<message>
The operating system cannot run (null).
 (0xc3) - exit code 195 (0xc3)</message>
<stderr_txt>
00:03:09 (18716): wrapper (7.9.26016): starting
00:03:09 (18716): wrapper: running Library/usr/bin/tar.exe (xjvf input.tar.bz2)
tasks.json
run.bat
conf.yaml
main_generation-0.1.0-py3-none-any.whl
run.sh
00:03:10 (18716): Library/usr/bin/tar.exe exited; CPU time 0.000000
00:03:10 (18716): wrapper: running C:/Windows/system32/cmd.exe (/c call Scripts\activate.bat && Scripts\conda-unpack.exe && run.bat)

Generating train split: 0 examples [00:00, ? examples/s]
Generating train split: 2500 examples [00:00, 184992.77 examples/s]
[W930 00:04:38.000000000 socket.cpp:759] [c10d] The client socket has failed to connect to [host.docker.internal]:58676 (system error: 10049 - The requested address is not valid in its context.).

Loading safetensors checkpoint shards:   0% Completed | 0/2 [00:00<?, ?it/s]

Loading safetensors checkpoint shards:  50% Completed | 1/2 [00:05<00:05,  5.58s/it]

Loading safetensors checkpoint shards: 100% Completed | 2/2 [00:10<00:00,  5.28s/it]

Loading safetensors checkpoint shards: 100% Completed | 2/2 [00:10<00:00,  5.32s/it]


Loading safetensors checkpoint shards:   0% Completed | 0/2 [00:00<?, ?it/s]

Loading safetensors checkpoint shards:  50% Completed | 1/2 [00:04<00:04,  4.81s/it]

Loading safetensors checkpoint shards: 100% Completed | 2/2 [00:09<00:00,  4.71s/it]

Loading safetensors checkpoint shards: 100% Completed | 2/2 [00:09<00:00,  4.73s/it]

[rank0]: Traceback (most recent call last):
[rank0]:   File "wheel_contents/aiengine/main_generation.py", line 87, in <module>
[rank0]:   File "wheel_contents/aiengine/model.py", line 36, in __init__
[rank0]:   File "C:\ProgramData\BOINC\slots\38\Lib\site-packages\vllm\utils.py", line 1096, in inner
[rank0]:     return fn(*args, **kwargs)
[rank0]:            ^^^^^^^^^^^^^^^^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\38\Lib\site-packages\vllm\entrypoints\llm.py", line 243, in __init__
[rank0]:     self.llm_engine = LLMEngine.from_engine_args(
[rank0]:                       ^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\38\Lib\site-packages\vllm\engine\llm_engine.py", line 521, in from_engine_args
[rank0]:     return engine_cls.from_vllm_config(
[rank0]:            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\38\Lib\site-packages\vllm\engine\llm_engine.py", line 497, in from_vllm_config
[rank0]:     return cls(
[rank0]:            ^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\38\Lib\site-packages\vllm\engine\llm_engine.py", line 284, in __init__
[rank0]:     self._initialize_kv_caches()
[rank0]:   File "C:\ProgramData\BOINC\slots\38\Lib\site-packages\vllm\engine\llm_engine.py", line 446, in _initialize_kv_caches
[rank0]:     self.model_executor.initialize_cache(num_gpu_blocks, num_cpu_blocks)
[rank0]:   File "C:\ProgramData\BOINC\slots\38\Lib\site-packages\vllm\executor\executor_base.py", line 123, in initialize_cache
[rank0]:     self.collective_rpc("initialize_cache",
[rank0]:   File "C:\ProgramData\BOINC\slots\38\Lib\site-packages\vllm\executor\uniproc_executor.py", line 56, in collective_rpc
[rank0]:     answer = run_method(self.driver_worker, method, args, kwargs)
[rank0]:              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\38\Lib\site-packages\vllm\utils.py", line 2359, in run_method
[rank0]:     return func(*args, **kwargs)
[rank0]:            ^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\38\Lib\site-packages\vllm\worker\worker.py", line 292, in initialize_cache
[rank0]:     raise_if_cache_size_invalid(
[rank0]:   File "C:\ProgramData\BOINC\slots\38\Lib\site-packages\vllm\worker\worker.py", line 546, in raise_if_cache_size_invalid
[rank0]:     raise ValueError("No available memory for the cache blocks. "
[rank0]: ValueError: No available memory for the cache blocks. Try increasing `gpu_memory_utilization` when initializing the engine.
00:06:43 (18716): C:/Windows/system32/cmd.exe exited; CPU time 175.437500
00:06:43 (18716): app exit status: 0x16
00:06:43 (18716): called boinc_finish(195)
0 bytes in 0 Free Blocks.
256 bytes in 6 Normal Blocks.
1144 bytes in 1 CRT Blocks.
0 bytes in 0 Ignore Blocks.
0 bytes in 0 Client Blocks.
Largest number used: 0 bytes.
Total allocations: 3670877 bytes.
Dumping objects ->
{1601203} normal block at 0x0000024AACE2ECC0, 48 bytes long.
 Data: <PATH=C:\ProgramD> 50 41 54 48 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 
{1601192} normal block at 0x0000024AACE2F510, 48 bytes long.
 Data: <HOME=C:\ProgramD> 48 4F 4D 45 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 
{1601181} normal block at 0x0000024AACE2F0B0, 48 bytes long.
 Data: <TMP=C:\ProgramDa> 54 4D 50 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 61 
{1601170} normal block at 0x0000024AACE2EE10, 48 bytes long.
 Data: <TEMP=C:\ProgramD> 54 45 4D 50 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 
{1601159} normal block at 0x0000024AACE06060, 48 bytes long.
 Data: <TMPDIR=C:\Progra> 54 4D 50 44 49 52 3D 43 3A 5C 50 72 6F 67 72 61 
{1601128} normal block at 0x0000024AAED91C90, 64 bytes long.
 Data: <PATH=C:\ProgramD> 50 41 54 48 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 
..\api\boinc_api.cpp(309) : {1601115} normal block at 0x0000024AACE02510, 8 bytes long.
 Data: <  2&#173;J   > 00 00 32 AD 4A 02 00 00 
{1599920} normal block at 0x0000024AACE01C50, 8 bytes long.
 Data: <P&#235;&#237;&#172;J   > 50 EB ED AC 4A 02 00 00 
..\zip\boinc_zip.cpp(122) : {307} normal block at 0x0000024AACDF6A10, 260 bytes long.
 Data: <                > 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 
{292} normal block at 0x0000024AACDF53A0, 80 bytes long.
 Data: </c call Scripts\> 2F 63 20 63 61 6C 6C 20 53 63 72 69 70 74 73 5C 
{291} normal block at 0x0000024AACE0A930, 16 bytes long.
 Data: <&#232;]&#224;&#172;J           > E8 5D E0 AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{290} normal block at 0x0000024AACE0ABB0, 16 bytes long.
 Data: <&#192;]&#224;&#172;J           > C0 5D E0 AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{289} normal block at 0x0000024AACE0A6B0, 16 bytes long.
 Data: < ]&#224;&#172;J           > 98 5D E0 AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{288} normal block at 0x0000024AACE0AE80, 16 bytes long.
 Data: <p]&#224;&#172;J           > 70 5D E0 AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{287} normal block at 0x0000024AACE0AB60, 16 bytes long.
 Data: <H]&#224;&#172;J           > 48 5D E0 AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{286} normal block at 0x0000024AACE0AB10, 16 bytes long.
 Data: < ]&#224;&#172;J           > 20 5D E0 AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{285} normal block at 0x0000024AACE065A0, 48 bytes long.
 Data: <ComSpec=C:\Windo> 43 6F 6D 53 70 65 63 3D 43 3A 5C 57 69 6E 64 6F 
{284} normal block at 0x0000024AACE0AA70, 16 bytes long.
 Data: <  &#223;&#172;J           > 18 1A DF AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{283} normal block at 0x0000024AACE03CE0, 32 bytes long.
 Data: <SystemRoot=C:\Wi> 53 79 73 74 65 6D 52 6F 6F 74 3D 43 3A 5C 57 69 
{282} normal block at 0x0000024AACE0A480, 16 bytes long.
 Data: <&#240; &#223;&#172;J           > F0 19 DF AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{280} normal block at 0x0000024AACE0AC50, 16 bytes long.
 Data: <&#200; &#223;&#172;J           > C8 19 DF AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{279} normal block at 0x0000024AACE0A3E0, 16 bytes long.
 Data: <&#160; &#223;&#172;J           > A0 19 DF AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{278} normal block at 0x0000024AACE0A390, 16 bytes long.
 Data: <x &#223;&#172;J           > 78 19 DF AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{277} normal block at 0x0000024AACE0A980, 16 bytes long.
 Data: <P &#223;&#172;J           > 50 19 DF AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{276} normal block at 0x0000024AACE0AA20, 16 bytes long.
 Data: <( &#223;&#172;J           > 28 19 DF AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{275} normal block at 0x0000024AACE04580, 32 bytes long.
 Data: <CUDA_DEVICE=0 PU> 43 55 44 41 5F 44 45 56 49 43 45 3D 30 00 50 55 
{274} normal block at 0x0000024AACE0A7F0, 16 bytes long.
 Data: <  &#223;&#172;J           > 00 19 DF AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{273} normal block at 0x0000024AACDF1900, 320 bytes long.
 Data: <&#240;&#167;&#224;&#172;J    E&#224;&#172;J   > F0 A7 E0 AC 4A 02 00 00 80 45 E0 AC 4A 02 00 00 
{272} normal block at 0x0000024AACE0AE30, 16 bytes long.
 Data: < ]&#224;&#172;J           > 00 5D E0 AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{271} normal block at 0x0000024AACE0A4D0, 16 bytes long.
 Data: <&#216;\&#224;&#172;J           > D8 5C E0 AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{270} normal block at 0x0000024AACE03C80, 32 bytes long.
 Data: <C:/Windows/syste> 43 3A 2F 57 69 6E 64 6F 77 73 2F 73 79 73 74 65 
{269} normal block at 0x0000024AACE0ADE0, 16 bytes long.
 Data: <&#176;\&#224;&#172;J           > B0 5C E0 AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{268} normal block at 0x0000024AACE04B20, 32 bytes long.
 Data: <xjvf input.tar.b> 78 6A 76 66 20 69 6E 70 75 74 2E 74 61 72 2E 62 
{267} normal block at 0x0000024AACE0A430, 16 bytes long.
 Data: <&#248;[&#224;&#172;J           > F8 5B E0 AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{266} normal block at 0x0000024AACE0A8E0, 16 bytes long.
 Data: <&#208;[&#224;&#172;J           > D0 5B E0 AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{265} normal block at 0x0000024AACE0AD90, 16 bytes long.
 Data: <&#168;[&#224;&#172;J           > A8 5B E0 AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{264} normal block at 0x0000024AACE0A9D0, 16 bytes long.
 Data: < [&#224;&#172;J           > 80 5B E0 AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{263} normal block at 0x0000024AACE0AFC0, 16 bytes long.
 Data: <X[&#224;&#172;J           > 58 5B E0 AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{262} normal block at 0x0000024AACE0ACA0, 16 bytes long.
 Data: <0[&#224;&#172;J           > 30 5B E0 AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{260} normal block at 0x0000024AACE0AD40, 16 bytes long.
 Data: <`g&#224;&#172;J           > 60 67 E0 AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{259} normal block at 0x0000024AACE06760, 40 bytes long.
 Data: <@&#173;&#224;&#172;J     &#217;&#174;J   > 40 AD E0 AC 4A 02 00 00 90 1C D9 AE 4A 02 00 00 
{258} normal block at 0x0000024AACE0B240, 16 bytes long.
 Data: < [&#224;&#172;J           > 10 5B E0 AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{257} normal block at 0x0000024AACE0B1F0, 16 bytes long.
 Data: <&#232;Z&#224;&#172;J           > E8 5A E0 AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{256} normal block at 0x0000024AACE04A60, 32 bytes long.
 Data: <Library/usr/bin/> 4C 69 62 72 61 72 79 2F 75 73 72 2F 62 69 6E 2F 
{255} normal block at 0x0000024AACE0A340, 16 bytes long.
 Data: <&#192;Z&#224;&#172;J           > C0 5A E0 AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{254} normal block at 0x0000024AACE05AC0, 992 bytes long.
 Data: <@&#163;&#224;&#172;J   `J&#224;&#172;J   > 40 A3 E0 AC 4A 02 00 00 60 4A E0 AC 4A 02 00 00 
{98} normal block at 0x0000024AACE04340, 32 bytes long.
 Data: <windows_x86_64__> 77 69 6E 64 6F 77 73 5F 78 38 36 5F 36 34 5F 5F 
{97} normal block at 0x0000024AACE01AC0, 16 bytes long.
 Data: < c&#224;&#172;J           > 00 63 E0 AC 4A 02 00 00 00 00 00 00 00 00 00 00 
{96} normal block at 0x0000024AACE06300, 40 bytes long.
 Data: <&#192; &#224;&#172;J   @C&#224;&#172;J   > C0 1A E0 AC 4A 02 00 00 40 43 E0 AC 4A 02 00 00 
{75} normal block at 0x0000024AACE02830, 16 bytes long.
 Data: < &#234;&#159;=&#246;           > 80 EA 9F 3D F6 7F 00 00 00 00 00 00 00 00 00 00 
{74} normal block at 0x0000024AACE01BB0, 16 bytes long.
 Data: <@&#233;&#159;=&#246;           > 40 E9 9F 3D F6 7F 00 00 00 00 00 00 00 00 00 00 
{73} normal block at 0x0000024AACE01FC0, 16 bytes long.
 Data: <&#248;W&#156;=&#246;           > F8 57 9C 3D F6 7F 00 00 00 00 00 00 00 00 00 00 
{72} normal block at 0x0000024AACE024C0, 16 bytes long.
 Data: <&#216;W&#156;=&#246;           > D8 57 9C 3D F6 7F 00 00 00 00 00 00 00 00 00 00 
{71} normal block at 0x0000024AACE02150, 16 bytes long.
 Data: <P &#156;=&#246;           > 50 04 9C 3D F6 7F 00 00 00 00 00 00 00 00 00 00 
{70} normal block at 0x0000024AACE02790, 16 bytes long.
 Data: <0 &#156;=&#246;           > 30 04 9C 3D F6 7F 00 00 00 00 00 00 00 00 00 00 
{69} normal block at 0x0000024AACE023D0, 16 bytes long.
 Data: <&#224; &#156;=&#246;           > E0 02 9C 3D F6 7F 00 00 00 00 00 00 00 00 00 00 
{68} normal block at 0x0000024AACE02740, 16 bytes long.
 Data: <  &#156;=&#246;           > 10 04 9C 3D F6 7F 00 00 00 00 00 00 00 00 00 00 
{67} normal block at 0x0000024AACE02470, 16 bytes long.
 Data: <p &#156;=&#246;           > 70 04 9C 3D F6 7F 00 00 00 00 00 00 00 00 00 00 
{66} normal block at 0x0000024AACE01A20, 16 bytes long.
 Data: < &#192;&#154;=&#246;           > 18 C0 9A 3D F6 7F 00 00 00 00 00 00 00 00 00 00 
Object dump complete.

</stderr_txt>
]]>


©2025 Universitat Pompeu Fabra