Task 38579461

Name wu_75220a06-GIANNI_GPROTO7-0-1-RND5284_0
Workunit 31544839
Created 30 Sep 2025, 6:46:19 UTC
Sent 30 Sep 2025, 6:48:06 UTC
Report deadline 5 Oct 2025, 6:48:06 UTC
Received 30 Sep 2025, 6:53:16 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 195 (0x000000C3) EXIT_CHILD_FAILED
Computer ID 639165
Run time 1 min 56 sec
CPU time 14 sec
Validate state Invalid
Credit 0.00
Device peak FLOPS 54,671.31 GFLOPS
Application version LLM: LLMs for chemistry v1.01 (cuda124L)
windows_x86_64
Peak working set size 830.00 MB
Peak swap size 2.98 GB
Peak disk usage 5.98 GB

Stderr output

<core_client_version>8.2.4</core_client_version>
<![CDATA[
<message>
The operating system cannot run (null).
 (0xc3) - exit code 195 (0xc3)</message>
<stderr_txt>
16:50:45 (33384): wrapper (7.9.26016): starting
16:50:45 (33384): wrapper: running Library/usr/bin/tar.exe (xjvf input.tar.bz2)
tasks.json
run.bat
conf.yaml
main_generation-0.1.0-py3-none-any.whl
run.sh
16:50:46 (33384): Library/usr/bin/tar.exe exited; CPU time 0.000000
16:50:46 (33384): wrapper: running C:/Windows/system32/cmd.exe (/c call Scripts\activate.bat && Scripts\conda-unpack.exe && run.bat)

Generating train split: 0 examples [00:00, ? examples/s]
Generating train split: 2500 examples [00:00, 384516.32 examples/s]
C:\ProgramData\BOINC\slots\0\Lib\site-packages\torch\cuda\__init__.py:235: UserWarning: 
NVIDIA GeForce RTX 5090 with CUDA capability sm_120 is not compatible with the current PyTorch installation.
The current PyTorch install supports CUDA capabilities sm_50 sm_60 sm_61 sm_70 sm_75 sm_80 sm_86 sm_90.
If you want to use the NVIDIA GeForce RTX 5090 GPU with PyTorch, please check the instructions at https://pytorch.org/get-started/locally/

  warnings.warn(
[W930 16:51:26.000000000 socket.cpp:759] [c10d] The client socket has failed to connect to [Nebula_PC]:55478 (system error: 10049 - The requested address is not valid in its context.).
[rank0]: Traceback (most recent call last):
[rank0]:   File "wheel_contents/aiengine/main_generation.py", line 87, in <module>
[rank0]:   File "wheel_contents/aiengine/model.py", line 36, in __init__
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\vllm\utils.py", line 1096, in inner
[rank0]:     return fn(*args, **kwargs)
[rank0]:            ^^^^^^^^^^^^^^^^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\vllm\entrypoints\llm.py", line 243, in __init__
[rank0]:     self.llm_engine = LLMEngine.from_engine_args(
[rank0]:                       ^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\vllm\engine\llm_engine.py", line 521, in from_engine_args
[rank0]:     return engine_cls.from_vllm_config(
[rank0]:            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\vllm\engine\llm_engine.py", line 497, in from_vllm_config
[rank0]:     return cls(
[rank0]:            ^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\vllm\engine\llm_engine.py", line 281, in __init__
[rank0]:     self.model_executor = executor_class(vllm_config=vllm_config, )
[rank0]:                           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\vllm\executor\executor_base.py", line 52, in __init__
[rank0]:     self._init_executor()
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\vllm\executor\uniproc_executor.py", line 47, in _init_executor
[rank0]:     self.collective_rpc("load_model")
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\vllm\executor\uniproc_executor.py", line 56, in collective_rpc
[rank0]:     answer = run_method(self.driver_worker, method, args, kwargs)
[rank0]:              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\vllm\utils.py", line 2359, in run_method
[rank0]:     return func(*args, **kwargs)
[rank0]:            ^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\vllm\worker\worker.py", line 184, in load_model
[rank0]:     self.model_runner.load_model()
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\vllm\worker\model_runner.py", line 1113, in load_model
[rank0]:     self.model = get_model(vllm_config=self.vllm_config)
[rank0]:                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\vllm\model_executor\model_loader\__init__.py", line 14, in get_model
[rank0]:     return loader.load_model(vllm_config=vllm_config)
[rank0]:            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\vllm\model_executor\model_loader\loader.py", line 1278, in load_model
[rank0]:     model = _initialize_model(vllm_config=vllm_config)
[rank0]:             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\vllm\model_executor\model_loader\loader.py", line 127, in _initialize_model
[rank0]:     return model_class(vllm_config=vllm_config, prefix=prefix)
[rank0]:            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\vllm\model_executor\models\qwen2.py", line 431, in __init__
[rank0]:     self.model = Qwen2Model(vllm_config=vllm_config,
[rank0]:                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\vllm\compilation\decorators.py", line 151, in __init__
[rank0]:     old_init(self, vllm_config=vllm_config, prefix=prefix, **kwargs)
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\vllm\model_executor\models\qwen2.py", line 300, in __init__
[rank0]:     self.start_layer, self.end_layer, self.layers = make_layers(
[rank0]:                                                     ^^^^^^^^^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\vllm\model_executor\models\utils.py", line 610, in make_layers
[rank0]:     maybe_offload_to_cpu(layer_fn(prefix=f"{prefix}.{idx}"))
[rank0]:                          ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\vllm\model_executor\models\qwen2.py", line 302, in <lambda>
[rank0]:     lambda prefix: Qwen2DecoderLayer(config=config,
[rank0]:                    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\vllm\model_executor\models\qwen2.py", line 206, in __init__
[rank0]:     self.self_attn = Qwen2Attention(
[rank0]:                      ^^^^^^^^^^^^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\vllm\model_executor\models\qwen2.py", line 153, in __init__
[rank0]:     self.rotary_emb = get_rope(
[rank0]:                       ^^^^^^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\vllm\model_executor\layers\rotary_embedding.py", line 1180, in get_rope
[rank0]:     rotary_emb = RotaryEmbedding(head_size, rotary_dim, max_position, base,
[rank0]:                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\vllm\model_executor\layers\rotary_embedding.py", line 99, in __init__
[rank0]:     cache = self._compute_cos_sin_cache()
[rank0]:             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\vllm\model_executor\layers\rotary_embedding.py", line 116, in _compute_cos_sin_cache
[rank0]:     inv_freq = self._compute_inv_freq(self.base)
[rank0]:                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\vllm\model_executor\layers\rotary_embedding.py", line 110, in _compute_inv_freq
[rank0]:     inv_freq = 1.0 / (base**(torch.arange(
[rank0]:                              ^^^^^^^^^^^^^
[rank0]:   File "C:\ProgramData\BOINC\slots\0\Lib\site-packages\torch\utils\_device.py", line 104, in __torch_function__
[rank0]:     return func(*args, **kwargs)
[rank0]:            ^^^^^^^^^^^^^^^^^^^^^
[rank0]: RuntimeError: CUDA error: no kernel image is available for execution on the device
[rank0]: CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
[rank0]: For debugging consider passing CUDA_LAUNCH_BLOCKING=1
[rank0]: Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions.

16:51:27 (33384): C:/Windows/system32/cmd.exe exited; CPU time 14.218750
16:51:27 (33384): app exit status: 0x16
16:51:27 (33384): called boinc_finish(195)
0 bytes in 0 Free Blocks.
256 bytes in 6 Normal Blocks.
1144 bytes in 1 CRT Blocks.
0 bytes in 0 Ignore Blocks.
0 bytes in 0 Client Blocks.
Largest number used: 0 bytes.
Total allocations: 785909 bytes.
Dumping objects ->
{1601485} normal block at 0x0000018BEF8DCA80, 48 bytes long.
 Data: <PATH=C:\ProgramD> 50 41 54 48 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 
{1601474} normal block at 0x0000018BEF8DC540, 48 bytes long.
 Data: <HOME=C:\ProgramD> 48 4F 4D 45 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 
{1601463} normal block at 0x0000018BEF8DC3F0, 48 bytes long.
 Data: <TMP=C:\ProgramDa> 54 4D 50 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 61 
{1601452} normal block at 0x0000018BEF8DC8C0, 48 bytes long.
 Data: <TEMP=C:\ProgramD> 54 45 4D 50 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 
{1601441} normal block at 0x0000018BEF8B6F60, 48 bytes long.
 Data: <TMPDIR=C:\Progra> 54 4D 50 44 49 52 3D 43 3A 5C 50 72 6F 67 72 61 
{1601410} normal block at 0x0000018BF1520E70, 64 bytes long.
 Data: <PATH=C:\ProgramD> 50 41 54 48 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 
..\api\boinc_api.cpp(309) : {1601397} normal block at 0x0000018BEF8AFB30, 8 bytes long.
 Data: <  &#212;&#239;&#139;   > 00 00 D4 EF 8B 01 00 00 
{1599914} normal block at 0x0000018BEF8AF3B0, 8 bytes long.
 Data: <P&#208;\&#241;&#139;   > 50 D0 5C F1 8B 01 00 00 
..\zip\boinc_zip.cpp(122) : {304} normal block at 0x0000018BEF8A4220, 260 bytes long.
 Data: <                > 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 
{289} normal block at 0x0000018BEF89F440, 80 bytes long.
 Data: </c call Scripts\> 2F 63 20 63 61 6C 6C 20 53 63 72 69 70 74 73 5C 
{288} normal block at 0x0000018BEF8B7EF0, 16 bytes long.
 Data: < &#139;&#139;&#239;&#139;           > 98 8B 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{287} normal block at 0x0000018BEF8B8670, 16 bytes long.
 Data: <p&#139;&#139;&#239;&#139;           > 70 8B 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{286} normal block at 0x0000018BEF8B7E50, 16 bytes long.
 Data: <H&#139;&#139;&#239;&#139;           > 48 8B 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{285} normal block at 0x0000018BEF8B7CC0, 16 bytes long.
 Data: < &#139;&#139;&#239;&#139;           > 20 8B 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{284} normal block at 0x0000018BEF8B84E0, 16 bytes long.
 Data: <&#248;&#138;&#139;&#239;&#139;           > F8 8A 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{283} normal block at 0x0000018BEF8B8440, 16 bytes long.
 Data: <&#208;&#138;&#139;&#239;&#139;           > D0 8A 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{282} normal block at 0x0000018BEF8B7040, 48 bytes long.
 Data: <ComSpec=C:\Windo> 43 6F 6D 53 70 65 63 3D 43 3A 5C 57 69 6E 64 6F 
{281} normal block at 0x0000018BEF8B8620, 16 bytes long.
 Data: < u&#139;&#239;&#139;           > 98 75 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{280} normal block at 0x0000018BEF8B3860, 32 bytes long.
 Data: <SystemRoot=C:\Wi> 53 79 73 74 65 6D 52 6F 6F 74 3D 43 3A 5C 57 69 
{279} normal block at 0x0000018BEF8B83F0, 16 bytes long.
 Data: <pu&#139;&#239;&#139;           > 70 75 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{277} normal block at 0x0000018BEF8B80D0, 16 bytes long.
 Data: <Hu&#139;&#239;&#139;           > 48 75 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{276} normal block at 0x0000018BEF8B85D0, 16 bytes long.
 Data: < u&#139;&#239;&#139;           > 20 75 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{275} normal block at 0x0000018BEF8B7E00, 16 bytes long.
 Data: <&#248;t&#139;&#239;&#139;           > F8 74 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{274} normal block at 0x0000018BEF8B83A0, 16 bytes long.
 Data: <&#208;t&#139;&#239;&#139;           > D0 74 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{273} normal block at 0x0000018BEF8B8490, 16 bytes long.
 Data: <&#168;t&#139;&#239;&#139;           > A8 74 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{272} normal block at 0x0000018BEF8B3F80, 32 bytes long.
 Data: <CUDA_DEVICE=0 PU> 43 55 44 41 5F 44 45 56 49 43 45 3D 30 00 50 55 
{271} normal block at 0x0000018BEF8B8170, 16 bytes long.
 Data: < t&#139;&#239;&#139;           > 80 74 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{270} normal block at 0x0000018BEF8B7480, 320 bytes long.
 Data: <p &#139;&#239;&#139;    ?&#139;&#239;&#139;   > 70 81 8B EF 8B 01 00 00 80 3F 8B EF 8B 01 00 00 
{269} normal block at 0x0000018BEF8B7EA0, 16 bytes long.
 Data: <&#176;&#138;&#139;&#239;&#139;           > B0 8A 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{268} normal block at 0x0000018BEF8B8210, 16 bytes long.
 Data: < &#138;&#139;&#239;&#139;           > 88 8A 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{267} normal block at 0x0000018BEF8B3200, 32 bytes long.
 Data: <C:/Windows/syste> 43 3A 2F 57 69 6E 64 6F 77 73 2F 73 79 73 74 65 
{266} normal block at 0x0000018BEF8B7950, 16 bytes long.
 Data: <`&#138;&#139;&#239;&#139;           > 60 8A 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{265} normal block at 0x0000018BEF8B35C0, 32 bytes long.
 Data: <xjvf input.tar.b> 78 6A 76 66 20 69 6E 70 75 74 2E 74 61 72 2E 62 
{264} normal block at 0x0000018BEF8B8760, 16 bytes long.
 Data: <&#168;&#137;&#139;&#239;&#139;           > A8 89 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{263} normal block at 0x0000018BEF8B78B0, 16 bytes long.
 Data: < &#137;&#139;&#239;&#139;           > 80 89 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{262} normal block at 0x0000018BEF8B8350, 16 bytes long.
 Data: <X&#137;&#139;&#239;&#139;           > 58 89 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{261} normal block at 0x0000018BEF8B8580, 16 bytes long.
 Data: <0&#137;&#139;&#239;&#139;           > 30 89 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{260} normal block at 0x0000018BEF8B8300, 16 bytes long.
 Data: < &#137;&#139;&#239;&#139;           > 08 89 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{259} normal block at 0x0000018BEF8B81C0, 16 bytes long.
 Data: <&#224; &#139;&#239;&#139;           > E0 88 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{257} normal block at 0x0000018BEF8B7DB0, 16 bytes long.
 Data: < n&#139;&#239;&#139;           > 80 6E 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{256} normal block at 0x0000018BEF8B6E80, 40 bytes long.
 Data: <&#176;}&#139;&#239;&#139;   p R&#241;&#139;   > B0 7D 8B EF 8B 01 00 00 70 0E 52 F1 8B 01 00 00 
{255} normal block at 0x0000018BEF8B7900, 16 bytes long.
 Data: <&#192; &#139;&#239;&#139;           > C0 88 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{254} normal block at 0x0000018BEF8B8710, 16 bytes long.
 Data: <  &#139;&#239;&#139;           > 98 88 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{253} normal block at 0x0000018BEF8B38C0, 32 bytes long.
 Data: <Library/usr/bin/> 4C 69 62 72 61 72 79 2F 75 73 72 2F 62 69 6E 2F 
{252} normal block at 0x0000018BEF8B82B0, 16 bytes long.
 Data: <p &#139;&#239;&#139;           > 70 88 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{251} normal block at 0x0000018BEF8B8870, 992 bytes long.
 Data: <&#176;&#130;&#139;&#239;&#139;   &#192;8&#139;&#239;&#139;   > B0 82 8B EF 8B 01 00 00 C0 38 8B EF 8B 01 00 00 
{95} normal block at 0x0000018BEF8B3500, 32 bytes long.
 Data: <windows_x86_64__> 77 69 6E 64 6F 77 73 5F 78 38 36 5F 36 34 5F 5F 
{94} normal block at 0x0000018BEF8B0120, 16 bytes long.
 Data: <&#192;s&#139;&#239;&#139;           > C0 73 8B EF 8B 01 00 00 00 00 00 00 00 00 00 00 
{93} normal block at 0x0000018BEF8B73C0, 40 bytes long.
 Data: <  &#139;&#239;&#139;    5&#139;&#239;&#139;   > 20 01 8B EF 8B 01 00 00 00 35 8B EF 8B 01 00 00 
{72} normal block at 0x0000018BEF8AFD10, 16 bytes long.
 Data: < &#234;xe&#246;           > 80 EA 78 65 F6 7F 00 00 00 00 00 00 00 00 00 00 
{71} normal block at 0x0000018BEF8AFC70, 16 bytes long.
 Data: <@&#233;xe&#246;           > 40 E9 78 65 F6 7F 00 00 00 00 00 00 00 00 00 00 
{70} normal block at 0x0000018BEF8AF360, 16 bytes long.
 Data: <&#248;Wue&#246;           > F8 57 75 65 F6 7F 00 00 00 00 00 00 00 00 00 00 
{69} normal block at 0x0000018BEF8B0170, 16 bytes long.
 Data: <&#216;Wue&#246;           > D8 57 75 65 F6 7F 00 00 00 00 00 00 00 00 00 00 
{68} normal block at 0x0000018BEF8AF950, 16 bytes long.
 Data: <P ue&#246;           > 50 04 75 65 F6 7F 00 00 00 00 00 00 00 00 00 00 
{67} normal block at 0x0000018BEF8B0030, 16 bytes long.
 Data: <0 ue&#246;           > 30 04 75 65 F6 7F 00 00 00 00 00 00 00 00 00 00 
{66} normal block at 0x0000018BEF8AFC20, 16 bytes long.
 Data: <&#224; ue&#246;           > E0 02 75 65 F6 7F 00 00 00 00 00 00 00 00 00 00 
{65} normal block at 0x0000018BEF8AF540, 16 bytes long.
 Data: <  ue&#246;           > 10 04 75 65 F6 7F 00 00 00 00 00 00 00 00 00 00 
{64} normal block at 0x0000018BEF8AF450, 16 bytes long.
 Data: <p ue&#246;           > 70 04 75 65 F6 7F 00 00 00 00 00 00 00 00 00 00 
{63} normal block at 0x0000018BEF8AFFE0, 16 bytes long.
 Data: < &#192;se&#246;           > 18 C0 73 65 F6 7F 00 00 00 00 00 00 00 00 00 00 
Object dump complete.

</stderr_txt>
]]>


©2025 Universitat Pompeu Fabra