Name | wu_e0a02ffd-GIANNI_GPROTO7-0-1-RND9561_0 |
Workunit | 31544348 |
Created | 27 Sep 2025, 6:39:43 UTC |
Sent | 27 Sep 2025, 6:40:18 UTC |
Report deadline | 2 Oct 2025, 6:40:18 UTC |
Received | 27 Sep 2025, 8:36:53 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 195 (0x000000C3) EXIT_CHILD_FAILED |
Computer ID | 642323 |
Run time | 2 min 27 sec |
CPU time | 22 sec |
Validate state | Invalid |
Credit | 0.00 |
Device peak FLOPS | 55,497.90 GFLOPS |
Application version | LLM: LLMs for chemistry v1.01 (cuda124L) windows_x86_64 |
Peak working set size | 720.74 MB |
Peak swap size | 2.43 GB |
Peak disk usage | 5.97 GB |
<core_client_version>8.2.4</core_client_version> <![CDATA[ <message> The operating system cannot run (null). (0xc3) - exit code 195 (0xc3)</message> <stderr_txt> 04:34:21 (548): wrapper (7.9.26016): starting 04:34:21 (548): wrapper: running Library/usr/bin/tar.exe (xjvf input.tar.bz2) tasks.json run.bat conf.yaml main_generation-0.1.0-py3-none-any.whl run.sh 04:34:22 (548): Library/usr/bin/tar.exe exited; CPU time 0.000000 04:34:22 (548): wrapper: running C:/Windows/system32/cmd.exe (/c call Scripts\activate.bat && Scripts\conda-unpack.exe && run.bat) Generating train split: 0 examples [00:00, ? examples/s] Generating train split: 2500 examples [00:00, 238085.46 examples/s] D:\ProgramData\BOINC\slots\2\Lib\site-packages\huggingface_hub\file_download.py:144: UserWarning: `huggingface_hub` cache-system uses symlinks by default to efficiently store duplicated files but your machine does not support them in D:\ProgramData\BOINC\slots\.cache\hub\models--Acellera--proto. Caching files will still work but in a degraded version that might require more space on your disk. This warning can be disabled by setting the `HF_HUB_DISABLE_SYMLINKS_WARNING` environment variable. For more details, see https://huggingface.co/docs/huggingface_hub/how-to-cache#limitations. To support symlinks on Windows, you either need to activate Developer Mode or to run Python as an administrator. In order to activate developer mode, see this article: https://docs.microsoft.com/en-us/windows/apps/get-started/enable-your-device-for-development warnings.warn(message) D:\ProgramData\BOINC\slots\2\Lib\site-packages\huggingface_hub\file_download.py:144: UserWarning: `huggingface_hub` cache-system uses symlinks by default to efficiently store duplicated files but your machine does not support them in D:\ProgramData\BOINC\slots\.cache\hub\models--unsloth--Qwen2.5-14B-Instruct-bnb-4bit. Caching files will still work but in a degraded version that might require more space on your disk. This warning can be disabled by setting the `HF_HUB_DISABLE_SYMLINKS_WARNING` environment variable. For more details, see https://huggingface.co/docs/huggingface_hub/how-to-cache#limitations. To support symlinks on Windows, you either need to activate Developer Mode or to run Python as an administrator. In order to activate developer mode, see this article: https://docs.microsoft.com/en-us/windows/apps/get-started/enable-your-device-for-development warnings.warn(message) D:\ProgramData\BOINC\slots\2\Lib\site-packages\torch\cuda\__init__.py:235: UserWarning: NVIDIA GeForce RTX 5090 with CUDA capability sm_120 is not compatible with the current PyTorch installation. The current PyTorch install supports CUDA capabilities sm_50 sm_60 sm_61 sm_70 sm_75 sm_80 sm_86 sm_90. If you want to use the NVIDIA GeForce RTX 5090 GPU with PyTorch, please check the instructions at https://pytorch.org/get-started/locally/ warnings.warn( [W927 04:35:16.000000000 socket.cpp:759] [c10d] The client socket has failed to connect to [VENGEANCE]:62165 (system error: 10049 - The requested address is not valid in its context.). [rank0]: Traceback (most recent call last): [rank0]: File "wheel_contents/aiengine/main_generation.py", line 87, in <module> [rank0]: File "wheel_contents/aiengine/model.py", line 36, in __init__ [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\vllm\utils.py", line 1096, in inner [rank0]: return fn(*args, **kwargs) [rank0]: ^^^^^^^^^^^^^^^^^^^ [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\vllm\entrypoints\llm.py", line 243, in __init__ [rank0]: self.llm_engine = LLMEngine.from_engine_args( [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\vllm\engine\llm_engine.py", line 521, in from_engine_args [rank0]: return engine_cls.from_vllm_config( [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\vllm\engine\llm_engine.py", line 497, in from_vllm_config [rank0]: return cls( [rank0]: ^^^^ [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\vllm\engine\llm_engine.py", line 281, in __init__ [rank0]: self.model_executor = executor_class(vllm_config=vllm_config, ) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\vllm\executor\executor_base.py", line 52, in __init__ [rank0]: self._init_executor() [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\vllm\executor\uniproc_executor.py", line 47, in _init_executor [rank0]: self.collective_rpc("load_model") [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\vllm\executor\uniproc_executor.py", line 56, in collective_rpc [rank0]: answer = run_method(self.driver_worker, method, args, kwargs) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\vllm\utils.py", line 2359, in run_method [rank0]: return func(*args, **kwargs) [rank0]: ^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\vllm\worker\worker.py", line 184, in load_model [rank0]: self.model_runner.load_model() [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\vllm\worker\model_runner.py", line 1113, in load_model [rank0]: self.model = get_model(vllm_config=self.vllm_config) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\vllm\model_executor\model_loader\__init__.py", line 14, in get_model [rank0]: return loader.load_model(vllm_config=vllm_config) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\vllm\model_executor\model_loader\loader.py", line 1278, in load_model [rank0]: model = _initialize_model(vllm_config=vllm_config) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\vllm\model_executor\model_loader\loader.py", line 127, in _initialize_model [rank0]: return model_class(vllm_config=vllm_config, prefix=prefix) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\vllm\model_executor\models\qwen2.py", line 431, in __init__ [rank0]: self.model = Qwen2Model(vllm_config=vllm_config, [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\vllm\compilation\decorators.py", line 151, in __init__ [rank0]: old_init(self, vllm_config=vllm_config, prefix=prefix, **kwargs) [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\vllm\model_executor\models\qwen2.py", line 300, in __init__ [rank0]: self.start_layer, self.end_layer, self.layers = make_layers( [rank0]: ^^^^^^^^^^^^ [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\vllm\model_executor\models\utils.py", line 610, in make_layers [rank0]: maybe_offload_to_cpu(layer_fn(prefix=f"{prefix}.{idx}")) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\vllm\model_executor\models\qwen2.py", line 302, in <lambda> [rank0]: lambda prefix: Qwen2DecoderLayer(config=config, [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\vllm\model_executor\models\qwen2.py", line 206, in __init__ [rank0]: self.self_attn = Qwen2Attention( [rank0]: ^^^^^^^^^^^^^^^ [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\vllm\model_executor\models\qwen2.py", line 153, in __init__ [rank0]: self.rotary_emb = get_rope( [rank0]: ^^^^^^^^^ [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\vllm\model_executor\layers\rotary_embedding.py", line 1180, in get_rope [rank0]: rotary_emb = RotaryEmbedding(head_size, rotary_dim, max_position, base, [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\vllm\model_executor\layers\rotary_embedding.py", line 99, in __init__ [rank0]: cache = self._compute_cos_sin_cache() [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\vllm\model_executor\layers\rotary_embedding.py", line 116, in _compute_cos_sin_cache [rank0]: inv_freq = self._compute_inv_freq(self.base) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\vllm\model_executor\layers\rotary_embedding.py", line 110, in _compute_inv_freq [rank0]: inv_freq = 1.0 / (base**(torch.arange( [rank0]: ^^^^^^^^^^^^^ [rank0]: File "D:\ProgramData\BOINC\slots\2\Lib\site-packages\torch\utils\_device.py", line 104, in __torch_function__ [rank0]: return func(*args, **kwargs) [rank0]: ^^^^^^^^^^^^^^^^^^^^^ [rank0]: RuntimeError: CUDA error: no kernel image is available for execution on the device [rank0]: CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. [rank0]: For debugging consider passing CUDA_LAUNCH_BLOCKING=1 [rank0]: Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions. 04:35:19 (548): C:/Windows/system32/cmd.exe exited; CPU time 22.484375 04:35:19 (548): app exit status: 0x16 04:35:19 (548): called boinc_finish(195) 0 bytes in 0 Free Blocks. 256 bytes in 6 Normal Blocks. 1144 bytes in 1 CRT Blocks. 0 bytes in 0 Ignore Blocks. 0 bytes in 0 Client Blocks. Largest number used: 0 bytes. Total allocations: 914917 bytes. Dumping objects -> {1601515} normal block at 0x000002B67C278E50, 48 bytes long. Data: <PATH=D:\ProgramD> 50 41 54 48 3D 44 3A 5C 50 72 6F 67 72 61 6D 44 {1601504} normal block at 0x000002B67C254610, 48 bytes long. Data: <HOME=D:\ProgramD> 48 4F 4D 45 3D 44 3A 5C 50 72 6F 67 72 61 6D 44 {1601493} normal block at 0x000002B67C254530, 48 bytes long. Data: <TMP=D:\ProgramDa> 54 4D 50 3D 44 3A 5C 50 72 6F 67 72 61 6D 44 61 {1601482} normal block at 0x000002B67C2544C0, 48 bytes long. Data: <TEMP=D:\ProgramD> 54 45 4D 50 3D 44 3A 5C 50 72 6F 67 72 61 6D 44 {1601471} normal block at 0x000002B67C254A00, 48 bytes long. Data: <TMPDIR=D:\Progra> 54 4D 50 44 49 52 3D 44 3A 5C 50 72 6F 67 72 61 {1601440} normal block at 0x000002B67DFCC3A0, 64 bytes long. Data: <PATH=D:\ProgramD> 50 41 54 48 3D 44 3A 5C 50 72 6F 67 72 61 6D 44 ..\api\boinc_api.cpp(309) : {1601427} normal block at 0x000002B67C24F890, 8 bytes long. Data: < q|¶ > 00 00 71 7C B6 02 00 00 {1599902} normal block at 0x000002B67C24F700, 8 bytes long. Data: <Pé2|¶ > 50 E9 32 7C B6 02 00 00 ..\zip\boinc_zip.cpp(122) : {298} normal block at 0x000002B67C241850, 260 bytes long. Data: < > 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 {283} normal block at 0x000002B67C23C130, 80 bytes long. Data: </c call Scripts\> 2F 63 20 63 61 6C 6C 20 53 63 72 69 70 74 73 5C {282} normal block at 0x000002B67C2558C0, 16 bytes long. Data: <Ø^%|¶ > D8 5E 25 7C B6 02 00 00 00 00 00 00 00 00 00 00 {281} normal block at 0x000002B67C254CE0, 16 bytes long. Data: <°^%|¶ > B0 5E 25 7C B6 02 00 00 00 00 00 00 00 00 00 00 {280} normal block at 0x000002B67C255230, 16 bytes long. Data: < ^%|¶ > 88 5E 25 7C B6 02 00 00 00 00 00 00 00 00 00 00 {279} normal block at 0x000002B67C2557D0, 16 bytes long. Data: <`^%|¶ > 60 5E 25 7C B6 02 00 00 00 00 00 00 00 00 00 00 {278} normal block at 0x000002B67C2551E0, 16 bytes long. Data: <8^%|¶ > 38 5E 25 7C B6 02 00 00 00 00 00 00 00 00 00 00 {277} normal block at 0x000002B67C255410, 16 bytes long. Data: < ^%|¶ > 10 5E 25 7C B6 02 00 00 00 00 00 00 00 00 00 00 {276} normal block at 0x000002B67C2543E0, 48 bytes long. Data: <ComSpec=C:\Windo> 43 6F 6D 53 70 65 63 3D 43 3A 5C 57 69 6E 64 6F {275} normal block at 0x000002B67C255370, 16 bytes long. Data: <Ø $|¶ > D8 10 24 7C B6 02 00 00 00 00 00 00 00 00 00 00 {274} normal block at 0x000002B67C250270, 32 bytes long. Data: <SystemRoot=C:\Wi> 53 79 73 74 65 6D 52 6F 6F 74 3D 43 3A 5C 57 69 {273} normal block at 0x000002B67C2553C0, 16 bytes long. Data: <° $|¶ > B0 10 24 7C B6 02 00 00 00 00 00 00 00 00 00 00 {271} normal block at 0x000002B67C255190, 16 bytes long. Data: < $|¶ > 88 10 24 7C B6 02 00 00 00 00 00 00 00 00 00 00 {270} normal block at 0x000002B67C254FB0, 16 bytes long. Data: <` $|¶ > 60 10 24 7C B6 02 00 00 00 00 00 00 00 00 00 00 {269} normal block at 0x000002B67C2555A0, 16 bytes long. Data: <8 $|¶ > 38 10 24 7C B6 02 00 00 00 00 00 00 00 00 00 00 {268} normal block at 0x000002B67C255AF0, 16 bytes long. Data: < $|¶ > 10 10 24 7C B6 02 00 00 00 00 00 00 00 00 00 00 {267} normal block at 0x000002B67C255000, 16 bytes long. Data: <è $|¶ > E8 0F 24 7C B6 02 00 00 00 00 00 00 00 00 00 00 {266} normal block at 0x000002B67C250B10, 32 bytes long. Data: <CUDA_DEVICE=0 PU> 43 55 44 41 5F 44 45 56 49 43 45 3D 30 00 50 55 {265} normal block at 0x000002B67C255140, 16 bytes long. Data: <À $|¶ > C0 0F 24 7C B6 02 00 00 00 00 00 00 00 00 00 00 {264} normal block at 0x000002B67C240FC0, 320 bytes long. Data: <@Q%|¶ %|¶ > 40 51 25 7C B6 02 00 00 10 0B 25 7C B6 02 00 00 {263} normal block at 0x000002B67C254F60, 16 bytes long. Data: <ð]%|¶ > F0 5D 25 7C B6 02 00 00 00 00 00 00 00 00 00 00 {262} normal block at 0x000002B67C255730, 16 bytes long. Data: <È]%|¶ > C8 5D 25 7C B6 02 00 00 00 00 00 00 00 00 00 00 {261} normal block at 0x000002B67C250450, 32 bytes long. Data: <C:/Windows/syste> 43 3A 2F 57 69 6E 64 6F 77 73 2F 73 79 73 74 65 {260} normal block at 0x000002B67C255460, 16 bytes long. Data: < ]%|¶ > A0 5D 25 7C B6 02 00 00 00 00 00 00 00 00 00 00 {259} normal block at 0x000002B67C250630, 32 bytes long. Data: <xjvf input.tar.b> 78 6A 76 66 20 69 6E 70 75 74 2E 74 61 72 2E 62 {258} normal block at 0x000002B67C255320, 16 bytes long. Data: <è\%|¶ > E8 5C 25 7C B6 02 00 00 00 00 00 00 00 00 00 00 {257} normal block at 0x000002B67C255550, 16 bytes long. Data: <À\%|¶ > C0 5C 25 7C B6 02 00 00 00 00 00 00 00 00 00 00 {256} normal block at 0x000002B67C254F10, 16 bytes long. Data: < \%|¶ > 98 5C 25 7C B6 02 00 00 00 00 00 00 00 00 00 00 {255} normal block at 0x000002B67C254E20, 16 bytes long. Data: <p\%|¶ > 70 5C 25 7C B6 02 00 00 00 00 00 00 00 00 00 00 {254} normal block at 0x000002B67C2552D0, 16 bytes long. Data: <H\%|¶ > 48 5C 25 7C B6 02 00 00 00 00 00 00 00 00 00 00 {253} normal block at 0x000002B67C255A50, 16 bytes long. Data: < \%|¶ > 20 5C 25 7C B6 02 00 00 00 00 00 00 00 00 00 00 {251} normal block at 0x000002B67C254EC0, 16 bytes long. Data: <ðF%|¶ > F0 46 25 7C B6 02 00 00 00 00 00 00 00 00 00 00 {250} normal block at 0x000002B67C2546F0, 40 bytes long. Data: <ÀN%|¶  Ãü}¶ > C0 4E 25 7C B6 02 00 00 A0 C3 FC 7D B6 02 00 00 {249} normal block at 0x000002B67C255500, 16 bytes long. Data: < \%|¶ > 00 5C 25 7C B6 02 00 00 00 00 00 00 00 00 00 00 {248} normal block at 0x000002B67C255870, 16 bytes long. Data: <Ø[%|¶ > D8 5B 25 7C B6 02 00 00 00 00 00 00 00 00 00 00 {247} normal block at 0x000002B67C2502D0, 32 bytes long. Data: <Library/usr/bin/> 4C 69 62 72 61 72 79 2F 75 73 72 2F 62 69 6E 2F {246} normal block at 0x000002B67C255640, 16 bytes long. Data: <°[%|¶ > B0 5B 25 7C B6 02 00 00 00 00 00 00 00 00 00 00 {245} normal block at 0x000002B67C255BB0, 992 bytes long. Data: <@V%|¶ Ð %|¶ > 40 56 25 7C B6 02 00 00 D0 02 25 7C B6 02 00 00 {89} normal block at 0x000002B67C2506F0, 32 bytes long. Data: <windows_x86_64__> 77 69 6E 64 6F 77 73 5F 78 38 36 5F 36 34 5F 5F {88} normal block at 0x000002B67C24F390, 16 bytes long. Data: <°î$|¶ > B0 EE 24 7C B6 02 00 00 00 00 00 00 00 00 00 00 {87} normal block at 0x000002B67C24EEB0, 40 bytes long. Data: < ó$|¶ ð %|¶ > 90 F3 24 7C B6 02 00 00 F0 06 25 7C B6 02 00 00 {66} normal block at 0x000002B67C24FAC0, 16 bytes long. Data: < êüÆö > 80 EA FC C6 F6 7F 00 00 00 00 00 00 00 00 00 00 {65} normal block at 0x000002B67C24F250, 16 bytes long. Data: <@éüÆö > 40 E9 FC C6 F6 7F 00 00 00 00 00 00 00 00 00 00 {64} normal block at 0x000002B67C24F4D0, 16 bytes long. Data: <øWùÆö > F8 57 F9 C6 F6 7F 00 00 00 00 00 00 00 00 00 00 {63} normal block at 0x000002B67C24FC50, 16 bytes long. Data: <ØWùÆö > D8 57 F9 C6 F6 7F 00 00 00 00 00 00 00 00 00 00 {62} normal block at 0x000002B67C24F200, 16 bytes long. Data: <P ùÆö > 50 04 F9 C6 F6 7F 00 00 00 00 00 00 00 00 00 00 {61} normal block at 0x000002B67C24F9D0, 16 bytes long. Data: <0 ùÆö > 30 04 F9 C6 F6 7F 00 00 00 00 00 00 00 00 00 00 {60} normal block at 0x000002B67C24F1B0, 16 bytes long. Data: <à ùÆö > E0 02 F9 C6 F6 7F 00 00 00 00 00 00 00 00 00 00 {59} normal block at 0x000002B67C24F070, 16 bytes long. Data: < ùÆö > 10 04 F9 C6 F6 7F 00 00 00 00 00 00 00 00 00 00 {58} normal block at 0x000002B67C24FA70, 16 bytes long. Data: <p ùÆö > 70 04 F9 C6 F6 7F 00 00 00 00 00 00 00 00 00 00 {57} normal block at 0x000002B67C24FDE0, 16 bytes long. Data: < À÷Æö > 18 C0 F7 C6 F6 7F 00 00 00 00 00 00 00 00 00 00 Object dump complete. </stderr_txt> ]]>
©2025 Universitat Pompeu Fabra