Name | wu_309dfcdb-GIANNI_GPROTO7-0-1-RND6839_0 |
Workunit | 31542416 |
Created | 24 Sep 2025, 9:29:06 UTC |
Sent | 24 Sep 2025, 9:29:26 UTC |
Report deadline | 29 Sep 2025, 9:29:26 UTC |
Received | 24 Sep 2025, 9:42:47 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 195 (0x000000C3) EXIT_CHILD_FAILED |
Computer ID | 632751 |
Run time | 9 min 23 sec |
CPU time | 36 sec |
Validate state | Invalid |
Credit | 0.00 |
Device peak FLOPS | 83,567.27 GFLOPS |
Application version | LLM: LLMs for chemistry v1.01 (cuda124L) windows_x86_64 |
Peak working set size | 5.49 GB |
Peak swap size | 25.83 GB |
Peak disk usage | 6.35 GB |
<core_client_version>8.2.4</core_client_version> <![CDATA[ <message> Das Betriebssystem kann (null) nicht ausf�hren. (0xc3) - exit code 195 (0xc3)</message> <stderr_txt> 11:35:10 (496): wrapper (7.9.26016): starting 11:35:10 (496): wrapper: running Library/usr/bin/tar.exe (xjvf input.tar.bz2) tasks.json run.bat conf.yaml main_generation-0.1.0-py3-none-any.whl run.sh 11:35:11 (496): Library/usr/bin/tar.exe exited; CPU time 0.000000 11:35:11 (496): wrapper: running C:/Windows/system32/cmd.exe (/c call Scripts\activate.bat && Scripts\conda-unpack.exe && run.bat) Generating train split: 0 examples [00:00, ? examples/s] Generating train split: 2500 examples [00:00, 282749.36 examples/s] E:\BOINC\DATA\slots\0\Lib\site-packages\huggingface_hub\file_download.py:144: UserWarning: `huggingface_hub` cache-system uses symlinks by default to efficiently store duplicated files but your machine does not support them in E:\BOINC\DATA\slots\.cache\hub\models--Acellera--proto. Caching files will still work but in a degraded version that might require more space on your disk. This warning can be disabled by setting the `HF_HUB_DISABLE_SYMLINKS_WARNING` environment variable. For more details, see https://huggingface.co/docs/huggingface_hub/how-to-cache#limitations. To support symlinks on Windows, you either need to activate Developer Mode or to run Python as an administrator. In order to activate developer mode, see this article: https://docs.microsoft.com/en-us/windows/apps/get-started/enable-your-device-for-development warnings.warn(message) [W924 11:38:53.000000000 socket.cpp:759] [c10d] The client socket has failed to connect to [ZEUSLORD.fritz.box]:62489 (system error: 10049 - Die angeforderte Adresse ist in diesem Kontext ungültig.). Loading safetensors checkpoint shards: 0% Completed | 0/2 [00:00<?, ?it/s] Loading safetensors checkpoint shards: 50% Completed | 1/2 [00:32<00:32, 32.18s/it] Loading safetensors checkpoint shards: 100% Completed | 2/2 [01:07<00:00, 33.83s/it] Loading safetensors checkpoint shards: 100% Completed | 2/2 [01:07<00:00, 33.58s/it] Loading safetensors checkpoint shards: 0% Completed | 0/2 [00:00<?, ?it/s] Loading safetensors checkpoint shards: 50% Completed | 1/2 [00:03<00:03, 3.58s/it] Loading safetensors checkpoint shards: 100% Completed | 2/2 [00:07<00:00, 3.67s/it] Loading safetensors checkpoint shards: 100% Completed | 2/2 [00:07<00:00, 3.66s/it] [rank0]: Traceback (most recent call last): [rank0]: File "wheel_contents/aiengine/main_generation.py", line 87, in <module> [rank0]: File "wheel_contents/aiengine/model.py", line 36, in __init__ [rank0]: File "E:\BOINC\DATA\slots\0\Lib\site-packages\vllm\utils.py", line 1096, in inner [rank0]: return fn(*args, **kwargs) [rank0]: ^^^^^^^^^^^^^^^^^^^ [rank0]: File "E:\BOINC\DATA\slots\0\Lib\site-packages\vllm\entrypoints\llm.py", line 243, in __init__ [rank0]: self.llm_engine = LLMEngine.from_engine_args( [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "E:\BOINC\DATA\slots\0\Lib\site-packages\vllm\engine\llm_engine.py", line 521, in from_engine_args [rank0]: return engine_cls.from_vllm_config( [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "E:\BOINC\DATA\slots\0\Lib\site-packages\vllm\engine\llm_engine.py", line 497, in from_vllm_config [rank0]: return cls( [rank0]: ^^^^ [rank0]: File "E:\BOINC\DATA\slots\0\Lib\site-packages\vllm\engine\llm_engine.py", line 284, in __init__ [rank0]: self._initialize_kv_caches() [rank0]: File "E:\BOINC\DATA\slots\0\Lib\site-packages\vllm\engine\llm_engine.py", line 446, in _initialize_kv_caches [rank0]: self.model_executor.initialize_cache(num_gpu_blocks, num_cpu_blocks) [rank0]: File "E:\BOINC\DATA\slots\0\Lib\site-packages\vllm\executor\executor_base.py", line 123, in initialize_cache [rank0]: self.collective_rpc("initialize_cache", [rank0]: File "E:\BOINC\DATA\slots\0\Lib\site-packages\vllm\executor\uniproc_executor.py", line 56, in collective_rpc [rank0]: answer = run_method(self.driver_worker, method, args, kwargs) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "E:\BOINC\DATA\slots\0\Lib\site-packages\vllm\utils.py", line 2359, in run_method [rank0]: return func(*args, **kwargs) [rank0]: ^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "E:\BOINC\DATA\slots\0\Lib\site-packages\vllm\worker\worker.py", line 308, in initialize_cache [rank0]: self._init_cache_engine() [rank0]: File "E:\BOINC\DATA\slots\0\Lib\site-packages\vllm\worker\worker.py", line 314, in _init_cache_engine [rank0]: CacheEngine(self.cache_config, self.model_config, [rank0]: File "E:\BOINC\DATA\slots\0\Lib\site-packages\vllm\worker\cache_engine.py", line 66, in __init__ [rank0]: self.cpu_cache = self._allocate_kv_cache(self.num_cpu_blocks, "cpu") [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "E:\BOINC\DATA\slots\0\Lib\site-packages\vllm\worker\cache_engine.py", line 83, in _allocate_kv_cache [rank0]: layer_kv_cache = torch.zeros(kv_cache_shape, [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: RuntimeError: CUDA error: resource already mapped [rank0]: CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. [rank0]: For debugging consider passing CUDA_LAUNCH_BLOCKING=1 [rank0]: Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions. 11:40:22 (496): C:/Windows/system32/cmd.exe exited; CPU time 36.812500 11:40:22 (496): app exit status: 0x16 11:40:22 (496): called boinc_finish(195) 0 bytes in 0 Free Blocks. 176 bytes in 6 Normal Blocks. 1144 bytes in 1 CRT Blocks. 0 bytes in 0 Ignore Blocks. 0 bytes in 0 Client Blocks. Largest number used: 0 bytes. Total allocations: 5163651 bytes. Dumping objects -> {1601212} normal block at 0x000002451B244610, 48 bytes long. Data: <PATH=E:\BOINC\DA> 50 41 54 48 3D 45 3A 5C 42 4F 49 4E 43 5C 44 41 {1601201} normal block at 0x000002451D1AC310, 32 bytes long. Data: <HOME=E:\BOINC\DA> 48 4F 4D 45 3D 45 3A 5C 42 4F 49 4E 43 5C 44 41 {1601190} normal block at 0x000002451D1AB170, 32 bytes long. Data: <TMP=E:\BOINC\DAT> 54 4D 50 3D 45 3A 5C 42 4F 49 4E 43 5C 44 41 54 {1601179} normal block at 0x000002451D1AC370, 32 bytes long. Data: <TEMP=E:\BOINC\DA> 54 45 4D 50 3D 45 3A 5C 42 4F 49 4E 43 5C 44 41 {1601168} normal block at 0x000002451D1AC130, 32 bytes long. Data: <TMPDIR=E:\BOINC\> 54 4D 50 44 49 52 3D 45 3A 5C 42 4F 49 4E 43 5C {1601137} normal block at 0x000002451B244B50, 48 bytes long. Data: <PATH=E:\BOINC\DA> 50 41 54 48 3D 45 3A 5C 42 4F 49 4E 43 5C 44 41 ..\api\boinc_api.cpp(309) : {1601124} normal block at 0x000002451B24BFB0, 8 bytes long. Data: < n E > 00 00 6E 1B 45 02 00 00 {1599918} normal block at 0x000002451B24C4B0, 8 bytes long. Data: <0T" E > 30 54 22 1D 45 02 00 00 ..\zip\boinc_zip.cpp(122) : {306} normal block at 0x000002451B220330, 260 bytes long. Data: < > 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 {291} normal block at 0x000002451B226A30, 80 bytes long. Data: </c call Scripts\> 2F 63 20 63 61 6C 6C 20 53 63 72 69 70 74 73 5C {290} normal block at 0x000002451B24C460, 16 bytes long. Data: <øY$ E > F8 59 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {289} normal block at 0x000002451B24C820, 16 bytes long. Data: <ÐY$ E > D0 59 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {288} normal block at 0x000002451B24CC30, 16 bytes long. Data: <¨Y$ E > A8 59 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {287} normal block at 0x000002451B24BF10, 16 bytes long. Data: < Y$ E > 80 59 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {286} normal block at 0x000002451B24CB40, 16 bytes long. Data: <XY$ E > 58 59 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {285} normal block at 0x000002451B24CAA0, 16 bytes long. Data: <0Y$ E > 30 59 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {284} normal block at 0x000002451B247AC0, 48 bytes long. Data: <ComSpec=C:\Windo> 43 6F 6D 53 70 65 63 3D 43 3A 5C 57 69 6E 64 6F {283} normal block at 0x000002451B24BEC0, 16 bytes long. Data: < T$ E > 08 54 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {282} normal block at 0x000002451B24D240, 32 bytes long. Data: <SystemRoot=C:\Wi> 53 79 73 74 65 6D 52 6F 6F 74 3D 43 3A 5C 57 69 {281} normal block at 0x000002451B24C190, 16 bytes long. Data: <àS$ E > E0 53 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {279} normal block at 0x000002451B24CBE0, 16 bytes long. Data: <¸S$ E > B8 53 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {278} normal block at 0x000002451B24C730, 16 bytes long. Data: < S$ E > 90 53 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {277} normal block at 0x000002451B24C690, 16 bytes long. Data: <hS$ E > 68 53 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {276} normal block at 0x000002451B24CB90, 16 bytes long. Data: <@S$ E > 40 53 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {275} normal block at 0x000002451B24C870, 16 bytes long. Data: < S$ E > 18 53 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {274} normal block at 0x000002451B24D420, 32 bytes long. Data: <CUDA_DEVICE=0 PU> 43 55 44 41 5F 44 45 56 49 43 45 3D 30 00 50 55 {273} normal block at 0x000002451B24CA50, 16 bytes long. Data: <ðR$ E > F0 52 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {272} normal block at 0x000002451B2452F0, 320 bytes long. Data: <PÊ$ E Ô$ E > 50 CA 24 1B 45 02 00 00 20 D4 24 1B 45 02 00 00 {271} normal block at 0x000002451B24CA00, 16 bytes long. Data: < Y$ E > 10 59 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {270} normal block at 0x000002451B24C500, 16 bytes long. Data: <èX$ E > E8 58 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {269} normal block at 0x000002451B24D960, 32 bytes long. Data: <C:/Windows/syste> 43 3A 2F 57 69 6E 64 6F 77 73 2F 73 79 73 74 65 {268} normal block at 0x000002451B24C050, 16 bytes long. Data: <ÀX$ E > C0 58 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {267} normal block at 0x000002451B24DC00, 32 bytes long. Data: <xjvf input.tar.b> 78 6A 76 66 20 69 6E 70 75 74 2E 74 61 72 2E 62 {266} normal block at 0x000002451B24C410, 16 bytes long. Data: < X$ E > 08 58 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {265} normal block at 0x000002451B24C960, 16 bytes long. Data: <àW$ E > E0 57 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {264} normal block at 0x000002451B24C550, 16 bytes long. Data: <¸W$ E > B8 57 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {263} normal block at 0x000002451B24CAF0, 16 bytes long. Data: < W$ E > 90 57 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {262} normal block at 0x000002451B24C3C0, 16 bytes long. Data: <hW$ E > 68 57 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {261} normal block at 0x000002451B24C640, 16 bytes long. Data: <@W$ E > 40 57 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {259} normal block at 0x000002451B24C9B0, 16 bytes long. Data: <ðu$ E > F0 75 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {258} normal block at 0x000002451B2475F0, 40 bytes long. Data: <°É$ E PK$ E > B0 C9 24 1B 45 02 00 00 50 4B 24 1B 45 02 00 00 {257} normal block at 0x000002451B24C2D0, 16 bytes long. Data: < W$ E > 20 57 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {256} normal block at 0x000002451B24C6E0, 16 bytes long. Data: <øV$ E > F8 56 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {255} normal block at 0x000002451B24D540, 32 bytes long. Data: <Library/usr/bin/> 4C 69 62 72 61 72 79 2F 75 73 72 2F 62 69 6E 2F {254} normal block at 0x000002451B24C910, 16 bytes long. Data: <ÐV$ E > D0 56 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {253} normal block at 0x000002451B2456D0, 992 bytes long. Data: < É$ E @Õ$ E > 10 C9 24 1B 45 02 00 00 40 D5 24 1B 45 02 00 00 {97} normal block at 0x000002451B232850, 32 bytes long. Data: <windows_x86_64__> 77 69 6E 64 6F 77 73 5F 78 38 36 5F 36 34 5F 5F {96} normal block at 0x000002451B238CF0, 16 bytes long. Data: <py$ E > 70 79 24 1B 45 02 00 00 00 00 00 00 00 00 00 00 {95} normal block at 0x000002451B247970, 40 bytes long. Data: <ðŒ# E P(# E > F0 8C 23 1B 45 02 00 00 50 28 23 1B 45 02 00 00 {74} normal block at 0x000002451B238AC0, 16 bytes long. Data: < êrá÷ > 80 EA 72 E1 F7 7F 00 00 00 00 00 00 00 00 00 00 {73} normal block at 0x000002451B238A20, 16 bytes long. Data: <@érá÷ > 40 E9 72 E1 F7 7F 00 00 00 00 00 00 00 00 00 00 {72} normal block at 0x000002451B2386B0, 16 bytes long. Data: <øWoá÷ > F8 57 6F E1 F7 7F 00 00 00 00 00 00 00 00 00 00 {71} normal block at 0x000002451B238980, 16 bytes long. Data: <ØWoá÷ > D8 57 6F E1 F7 7F 00 00 00 00 00 00 00 00 00 00 {70} normal block at 0x000002451B238A70, 16 bytes long. Data: <P oá÷ > 50 04 6F E1 F7 7F 00 00 00 00 00 00 00 00 00 00 {69} normal block at 0x000002451B238610, 16 bytes long. Data: <0 oá÷ > 30 04 6F E1 F7 7F 00 00 00 00 00 00 00 00 00 00 {68} normal block at 0x000002451B2388E0, 16 bytes long. Data: <à oá÷ > E0 02 6F E1 F7 7F 00 00 00 00 00 00 00 00 00 00 {67} normal block at 0x000002451B2385C0, 16 bytes long. Data: < oá÷ > 10 04 6F E1 F7 7F 00 00 00 00 00 00 00 00 00 00 {66} normal block at 0x000002451B238570, 16 bytes long. Data: <p oá÷ > 70 04 6F E1 F7 7F 00 00 00 00 00 00 00 00 00 00 {65} normal block at 0x000002451B2389D0, 16 bytes long. Data: < Àmá÷ > 18 C0 6D E1 F7 7F 00 00 00 00 00 00 00 00 00 00 Object dump complete. </stderr_txt> ]]>
©2025 Universitat Pompeu Fabra