Name | test_1-SFARR_TEST_LLM_WINDOWS_101_7-0-1-RND2806_0 |
Workunit | 31482405 |
Created | 24 Apr 2025, 14:44:47 UTC |
Sent | 24 Apr 2025, 14:47:42 UTC |
Report deadline | 29 Apr 2025, 14:47:42 UTC |
Received | 24 Apr 2025, 14:58:50 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 195 (0x000000C3) EXIT_CHILD_FAILED |
Computer ID | 627900 |
Run time | 6 min 56 sec |
CPU time | 35 sec |
Validate state | Invalid |
Credit | 0.00 |
Device peak FLOPS | 40,002.59 GFLOPS |
Application version | LLM: LLMs for chemistry v1.01 (cuda124L) windows_x86_64 |
Peak working set size | 753.47 MB |
Peak swap size | 2.46 GB |
Peak disk usage | 12.03 GB |
<core_client_version>8.0.2</core_client_version> <![CDATA[ <message> (null) nelze spustit. (0xc3) - exit code 195 (0xc3)</message> <stderr_txt> 16:53:37 (4480): wrapper (7.9.26016): starting 16:53:37 (4480): wrapper: running Library/usr/bin/tar.exe (xjvf input.tar.bz2) conf.yaml main_generation-0.1.0-py3-none-any.whl run.bat run.sh tasks.json 16:53:38 (4480): Library/usr/bin/tar.exe exited; CPU time 0.015625 16:53:38 (4480): wrapper: running C:/Windows/system32/cmd.exe (/c call Scripts\activate.bat && Scripts\conda-unpack.exe && run.bat) Generating train split: 0 examples [00:00, ? examples/s] Generating train split: 1000 examples [00:00, 63594.38 examples/s] [W424 16:55:48.000000000 socket.cpp:759] [c10d] The client socket has failed to connect to [Pioneer-4v3]:50827 (system error: 10049 - Požadovaná adresa není v tomto kontextu platná.). [rank0]: Traceback (most recent call last): [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\model_executor\layers\quantization\bitsandbytes.py", line 158, in __init__ [rank0]: import bitsandbytes [rank0]: ModuleNotFoundError: No module named 'bitsandbytes' [rank0]: The above exception was the direct cause of the following exception: [rank0]: Traceback (most recent call last): [rank0]: File "wheel_contents/aiengine/main_generation.py", line 86, in <module> [rank0]: File "wheel_contents/aiengine/model.py", line 36, in __init__ [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\utils.py", line 1096, in inner [rank0]: return fn(*args, **kwargs) [rank0]: ^^^^^^^^^^^^^^^^^^^ [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\entrypoints\llm.py", line 243, in __init__ [rank0]: self.llm_engine = LLMEngine.from_engine_args( [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\engine\llm_engine.py", line 521, in from_engine_args [rank0]: return engine_cls.from_vllm_config( [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\engine\llm_engine.py", line 497, in from_vllm_config [rank0]: return cls( [rank0]: ^^^^ [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\engine\llm_engine.py", line 281, in __init__ [rank0]: self.model_executor = executor_class(vllm_config=vllm_config, ) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\executor\executor_base.py", line 52, in __init__ [rank0]: self._init_executor() [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\executor\uniproc_executor.py", line 47, in _init_executor [rank0]: self.collective_rpc("load_model") [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\executor\uniproc_executor.py", line 56, in collective_rpc [rank0]: answer = run_method(self.driver_worker, method, args, kwargs) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\utils.py", line 2359, in run_method [rank0]: return func(*args, **kwargs) [rank0]: ^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\worker\worker.py", line 184, in load_model [rank0]: self.model_runner.load_model() [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\worker\model_runner.py", line 1113, in load_model [rank0]: self.model = get_model(vllm_config=self.vllm_config) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\model_executor\model_loader\__init__.py", line 14, in get_model [rank0]: return loader.load_model(vllm_config=vllm_config) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\model_executor\model_loader\loader.py", line 1278, in load_model [rank0]: model = _initialize_model(vllm_config=vllm_config) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\model_executor\model_loader\loader.py", line 127, in _initialize_model [rank0]: return model_class(vllm_config=vllm_config, prefix=prefix) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\model_executor\models\qwen2.py", line 431, in __init__ [rank0]: self.model = Qwen2Model(vllm_config=vllm_config, [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\compilation\decorators.py", line 151, in __init__ [rank0]: old_init(self, vllm_config=vllm_config, prefix=prefix, **kwargs) [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\model_executor\models\qwen2.py", line 300, in __init__ [rank0]: self.start_layer, self.end_layer, self.layers = make_layers( [rank0]: ^^^^^^^^^^^^ [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\model_executor\models\utils.py", line 610, in make_layers [rank0]: maybe_offload_to_cpu(layer_fn(prefix=f"{prefix}.{idx}")) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\model_executor\models\qwen2.py", line 302, in <lambda> [rank0]: lambda prefix: Qwen2DecoderLayer(config=config, [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\model_executor\models\qwen2.py", line 206, in __init__ [rank0]: self.self_attn = Qwen2Attention( [rank0]: ^^^^^^^^^^^^^^^ [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\model_executor\models\qwen2.py", line 136, in __init__ [rank0]: self.qkv_proj = QKVParallelLinear( [rank0]: ^^^^^^^^^^^^^^^^^^ [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\model_executor\layers\linear.py", line 833, in __init__ [rank0]: super().__init__(input_size=input_size, [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\model_executor\layers\linear.py", line 384, in __init__ [rank0]: super().__init__(input_size, [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\model_executor\layers\linear.py", line 231, in __init__ [rank0]: self.quant_method = quant_config.get_quant_method(self, [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\model_executor\layers\quantization\bitsandbytes.py", line 128, in get_quant_method [rank0]: return BitsAndBytesLinearMethod(self) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "S:\BOINCdata\slots\21\Lib\site-packages\vllm\model_executor\layers\quantization\bitsandbytes.py", line 163, in __init__ [rank0]: raise ImportError("Please install bitsandbytes>=0.45.3 via " [rank0]: ImportError: Please install bitsandbytes>=0.45.3 via `pip install bitsandbytes>=0.45.3` to use bitsandbytes quantizer. 16:55:51 (4480): C:/Windows/system32/cmd.exe exited; CPU time 34.984375 16:55:51 (4480): app exit status: 0x16 16:55:51 (4480): called boinc_finish(195) 0 bytes in 0 Free Blocks. 380 bytes in 8 Normal Blocks. 1144 bytes in 1 CRT Blocks. 0 bytes in 0 Ignore Blocks. 0 bytes in 0 Client Blocks. Largest number used: 0 bytes. Total allocations: 2235441 bytes. Dumping objects -> {1601285} normal block at 0x000001814A491410, 48 bytes long. Data: <PATH=S:\BOINCdat> 50 41 54 48 3D 53 3A 5C 42 4F 49 4E 43 64 61 74 {1601274} normal block at 0x000001814A5869E0, 32 bytes long. Data: <HOME=S:\BOINCdat> 48 4F 4D 45 3D 53 3A 5C 42 4F 49 4E 43 64 61 74 {1601263} normal block at 0x000001814A586E00, 32 bytes long. Data: <TMP=S:\BOINCdata> 54 4D 50 3D 53 3A 5C 42 4F 49 4E 43 64 61 74 61 {1601252} normal block at 0x000001814A586560, 32 bytes long. Data: <TEMP=S:\BOINCdat> 54 45 4D 50 3D 53 3A 5C 42 4F 49 4E 43 64 61 74 {1601241} normal block at 0x000001814A585660, 32 bytes long. Data: <TMPDIR=S:\BOINCd> 54 4D 50 44 49 52 3D 53 3A 5C 42 4F 49 4E 43 64 {1601210} normal block at 0x000001814A4915D0, 48 bytes long. Data: <PATH=S:\BOINCdat> 50 41 54 48 3D 53 3A 5C 42 4F 49 4E 43 64 61 74 {1601199} normal block at 0x000001814A55BA90, 102 bytes long. Data: <<project_prefere> 3C 70 72 6F 6A 65 63 74 5F 70 72 65 66 65 72 65 ..\api\boinc_api.cpp(309) : {1601196} normal block at 0x000001814877E6E0, 8 bytes long. Data: < J > 00 00 20 4A 81 01 00 00 {1600538} normal block at 0x000001814A55C430, 102 bytes long. Data: <<project_prefere> 3C 70 72 6F 6A 65 63 74 5F 70 72 65 66 65 72 65 {1599910} normal block at 0x000001814877D8D0, 8 bytes long. Data: < l]J > A0 6C 5D 4A 81 01 00 00 ..\zip\boinc_zip.cpp(122) : {302} normal block at 0x00000181487714C0, 260 bytes long. Data: < > 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 {287} normal block at 0x00000181487723C0, 80 bytes long. Data: </c call Scripts\> 2F 63 20 63 61 6C 6C 20 53 63 72 69 70 74 73 5C {286} normal block at 0x00000181487868E0, 16 bytes long. Data: <ÈqxH > C8 71 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {285} normal block at 0x0000018148786890, 16 bytes long. Data: < qxH > A0 71 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {284} normal block at 0x00000181487860C0, 16 bytes long. Data: <xqxH > 78 71 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {283} normal block at 0x0000018148786660, 16 bytes long. Data: <PqxH > 50 71 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {282} normal block at 0x0000018148786840, 16 bytes long. Data: <(qxH > 28 71 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {281} normal block at 0x0000018148786390, 16 bytes long. Data: < qxH > 00 71 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {280} normal block at 0x000001814877FED0, 48 bytes long. Data: <ComSpec=C:\Windo> 43 6F 6D 53 70 65 63 3D 43 3A 5C 57 69 6E 64 6F {279} normal block at 0x0000018148786A20, 16 bytes long. Data: <x]xH > 78 5D 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {278} normal block at 0x0000018148782810, 32 bytes long. Data: <SystemRoot=C:\Wi> 53 79 73 74 65 6D 52 6F 6F 74 3D 43 3A 5C 57 69 {277} normal block at 0x0000018148786C00, 16 bytes long. Data: <P]xH > 50 5D 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {275} normal block at 0x0000018148786520, 16 bytes long. Data: <(]xH > 28 5D 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {274} normal block at 0x0000018148786430, 16 bytes long. Data: < ]xH > 00 5D 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {273} normal block at 0x00000181487862A0, 16 bytes long. Data: <Ø\xH > D8 5C 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {272} normal block at 0x0000018148786340, 16 bytes long. Data: <°\xH > B0 5C 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {271} normal block at 0x0000018148786750, 16 bytes long. Data: < \xH > 88 5C 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {270} normal block at 0x00000181487822D0, 32 bytes long. Data: <CUDA_DEVICE=0 PU> 43 55 44 41 5F 44 45 56 49 43 45 3D 30 00 50 55 {269} normal block at 0x0000018148786480, 16 bytes long. Data: <`\xH > 60 5C 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {268} normal block at 0x0000018148785C60, 320 bytes long. Data: < dxH Ð"xH > 80 64 78 48 81 01 00 00 D0 22 78 48 81 01 00 00 {267} normal block at 0x00000181487861B0, 16 bytes long. Data: <àpxH > E0 70 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {266} normal block at 0x00000181487864D0, 16 bytes long. Data: <¸pxH > B8 70 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {265} normal block at 0x0000018148782210, 32 bytes long. Data: <C:/Windows/syste> 43 3A 2F 57 69 6E 64 6F 77 73 2F 73 79 73 74 65 {264} normal block at 0x0000018148786020, 16 bytes long. Data: < pxH > 90 70 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {263} normal block at 0x0000018148781A30, 32 bytes long. Data: <xjvf input.tar.b> 78 6A 76 66 20 69 6E 70 75 74 2E 74 61 72 2E 62 {262} normal block at 0x0000018148786700, 16 bytes long. Data: <ØoxH > D8 6F 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {261} normal block at 0x0000018148786D40, 16 bytes long. Data: <°oxH > B0 6F 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {260} normal block at 0x00000181487865C0, 16 bytes long. Data: < oxH > 88 6F 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {259} normal block at 0x0000018148786250, 16 bytes long. Data: <`oxH > 60 6F 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {258} normal block at 0x0000018148786C50, 16 bytes long. Data: <8oxH > 38 6F 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {257} normal block at 0x00000181487863E0, 16 bytes long. Data: < oxH > 10 6F 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {255} normal block at 0x0000018148786BB0, 16 bytes long. Data: < ýwH > 80 FD 77 48 81 01 00 00 00 00 00 00 00 00 00 00 {254} normal block at 0x000001814877FD80, 40 bytes long. Data: <°kxH Ð IJ > B0 6B 78 48 81 01 00 00 D0 15 49 4A 81 01 00 00 {253} normal block at 0x00000181487867F0, 16 bytes long. Data: <ðnxH > F0 6E 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {252} normal block at 0x00000181487867A0, 16 bytes long. Data: <ÈnxH > C8 6E 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {251} normal block at 0x0000018148782570, 32 bytes long. Data: <Library/usr/bin/> 4C 69 62 72 61 72 79 2F 75 73 72 2F 62 69 6E 2F {250} normal block at 0x0000018148786CA0, 16 bytes long. Data: < nxH > A0 6E 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {249} normal block at 0x0000018148786EA0, 992 bytes long. Data: < lxH p%xH > A0 6C 78 48 81 01 00 00 70 25 78 48 81 01 00 00 {93} normal block at 0x00000181487820F0, 32 bytes long. Data: <windows_x86_64__> 77 69 6E 64 6F 77 73 5F 78 38 36 5F 36 34 5F 5F {92} normal block at 0x000001814877DB00, 16 bytes long. Data: <à xH > E0 01 78 48 81 01 00 00 00 00 00 00 00 00 00 00 {91} normal block at 0x00000181487801E0, 40 bytes long. Data: < ÛwH ð xH > 00 DB 77 48 81 01 00 00 F0 20 78 48 81 01 00 00 {70} normal block at 0x000001814877DAB0, 16 bytes long. Data: < êòÄö > 80 EA F2 C4 F6 7F 00 00 00 00 00 00 00 00 00 00 {69} normal block at 0x000001814877E5F0, 16 bytes long. Data: <@éòÄö > 40 E9 F2 C4 F6 7F 00 00 00 00 00 00 00 00 00 00 {68} normal block at 0x000001814877E730, 16 bytes long. Data: <øWïÄö > F8 57 EF C4 F6 7F 00 00 00 00 00 00 00 00 00 00 {67} normal block at 0x000001814877E000, 16 bytes long. Data: <ØWïÄö > D8 57 EF C4 F6 7F 00 00 00 00 00 00 00 00 00 00 {66} normal block at 0x000001814877E5A0, 16 bytes long. Data: <P ïÄö > 50 04 EF C4 F6 7F 00 00 00 00 00 00 00 00 00 00 {65} normal block at 0x000001814877E410, 16 bytes long. Data: <0 ïÄö > 30 04 EF C4 F6 7F 00 00 00 00 00 00 00 00 00 00 {64} normal block at 0x000001814877DF60, 16 bytes long. Data: <à ïÄö > E0 02 EF C4 F6 7F 00 00 00 00 00 00 00 00 00 00 {63} normal block at 0x000001814877E690, 16 bytes long. Data: < ïÄö > 10 04 EF C4 F6 7F 00 00 00 00 00 00 00 00 00 00 {62} normal block at 0x000001814877E140, 16 bytes long. Data: <p ïÄö > 70 04 EF C4 F6 7F 00 00 00 00 00 00 00 00 00 00 {61} normal block at 0x000001814877E500, 16 bytes long. Data: < ÀíÄö > 18 C0 ED C4 F6 7F 00 00 00 00 00 00 00 00 00 00 Object dump complete. </stderr_txt> ]]>
©2025 Universitat Pompeu Fabra