Name | wu_a4e219a5-GIANNI_GPROTO7-0-1-RND2248_0 |
Workunit | 31541193 |
Created | 22 Sep 2025, 22:28:18 UTC |
Sent | 22 Sep 2025, 22:48:51 UTC |
Report deadline | 27 Sep 2025, 22:48:51 UTC |
Received | 22 Sep 2025, 23:34:17 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 195 (0x000000C3) EXIT_CHILD_FAILED |
Computer ID | 604231 |
Run time | 13 min 7 sec |
CPU time | 1 min 5 sec |
Validate state | Invalid |
Credit | 0.00 |
Device peak FLOPS | 82,585.56 GFLOPS |
Application version | LLM: LLMs for chemistry v1.01 (cuda124L) windows_x86_64 |
Peak working set size | 1.64 GB |
Peak swap size | 13.36 GB |
Peak disk usage | 11.51 GB |
<core_client_version>8.2.4</core_client_version> <![CDATA[ <message> The operating system cannot run (null). (0xc3) - exit code 195 (0xc3)</message> <stderr_txt> 18:25:29 (214560): wrapper (7.9.26016): starting 18:25:29 (214560): wrapper: running Library/usr/bin/tar.exe (xjvf input.tar.bz2) tasks.json run.bat conf.yaml main_generation-0.1.0-py3-none-any.whl run.sh 18:25:30 (214560): Library/usr/bin/tar.exe exited; CPU time 0.015625 18:25:30 (214560): wrapper: running C:/Windows/system32/cmd.exe (/c call Scripts\activate.bat && Scripts\conda-unpack.exe && run.bat) Generating train split: 0 examples [00:00, ? examples/s] Generating train split: 2500 examples [00:00, 231560.63 examples/s] [W922 18:27:28.000000000 socket.cpp:759] [c10d] The client socket has failed to connect to [EMD-FL9]:58078 (system error: 10049 - The requested address is not valid in its context.). [rank0]: Traceback (most recent call last): [rank0]: File "wheel_contents/aiengine/main_generation.py", line 87, in <module> [rank0]: File "wheel_contents/aiengine/model.py", line 36, in __init__ [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\vllm\utils.py", line 1096, in inner [rank0]: return fn(*args, **kwargs) [rank0]: ^^^^^^^^^^^^^^^^^^^ [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\vllm\entrypoints\llm.py", line 243, in __init__ [rank0]: self.llm_engine = LLMEngine.from_engine_args( [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\vllm\engine\llm_engine.py", line 521, in from_engine_args [rank0]: return engine_cls.from_vllm_config( [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\vllm\engine\llm_engine.py", line 497, in from_vllm_config [rank0]: return cls( [rank0]: ^^^^ [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\vllm\engine\llm_engine.py", line 281, in __init__ [rank0]: self.model_executor = executor_class(vllm_config=vllm_config, ) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\vllm\executor\executor_base.py", line 52, in __init__ [rank0]: self._init_executor() [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\vllm\executor\uniproc_executor.py", line 47, in _init_executor [rank0]: self.collective_rpc("load_model") [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\vllm\executor\uniproc_executor.py", line 56, in collective_rpc [rank0]: answer = run_method(self.driver_worker, method, args, kwargs) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\vllm\utils.py", line 2359, in run_method [rank0]: return func(*args, **kwargs) [rank0]: ^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\vllm\worker\worker.py", line 184, in load_model [rank0]: self.model_runner.load_model() [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\vllm\worker\model_runner.py", line 1113, in load_model [rank0]: self.model = get_model(vllm_config=self.vllm_config) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\vllm\model_executor\model_loader\__init__.py", line 14, in get_model [rank0]: return loader.load_model(vllm_config=vllm_config) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\vllm\model_executor\model_loader\loader.py", line 1280, in load_model [rank0]: self._load_weights(model_config, model) [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\vllm\model_executor\model_loader\loader.py", line 1183, in _load_weights [rank0]: self._get_quantized_weights_iterator(model_config.model, [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\vllm\model_executor\model_loader\loader.py", line 891, in _get_quantized_weights_iterator [rank0]: hf_weights_files, use_safetensors = self._prepare_weights( [rank0]: ^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\vllm\model_executor\model_loader\loader.py", line 820, in _prepare_weights [rank0]: hf_folder, hf_weights_files, matched_pattern = self._get_weight_files( [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\vllm\model_executor\model_loader\loader.py", line 801, in _get_weight_files [rank0]: hf_folder = download_weights_from_hf( [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\vllm\model_executor\model_loader\weight_utils.py", line 270, in download_weights_from_hf [rank0]: hf_folder = snapshot_download( [rank0]: ^^^^^^^^^^^^^^^^^^ [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\huggingface_hub\utils\_validators.py", line 114, in _inner_fn [rank0]: return fn(*args, **kwargs) [rank0]: ^^^^^^^^^^^^^^^^^^^ [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\huggingface_hub\_snapshot_download.py", line 296, in snapshot_download [rank0]: thread_map( [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\tqdm\contrib\concurrent.py", line 69, in thread_map [rank0]: return _executor_map(ThreadPoolExecutor, fn, *iterables, **tqdm_kwargs) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\tqdm\contrib\concurrent.py", line 51, in _executor_map [rank0]: return list(tqdm_class(ex.map(fn, *iterables, chunksize=chunksize), **kwargs)) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\tqdm\std.py", line 1169, in __iter__ [rank0]: for obj in iterable: [rank0]: ^^^^^^^^ [rank0]: File "M:\BOINC\slots\1\Lib\concurrent\futures\_base.py", line 619, in result_iterator [rank0]: yield _result_or_cancel(fs.pop()) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "M:\BOINC\slots\1\Lib\concurrent\futures\_base.py", line 317, in _result_or_cancel [rank0]: return fut.result(timeout) [rank0]: ^^^^^^^^^^^^^^^^^^^ [rank0]: File "M:\BOINC\slots\1\Lib\concurrent\futures\_base.py", line 456, in result [rank0]: return self.__get_result() [rank0]: ^^^^^^^^^^^^^^^^^^^ [rank0]: File "M:\BOINC\slots\1\Lib\concurrent\futures\_base.py", line 401, in __get_result [rank0]: raise self._exception [rank0]: File "M:\BOINC\slots\1\Lib\concurrent\futures\thread.py", line 59, in run [rank0]: result = self.fn(*self.args, **self.kwargs) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\huggingface_hub\_snapshot_download.py", line 270, in _inner_hf_hub_download [rank0]: return hf_hub_download( [rank0]: ^^^^^^^^^^^^^^^^ [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\huggingface_hub\utils\_validators.py", line 114, in _inner_fn [rank0]: return fn(*args, **kwargs) [rank0]: ^^^^^^^^^^^^^^^^^^^ [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\huggingface_hub\file_download.py", line 961, in hf_hub_download [rank0]: return _hf_hub_download_to_cache_dir( [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\huggingface_hub\file_download.py", line 1112, in _hf_hub_download_to_cache_dir [rank0]: _download_to_tmp_and_move( [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\huggingface_hub\file_download.py", line 1661, in _download_to_tmp_and_move [rank0]: xet_get( [rank0]: File "M:\BOINC\slots\1\Lib\site-packages\huggingface_hub\file_download.py", line 580, in xet_get [rank0]: download_files( [rank0]: RuntimeError: Data processing error: CAS service error : Error : single flight error: Real call failed: CasObjectError(InternalIOError(Custom { kind: Other, error: reqwest::Error { kind: Decode, source: hyper::Error(Body, Custom { kind: UnexpectedEof, error: IncompleteBody }) } })) 18:32:39 (214560): C:/Windows/system32/cmd.exe exited; CPU time 65.000000 18:32:39 (214560): app exit status: 0x16 18:32:39 (214560): called boinc_finish(195) 0 bytes in 0 Free Blocks. 176 bytes in 6 Normal Blocks. 1144 bytes in 1 CRT Blocks. 0 bytes in 0 Ignore Blocks. 0 bytes in 0 Client Blocks. Largest number used: 0 bytes. Total allocations: 6613833 bytes. Dumping objects -> {1601490} normal block at 0x00000200BFBC61C0, 48 bytes long. Data: <PATH=M:\BOINC\sl> 50 41 54 48 3D 4D 3A 5C 42 4F 49 4E 43 5C 73 6C {1601479} normal block at 0x00000200C1876970, 32 bytes long. Data: <HOME=M:\BOINC\sl> 48 4F 4D 45 3D 4D 3A 5C 42 4F 49 4E 43 5C 73 6C {1601468} normal block at 0x00000200C18774B0, 32 bytes long. Data: <TMP=M:\BOINC\slo> 54 4D 50 3D 4D 3A 5C 42 4F 49 4E 43 5C 73 6C 6F {1601457} normal block at 0x00000200C1876730, 32 bytes long. Data: <TEMP=M:\BOINC\sl> 54 45 4D 50 3D 4D 3A 5C 42 4F 49 4E 43 5C 73 6C {1601446} normal block at 0x00000200C18761F0, 32 bytes long. Data: <TMPDIR=M:\BOINC\> 54 4D 50 44 49 52 3D 4D 3A 5C 42 4F 49 4E 43 5C {1601415} normal block at 0x00000200BFBC6150, 48 bytes long. Data: <PATH=M:\BOINC\sl> 50 41 54 48 3D 4D 3A 5C 42 4F 49 4E 43 5C 73 6C ..\api\boinc_api.cpp(309) : {1601402} normal block at 0x00000200BFBBE960, 8 bytes long. Data: < µ¿ > 00 00 B5 BF 00 02 00 00 {1599918} normal block at 0x00000200BFBBE870, 8 bytes long. Data: <PmÁ¿ > 50 6D C1 BF 00 02 00 00 ..\zip\boinc_zip.cpp(122) : {306} normal block at 0x00000200BFBB2AC0, 260 bytes long. Data: < > 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 {291} normal block at 0x00000200BFBB0100, 80 bytes long. Data: </c call Scripts\> 2F 63 20 63 61 6C 6C 20 53 63 72 69 70 74 73 5C {290} normal block at 0x00000200BFBC7520, 16 bytes long. Data: <¨y¼¿ > A8 79 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 {289} normal block at 0x00000200BFBC6FD0, 16 bytes long. Data: < y¼¿ > 80 79 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 {288} normal block at 0x00000200BFBC7160, 16 bytes long. Data: <Xy¼¿ > 58 79 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 {287} normal block at 0x00000200BFBC6B70, 16 bytes long. Data: <0y¼¿ > 30 79 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 {286} normal block at 0x00000200BFBC6850, 16 bytes long. Data: < y¼¿ > 08 79 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 {285} normal block at 0x00000200BFBC6F30, 16 bytes long. Data: <àx¼¿ > E0 78 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 {284} normal block at 0x00000200BFBC6000, 48 bytes long. Data: <ComSpec=C:\Windo> 43 6F 6D 53 70 65 63 3D 43 3A 5C 57 69 6E 64 6F {283} normal block at 0x00000200BFBC6DA0, 16 bytes long. Data: < '»¿ > 08 27 BB BF 00 02 00 00 00 00 00 00 00 00 00 00 {282} normal block at 0x00000200BFBC0E20, 32 bytes long. Data: <SystemRoot=C:\Wi> 53 79 73 74 65 6D 52 6F 6F 74 3D 43 3A 5C 57 69 {281} normal block at 0x00000200BFBC7070, 16 bytes long. Data: <à&»¿ > E0 26 BB BF 00 02 00 00 00 00 00 00 00 00 00 00 {279} normal block at 0x00000200BFBC74D0, 16 bytes long. Data: <¸&»¿ > B8 26 BB BF 00 02 00 00 00 00 00 00 00 00 00 00 {278} normal block at 0x00000200BFBC7110, 16 bytes long. Data: < &»¿ > 90 26 BB BF 00 02 00 00 00 00 00 00 00 00 00 00 {277} normal block at 0x00000200BFBC72A0, 16 bytes long. Data: <h&»¿ > 68 26 BB BF 00 02 00 00 00 00 00 00 00 00 00 00 {276} normal block at 0x00000200BFBC75C0, 16 bytes long. Data: <@&»¿ > 40 26 BB BF 00 02 00 00 00 00 00 00 00 00 00 00 {275} normal block at 0x00000200BFBC7480, 16 bytes long. Data: < &»¿ > 18 26 BB BF 00 02 00 00 00 00 00 00 00 00 00 00 {274} normal block at 0x00000200BFBC0DC0, 32 bytes long. Data: <CUDA_DEVICE=0 PU> 43 55 44 41 5F 44 45 56 49 43 45 3D 30 00 50 55 {273} normal block at 0x00000200BFBC7020, 16 bytes long. Data: <ð%»¿ > F0 25 BB BF 00 02 00 00 00 00 00 00 00 00 00 00 {272} normal block at 0x00000200BFBB25F0, 320 bytes long. Data: < p¼¿ À ¼¿ > 20 70 BC BF 00 02 00 00 C0 0D BC BF 00 02 00 00 {271} normal block at 0x00000200BFBC7610, 16 bytes long. Data: <Àx¼¿ > C0 78 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 {270} normal block at 0x00000200BFBC6A30, 16 bytes long. Data: < x¼¿ > 98 78 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 {269} normal block at 0x00000200BFBC0640, 32 bytes long. Data: <C:/Windows/syste> 43 3A 2F 57 69 6E 64 6F 77 73 2F 73 79 73 74 65 {268} normal block at 0x00000200BFBC6800, 16 bytes long. Data: <px¼¿ > 70 78 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 {267} normal block at 0x00000200BFBC1240, 32 bytes long. Data: <xjvf input.tar.b> 78 6A 76 66 20 69 6E 70 75 74 2E 74 61 72 2E 62 {266} normal block at 0x00000200BFBC71B0, 16 bytes long. Data: <¸w¼¿ > B8 77 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 {265} normal block at 0x00000200BFBC6D00, 16 bytes long. Data: < w¼¿ > 90 77 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 {264} normal block at 0x00000200BFBC7430, 16 bytes long. Data: <hw¼¿ > 68 77 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 {263} normal block at 0x00000200BFBC72F0, 16 bytes long. Data: <@w¼¿ > 40 77 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 {262} normal block at 0x00000200BFBC70C0, 16 bytes long. Data: < w¼¿ > 18 77 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 {261} normal block at 0x00000200BFBC6A80, 16 bytes long. Data: <ðv¼¿ > F0 76 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 {259} normal block at 0x00000200BFBC6F80, 16 bytes long. Data: < c¼¿ > 80 63 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 {258} normal block at 0x00000200BFBC6380, 40 bytes long. Data: < o¼¿ Pa¼¿ > 80 6F BC BF 00 02 00 00 50 61 BC BF 00 02 00 00 {257} normal block at 0x00000200BFBC6760, 16 bytes long. Data: <Ðv¼¿ > D0 76 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 {256} normal block at 0x00000200BFBC7250, 16 bytes long. Data: <¨v¼¿ > A8 76 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 {255} normal block at 0x00000200BFBC0940, 32 bytes long. Data: <Library/usr/bin/> 4C 69 62 72 61 72 79 2F 75 73 72 2F 62 69 6E 2F {254} normal block at 0x00000200BFBC6710, 16 bytes long. Data: < v¼¿ > 80 76 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 {253} normal block at 0x00000200BFBC7680, 992 bytes long. Data: < g¼¿ @ ¼¿ > 10 67 BC BF 00 02 00 00 40 09 BC BF 00 02 00 00 {97} normal block at 0x00000200BFBC0D00, 32 bytes long. Data: <windows_x86_64__> 77 69 6E 64 6F 77 73 5F 78 38 36 5F 36 34 5F 5F {96} normal block at 0x00000200BFBBEAA0, 16 bytes long. Data: <à`¼¿ > E0 60 BC BF 00 02 00 00 00 00 00 00 00 00 00 00 {95} normal block at 0x00000200BFBC60E0, 40 bytes long. Data: < ê»¿ ¼¿ > A0 EA BB BF 00 02 00 00 00 0D BC BF 00 02 00 00 {74} normal block at 0x00000200BFBBECD0, 16 bytes long. Data: < êâ ÷ > 80 EA E2 06 F7 7F 00 00 00 00 00 00 00 00 00 00 {73} normal block at 0x00000200BFBBEDC0, 16 bytes long. Data: <@éâ ÷ > 40 E9 E2 06 F7 7F 00 00 00 00 00 00 00 00 00 00 {72} normal block at 0x00000200BFBBE640, 16 bytes long. Data: <øWß ÷ > F8 57 DF 06 F7 7F 00 00 00 00 00 00 00 00 00 00 {71} normal block at 0x00000200BFBBEBE0, 16 bytes long. Data: <ØWß ÷ > D8 57 DF 06 F7 7F 00 00 00 00 00 00 00 00 00 00 {70} normal block at 0x00000200BFBBE410, 16 bytes long. Data: <P ß ÷ > 50 04 DF 06 F7 7F 00 00 00 00 00 00 00 00 00 00 {69} normal block at 0x00000200BFBBEEB0, 16 bytes long. Data: <0 ß ÷ > 30 04 DF 06 F7 7F 00 00 00 00 00 00 00 00 00 00 {68} normal block at 0x00000200BFBBE140, 16 bytes long. Data: <à ß ÷ > E0 02 DF 06 F7 7F 00 00 00 00 00 00 00 00 00 00 {67} normal block at 0x00000200BFBBE2D0, 16 bytes long. Data: < ß ÷ > 10 04 DF 06 F7 7F 00 00 00 00 00 00 00 00 00 00 {66} normal block at 0x00000200BFBBEF00, 16 bytes long. Data: <p ß ÷ > 70 04 DF 06 F7 7F 00 00 00 00 00 00 00 00 00 00 {65} normal block at 0x00000200BFBBE820, 16 bytes long. Data: < ÀÝ ÷ > 18 C0 DD 06 F7 7F 00 00 00 00 00 00 00 00 00 00 Object dump complete. </stderr_txt> ]]>
©2025 Universitat Pompeu Fabra