| Name | test_2-SFARR_TEST_LLM_WINDOWS_101_7-0-1-RND9798_0 |
| Workunit | 31482406 |
| Created | 24 Apr 2025, 14:44:47 UTC |
| Sent | 24 Apr 2025, 14:48:09 UTC |
| Report deadline | 29 Apr 2025, 14:48:09 UTC |
| Received | 24 Apr 2025, 14:54:24 UTC |
| Server state | Over |
| Outcome | Computation error |
| Client state | Compute error |
| Exit status | 195 (0x000000C3) EXIT_CHILD_FAILED |
| Computer ID | 506550 |
| Run time | 4 min 28 sec |
| CPU time | 26 sec |
| Validate state | Invalid |
| Credit | 0.00 |
| Device peak FLOPS | 83,073.27 GFLOPS |
| Application version | LLM: LLMs for chemistry v1.01 (cuda124L) windows_x86_64 |
| Peak working set size | 714.66 MB |
| Peak swap size | 2.42 GB |
| Peak disk usage | 5.97 GB |
<core_client_version>8.0.2</core_client_version>
<![CDATA[
<message>
The operating system cannot run (null).
(0xc3) - exit code 195 (0xc3)</message>
<stderr_txt>
07:50:18 (2716): wrapper (7.9.26016): starting
07:50:18 (2716): wrapper: running Library/usr/bin/tar.exe (xjvf input.tar.bz2)
conf.yaml
main_generation-0.1.0-py3-none-any.whl
run.bat
run.sh
tasks.json
07:50:28 (2716): Library/usr/bin/tar.exe exited; CPU time 0.000000
07:50:30 (2716): wrapper: running C:/Windows/system32/cmd.exe (/c call Scripts\activate.bat && Scripts\conda-unpack.exe && run.bat)
Generating train split: 0 examples [00:00, ? examples/s]
Generating train split: 1000 examples [00:00, 102540.19 examples/s]
[W424 07:51:47.000000000 socket.cpp:759] [c10d] The client socket has failed to connect to [professor-x]:60592 (system error: 10049 - The requested address is not valid in its context.).
[rank0]: Traceback (most recent call last):
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\model_executor\layers\quantization\bitsandbytes.py", line 158, in __init__
[rank0]: import bitsandbytes
[rank0]: ModuleNotFoundError: No module named 'bitsandbytes'
[rank0]: The above exception was the direct cause of the following exception:
[rank0]: Traceback (most recent call last):
[rank0]: File "wheel_contents/aiengine/main_generation.py", line 86, in <module>
[rank0]: File "wheel_contents/aiengine/model.py", line 36, in __init__
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\utils.py", line 1096, in inner
[rank0]: return fn(*args, **kwargs)
[rank0]: ^^^^^^^^^^^^^^^^^^^
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\entrypoints\llm.py", line 243, in __init__
[rank0]: self.llm_engine = LLMEngine.from_engine_args(
[rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\engine\llm_engine.py", line 521, in from_engine_args
[rank0]: return engine_cls.from_vllm_config(
[rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\engine\llm_engine.py", line 497, in from_vllm_config
[rank0]: return cls(
[rank0]: ^^^^
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\engine\llm_engine.py", line 281, in __init__
[rank0]: self.model_executor = executor_class(vllm_config=vllm_config, )
[rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\executor\executor_base.py", line 52, in __init__
[rank0]: self._init_executor()
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\executor\uniproc_executor.py", line 47, in _init_executor
[rank0]: self.collective_rpc("load_model")
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\executor\uniproc_executor.py", line 56, in collective_rpc
[rank0]: answer = run_method(self.driver_worker, method, args, kwargs)
[rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\utils.py", line 2359, in run_method
[rank0]: return func(*args, **kwargs)
[rank0]: ^^^^^^^^^^^^^^^^^^^^^
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\worker\worker.py", line 184, in load_model
[rank0]: self.model_runner.load_model()
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\worker\model_runner.py", line 1113, in load_model
[rank0]: self.model = get_model(vllm_config=self.vllm_config)
[rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\model_executor\model_loader\__init__.py", line 14, in get_model
[rank0]: return loader.load_model(vllm_config=vllm_config)
[rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\model_executor\model_loader\loader.py", line 1278, in load_model
[rank0]: model = _initialize_model(vllm_config=vllm_config)
[rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\model_executor\model_loader\loader.py", line 127, in _initialize_model
[rank0]: return model_class(vllm_config=vllm_config, prefix=prefix)
[rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\model_executor\models\qwen2.py", line 431, in __init__
[rank0]: self.model = Qwen2Model(vllm_config=vllm_config,
[rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\compilation\decorators.py", line 151, in __init__
[rank0]: old_init(self, vllm_config=vllm_config, prefix=prefix, **kwargs)
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\model_executor\models\qwen2.py", line 300, in __init__
[rank0]: self.start_layer, self.end_layer, self.layers = make_layers(
[rank0]: ^^^^^^^^^^^^
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\model_executor\models\utils.py", line 610, in make_layers
[rank0]: maybe_offload_to_cpu(layer_fn(prefix=f"{prefix}.{idx}"))
[rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\model_executor\models\qwen2.py", line 302, in <lambda>
[rank0]: lambda prefix: Qwen2DecoderLayer(config=config,
[rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\model_executor\models\qwen2.py", line 206, in __init__
[rank0]: self.self_attn = Qwen2Attention(
[rank0]: ^^^^^^^^^^^^^^^
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\model_executor\models\qwen2.py", line 136, in __init__
[rank0]: self.qkv_proj = QKVParallelLinear(
[rank0]: ^^^^^^^^^^^^^^^^^^
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\model_executor\layers\linear.py", line 833, in __init__
[rank0]: super().__init__(input_size=input_size,
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\model_executor\layers\linear.py", line 384, in __init__
[rank0]: super().__init__(input_size,
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\model_executor\layers\linear.py", line 231, in __init__
[rank0]: self.quant_method = quant_config.get_quant_method(self,
[rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\model_executor\layers\quantization\bitsandbytes.py", line 128, in get_quant_method
[rank0]: return BitsAndBytesLinearMethod(self)
[rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]: File "C:\ProgramData\BOINC\slots\33\Lib\site-packages\vllm\model_executor\layers\quantization\bitsandbytes.py", line 163, in __init__
[rank0]: raise ImportError("Please install bitsandbytes>=0.45.3 via "
[rank0]: ImportError: Please install bitsandbytes>=0.45.3 via `pip install bitsandbytes>=0.45.3` to use bitsandbytes quantizer.
07:51:58 (2716): C:/Windows/system32/cmd.exe exited; CPU time 26.171875
07:51:58 (2716): app exit status: 0x16
07:51:58 (2716): called boinc_finish(195)
0 bytes in 0 Free Blocks.
456 bytes in 8 Normal Blocks.
1144 bytes in 1 CRT Blocks.
0 bytes in 0 Ignore Blocks.
0 bytes in 0 Client Blocks.
Largest number used: 0 bytes.
Total allocations: 609217 bytes.
Dumping objects ->
{1601264} normal block at 0x00000244C7E49EE0, 48 bytes long.
Data: <PATH=C:\ProgramD> 50 41 54 48 3D 43 3A 5C 50 72 6F 67 72 61 6D 44
{1601253} normal block at 0x00000244C7E49930, 48 bytes long.
Data: <HOME=C:\ProgramD> 48 4F 4D 45 3D 43 3A 5C 50 72 6F 67 72 61 6D 44
{1601242} normal block at 0x00000244C7E24EC0, 48 bytes long.
Data: <TMP=C:\ProgramDa> 54 4D 50 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 61
{1601231} normal block at 0x00000244C7E255C0, 48 bytes long.
Data: <TEMP=C:\ProgramD> 54 45 4D 50 3D 43 3A 5C 50 72 6F 67 72 61 6D 44
{1601220} normal block at 0x00000244C7E254E0, 48 bytes long.
Data: <TMPDIR=C:\Progra> 54 4D 50 44 49 52 3D 43 3A 5C 50 72 6F 67 72 61
{1601189} normal block at 0x00000244C9C2F880, 64 bytes long.
Data: <PATH=C:\ProgramD> 50 41 54 48 3D 43 3A 5C 50 72 6F 67 72 61 6D 44
{1601178} normal block at 0x00000244C7EDDFA0, 100 bytes long.
Data: <<project_prefere> 3C 70 72 6F 6A 65 63 74 5F 70 72 65 66 65 72 65
..\api\boinc_api.cpp(309) : {1601174} normal block at 0x00000244C7E1D160, 8 bytes long.
Data: < {ÉD > 00 00 7B C9 44 02 00 00
{1600526} normal block at 0x00000244C7EDCC40, 100 bytes long.
Data: <<project_prefere> 3C 70 72 6F 6A 65 63 74 5F 70 72 65 66 65 72 65
{1599906} normal block at 0x00000244C7E1D980, 8 bytes long.
Data: <ÐÚàÇD > D0 DA E0 C7 44 02 00 00
..\zip\boinc_zip.cpp(122) : {300} normal block at 0x00000244C7E11360, 260 bytes long.
Data: < > 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
{285} normal block at 0x00000244C7E12370, 80 bytes long.
Data: </c call Scripts\> 2F 63 20 63 61 6C 6C 20 53 63 72 69 70 74 73 5C
{284} normal block at 0x00000244C7E26120, 16 bytes long.
Data: <¸iâÇD > B8 69 E2 C7 44 02 00 00 00 00 00 00 00 00 00 00
{283} normal block at 0x00000244C7E26350, 16 bytes long.
Data: < iâÇD > 90 69 E2 C7 44 02 00 00 00 00 00 00 00 00 00 00
{282} normal block at 0x00000244C7E26030, 16 bytes long.
Data: <hiâÇD > 68 69 E2 C7 44 02 00 00 00 00 00 00 00 00 00 00
{281} normal block at 0x00000244C7E25CC0, 16 bytes long.
Data: <@iâÇD > 40 69 E2 C7 44 02 00 00 00 00 00 00 00 00 00 00
{280} normal block at 0x00000244C7E25FE0, 16 bytes long.
Data: < iâÇD > 18 69 E2 C7 44 02 00 00 00 00 00 00 00 00 00 00
{279} normal block at 0x00000244C7E25C70, 16 bytes long.
Data: <ðhâÇD > F0 68 E2 C7 44 02 00 00 00 00 00 00 00 00 00 00
{278} normal block at 0x00000244C7E24F30, 48 bytes long.
Data: <ComSpec=C:\Windo> 43 6F 6D 53 70 65 63 3D 43 3A 5C 57 69 6E 64 6F
{277} normal block at 0x00000244C7E259A0, 16 bytes long.
Data: <è áÇD > E8 06 E1 C7 44 02 00 00 00 00 00 00 00 00 00 00
{276} normal block at 0x00000244C7E213C0, 32 bytes long.
Data: <SystemRoot=C:\Wi> 53 79 73 74 65 6D 52 6F 6F 74 3D 43 3A 5C 57 69
{275} normal block at 0x00000244C7E260D0, 16 bytes long.
Data: <À áÇD > C0 06 E1 C7 44 02 00 00 00 00 00 00 00 00 00 00
{273} normal block at 0x00000244C7E25900, 16 bytes long.
Data: < áÇD > 98 06 E1 C7 44 02 00 00 00 00 00 00 00 00 00 00
{272} normal block at 0x00000244C7E25B30, 16 bytes long.
Data: <p áÇD > 70 06 E1 C7 44 02 00 00 00 00 00 00 00 00 00 00
{271} normal block at 0x00000244C7E25950, 16 bytes long.
Data: <H áÇD > 48 06 E1 C7 44 02 00 00 00 00 00 00 00 00 00 00
{270} normal block at 0x00000244C7E25F90, 16 bytes long.
Data: < áÇD > 20 06 E1 C7 44 02 00 00 00 00 00 00 00 00 00 00
{269} normal block at 0x00000244C7E25C20, 16 bytes long.
Data: <ø áÇD > F8 05 E1 C7 44 02 00 00 00 00 00 00 00 00 00 00
{268} normal block at 0x00000244C7E21240, 32 bytes long.
Data: <CUDA_DEVICE=0 PU> 43 55 44 41 5F 44 45 56 49 43 45 3D 30 00 50 55
{267} normal block at 0x00000244C7E26210, 16 bytes long.
Data: <Ð áÇD > D0 05 E1 C7 44 02 00 00 00 00 00 00 00 00 00 00
{266} normal block at 0x00000244C7E105D0, 320 bytes long.
Data: < bâÇD @ âÇD > 10 62 E2 C7 44 02 00 00 40 12 E2 C7 44 02 00 00
{265} normal block at 0x00000244C7E25F40, 16 bytes long.
Data: <ÐhâÇD > D0 68 E2 C7 44 02 00 00 00 00 00 00 00 00 00 00
{264} normal block at 0x00000244C7E25EF0, 16 bytes long.
Data: <¨hâÇD > A8 68 E2 C7 44 02 00 00 00 00 00 00 00 00 00 00
{263} normal block at 0x00000244C7E20D00, 32 bytes long.
Data: <C:/Windows/syste> 43 3A 2F 57 69 6E 64 6F 77 73 2F 73 79 73 74 65
{262} normal block at 0x00000244C7E25720, 16 bytes long.
Data: < hâÇD > 80 68 E2 C7 44 02 00 00 00 00 00 00 00 00 00 00
{261} normal block at 0x00000244C7E211E0, 32 bytes long.
Data: <xjvf input.tar.b> 78 6A 76 66 20 69 6E 70 75 74 2E 74 61 72 2E 62
{260} normal block at 0x00000244C7E25D10, 16 bytes long.
Data: <ÈgâÇD > C8 67 E2 C7 44 02 00 00 00 00 00 00 00 00 00 00
{259} normal block at 0x00000244C7E25AE0, 16 bytes long.
Data: < gâÇD > A0 67 E2 C7 44 02 00 00 00 00 00 00 00 00 00 00
{258} normal block at 0x00000244C7E25BD0, 16 bytes long.
Data: <xgâÇD > 78 67 E2 C7 44 02 00 00 00 00 00 00 00 00 00 00
{257} normal block at 0x00000244C7E25E00, 16 bytes long.
Data: <PgâÇD > 50 67 E2 C7 44 02 00 00 00 00 00 00 00 00 00 00
{256} normal block at 0x00000244C7E25A90, 16 bytes long.
Data: <(gâÇD > 28 67 E2 C7 44 02 00 00 00 00 00 00 00 00 00 00
{255} normal block at 0x00000244C7E26170, 16 bytes long.
Data: < gâÇD > 00 67 E2 C7 44 02 00 00 00 00 00 00 00 00 00 00
{253} normal block at 0x00000244C7E261C0, 16 bytes long.
Data: <ÐQâÇD > D0 51 E2 C7 44 02 00 00 00 00 00 00 00 00 00 00
{252} normal block at 0x00000244C7E251D0, 40 bytes long.
Data: <ÀaâÇD øÂÉD > C0 61 E2 C7 44 02 00 00 80 F8 C2 C9 44 02 00 00
{251} normal block at 0x00000244C7E257C0, 16 bytes long.
Data: <àfâÇD > E0 66 E2 C7 44 02 00 00 00 00 00 00 00 00 00 00
{250} normal block at 0x00000244C7E26530, 16 bytes long.
Data: <¸fâÇD > B8 66 E2 C7 44 02 00 00 00 00 00 00 00 00 00 00
{249} normal block at 0x00000244C7E20CA0, 32 bytes long.
Data: <Library/usr/bin/> 4C 69 62 72 61 72 79 2F 75 73 72 2F 62 69 6E 2F
{248} normal block at 0x00000244C7E25860, 16 bytes long.
Data: < fâÇD > 90 66 E2 C7 44 02 00 00 00 00 00 00 00 00 00 00
{247} normal block at 0x00000244C7E26690, 992 bytes long.
Data: <`XâÇD   âÇD > 60 58 E2 C7 44 02 00 00 A0 0C E2 C7 44 02 00 00
{91} normal block at 0x00000244C7E21360, 32 bytes long.
Data: <windows_x86_64__> 77 69 6E 64 6F 77 73 5F 78 38 36 5F 36 34 5F 5F
{90} normal block at 0x00000244C7E1DD90, 16 bytes long.
Data: <ðPâÇD > F0 50 E2 C7 44 02 00 00 00 00 00 00 00 00 00 00
{89} normal block at 0x00000244C7E250F0, 40 bytes long.
Data: < ÝáÇD ` âÇD > 90 DD E1 C7 44 02 00 00 60 13 E2 C7 44 02 00 00
{68} normal block at 0x00000244C7E1DD40, 16 bytes long.
Data: < êôM÷ > 80 EA F4 4D F7 7F 00 00 00 00 00 00 00 00 00 00
{67} normal block at 0x00000244C7E1D200, 16 bytes long.
Data: <@éôM÷ > 40 E9 F4 4D F7 7F 00 00 00 00 00 00 00 00 00 00
{66} normal block at 0x00000244C7E1D430, 16 bytes long.
Data: <øWñM÷ > F8 57 F1 4D F7 7F 00 00 00 00 00 00 00 00 00 00
{65} normal block at 0x00000244C7E1D0C0, 16 bytes long.
Data: <ØWñM÷ > D8 57 F1 4D F7 7F 00 00 00 00 00 00 00 00 00 00
{64} normal block at 0x00000244C7E1D8E0, 16 bytes long.
Data: <P ñM÷ > 50 04 F1 4D F7 7F 00 00 00 00 00 00 00 00 00 00
{63} normal block at 0x00000244C7E1DFC0, 16 bytes long.
Data: <0 ñM÷ > 30 04 F1 4D F7 7F 00 00 00 00 00 00 00 00 00 00
{62} normal block at 0x00000244C7E1DCF0, 16 bytes long.
Data: <à ñM÷ > E0 02 F1 4D F7 7F 00 00 00 00 00 00 00 00 00 00
{61} normal block at 0x00000244C7E1D700, 16 bytes long.
Data: < ñM÷ > 10 04 F1 4D F7 7F 00 00 00 00 00 00 00 00 00 00
{60} normal block at 0x00000244C7E1DB60, 16 bytes long.
Data: <p ñM÷ > 70 04 F1 4D F7 7F 00 00 00 00 00 00 00 00 00 00
{59} normal block at 0x00000244C7E1D520, 16 bytes long.
Data: < ÀïM÷ > 18 C0 EF 4D F7 7F 00 00 00 00 00 00 00 00 00 00
Object dump complete.
</stderr_txt>
]]>
©2025 Universitat Pompeu Fabra