| Name | test_2-SFARR_TEST_LLM_WINDOWS_101_6-0-1-RND3891_0 |
| Workunit | 31482397 |
| Created | 24 Apr 2025, 14:07:16 UTC |
| Sent | 24 Apr 2025, 14:08:08 UTC |
| Report deadline | 29 Apr 2025, 14:08:08 UTC |
| Received | 24 Apr 2025, 14:12:46 UTC |
| Server state | Over |
| Outcome | Computation error |
| Client state | Compute error |
| Exit status | 195 (0x000000C3) EXIT_CHILD_FAILED |
| Computer ID | 592287 |
| Run time | 3 min 3 sec |
| CPU time | 29 sec |
| Validate state | Invalid |
| Credit | 0.00 |
| Device peak FLOPS | 37,792.05 GFLOPS |
| Application version | LLM: LLMs for chemistry v1.01 (cuda124L) windows_x86_64 |
| Peak working set size | 717.24 MB |
| Peak swap size | 2.42 GB |
| Peak disk usage | 8.15 GB |
<core_client_version>8.0.2</core_client_version>
<![CDATA[
<message>
The operating system cannot run (null).
(0xc3) - exit code 195 (0xc3)</message>
<stderr_txt>
07:10:17 (51940): wrapper (7.9.26016): starting
07:10:17 (51940): wrapper: running Library/usr/bin/tar.exe (xjvf input.tar.bz2)
conf.yaml
main_generation-0.1.0-py3-none-any.whl
run.bat
run.sh
tasks.json
07:10:18 (51940): Library/usr/bin/tar.exe exited; CPU time 0.000000
07:10:18 (51940): wrapper: running C:/Windows/system32/cmd.exe (/c call Scripts\activate.bat && Scripts\conda-unpack.exe && run.bat)
Generating train split: 0 examples [00:00, ? examples/s]
Generating train split: 1000 examples [00:00, 114239.52 examples/s]
Traceback (most recent call last):
File "wheel_contents/aiengine/main_generation.py", line 86, in <module>
File "wheel_contents/aiengine/model.py", line 36, in __init__
File "C:\ProgramData\BOINC\slots\14\Lib\site-packages\vllm\utils.py", line 1096, in inner
return fn(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^
File "C:\ProgramData\BOINC\slots\14\Lib\site-packages\vllm\entrypoints\llm.py", line 243, in __init__
self.llm_engine = LLMEngine.from_engine_args(
^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\ProgramData\BOINC\slots\14\Lib\site-packages\vllm\engine\llm_engine.py", line 521, in from_engine_args
return engine_cls.from_vllm_config(
^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\ProgramData\BOINC\slots\14\Lib\site-packages\vllm\engine\llm_engine.py", line 497, in from_vllm_config
return cls(
^^^^
File "C:\ProgramData\BOINC\slots\14\Lib\site-packages\vllm\engine\llm_engine.py", line 281, in __init__
self.model_executor = executor_class(vllm_config=vllm_config, )
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\ProgramData\BOINC\slots\14\Lib\site-packages\vllm\executor\executor_base.py", line 52, in __init__
self._init_executor()
File "C:\ProgramData\BOINC\slots\14\Lib\site-packages\vllm\executor\uniproc_executor.py", line 46, in _init_executor
self.collective_rpc("init_device")
File "C:\ProgramData\BOINC\slots\14\Lib\site-packages\vllm\executor\uniproc_executor.py", line 56, in collective_rpc
answer = run_method(self.driver_worker, method, args, kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\ProgramData\BOINC\slots\14\Lib\site-packages\vllm\utils.py", line 2359, in run_method
return func(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^
File "C:\ProgramData\BOINC\slots\14\Lib\site-packages\vllm\worker\worker_base.py", line 604, in init_device
self.worker.init_device() # type: ignore
^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\ProgramData\BOINC\slots\14\Lib\site-packages\vllm\worker\worker.py", line 167, in init_device
init_worker_distributed_environment(self.vllm_config, self.rank,
File "C:\ProgramData\BOINC\slots\14\Lib\site-packages\vllm\worker\worker.py", line 506, in init_worker_distributed_environment
init_distributed_environment(
File "C:\ProgramData\BOINC\slots\14\Lib\site-packages\vllm\distributed\parallel_state.py", line 844, in init_distributed_environment
torch.distributed.init_process_group(
File "C:\ProgramData\BOINC\slots\14\Lib\site-packages\torch\distributed\c10d_logger.py", line 81, in wrapper
return func(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^
File "C:\ProgramData\BOINC\slots\14\Lib\site-packages\torch\distributed\c10d_logger.py", line 95, in wrapper
func_return = func(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^
File "C:\ProgramData\BOINC\slots\14\Lib\site-packages\torch\distributed\distributed_c10d.py", line 1714, in init_process_group
store, rank, world_size = next(rendezvous_iterator)
^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\ProgramData\BOINC\slots\14\Lib\site-packages\torch\distributed\rendezvous.py", line 226, in _tcp_rendezvous_handler
store = _create_c10d_store(
^^^^^^^^^^^^^^^^^^^
File "C:\ProgramData\BOINC\slots\14\Lib\site-packages\torch\distributed\rendezvous.py", line 194, in _create_c10d_store
return TCPStore(
^^^^^^^^^
RuntimeError: use_libuv was requested but PyTorch was build without libuv support
07:11:19 (51940): C:/Windows/system32/cmd.exe exited; CPU time 29.500000
07:11:19 (51940): app exit status: 0x16
07:11:19 (51940): called boinc_finish(195)
0 bytes in 0 Free Blocks.
572 bytes in 8 Normal Blocks.
1144 bytes in 1 CRT Blocks.
0 bytes in 0 Ignore Blocks.
0 bytes in 0 Client Blocks.
Largest number used: 0 bytes.
Total allocations: 1566903 bytes.
Dumping objects ->
{1601509} normal block at 0x0000021EEF9CBE60, 48 bytes long.
Data: <PATH=C:\ProgramD> 50 41 54 48 3D 43 3A 5C 50 72 6F 67 72 61 6D 44
{1601498} normal block at 0x0000021EEF9CBBC0, 48 bytes long.
Data: <HOME=C:\ProgramD> 48 4F 4D 45 3D 43 3A 5C 50 72 6F 67 72 61 6D 44
{1601487} normal block at 0x0000021EEF9CBDF0, 48 bytes long.
Data: <TMP=C:\ProgramDa> 54 4D 50 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 61
{1601476} normal block at 0x0000021EEF9CB8B0, 48 bytes long.
Data: <TEMP=C:\ProgramD> 54 45 4D 50 3D 43 3A 5C 50 72 6F 67 72 61 6D 44
{1601465} normal block at 0x0000021EEF9A4D30, 48 bytes long.
Data: <TMPDIR=C:\Progra> 54 4D 50 44 49 52 3D 43 3A 5C 50 72 6F 67 72 61
{1601434} normal block at 0x0000021EF167CC20, 64 bytes long.
Data: <PATH=C:\ProgramD> 50 41 54 48 3D 43 3A 5C 50 72 6F 67 72 61 6D 44
{1601423} normal block at 0x0000021EEFA23DC0, 158 bytes long.
Data: <<project_prefere> 3C 70 72 6F 6A 65 63 74 5F 70 72 65 66 65 72 65
..\api\boinc_api.cpp(309) : {1601420} normal block at 0x0000021EEF99E350, 8 bytes long.
Data: < “ï > 00 00 93 EF 1E 02 00 00
{1600651} normal block at 0x0000021EEFA236C0, 158 bytes long.
Data: <<project_prefere> 3C 70 72 6F 6A 65 63 74 5F 70 72 65 66 65 72 65
{1599916} normal block at 0x0000021EEF99E620, 8 bytes long.
Data: <ðã ï > F0 E3 98 EF 1E 02 00 00
..\zip\boinc_zip.cpp(122) : {305} normal block at 0x0000021EEF9920B0, 260 bytes long.
Data: < > 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
{290} normal block at 0x0000021EEF9930C0, 80 bytes long.
Data: </c call Scripts\> 2F 63 20 63 61 6C 6C 20 53 63 72 69 70 74 73 5C
{289} normal block at 0x0000021EEF9A6A00, 16 bytes long.
Data: <ø{šï > F8 7B 9A EF 1E 02 00 00 00 00 00 00 00 00 00 00
{288} normal block at 0x0000021EEF9A6D70, 16 bytes long.
Data: <Ð{šï > D0 7B 9A EF 1E 02 00 00 00 00 00 00 00 00 00 00
{287} normal block at 0x0000021EEF9A6BE0, 16 bytes long.
Data: <¨{šï > A8 7B 9A EF 1E 02 00 00 00 00 00 00 00 00 00 00
{286} normal block at 0x0000021EEF9A7720, 16 bytes long.
Data: < {šï > 80 7B 9A EF 1E 02 00 00 00 00 00 00 00 00 00 00
{285} normal block at 0x0000021EEF9A6B40, 16 bytes long.
Data: <X{šï > 58 7B 9A EF 1E 02 00 00 00 00 00 00 00 00 00 00
{284} normal block at 0x0000021EEF9A75E0, 16 bytes long.
Data: <0{šï > 30 7B 9A EF 1E 02 00 00 00 00 00 00 00 00 00 00
{283} normal block at 0x0000021EEF9A5350, 48 bytes long.
Data: <ComSpec=C:\Windo> 43 6F 6D 53 70 65 63 3D 43 3A 5C 57 69 6E 64 6F
{282} normal block at 0x0000021EEF9A6960, 16 bytes long.
Data: <XÁ ï > 58 C1 98 EF 1E 02 00 00 00 00 00 00 00 00 00 00
{281} normal block at 0x0000021EEF9A0F60, 32 bytes long.
Data: <SystemRoot=C:\Wi> 53 79 73 74 65 6D 52 6F 6F 74 3D 43 3A 5C 57 69
{280} normal block at 0x0000021EEF9A7220, 16 bytes long.
Data: <0Á ï > 30 C1 98 EF 1E 02 00 00 00 00 00 00 00 00 00 00
{278} normal block at 0x0000021EEF9A6F50, 16 bytes long.
Data: < Á ï > 08 C1 98 EF 1E 02 00 00 00 00 00 00 00 00 00 00
{277} normal block at 0x0000021EEF9A6C30, 16 bytes long.
Data: <àÀ ï > E0 C0 98 EF 1E 02 00 00 00 00 00 00 00 00 00 00
{276} normal block at 0x0000021EEF9A6F00, 16 bytes long.
Data: <¸À ï > B8 C0 98 EF 1E 02 00 00 00 00 00 00 00 00 00 00
{275} normal block at 0x0000021EEF9A72C0, 16 bytes long.
Data: < À ï > 90 C0 98 EF 1E 02 00 00 00 00 00 00 00 00 00 00
{274} normal block at 0x0000021EEF9A6AA0, 16 bytes long.
Data: <hÀ ï > 68 C0 98 EF 1E 02 00 00 00 00 00 00 00 00 00 00
{273} normal block at 0x0000021EEF9A1200, 32 bytes long.
Data: <CUDA_DEVICE=0 PU> 43 55 44 41 5F 44 45 56 49 43 45 3D 30 00 50 55
{272} normal block at 0x0000021EEF9A7180, 16 bytes long.
Data: <@À ï > 40 C0 98 EF 1E 02 00 00 00 00 00 00 00 00 00 00
{271} normal block at 0x0000021EEF98C040, 320 bytes long.
Data: < qšï šï > 80 71 9A EF 1E 02 00 00 00 12 9A EF 1E 02 00 00
{270} normal block at 0x0000021EEF9A74F0, 16 bytes long.
Data: < {šï > 10 7B 9A EF 1E 02 00 00 00 00 00 00 00 00 00 00
{269} normal block at 0x0000021EEF9A6910, 16 bytes long.
Data: <èzšï > E8 7A 9A EF 1E 02 00 00 00 00 00 00 00 00 00 00
{268} normal block at 0x0000021EEF9A0CC0, 32 bytes long.
Data: <C:/Windows/syste> 43 3A 2F 57 69 6E 64 6F 77 73 2F 73 79 73 74 65
{267} normal block at 0x0000021EEF9A6E60, 16 bytes long.
Data: <Àzšï > C0 7A 9A EF 1E 02 00 00 00 00 00 00 00 00 00 00
{266} normal block at 0x0000021EEF9A1140, 32 bytes long.
Data: <xjvf input.tar.b> 78 6A 76 66 20 69 6E 70 75 74 2E 74 61 72 2E 62
{265} normal block at 0x0000021EEF9A7590, 16 bytes long.
Data: < zšï > 08 7A 9A EF 1E 02 00 00 00 00 00 00 00 00 00 00
{264} normal block at 0x0000021EEF9A7860, 16 bytes long.
Data: <àyšï > E0 79 9A EF 1E 02 00 00 00 00 00 00 00 00 00 00
{263} normal block at 0x0000021EEF9A6B90, 16 bytes long.
Data: <¸yšï > B8 79 9A EF 1E 02 00 00 00 00 00 00 00 00 00 00
{262} normal block at 0x0000021EEF9A6AF0, 16 bytes long.
Data: < yšï > 90 79 9A EF 1E 02 00 00 00 00 00 00 00 00 00 00
{261} normal block at 0x0000021EEF9A6DC0, 16 bytes long.
Data: <hyšï > 68 79 9A EF 1E 02 00 00 00 00 00 00 00 00 00 00
{260} normal block at 0x0000021EEF9A7270, 16 bytes long.
Data: <@yšï > 40 79 9A EF 1E 02 00 00 00 00 00 00 00 00 00 00
{258} normal block at 0x0000021EEF9A76D0, 16 bytes long.
Data: <ðNšï > F0 4E 9A EF 1E 02 00 00 00 00 00 00 00 00 00 00
{257} normal block at 0x0000021EEF9A4EF0, 40 bytes long.
Data: <Ðvšï Ìgñ > D0 76 9A EF 1E 02 00 00 20 CC 67 F1 1E 02 00 00
{256} normal block at 0x0000021EEF9A69B0, 16 bytes long.
Data: < yšï > 20 79 9A EF 1E 02 00 00 00 00 00 00 00 00 00 00
{255} normal block at 0x0000021EEF9A7540, 16 bytes long.
Data: <øxšï > F8 78 9A EF 1E 02 00 00 00 00 00 00 00 00 00 00
{254} normal block at 0x0000021EEF9A10E0, 32 bytes long.
Data: <Library/usr/bin/> 4C 69 62 72 61 72 79 2F 75 73 72 2F 62 69 6E 2F
{253} normal block at 0x0000021EEF9A6CD0, 16 bytes long.
Data: <Ðxšï > D0 78 9A EF 1E 02 00 00 00 00 00 00 00 00 00 00
{252} normal block at 0x0000021EEF9A78D0, 992 bytes long.
Data: <Ðlšï à šï > D0 6C 9A EF 1E 02 00 00 E0 10 9A EF 1E 02 00 00
{96} normal block at 0x0000021EEF9A1080, 32 bytes long.
Data: <windows_x86_64__> 77 69 6E 64 6F 77 73 5F 78 38 36 5F 36 34 5F 5F
{95} normal block at 0x0000021EEF99E850, 16 bytes long.
Data: <@Pšï > 40 50 9A EF 1E 02 00 00 00 00 00 00 00 00 00 00
{94} normal block at 0x0000021EEF9A5040, 40 bytes long.
Data: <Pè ï šï > 50 E8 99 EF 1E 02 00 00 80 10 9A EF 1E 02 00 00
{73} normal block at 0x0000021EEF99E210, 16 bytes long.
Data: < êÎà÷ > 80 EA CE E0 F7 7F 00 00 00 00 00 00 00 00 00 00
{72} normal block at 0x0000021EEF99E5D0, 16 bytes long.
Data: <@éÎà÷ > 40 E9 CE E0 F7 7F 00 00 00 00 00 00 00 00 00 00
{71} normal block at 0x0000021EEF99EFD0, 16 bytes long.
Data: <øWËà÷ > F8 57 CB E0 F7 7F 00 00 00 00 00 00 00 00 00 00
{70} normal block at 0x0000021EEF99EEE0, 16 bytes long.
Data: <ØWËà÷ > D8 57 CB E0 F7 7F 00 00 00 00 00 00 00 00 00 00
{69} normal block at 0x0000021EEF99E990, 16 bytes long.
Data: <P Ëà÷ > 50 04 CB E0 F7 7F 00 00 00 00 00 00 00 00 00 00
{68} normal block at 0x0000021EEF99E4E0, 16 bytes long.
Data: <0 Ëà÷ > 30 04 CB E0 F7 7F 00 00 00 00 00 00 00 00 00 00
{67} normal block at 0x0000021EEF99E670, 16 bytes long.
Data: <à Ëà÷ > E0 02 CB E0 F7 7F 00 00 00 00 00 00 00 00 00 00
{66} normal block at 0x0000021EEF99EF80, 16 bytes long.
Data: < Ëà÷ > 10 04 CB E0 F7 7F 00 00 00 00 00 00 00 00 00 00
{65} normal block at 0x0000021EEF99ED00, 16 bytes long.
Data: <p Ëà÷ > 70 04 CB E0 F7 7F 00 00 00 00 00 00 00 00 00 00
{64} normal block at 0x0000021EEF99E490, 16 bytes long.
Data: < ÀÉà÷ > 18 C0 C9 E0 F7 7F 00 00 00 00 00 00 00 00 00 00
Object dump complete.
</stderr_txt>
]]>
©2025 Universitat Pompeu Fabra