Task 38487474

Name test_2-SFARR_TEST_LLM_WINDOWS_101_6-0-1-RND3891_2
Workunit 31482397
Created 24 Apr 2025, 14:23:41 UTC
Sent 24 Apr 2025, 14:24:40 UTC
Report deadline 29 Apr 2025, 14:24:40 UTC
Received 24 Apr 2025, 14:29:18 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 195 (0x000000C3) EXIT_CHILD_FAILED
Computer ID 623816
Run time 2 min 42 sec
CPU time 25 sec
Validate state Invalid
Credit 0.00
Device peak FLOPS 37,475.84 GFLOPS
Application version LLM: LLMs for chemistry v1.01 (cuda124L)
windows_x86_64
Peak working set size 698.33 MB
Peak swap size 2.40 GB
Peak disk usage 5.99 GB

Stderr output

<core_client_version>8.0.4</core_client_version>
<![CDATA[
<message>
(unknown error) (0) - exit code 195 (0xc3)</message>
<stderr_txt>
16:26:33 (12604): wrapper (7.9.26016): starting
16:26:33 (12604): wrapper: running Library/usr/bin/tar.exe (xjvf input.tar.bz2)
conf.yaml
main_generation-0.1.0-py3-none-any.whl
run.bat
run.sh
tasks.json
16:26:34 (12604): Library/usr/bin/tar.exe exited; CPU time 0.000000
16:26:34 (12604): wrapper: running C:/Windows/system32/cmd.exe (/c call Scripts\activate.bat && Scripts\conda-unpack.exe && run.bat)
Traceback (most recent call last):
  File "wheel_contents/aiengine/main_generation.py", line 86, in <module>
  File "wheel_contents/aiengine/model.py", line 36, in __init__
  File "C:\ProgramData\BOINC\slots\23\Lib\site-packages\vllm\utils.py", line 1096, in inner
    return fn(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^
  File "C:\ProgramData\BOINC\slots\23\Lib\site-packages\vllm\entrypoints\llm.py", line 243, in __init__
    self.llm_engine = LLMEngine.from_engine_args(
                      ^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\ProgramData\BOINC\slots\23\Lib\site-packages\vllm\engine\llm_engine.py", line 521, in from_engine_args
    return engine_cls.from_vllm_config(
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\ProgramData\BOINC\slots\23\Lib\site-packages\vllm\engine\llm_engine.py", line 497, in from_vllm_config
    return cls(
           ^^^^
  File "C:\ProgramData\BOINC\slots\23\Lib\site-packages\vllm\engine\llm_engine.py", line 281, in __init__
    self.model_executor = executor_class(vllm_config=vllm_config, )
                          ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\ProgramData\BOINC\slots\23\Lib\site-packages\vllm\executor\executor_base.py", line 52, in __init__
    self._init_executor()
  File "C:\ProgramData\BOINC\slots\23\Lib\site-packages\vllm\executor\uniproc_executor.py", line 46, in _init_executor
    self.collective_rpc("init_device")
  File "C:\ProgramData\BOINC\slots\23\Lib\site-packages\vllm\executor\uniproc_executor.py", line 56, in collective_rpc
    answer = run_method(self.driver_worker, method, args, kwargs)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\ProgramData\BOINC\slots\23\Lib\site-packages\vllm\utils.py", line 2359, in run_method
    return func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^
  File "C:\ProgramData\BOINC\slots\23\Lib\site-packages\vllm\worker\worker_base.py", line 604, in init_device
    self.worker.init_device()  # type: ignore
    ^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\ProgramData\BOINC\slots\23\Lib\site-packages\vllm\worker\worker.py", line 167, in init_device
    init_worker_distributed_environment(self.vllm_config, self.rank,
  File "C:\ProgramData\BOINC\slots\23\Lib\site-packages\vllm\worker\worker.py", line 506, in init_worker_distributed_environment
    init_distributed_environment(
  File "C:\ProgramData\BOINC\slots\23\Lib\site-packages\vllm\distributed\parallel_state.py", line 844, in init_distributed_environment
    torch.distributed.init_process_group(
  File "C:\ProgramData\BOINC\slots\23\Lib\site-packages\torch\distributed\c10d_logger.py", line 81, in wrapper
    return func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^
  File "C:\ProgramData\BOINC\slots\23\Lib\site-packages\torch\distributed\c10d_logger.py", line 95, in wrapper
    func_return = func(*args, **kwargs)
                  ^^^^^^^^^^^^^^^^^^^^^
  File "C:\ProgramData\BOINC\slots\23\Lib\site-packages\torch\distributed\distributed_c10d.py", line 1714, in init_process_group
    store, rank, world_size = next(rendezvous_iterator)
                              ^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\ProgramData\BOINC\slots\23\Lib\site-packages\torch\distributed\rendezvous.py", line 226, in _tcp_rendezvous_handler
    store = _create_c10d_store(
            ^^^^^^^^^^^^^^^^^^^
  File "C:\ProgramData\BOINC\slots\23\Lib\site-packages\torch\distributed\rendezvous.py", line 194, in _create_c10d_store
    return TCPStore(
           ^^^^^^^^^
RuntimeError: use_libuv was requested but PyTorch was build without libuv support
16:27:29 (12604): C:/Windows/system32/cmd.exe exited; CPU time 25.640625
16:27:29 (12604): app exit status: 0x16
16:27:29 (12604): called boinc_finish(195)
0 bytes in 0 Free Blocks.
536 bytes in 8 Normal Blocks.
1144 bytes in 1 CRT Blocks.
0 bytes in 0 Ignore Blocks.
0 bytes in 0 Client Blocks.
Largest number used: 0 bytes.
Total allocations: 867801 bytes.
Dumping objects ->
{1601253} normal block at 0x00000000005338F0, 48 bytes long.
 Data: <PATH=C:\ProgramD> 50 41 54 48 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 
{1601242} normal block at 0x0000000000533CE0, 48 bytes long.
 Data: <HOME=C:\ProgramD> 48 4F 4D 45 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 
{1601231} normal block at 0x0000000000534370, 48 bytes long.
 Data: <TMP=C:\ProgramDa> 54 4D 50 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 61 
{1601220} normal block at 0x0000000000533960, 48 bytes long.
 Data: <TEMP=C:\ProgramD> 54 45 4D 50 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 
{1601209} normal block at 0x000000000050D470, 48 bytes long.
 Data: <TMPDIR=C:\Progra> 54 4D 50 44 49 52 3D 43 3A 5C 50 72 6F 67 72 61 
{1601178} normal block at 0x00000000025DA8B0, 64 bytes long.
 Data: <PATH=C:\ProgramD> 50 41 54 48 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 
{1601167} normal block at 0x000000000268F8E0, 140 bytes long.
 Data: <<project_prefere> 3C 70 72 6F 6A 65 63 74 5F 70 72 65 66 65 72 65 
..\api\boinc_api.cpp(309) : {1601164} normal block at 0x00000000005068F0, 8 bytes long.
 Data: <        > 00 00 1A 00 00 00 00 00 
{1600516} normal block at 0x000000000268F810, 140 bytes long.
 Data: <<project_prefere> 3C 70 72 6F 6A 65 63 74 5F 70 72 65 66 65 72 65 
{1599898} normal block at 0x0000000000506530, 8 bytes long.
 Data: < &#232;h     > 20 E8 68 02 00 00 00 00 
..\zip\boinc_zip.cpp(122) : {296} normal block at 0x000000000050DFB0, 260 bytes long.
 Data: <                > 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 
{281} normal block at 0x00000000004FA990, 80 bytes long.
 Data: </c call Scripts\> 2F 63 20 63 61 6C 6C 20 53 63 72 69 70 74 73 5C 
{280} normal block at 0x0000000000510080, 16 bytes long.
 Data: <&#200; Q             > C8 05 51 00 00 00 00 00 00 00 00 00 00 00 00 00 
{279} normal block at 0x000000000050F630, 16 bytes long.
 Data: <&#160; Q             > A0 05 51 00 00 00 00 00 00 00 00 00 00 00 00 00 
{278} normal block at 0x000000000050FC70, 16 bytes long.
 Data: <x Q             > 78 05 51 00 00 00 00 00 00 00 00 00 00 00 00 00 
{277} normal block at 0x000000000050FB80, 16 bytes long.
 Data: <P Q             > 50 05 51 00 00 00 00 00 00 00 00 00 00 00 00 00 
{276} normal block at 0x000000000050F4A0, 16 bytes long.
 Data: <( Q             > 28 05 51 00 00 00 00 00 00 00 00 00 00 00 00 00 
{275} normal block at 0x000000000050F400, 16 bytes long.
 Data: <  Q             > 00 05 51 00 00 00 00 00 00 00 00 00 00 00 00 00 
{274} normal block at 0x000000000050D630, 48 bytes long.
 Data: <ComSpec=C:\Windo> 43 6F 6D 53 70 65 63 3D 43 3A 5C 57 69 6E 64 6F 
{273} normal block at 0x000000000050FD10, 16 bytes long.
 Data: <&#200;&#239;P             > C8 EF 50 00 00 00 00 00 00 00 00 00 00 00 00 00 
{272} normal block at 0x0000000000509E40, 32 bytes long.
 Data: <SystemRoot=C:\Wi> 53 79 73 74 65 6D 52 6F 6F 74 3D 43 3A 5C 57 69 
{271} normal block at 0x000000000050FB30, 16 bytes long.
 Data: <&#160;&#239;P             > A0 EF 50 00 00 00 00 00 00 00 00 00 00 00 00 00 
{269} normal block at 0x000000000050FAE0, 16 bytes long.
 Data: <x&#239;P             > 78 EF 50 00 00 00 00 00 00 00 00 00 00 00 00 00 
{268} normal block at 0x000000000050F5E0, 16 bytes long.
 Data: <P&#239;P             > 50 EF 50 00 00 00 00 00 00 00 00 00 00 00 00 00 
{267} normal block at 0x000000000050FCC0, 16 bytes long.
 Data: <(&#239;P             > 28 EF 50 00 00 00 00 00 00 00 00 00 00 00 00 00 
{266} normal block at 0x000000000050FEA0, 16 bytes long.
 Data: < &#239;P             > 00 EF 50 00 00 00 00 00 00 00 00 00 00 00 00 00 
{265} normal block at 0x000000000050F590, 16 bytes long.
 Data: <&#216;&#238;P             > D8 EE 50 00 00 00 00 00 00 00 00 00 00 00 00 00 
{264} normal block at 0x0000000000509B40, 32 bytes long.
 Data: <CUDA_DEVICE=0 PU> 43 55 44 41 5F 44 45 56 49 43 45 3D 30 00 50 55 
{263} normal block at 0x000000000050FF90, 16 bytes long.
 Data: <&#176;&#238;P             > B0 EE 50 00 00 00 00 00 00 00 00 00 00 00 00 00 
{262} normal block at 0x000000000050EEB0, 320 bytes long.
 Data: < &#255;P     @&#155;P     > 90 FF 50 00 00 00 00 00 40 9B 50 00 00 00 00 00 
{261} normal block at 0x000000000050F860, 16 bytes long.
 Data: <&#224; Q             > E0 04 51 00 00 00 00 00 00 00 00 00 00 00 00 00 
{260} normal block at 0x000000000050FE50, 16 bytes long.
 Data: <&#184; Q             > B8 04 51 00 00 00 00 00 00 00 00 00 00 00 00 00 
{259} normal block at 0x0000000000509C00, 32 bytes long.
 Data: <C:/Windows/syste> 43 3A 2F 57 69 6E 64 6F 77 73 2F 73 79 73 74 65 
{258} normal block at 0x000000000050F950, 16 bytes long.
 Data: <  Q             > 90 04 51 00 00 00 00 00 00 00 00 00 00 00 00 00 
{257} normal block at 0x0000000000509F00, 32 bytes long.
 Data: <xjvf input.tar.b> 78 6A 76 66 20 69 6E 70 75 74 2E 74 61 72 2E 62 
{256} normal block at 0x000000000050F900, 16 bytes long.
 Data: <&#216; Q             > D8 03 51 00 00 00 00 00 00 00 00 00 00 00 00 00 
{255} normal block at 0x000000000050F680, 16 bytes long.
 Data: <&#176; Q             > B0 03 51 00 00 00 00 00 00 00 00 00 00 00 00 00 
{254} normal block at 0x000000000050F3B0, 16 bytes long.
 Data: <  Q             > 88 03 51 00 00 00 00 00 00 00 00 00 00 00 00 00 
{253} normal block at 0x000000000050FBD0, 16 bytes long.
 Data: <` Q             > 60 03 51 00 00 00 00 00 00 00 00 00 00 00 00 00 
{252} normal block at 0x000000000050F8B0, 16 bytes long.
 Data: <8 Q             > 38 03 51 00 00 00 00 00 00 00 00 00 00 00 00 00 
{251} normal block at 0x000000000050F9F0, 16 bytes long.
 Data: <  Q             > 10 03 51 00 00 00 00 00 00 00 00 00 00 00 00 00 
{249} normal block at 0x000000000050F7C0, 16 bytes long.
 Data: <`&#216;P             > 60 D8 50 00 00 00 00 00 00 00 00 00 00 00 00 00 
{248} normal block at 0x000000000050D860, 40 bytes long.
 Data: <&#192;&#247;P     &#176;&#168;]     > C0 F7 50 00 00 00 00 00 B0 A8 5D 02 00 00 00 00 
{247} normal block at 0x000000000050FA90, 16 bytes long.
 Data: <&#240; Q             > F0 02 51 00 00 00 00 00 00 00 00 00 00 00 00 00 
{246} normal block at 0x000000000050FDB0, 16 bytes long.
 Data: <&#200; Q             > C8 02 51 00 00 00 00 00 00 00 00 00 00 00 00 00 
{245} normal block at 0x0000000000509CC0, 32 bytes long.
 Data: <Library/usr/bin/> 4C 69 62 72 61 72 79 2F 75 73 72 2F 62 69 6E 2F 
{244} normal block at 0x000000000050F810, 16 bytes long.
 Data: <&#160; Q             > A0 02 51 00 00 00 00 00 00 00 00 00 00 00 00 00 
{243} normal block at 0x00000000005102A0, 992 bytes long.
 Data: < &#248;P     &#192;&#156;P     > 10 F8 50 00 00 00 00 00 C0 9C 50 00 00 00 00 00 
{87} normal block at 0x000000000050A4A0, 32 bytes long.
 Data: <windows_x86_64__> 77 69 6E 64 6F 77 73 5F 78 38 36 5F 36 34 5F 5F 
{86} normal block at 0x00000000005067B0, 16 bytes long.
 Data: <&#208;&#216;P             > D0 D8 50 00 00 00 00 00 00 00 00 00 00 00 00 00 
{85} normal block at 0x000000000050D8D0, 40 bytes long.
 Data: <&#176;gP     &#160;&#164;P     > B0 67 50 00 00 00 00 00 A0 A4 50 00 00 00 00 00 
{64} normal block at 0x0000000000506A80, 16 bytes long.
 Data: < &#234;)&#131;&#247;           > 80 EA 29 83 F7 7F 00 00 00 00 00 00 00 00 00 00 
{63} normal block at 0x00000000005070C0, 16 bytes long.
 Data: <@&#233;)&#131;&#247;           > 40 E9 29 83 F7 7F 00 00 00 00 00 00 00 00 00 00 
{62} normal block at 0x00000000005068A0, 16 bytes long.
 Data: <&#248;W&&#131;&#247;           > F8 57 26 83 F7 7F 00 00 00 00 00 00 00 00 00 00 
{61} normal block at 0x0000000000506AD0, 16 bytes long.
 Data: <&#216;W&&#131;&#247;           > D8 57 26 83 F7 7F 00 00 00 00 00 00 00 00 00 00 
{60} normal block at 0x0000000000507160, 16 bytes long.
 Data: <P &&#131;&#247;           > 50 04 26 83 F7 7F 00 00 00 00 00 00 00 00 00 00 
{59} normal block at 0x0000000000506DF0, 16 bytes long.
 Data: <0 &&#131;&#247;           > 30 04 26 83 F7 7F 00 00 00 00 00 00 00 00 00 00 
{58} normal block at 0x0000000000506DA0, 16 bytes long.
 Data: <&#224; &&#131;&#247;           > E0 02 26 83 F7 7F 00 00 00 00 00 00 00 00 00 00 
{57} normal block at 0x0000000000507110, 16 bytes long.
 Data: <  &&#131;&#247;           > 10 04 26 83 F7 7F 00 00 00 00 00 00 00 00 00 00 
{56} normal block at 0x0000000000506F30, 16 bytes long.
 Data: <p &&#131;&#247;           > 70 04 26 83 F7 7F 00 00 00 00 00 00 00 00 00 00 
{55} normal block at 0x00000000005066C0, 16 bytes long.
 Data: < &#192;$&#131;&#247;           > 18 C0 24 83 F7 7F 00 00 00 00 00 00 00 00 00 00 
Object dump complete.

</stderr_txt>
]]>


©2025 Universitat Pompeu Fabra