Name | test_1-SFARR_TEST_LLM_WINDOWS_101_4-0-1-RND4423_2 |
Workunit | 31482384 |
Created | 24 Apr 2025, 13:15:19 UTC |
Sent | 24 Apr 2025, 13:15:37 UTC |
Report deadline | 29 Apr 2025, 13:15:37 UTC |
Received | 24 Apr 2025, 13:20:16 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 195 (0x000000C3) EXIT_CHILD_FAILED |
Computer ID | 623816 |
Run time | 2 min 27 sec |
CPU time | 27 sec |
Validate state | Invalid |
Credit | 0.00 |
Device peak FLOPS | 37,475.84 GFLOPS |
Application version | LLM: LLMs for chemistry v1.01 (cuda124L) windows_x86_64 |
Peak working set size | 808.11 MB |
Peak swap size | 2.53 GB |
Peak disk usage | 5.98 GB |
<core_client_version>8.0.4</core_client_version> <![CDATA[ <message> (unknown error) (0) - exit code 195 (0xc3)</message> <stderr_txt> 15:17:22 (15760): wrapper (7.9.26016): starting 15:17:22 (15760): wrapper: running Library/usr/bin/tar.exe (xjvf input.tar.bz2) conf.yaml main_generation-0.1.0-py3-none-any.whl run.bat run.sh tasks.json 15:17:23 (15760): Library/usr/bin/tar.exe exited; CPU time 0.015625 15:17:23 (15760): wrapper: running C:/Windows/system32/cmd.exe (/c call Scripts\activate.bat && Scripts\conda-unpack.exe && run.bat) Generating train split: 0 examples [00:00, ? examples/s] Generating train split: 1000 examples [00:00, 99099.90 examples/s] Traceback (most recent call last): File "wheel_contents/aiengine/main_generation.py", line 86, in <module> File "wheel_contents/aiengine/model.py", line 36, in __init__ File "C:\ProgramData\BOINC\slots\3\Lib\site-packages\vllm\utils.py", line 1096, in inner return fn(*args, **kwargs) ^^^^^^^^^^^^^^^^^^^ File "C:\ProgramData\BOINC\slots\3\Lib\site-packages\vllm\entrypoints\llm.py", line 243, in __init__ self.llm_engine = LLMEngine.from_engine_args( ^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "C:\ProgramData\BOINC\slots\3\Lib\site-packages\vllm\engine\llm_engine.py", line 521, in from_engine_args return engine_cls.from_vllm_config( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "C:\ProgramData\BOINC\slots\3\Lib\site-packages\vllm\engine\llm_engine.py", line 497, in from_vllm_config return cls( ^^^^ File "C:\ProgramData\BOINC\slots\3\Lib\site-packages\vllm\engine\llm_engine.py", line 281, in __init__ self.model_executor = executor_class(vllm_config=vllm_config, ) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "C:\ProgramData\BOINC\slots\3\Lib\site-packages\vllm\executor\executor_base.py", line 52, in __init__ self._init_executor() File "C:\ProgramData\BOINC\slots\3\Lib\site-packages\vllm\executor\uniproc_executor.py", line 45, in _init_executor self.collective_rpc("init_worker", args=([kwargs], )) File "C:\ProgramData\BOINC\slots\3\Lib\site-packages\vllm\executor\uniproc_executor.py", line 56, in collective_rpc answer = run_method(self.driver_worker, method, args, kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "C:\ProgramData\BOINC\slots\3\Lib\site-packages\vllm\utils.py", line 2359, in run_method return func(*args, **kwargs) ^^^^^^^^^^^^^^^^^^^^^ File "C:\ProgramData\BOINC\slots\3\Lib\site-packages\vllm\worker\worker_base.py", line 558, in init_worker worker_class = resolve_obj_by_qualname( ^^^^^^^^^^^^^^^^^^^^^^^^ File "C:\ProgramData\BOINC\slots\3\Lib\site-packages\vllm\utils.py", line 2005, in resolve_obj_by_qualname module = importlib.import_module(module_name) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "C:\ProgramData\BOINC\slots\3\Lib\importlib\__init__.py", line 90, in import_module return _bootstrap._gcd_import(name[level:], package, level) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "<frozen importlib._bootstrap>", line 1387, in _gcd_import File "<frozen importlib._bootstrap>", line 1360, in _find_and_load File "<frozen importlib._bootstrap>", line 1331, in _find_and_load_unlocked File "<frozen importlib._bootstrap>", line 935, in _load_unlocked File "<frozen importlib._bootstrap_external>", line 999, in exec_module File "<frozen importlib._bootstrap>", line 488, in _call_with_frames_removed File "C:\ProgramData\BOINC\slots\3\Lib\site-packages\vllm\worker\worker.py", line 13, in <module> from vllm.device_allocator.cumem import CuMemAllocator File "C:\ProgramData\BOINC\slots\3\Lib\site-packages\vllm\device_allocator\cumem.py", line 59, in <module> libcudart = CudaRTLibrary() ^^^^^^^^^^^^^^^ File "C:\ProgramData\BOINC\slots\3\Lib\site-packages\vllm\distributed\device_communicators\cuda_wrapper.py", line 148, in __init__ so_file = find_loaded_library("libcudart") ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "C:\ProgramData\BOINC\slots\3\Lib\site-packages\vllm\distributed\device_communicators\cuda_wrapper.py", line 95, in find_loaded_library raise ValueError( ValueError: VLLM_CUDART_SO_PATH is not set. VLLM_CUDART_SO_PATH need to be set with the absolute path to cudart dll on Windows (for example, set VLLM_CUDART_SO_PATH=C:\CUDA\v12.4\bin\cudart64_12.dll) 15:18:10 (15760): C:/Windows/system32/cmd.exe exited; CPU time 27.062500 15:18:10 (15760): app exit status: 0x16 15:18:10 (15760): called boinc_finish(195) 0 bytes in 0 Free Blocks. 536 bytes in 8 Normal Blocks. 1144 bytes in 1 CRT Blocks. 0 bytes in 0 Ignore Blocks. 0 bytes in 0 Client Blocks. Largest number used: 0 bytes. Total allocations: 554819 bytes. Dumping objects -> {1601253} normal block at 0x0000000000613D10, 48 bytes long. Data: <PATH=C:\ProgramD> 50 41 54 48 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 {1601242} normal block at 0x0000000000613B50, 48 bytes long. Data: <HOME=C:\ProgramD> 48 4F 4D 45 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 {1601231} normal block at 0x0000000000613F40, 48 bytes long. Data: <TMP=C:\ProgramDa> 54 4D 50 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 61 {1601220} normal block at 0x0000000000613300, 48 bytes long. Data: <TEMP=C:\ProgramD> 54 45 4D 50 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 {1601209} normal block at 0x00000000005ED040, 48 bytes long. Data: <TMPDIR=C:\Progra> 54 4D 50 44 49 52 3D 43 3A 5C 50 72 6F 67 72 61 {1601178} normal block at 0x00000000025CD640, 64 bytes long. Data: <PATH=C:\ProgramD> 50 41 54 48 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 {1601167} normal block at 0x0000000000614720, 140 bytes long. Data: <<project_prefere> 3C 70 72 6F 6A 65 63 74 5F 70 72 65 66 65 72 65 ..\api\boinc_api.cpp(309) : {1601164} normal block at 0x00000000005E6750, 8 bytes long. Data: < > 00 00 1A 00 00 00 00 00 {1600516} normal block at 0x0000000000614CD0, 140 bytes long. Data: <<project_prefere> 3C 70 72 6F 6A 65 63 74 5F 70 72 65 66 65 72 65 {1599898} normal block at 0x00000000005E66B0, 8 bytes long. Data: < ãg > 90 E3 67 02 00 00 00 00 ..\zip\boinc_zip.cpp(122) : {296} normal block at 0x00000000005EDB10, 260 bytes long. Data: < > 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 {281} normal block at 0x00000000005DC830, 80 bytes long. Data: </c call Scripts\> 2F 63 20 63 61 6C 6C 20 53 63 72 69 70 74 73 5C {280} normal block at 0x00000000005EEC90, 16 bytes long. Data: <( _ > 28 01 5F 00 00 00 00 00 00 00 00 00 00 00 00 00 {279} normal block at 0x00000000005EF7D0, 16 bytes long. Data: < _ > 00 01 5F 00 00 00 00 00 00 00 00 00 00 00 00 00 {278} normal block at 0x00000000005EF4B0, 16 bytes long. Data: <Ø _ > D8 00 5F 00 00 00 00 00 00 00 00 00 00 00 00 00 {277} normal block at 0x00000000005EF780, 16 bytes long. Data: <° _ > B0 00 5F 00 00 00 00 00 00 00 00 00 00 00 00 00 {276} normal block at 0x00000000005EF690, 16 bytes long. Data: < _ > 88 00 5F 00 00 00 00 00 00 00 00 00 00 00 00 00 {275} normal block at 0x00000000005EF370, 16 bytes long. Data: <` _ > 60 00 5F 00 00 00 00 00 00 00 00 00 00 00 00 00 {274} normal block at 0x00000000005ED510, 48 bytes long. Data: <ComSpec=C:\Windo> 43 6F 6D 53 70 65 63 3D 43 3A 5C 57 69 6E 64 6F {273} normal block at 0x00000000005EF460, 16 bytes long. Data: <(ë^ > 28 EB 5E 00 00 00 00 00 00 00 00 00 00 00 00 00 {272} normal block at 0x00000000005E9760, 32 bytes long. Data: <SystemRoot=C:\Wi> 53 79 73 74 65 6D 52 6F 6F 74 3D 43 3A 5C 57 69 {271} normal block at 0x00000000005EF5F0, 16 bytes long. Data: < ë^ > 00 EB 5E 00 00 00 00 00 00 00 00 00 00 00 00 00 {269} normal block at 0x00000000005EF190, 16 bytes long. Data: <Øê^ > D8 EA 5E 00 00 00 00 00 00 00 00 00 00 00 00 00 {268} normal block at 0x00000000005EED30, 16 bytes long. Data: <°ê^ > B0 EA 5E 00 00 00 00 00 00 00 00 00 00 00 00 00 {267} normal block at 0x00000000005EF0F0, 16 bytes long. Data: < ê^ > 88 EA 5E 00 00 00 00 00 00 00 00 00 00 00 00 00 {266} normal block at 0x00000000005EF820, 16 bytes long. Data: <`ê^ > 60 EA 5E 00 00 00 00 00 00 00 00 00 00 00 00 00 {265} normal block at 0x00000000005EEE20, 16 bytes long. Data: <8ê^ > 38 EA 5E 00 00 00 00 00 00 00 00 00 00 00 00 00 {264} normal block at 0x00000000005E98E0, 32 bytes long. Data: <CUDA_DEVICE=0 PU> 43 55 44 41 5F 44 45 56 49 43 45 3D 30 00 50 55 {263} normal block at 0x00000000005EF730, 16 bytes long. Data: < ê^ > 10 EA 5E 00 00 00 00 00 00 00 00 00 00 00 00 00 {262} normal block at 0x00000000005EEA10, 320 bytes long. Data: <0÷^ à ^ > 30 F7 5E 00 00 00 00 00 E0 98 5E 00 00 00 00 00 {261} normal block at 0x00000000005EEF60, 16 bytes long. Data: <@ _ > 40 00 5F 00 00 00 00 00 00 00 00 00 00 00 00 00 {260} normal block at 0x00000000005EF410, 16 bytes long. Data: < _ > 18 00 5F 00 00 00 00 00 00 00 00 00 00 00 00 00 {259} normal block at 0x00000000005EA480, 32 bytes long. Data: <C:/Windows/syste> 43 3A 2F 57 69 6E 64 6F 77 73 2F 73 79 73 74 65 {258} normal block at 0x00000000005EF6E0, 16 bytes long. Data: <ðÿ^ > F0 FF 5E 00 00 00 00 00 00 00 00 00 00 00 00 00 {257} normal block at 0x00000000005E9DC0, 32 bytes long. Data: <xjvf input.tar.b> 78 6A 76 66 20 69 6E 70 75 74 2E 74 61 72 2E 62 {256} normal block at 0x00000000005EF280, 16 bytes long. Data: <8ÿ^ > 38 FF 5E 00 00 00 00 00 00 00 00 00 00 00 00 00 {255} normal block at 0x00000000005EF5A0, 16 bytes long. Data: < ÿ^ > 10 FF 5E 00 00 00 00 00 00 00 00 00 00 00 00 00 {254} normal block at 0x00000000005EF320, 16 bytes long. Data: <èþ^ > E8 FE 5E 00 00 00 00 00 00 00 00 00 00 00 00 00 {253} normal block at 0x00000000005EF500, 16 bytes long. Data: <Àþ^ > C0 FE 5E 00 00 00 00 00 00 00 00 00 00 00 00 00 {252} normal block at 0x00000000005EFA50, 16 bytes long. Data: < þ^ > 98 FE 5E 00 00 00 00 00 00 00 00 00 00 00 00 00 {251} normal block at 0x00000000005EFB40, 16 bytes long. Data: <pþ^ > 70 FE 5E 00 00 00 00 00 00 00 00 00 00 00 00 00 {249} normal block at 0x00000000005EF230, 16 bytes long. Data: < Ò^ > 00 D2 5E 00 00 00 00 00 00 00 00 00 00 00 00 00 {248} normal block at 0x00000000005ED200, 40 bytes long. Data: <0ò^ @Ö\ > 30 F2 5E 00 00 00 00 00 40 D6 5C 02 00 00 00 00 {247} normal block at 0x00000000005EF140, 16 bytes long. Data: <Pþ^ > 50 FE 5E 00 00 00 00 00 00 00 00 00 00 00 00 00 {246} normal block at 0x00000000005EF9B0, 16 bytes long. Data: <(þ^ > 28 FE 5E 00 00 00 00 00 00 00 00 00 00 00 00 00 {245} normal block at 0x00000000005E9EE0, 32 bytes long. Data: <Library/usr/bin/> 4C 69 62 72 61 72 79 2F 75 73 72 2F 62 69 6E 2F {244} normal block at 0x00000000005EF1E0, 16 bytes long. Data: < þ^ > 00 FE 5E 00 00 00 00 00 00 00 00 00 00 00 00 00 {243} normal block at 0x00000000005EFE00, 992 bytes long. Data: <àñ^ àž^ > E0 F1 5E 00 00 00 00 00 E0 9E 5E 00 00 00 00 00 {87} normal block at 0x00000000005E9640, 32 bytes long. Data: <windows_x86_64__> 77 69 6E 64 6F 77 73 5F 78 38 36 5F 36 34 5F 5F {86} normal block at 0x00000000005E6980, 16 bytes long. Data: <0Ô^ > 30 D4 5E 00 00 00 00 00 00 00 00 00 00 00 00 00 {85} normal block at 0x00000000005ED430, 40 bytes long. Data: < i^ @–^ > 80 69 5E 00 00 00 00 00 40 96 5E 00 00 00 00 00 {64} normal block at 0x00000000005E6660, 16 bytes long. Data: < ê)ƒ÷ > 80 EA 29 83 F7 7F 00 00 00 00 00 00 00 00 00 00 {63} normal block at 0x00000000005E6430, 16 bytes long. Data: <@é)ƒ÷ > 40 E9 29 83 F7 7F 00 00 00 00 00 00 00 00 00 00 {62} normal block at 0x00000000005E6C00, 16 bytes long. Data: <øW&ƒ÷ > F8 57 26 83 F7 7F 00 00 00 00 00 00 00 00 00 00 {61} normal block at 0x00000000005E6840, 16 bytes long. Data: <ØW&ƒ÷ > D8 57 26 83 F7 7F 00 00 00 00 00 00 00 00 00 00 {60} normal block at 0x00000000005E6480, 16 bytes long. Data: <P &ƒ÷ > 50 04 26 83 F7 7F 00 00 00 00 00 00 00 00 00 00 {59} normal block at 0x00000000005E6340, 16 bytes long. Data: <0 &ƒ÷ > 30 04 26 83 F7 7F 00 00 00 00 00 00 00 00 00 00 {58} normal block at 0x00000000005E6610, 16 bytes long. Data: <à &ƒ÷ > E0 02 26 83 F7 7F 00 00 00 00 00 00 00 00 00 00 {57} normal block at 0x00000000005E6B10, 16 bytes long. Data: < &ƒ÷ > 10 04 26 83 F7 7F 00 00 00 00 00 00 00 00 00 00 {56} normal block at 0x00000000005E6BB0, 16 bytes long. Data: <p &ƒ÷ > 70 04 26 83 F7 7F 00 00 00 00 00 00 00 00 00 00 {55} normal block at 0x00000000005E65C0, 16 bytes long. Data: < À$ƒ÷ > 18 C0 24 83 F7 7F 00 00 00 00 00 00 00 00 00 00 Object dump complete. </stderr_txt> ]]>
©2025 Universitat Pompeu Fabra