Name | test_2-SFARR_TEST_LLM_WINDOWS_101_5-0-1-RND9177_0 |
Workunit | 31482390 |
Created | 24 Apr 2025, 13:44:30 UTC |
Sent | 24 Apr 2025, 13:49:20 UTC |
Report deadline | 29 Apr 2025, 13:49:20 UTC |
Received | 24 Apr 2025, 13:58:06 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 195 (0x000000C3) EXIT_CHILD_FAILED |
Computer ID | 633425 |
Run time | 5 min 42 sec |
CPU time | 27 sec |
Validate state | Invalid |
Credit | 0.00 |
Device peak FLOPS | 41,292.04 GFLOPS |
Application version | LLM: LLMs for chemistry v1.01 (cuda124L) windows_x86_64 |
Peak working set size | 615.87 MB |
Peak swap size | 1.77 GB |
Peak disk usage | 6.14 GB |
<core_client_version>8.0.4</core_client_version> <![CDATA[ <message> (unknown error) (0) - exit code 195 (0xc3)</message> <stderr_txt> 15:53:29 (26252): wrapper (7.9.26016): starting 15:53:29 (26252): wrapper: running Library/usr/bin/tar.exe (xjvf input.tar.bz2) conf.yaml main_generation-0.1.0-py3-none-any.whl run.bat run.sh tasks.json 15:53:30 (26252): Library/usr/bin/tar.exe exited; CPU time 0.015625 15:53:30 (26252): wrapper: running C:/Windows/system32/cmd.exe (/c call Scripts\activate.bat && Scripts\conda-unpack.exe && run.bat) Generating train split: 0 examples [00:00, ? examples/s] Generating train split: 1000 examples [00:00, 52623.51 examples/s] C:\ProgramData\BOINC\slots\18\Lib\site-packages\huggingface_hub\file_download.py:144: UserWarning: `huggingface_hub` cache-system uses symlinks by default to efficiently store duplicated files but your machine does not support them in C:\ProgramData\BOINC\slots\.cache\hub\models--unsloth--Qwen2.5-14B-Instruct-bnb-4bit. Caching files will still work but in a degraded version that might require more space on your disk. This warning can be disabled by setting the `HF_HUB_DISABLE_SYMLINKS_WARNING` environment variable. For more details, see https://huggingface.co/docs/huggingface_hub/how-to-cache#limitations. To support symlinks on Windows, you either need to activate Developer Mode or to run Python as an administrator. In order to activate developer mode, see this article: https://docs.microsoft.com/en-us/windows/apps/get-started/enable-your-device-for-development warnings.warn(message) Traceback (most recent call last): File "wheel_contents/aiengine/main_generation.py", line 86, in <module> File "wheel_contents/aiengine/model.py", line 36, in __init__ File "C:\ProgramData\BOINC\slots\18\Lib\site-packages\vllm\utils.py", line 1096, in inner return fn(*args, **kwargs) ^^^^^^^^^^^^^^^^^^^ File "C:\ProgramData\BOINC\slots\18\Lib\site-packages\vllm\entrypoints\llm.py", line 243, in __init__ self.llm_engine = LLMEngine.from_engine_args( ^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "C:\ProgramData\BOINC\slots\18\Lib\site-packages\vllm\engine\llm_engine.py", line 521, in from_engine_args return engine_cls.from_vllm_config( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "C:\ProgramData\BOINC\slots\18\Lib\site-packages\vllm\engine\llm_engine.py", line 497, in from_vllm_config return cls( ^^^^ File "C:\ProgramData\BOINC\slots\18\Lib\site-packages\vllm\engine\llm_engine.py", line 281, in __init__ self.model_executor = executor_class(vllm_config=vllm_config, ) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "C:\ProgramData\BOINC\slots\18\Lib\site-packages\vllm\executor\executor_base.py", line 52, in __init__ self._init_executor() File "C:\ProgramData\BOINC\slots\18\Lib\site-packages\vllm\executor\uniproc_executor.py", line 45, in _init_executor self.collective_rpc("init_worker", args=([kwargs], )) File "C:\ProgramData\BOINC\slots\18\Lib\site-packages\vllm\executor\uniproc_executor.py", line 56, in collective_rpc answer = run_method(self.driver_worker, method, args, kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "C:\ProgramData\BOINC\slots\18\Lib\site-packages\vllm\utils.py", line 2359, in run_method return func(*args, **kwargs) ^^^^^^^^^^^^^^^^^^^^^ File "C:\ProgramData\BOINC\slots\18\Lib\site-packages\vllm\worker\worker_base.py", line 558, in init_worker worker_class = resolve_obj_by_qualname( ^^^^^^^^^^^^^^^^^^^^^^^^ File "C:\ProgramData\BOINC\slots\18\Lib\site-packages\vllm\utils.py", line 2005, in resolve_obj_by_qualname module = importlib.import_module(module_name) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "C:\ProgramData\BOINC\slots\18\Lib\importlib\__init__.py", line 90, in import_module return _bootstrap._gcd_import(name[level:], package, level) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "<frozen importlib._bootstrap>", line 1387, in _gcd_import File "<frozen importlib._bootstrap>", line 1360, in _find_and_load File "<frozen importlib._bootstrap>", line 1331, in _find_and_load_unlocked File "<frozen importlib._bootstrap>", line 935, in _load_unlocked File "<frozen importlib._bootstrap_external>", line 999, in exec_module File "<frozen importlib._bootstrap>", line 488, in _call_with_frames_removed File "C:\ProgramData\BOINC\slots\18\Lib\site-packages\vllm\worker\worker.py", line 13, in <module> from vllm.device_allocator.cumem import CuMemAllocator File "C:\ProgramData\BOINC\slots\18\Lib\site-packages\vllm\device_allocator\cumem.py", line 59, in <module> libcudart = CudaRTLibrary() ^^^^^^^^^^^^^^^ File "C:\ProgramData\BOINC\slots\18\Lib\site-packages\vllm\distributed\device_communicators\cuda_wrapper.py", line 157, in __init__ lib = ctypes.CDLL(so_file) ^^^^^^^^^^^^^^^^^^^^ File "C:\ProgramData\BOINC\slots\18\Lib\ctypes\__init__.py", line 379, in __init__ self._handle = _dlopen(self._name, mode) ^^^^^^^^^^^^^^^^^^^^^^^^^ FileNotFoundError: Could not find module 'C:\ProgramData\BOINC\slots\18\"C:\ProgramData\BOINC\slots\18\Lib\site-packages\torch\lib\cudart64_12.dll"' (or one of its dependencies). Try using the full path with constructor syntax. 15:56:36 (26252): C:/Windows/system32/cmd.exe exited; CPU time 27.343750 15:56:36 (26252): app exit status: 0x16 15:56:36 (26252): called boinc_finish(195) 0 bytes in 0 Free Blocks. 616 bytes in 8 Normal Blocks. 1144 bytes in 1 CRT Blocks. 0 bytes in 0 Ignore Blocks. 0 bytes in 0 Client Blocks. Largest number used: 0 bytes. Total allocations: 2863291 bytes. Dumping objects -> {1601472} normal block at 0x000001C9A14BA4A0, 48 bytes long. Data: <PATH=C:\ProgramD> 50 41 54 48 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 {1601461} normal block at 0x000001C9A14BB000, 48 bytes long. Data: <HOME=C:\ProgramD> 48 4F 4D 45 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 {1601450} normal block at 0x000001C9A14BAEB0, 48 bytes long. Data: <TMP=C:\ProgramDa> 54 4D 50 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 61 {1601439} normal block at 0x000001C9A14BAA50, 48 bytes long. Data: <TEMP=C:\ProgramD> 54 45 4D 50 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 {1601428} normal block at 0x000001C9A14943A0, 48 bytes long. Data: <TMPDIR=C:\Progra> 54 4D 50 44 49 52 3D 43 3A 5C 50 72 6F 67 72 61 {1601397} normal block at 0x000001C9A155FCF0, 64 bytes long. Data: <PATH=C:\ProgramD> 50 41 54 48 3D 43 3A 5C 50 72 6F 67 72 61 6D 44 {1601386} normal block at 0x000001C9A3266720, 180 bytes long. Data: <<project_prefere> 3C 70 72 6F 6A 65 63 74 5F 70 72 65 66 65 72 65 ..\api\boinc_api.cpp(309) : {1601383} normal block at 0x000001C9A148E1F0, 8 bytes long. Data: < ê¢É > 00 00 EA A2 C9 01 00 00 {1600655} normal block at 0x000001C9A32669F0, 180 bytes long. Data: <<project_prefere> 3C 70 72 6F 6A 65 63 74 5F 70 72 65 66 65 72 65 {1599914} normal block at 0x000001C9A148E150, 8 bytes long. Data: <ÀÐ(£É > C0 D0 28 A3 C9 01 00 00 ..\zip\boinc_zip.cpp(122) : {304} normal block at 0x000001C9A1481750, 260 bytes long. Data: < > 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 {289} normal block at 0x000001C9A1482D70, 80 bytes long. Data: </c call Scripts\> 2F 63 20 63 61 6C 6C 20 53 63 72 69 70 74 73 5C {288} normal block at 0x000001C9A1496A70, 16 bytes long. Data: < nI¡É > 08 6E 49 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {287} normal block at 0x000001C9A14969D0, 16 bytes long. Data: <àmI¡É > E0 6D 49 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {286} normal block at 0x000001C9A1495D50, 16 bytes long. Data: <¸mI¡É > B8 6D 49 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {285} normal block at 0x000001C9A1495D00, 16 bytes long. Data: < mI¡É > 90 6D 49 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {284} normal block at 0x000001C9A14962F0, 16 bytes long. Data: <hmI¡É > 68 6D 49 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {283} normal block at 0x000001C9A1496340, 16 bytes long. Data: <@mI¡É > 40 6D 49 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {282} normal block at 0x000001C9A1493ED0, 48 bytes long. Data: <ComSpec=C:\Windo> 43 6F 6D 53 70 65 63 3D 43 3A 5C 57 69 6E 64 6F {281} normal block at 0x000001C9A1496250, 16 bytes long. Data: <(¾G¡É > 28 BE 47 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {280} normal block at 0x000001C9A148FCB0, 32 bytes long. Data: <SystemRoot=C:\Wi> 53 79 73 74 65 6D 52 6F 6F 74 3D 43 3A 5C 57 69 {279} normal block at 0x000001C9A1496750, 16 bytes long. Data: < ¾G¡É > 00 BE 47 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {277} normal block at 0x000001C9A14968E0, 16 bytes long. Data: <ؽG¡É > D8 BD 47 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {276} normal block at 0x000001C9A14963E0, 16 bytes long. Data: <°½G¡É > B0 BD 47 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {275} normal block at 0x000001C9A1495F80, 16 bytes long. Data: < ½G¡É > 88 BD 47 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {274} normal block at 0x000001C9A1495F30, 16 bytes long. Data: <`½G¡É > 60 BD 47 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {273} normal block at 0x000001C9A1495CB0, 16 bytes long. Data: <8½G¡É > 38 BD 47 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {272} normal block at 0x000001C9A14908B0, 32 bytes long. Data: <CUDA_DEVICE=0 PU> 43 55 44 41 5F 44 45 56 49 43 45 3D 30 00 50 55 {271} normal block at 0x000001C9A1496020, 16 bytes long. Data: < ½G¡É > 10 BD 47 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {270} normal block at 0x000001C9A147BD10, 320 bytes long. Data: < `I¡É ° I¡É > 20 60 49 A1 C9 01 00 00 B0 08 49 A1 C9 01 00 00 {269} normal block at 0x000001C9A14964D0, 16 bytes long. Data: < mI¡É > 20 6D 49 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {268} normal block at 0x000001C9A1496890, 16 bytes long. Data: <ølI¡É > F8 6C 49 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {267} normal block at 0x000001C9A148FC50, 32 bytes long. Data: <C:/Windows/syste> 43 3A 2F 57 69 6E 64 6F 77 73 2F 73 79 73 74 65 {266} normal block at 0x000001C9A1495DF0, 16 bytes long. Data: <ÐlI¡É > D0 6C 49 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {265} normal block at 0x000001C9A1490130, 32 bytes long. Data: <xjvf input.tar.b> 78 6A 76 66 20 69 6E 70 75 74 2E 74 61 72 2E 62 {264} normal block at 0x000001C9A1496610, 16 bytes long. Data: < lI¡É > 18 6C 49 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {263} normal block at 0x000001C9A1496160, 16 bytes long. Data: <ðkI¡É > F0 6B 49 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {262} normal block at 0x000001C9A1496200, 16 bytes long. Data: <ÈkI¡É > C8 6B 49 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {261} normal block at 0x000001C9A1496840, 16 bytes long. Data: < kI¡É > A0 6B 49 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {260} normal block at 0x000001C9A1496930, 16 bytes long. Data: <xkI¡É > 78 6B 49 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {259} normal block at 0x000001C9A14965C0, 16 bytes long. Data: <PkI¡É > 50 6B 49 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {257} normal block at 0x000001C9A1496A20, 16 bytes long. Data: <ÀBI¡É > C0 42 49 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {256} normal block at 0x000001C9A14942C0, 40 bytes long. Data: < jI¡É ðüU¡É > 20 6A 49 A1 C9 01 00 00 F0 FC 55 A1 C9 01 00 00 {255} normal block at 0x000001C9A1496480, 16 bytes long. Data: <0kI¡É > 30 6B 49 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {254} normal block at 0x000001C9A1496570, 16 bytes long. Data: < kI¡É > 08 6B 49 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {253} normal block at 0x000001C9A1490610, 32 bytes long. Data: <Library/usr/bin/> 4C 69 62 72 61 72 79 2F 75 73 72 2F 62 69 6E 2F {252} normal block at 0x000001C9A14960C0, 16 bytes long. Data: <àjI¡É > E0 6A 49 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {251} normal block at 0x000001C9A1496AE0, 992 bytes long. Data: <À`I¡É I¡É > C0 60 49 A1 C9 01 00 00 10 06 49 A1 C9 01 00 00 {95} normal block at 0x000001C9A1490010, 32 bytes long. Data: <windows_x86_64__> 77 69 6E 64 6F 77 73 5F 78 38 36 5F 36 34 5F 5F {94} normal block at 0x000001C9A148E650, 16 bytes long. Data: <ðDI¡É > F0 44 49 A1 C9 01 00 00 00 00 00 00 00 00 00 00 {93} normal block at 0x000001C9A14944F0, 40 bytes long. Data: <PæH¡É I¡É > 50 E6 48 A1 C9 01 00 00 10 00 49 A1 C9 01 00 00 {72} normal block at 0x000001C9A148DD40, 16 bytes long. Data: < ê@ ÷ > 80 EA 40 A0 F7 7F 00 00 00 00 00 00 00 00 00 00 {71} normal block at 0x000001C9A148E3D0, 16 bytes long. Data: <@é@ ÷ > 40 E9 40 A0 F7 7F 00 00 00 00 00 00 00 00 00 00 {70} normal block at 0x000001C9A148DF70, 16 bytes long. Data: <øW= ÷ > F8 57 3D A0 F7 7F 00 00 00 00 00 00 00 00 00 00 {69} normal block at 0x000001C9A148DCF0, 16 bytes long. Data: <ØW= ÷ > D8 57 3D A0 F7 7F 00 00 00 00 00 00 00 00 00 00 {68} normal block at 0x000001C9A148E510, 16 bytes long. Data: <P = ÷ > 50 04 3D A0 F7 7F 00 00 00 00 00 00 00 00 00 00 {67} normal block at 0x000001C9A148DF20, 16 bytes long. Data: <0 = ÷ > 30 04 3D A0 F7 7F 00 00 00 00 00 00 00 00 00 00 {66} normal block at 0x000001C9A148E600, 16 bytes long. Data: <à = ÷ > E0 02 3D A0 F7 7F 00 00 00 00 00 00 00 00 00 00 {65} normal block at 0x000001C9A148E380, 16 bytes long. Data: < = ÷ > 10 04 3D A0 F7 7F 00 00 00 00 00 00 00 00 00 00 {64} normal block at 0x000001C9A148E0B0, 16 bytes long. Data: <p = ÷ > 70 04 3D A0 F7 7F 00 00 00 00 00 00 00 00 00 00 {63} normal block at 0x000001C9A148E6A0, 16 bytes long. Data: < À; ÷ > 18 C0 3B A0 F7 7F 00 00 00 00 00 00 00 00 00 00 Object dump complete. </stderr_txt> ]]>
©2025 Universitat Pompeu Fabra