Message boards : Number crunching : have a lot of stuck tasks, abort some?
Author | Message |
---|---|
have never seen this many before other than when my router was turned off. Other systems are running fine even those with gpugrid tasks. Looks like all tasks completed just fine, no error, but cannot upload. A restart of boinc did not help. GPUGRID initial_1344-ELISA_GSN4V1-8-100-RND6294_0_1 15.093 3817.50 K 00:23:01 - 16:48:28 0.00 Kbps Upload pending (Retry in: 02:31:01), retried: 8 JYSArea51 GPUGRID initial_1344-ELISA_GSN4V1-8-100-RND6294_0_2 20.116 3817.50 K 00:26:10 0.71 Kbps Uploading JYSArea51 GPUGRID initial_1344-ELISA_GSN4V1-8-100-RND6294_0_9 0.850 67761.89 K 00:22:58 - 17:47:06 0.00 Kbps Upload pending (Retry in: 03:29:39), retried: 8 JYSArea51 GPUGRID initial_1381-ELISA_GSN0V1-9-100-RND4251_0_1 1.683 3816.54 K 00:02:34 0.00 Kbps Upload pending, retried: 1 JYSArea51 GPUGRID initial_1381-ELISA_GSN0V1-9-100-RND4251_0_2 1.683 3816.54 K 00:02:34 0.00 Kbps Upload pending, retried: 1 JYSArea51 GPUGRID initial_1381-ELISA_GSN0V1-9-100-RND4251_0_9 0.094 68042.38 K 00:02:36 0.00 Kbps Upload pending, retried: 1 JYSArea51 GPUGRID initial_1512-ELISA_GSN0V1-6-100-RND5965_0_1 3.359 3816.54 K 00:05:13 0.00 Kbps Upload pending, retried: 2 JYSArea51 GPUGRID initial_1512-ELISA_GSN0V1-6-100-RND5965_0_2 3.359 3816.54 K 00:05:12 0.00 Kbps Upload pending, retried: 2 JYSArea51 GPUGRID initial_1512-ELISA_GSN0V1-6-100-RND5965_0_9 0.094 67987.78 K 00:02:36 0.00 Kbps Upload pending, retried: 1 JYSArea51 GPUGRID initial_1719-ELISA_GSN0V1-5-100-RND4368_0_1 1.683 3816.54 K 00:02:37 0.00 Kbps Upload pending, retried: 1 JYSArea51 GPUGRID initial_1719-ELISA_GSN0V1-5-100-RND4368_0_2 1.683 3816.54 K 00:02:35 0.00 Kbps Upload pending, retried: 1 JYSArea51 GPUGRID initial_1719-ELISA_GSN0V1-5-100-RND4368_0_9 0.095 67536.57 K 00:02:36 0.00 Kbps Upload pending, retried: 1 JYSArea51 GPUGRID test265-TONI_GSNTEST3-11-100-RND0660_0_1 1.682 3817.50 K 00:02:36 0.00 Kbps Upload pending, retried: 1 JYSArea51 GPUGRID test265-TONI_GSNTEST3-11-100-RND0660_0_2 1.682 3817.50 K 00:02:34 0.00 Kbps Upload pending, retried: 1 JYSArea51 GPUGRID test265-TONI_GSNTEST3-11-100-RND0660_0_9 0.094 68081.72 K 00:02:36 0.00 Kbps Upload pending, retried: 1 JYSArea51 GPUGRID test360-TONI_GSNTEST3-6-100-RND5366_0_1 18.440 3817.50 K 00:23:32 0.71 Kbps Uploading JYSArea51 GPUGRID test360-TONI_GSNTEST3-6-100-RND5366_0_2 10.064 3817.50 K 00:15:35 0.00 Kbps Upload pending, retried: 6 JYSArea51 GPUGRID test360-TONI_GSNTEST3-6-100-RND5366_0_9 0.470 68081.16 K 00:12:57 0.00 Kbps Upload pending, retried: 5 JYSArea51 [EDIT] reboot of windows started things going. I suspect the first 67mb files caused a problem which was compouned by subsequent ones of same size all trying to upload concurrently. Need to figure a was to stop this. Have three 1070ti boards but network cant seem to handle the large files when all get done near same time. In other news I got my first Linux cuda100. It is running on gtx 1660ti. | |
ID: 53140 | Rating: 0 | rate: / Reply Quote | |
JStateson wrote: I suspect the first 67mb files caused a problem which was compouned by subsequent ones of same size all trying to upload concurrently. Need to figure a was to stop this. Just a thought, in the cc_config.xml file there is an option for <max_file_xfers_per_project>N</max_file_xfers_per_project> .Maybe that would help. | |
ID: 53155 | Rating: 0 | rate: / Reply Quote | |
I am not alone! You are not alone! | |
ID: 53157 | Rating: 0 | rate: / Reply Quote | |
Since I run multiple projects on the same hosts, I need to provide sufficient network communication threads for all the uploads/downloads. | |
ID: 53160 | Rating: 0 | rate: / Reply Quote | |
I use these parameters in cc_config.xml So the cc_config.xml would like look like: <cc_config> <options> <max_file_xfers>16</max_file_xfers> <max_file_xfers_per_project>8</max_file_xfers_per_project> </options> </cc_config> | |
ID: 53162 | Rating: 0 | rate: / Reply Quote | |
All tasks finally uploaded after a reboot | |
ID: 53164 | Rating: 0 | rate: / Reply Quote | |
I have a few tasks that have not uploaded for a while and all have "Upload Pending Project Backoff" Do I just let them sit there and wait till they upload. I have tried stopping and starting BOINC but that did not fix it. | |
ID: 53245 | Rating: 0 | rate: / Reply Quote | |
01.12.2019 10:51:50 | GPUGRID | [error] Error reported by file upload server: Server is out of disk space | |
ID: 53246 | Rating: 0 | rate: / Reply Quote | |
This is being treated on this other thread: | |
ID: 53247 | Rating: 0 | rate: / Reply Quote | |
Please be patient, no need to abort | |
ID: 53250 | Rating: 0 | rate: / Reply Quote | |
Message boards : Number crunching : have a lot of stuck tasks, abort some?