New multicore app and WUs

Message boards : News : New multicore app and WUs
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · 4 . . . 5 · Next

AuthorMessage
Toni
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 9 Dec 08
Posts: 1006
Credit: 5,068,599
RAC: 0
Level
Ser
Scientific publications
watwatwatwat
Message 48127 - Posted: 10 Nov 2017, 14:58:07 UTC
Last modified: 10 Nov 2017, 15:10:10 UTC

Dears,

we would like to test our new CPU multicore application for quantum chemistry tasks ("QC"). Since it’s the first time we have a CPU app out, I’ll test the behavior of GPUGRID with a relatively large batch that you will see soon. Workunits are named "*QC309big*".

Here’s some features of the app, in short (subject to change):

* Platform: Linux only for now, generic x64.
* Threads: as many as Boinc decides. I guess it depends on your machine, your preferences, and other running tasks in ways which are obscure to me…
* Run time: about 1 CPU hour per WU (so, shorter if multithreading)
* Credit: computed with the default algorithm (tasks are short, don’t expect much). Bonus mechanism for fast turnaround is still on.
* Known bugs: restarts and checkpoints. This should be mitigated with the “keep in memory when suspended” option. Sorry about that, it’s outside of our control.
* Network behavior: the first time you get a WU of this kind it downloads a Python interpreter (miniconda) and then some open-source packages, and installs them in the project directory. The installation is reused whenever possible.
* Disk usage: could go around 1 GB, perhaps more when tasks are running. Resetting the project should remove everything.
* Memory usage: should be around 1 GB when running.

Depending on the results of this test, we’ll start thinking about other platforms.

Thanks and nice crunching!

Toni
ID: 48127 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Sergey Kovalchuk

Send message
Joined: 18 Feb 16
Posts: 6
Credit: 1,121,331
RAC: 0
Level
Ala
Scientific publications
wat
Message 48130 - Posted: 10 Nov 2017, 15:37:26 UTC - in response to Message 48127.  

the client does not receive WUs, although there are almost a thousand of them and the client is suitable for the requirements (Linux x64). earlier this host was able to receive test tasks for QC and python

please write the exact requirements (memory, disk, OS) specified when generating tasks
ID: 48130 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Toni
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 9 Dec 08
Posts: 1006
Credit: 5,068,599
RAC: 0
Level
Ser
Scientific publications
watwatwatwat
Message 48131 - Posted: 10 Nov 2017, 15:41:06 UTC - in response to Message 48130.  
Last modified: 10 Nov 2017, 15:44:30 UTC

Can you check what applications are you accepting in your preferences?

By the way requests are currently as follows:

    <rsc_fpops_est>3e12</rsc_fpops_est>
    <rsc_fpops_bound>250e15</rsc_fpops_bound>
    <rsc_disk_bound>4e9</rsc_disk_bound>
    <rsc_memory_bound>1e9</rsc_memory_bound>
ID: 48131 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Sergey Kovalchuk

Send message
Joined: 18 Feb 16
Posts: 6
Credit: 1,121,331
RAC: 0
Level
Ala
Scientific publications
wat
Message 48132 - Posted: 10 Nov 2017, 16:05:30 UTC - in response to Message 48131.  

All apps selected & "accept work from other"


Preferences:
max memory usage when active: 1900.76MB
max memory usage when idle: 1980.80MB
max disk usage: 6.71GB (4,47 free)
ID: 48132 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Toni
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 9 Dec 08
Posts: 1006
Credit: 5,068,599
RAC: 0
Level
Ser
Scientific publications
watwatwatwat
Message 48133 - Posted: 10 Nov 2017, 16:09:28 UTC - in response to Message 48132.  

Another boinc mystery...
ID: 48133 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Toni
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 9 Dec 08
Posts: 1006
Credit: 5,068,599
RAC: 0
Level
Ser
Scientific publications
watwatwatwat
Message 48134 - Posted: 10 Nov 2017, 16:44:59 UTC - in response to Message 48133.  

Jobs only seem to go to a subset of eligible machines. If anybody out there has a clue of the reason, I'll be glad to hear.

ID: 48134 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
klepel

Send message
Joined: 23 Dec 09
Posts: 189
Credit: 4,798,881,008
RAC: 343
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 48135 - Posted: 10 Nov 2017, 17:25:12 UTC

All error out with this:
Stderr output

<core_client_version>7.6.33</core_client_version>
<![CDATA[
<message>
process exited with code 195 (0xc3, -61)
</message>
<stderr_txt>
12:19:41 (31019): wrapper (7.7.26016): starting
12:19:41 (31019): wrapper (7.7.26016): starting
12:19:41 (31019): wrapper: running ../../projects/www.gpugrid.net/Miniconda3-4.3.30-Linux-x86_64.sh (-b -f -p /var/lib/boinc-client/projects/www.gpugrid.net/miniconda)
Python 3.6.3 :: Anaconda, Inc.
12:19:49 (31019): miniconda-installer exited; CPU time 6.649529
12:19:49 (31019): wrapper: running /var/lib/boinc-client/projects/www.gpugrid.net/miniconda/bin/python (pre_script.py)
12:19:59 (31019): $PROJECT_DIR/miniconda/bin/python exited; CPU time 7.101246
12:19:59 (31019): wrapper: running /var/lib/boinc-client/projects/www.gpugrid.net/miniconda/bin/psi4 (-n 14 -i psi4.in -o psi4.out)
/var/lib/boinc-client/projects/www.gpugrid.net/miniconda/bin/psi4: 3: /var/lib/boinc-client/projects/www.gpugrid.net/miniconda/bin/psi4: readlink: not found
/var/lib/boinc-client/projects/www.gpugrid.net/miniconda/bin/psi4: 9: /var/lib/boinc-client/projects/www.gpugrid.net/miniconda/bin/psi4: /bin/psi4.bin: not found
12:20:00 (31019): $PROJECT_DIR/miniconda/bin/psi4 exited; CPU time 0.001541
12:20:00 (31019): app exit status: 0x7f
12:20:00 (31019): called boinc_finish(195)

</stderr_txt>
]]>
It is this computer:
http://www.gpugrid.net/show_host_detail.php?hostid=420971
ID: 48135 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Stoneageman
Avatar

Send message
Joined: 25 May 09
Posts: 224
Credit: 34,057,374,498
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 48136 - Posted: 10 Nov 2017, 17:53:41 UTC

All error out after a few seconds on AMD and Intel machines
<core_client_version>7.6.33</core_client_version>
<![CDATA[
<message>
process exited with code 195 (0xc3, -61)
</message>
<stderr_txt>
17:27:46 (14006): wrapper (7.7.26016): starting
17:27:46 (14006): wrapper (7.7.26016): starting
17:27:46 (14006): wrapper: running ../../projects/www.gpugrid.net/Miniconda3-4.3.30-Linux-x86_64.sh (-b -f -p /var/lib/boinc-client/projects/www.gpugrid.net/miniconda)
Python 3.6.3 :: Anaconda, Inc.
17:27:54 (14006): miniconda-installer exited; CPU time 6.648000
17:27:54 (14006): wrapper: running /var/lib/boinc-client/projects/www.gpugrid.net/miniconda/bin/python (pre_script.py)
17:28:05 (14006): $PROJECT_DIR/miniconda/bin/python exited; CPU time 7.584000
17:28:05 (14006): wrapper: running /var/lib/boinc-client/projects/www.gpugrid.net/miniconda/bin/psi4 (-n 15 -i psi4.in -o psi4.out)
/var/lib/boinc-client/projects/www.gpugrid.net/miniconda/bin/psi4: 3: /var/lib/boinc-client/projects/www.gpugrid.net/miniconda/bin/psi4: readlink: not found
/var/lib/boinc-client/projects/www.gpugrid.net/miniconda/bin/psi4: 9: /var/lib/boinc-client/projects/www.gpugrid.net/miniconda/bin/psi4: /bin/psi4.bin: not found
17:28:06 (14006): $PROJECT_DIR/miniconda/bin/psi4 exited; CPU time 0.000000
17:28:06 (14006): app exit status: 0x7f
17:28:06 (14006): called boinc_finish(195)
ID: 48136 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [AF] fansyl

Send message
Joined: 26 Sep 13
Posts: 20
Credit: 1,714,356,441
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 48137 - Posted: 10 Nov 2017, 18:09:57 UTC
Last modified: 10 Nov 2017, 18:11:15 UTC

Hello,

error on my computer: Ubuntu mate 16.04/kernel 4.13.11/Ryzen 5 1400

Stderr output

<core_client_version>7.6.31</core_client_version>
<![CDATA[
<message>
process exited with code 195 (0xc3, -61)
</message>
<stderr_txt>
19:00:23 (31619): wrapper (7.7.26016): starting
19:00:23 (31619): wrapper (7.7.26016): starting
19:00:23 (31619): wrapper: running ../../projects/www.gpugrid.net/Miniconda3-4.3.30-Linux-x86_64.sh (-b -f -p /var/lib/boinc-client/projects/www.gpugrid.net/miniconda)
Python 3.6.3 :: Anaconda, Inc.
19:00:33 (31619): miniconda-installer exited; CPU time 8.382948
19:00:33 (31619): wrapper: running /var/lib/boinc-client/projects/www.gpugrid.net/miniconda/bin/python (pre_script.py)
19:03:37 (31619): $PROJECT_DIR/miniconda/bin/python exited; CPU time 63.497739
19:03:37 (31619): wrapper: running /var/lib/boinc-client/projects/www.gpugrid.net/miniconda/bin/psi4 (-n 7 -i psi4.in -o psi4.out)
/var/lib/boinc-client/projects/www.gpugrid.net/miniconda/bin/psi4: 3: /var/lib/boinc-client/projects/www.gpugrid.net/miniconda/bin/psi4: readlink: not found
/var/lib/boinc-client/projects/www.gpugrid.net/miniconda/bin/psi4: 9: /var/lib/boinc-client/projects/www.gpugrid.net/miniconda/bin/psi4: /bin/psi4.bin: not found
19:03:38 (31619): $PROJECT_DIR/miniconda/bin/psi4 exited; CPU time 0.002335
19:03:38 (31619): app exit status: 0x7f
19:03:38 (31619): called boinc_finish(195)

</stderr_txt>
]]>


Good luck for debug
ID: 48137 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Toni
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 9 Dec 08
Posts: 1006
Credit: 5,068,599
RAC: 0
Level
Ser
Scientific publications
watwatwatwat
Message 48138 - Posted: 10 Nov 2017, 18:21:56 UTC - in response to Message 48137.  

Dears, all three errors mention a missing "readlink" executable. It is surprising, because it's a fairly basic command, but please check if you can run "readlink" in a terminal. If not installed, should be in the "coreutils" package.
ID: 48138 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Stoneageman
Avatar

Send message
Joined: 25 May 09
Posts: 224
Credit: 34,057,374,498
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 48139 - Posted: 10 Nov 2017, 18:40:21 UTC

It is installed
readlink --version
readlink (GNU coreutils) 8.26
Copyright (C) 2016 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html>.
ID: 48139 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
klepel

Send message
Joined: 23 Dec 09
Posts: 189
Credit: 4,798,881,008
RAC: 343
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 48140 - Posted: 10 Nov 2017, 19:01:39 UTC

Same here. It is installed readlink version 8.26.
ID: 48140 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [B@P] Daniel

Send message
Joined: 17 Sep 16
Posts: 5
Credit: 382,453,727
RAC: 0
Level
Asp
Scientific publications
watwatwatwatwatwat
Message 48141 - Posted: 10 Nov 2017, 19:12:06 UTC

I also have problem with getting new WUs on some of my machines. Looks that ones with Nvidia card get work, and ones without it do not get anything.
ID: 48141 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
PappaLitto

Send message
Joined: 21 Mar 16
Posts: 513
Credit: 4,673,458,277
RAC: 0
Level
Arg
Scientific publications
watwatwatwatwatwatwatwat
Message 48142 - Posted: 10 Nov 2017, 19:21:43 UTC

Is there a particular reason this is a CPU application and not a GPU one?
ID: 48142 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mmonnin

Send message
Joined: 2 Jul 16
Posts: 338
Credit: 7,987,341,558
RAC: 197,587
Level
Tyr
Scientific publications
watwatwatwatwat
Message 48144 - Posted: 10 Nov 2017, 23:02:51 UTC - in response to Message 48140.  

Same here. It is installed readlink version 8.26.



Same here.

NNW until there's a fix.
ID: 48144 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [B@P] Daniel

Send message
Joined: 17 Sep 16
Posts: 5
Credit: 382,453,727
RAC: 0
Level
Asp
Scientific publications
watwatwatwatwatwat
Message 48145 - Posted: 11 Nov 2017, 0:06:59 UTC - in response to Message 48144.  
Last modified: 11 Nov 2017, 0:19:28 UTC

Same here. It is installed readlink version 8.26.



Same here.

NNW until there's a fix.

On Linux CentOS 7.4 is works fine. I suspect that bolinc is not able to find or execute readlink cmd. Please try executing following commands:

which readlink
ls -l `which readlink`
sudo -iu boinc bash -c 'which readlink'
sudo -iu boinc bash -c 'ls -l `which readlink`'
sudo -iu boinc readlink /lib/libz.so.1


On my CentOS they return following results:

# which readlink
/usr/bin/readlink
# ls -l `which readlink`
-rwxr-xr-x. 1 root root 41800 2016-11-05  /usr/bin/readlink
# sudo -iu boinc bash -c 'which readlink'
/bin/readlink
# sudo -iu boinc bash -c 'ls -l `which readlink`'
-rwxr-xr-x. 1 root root 41800 2016-11-05  /bin/readlink
# sudo -iu boinc readlink /lib/libz.so.1
libz.so.1.2.7

ID: 48145 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Stoneageman
Avatar

Send message
Joined: 25 May 09
Posts: 224
Credit: 34,057,374,498
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 48146 - Posted: 11 Nov 2017, 1:25:35 UTC

# which readlink
/bin/readlink


# ls -l `which readlink`
-rwxr-xr-x 1 root root 43192 Oct 4 20:56 /bin/readlink

The following return nothing
# sudo -iu boinc bash -c 'which readlink'
# sudo -iu boinc bash -c 'ls -l `which readlink`'
# sudo -iu boinc readlink /lib/libz.so.1
ID: 48146 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mmonnin

Send message
Joined: 2 Jul 16
Posts: 338
Credit: 7,987,341,558
RAC: 197,587
Level
Tyr
Scientific publications
watwatwatwatwat
Message 48147 - Posted: 11 Nov 2017, 2:51:59 UTC - in response to Message 48146.  

Commands do not work for me either.
ID: 48147 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Trotador

Send message
Joined: 25 Mar 12
Posts: 103
Credit: 14,948,929,771
RAC: 12,866
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 48148 - Posted: 11 Nov 2017, 9:13:37 UTC

So, I copied readlink program to usr/bin and now it is working in my ubuntu hosts.
ID: 48148 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [VENETO] sabayonino

Send message
Joined: 4 Apr 10
Posts: 50
Credit: 650,142,596
RAC: 0
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 48149 - Posted: 11 Nov 2017, 10:27:00 UTC - in response to Message 48148.  
Last modified: 11 Nov 2017, 10:35:56 UTC

Readlink path usually is /usr/bin but it depend on various packaging and configuration provided by the distro

Don't copy the file from /bin to /usr/bin (or whatever)

just create a symlink. If for same reason readlink will be updated , the file you've copied will not updated

$ sudo ln -sf /bin/readlink /usr/bin/readlink


PS : my readlink path is
$ which readlink
/usr/bin/readlink
ID: 48149 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
1 · 2 · 3 · 4 . . . 5 · Next

Message boards : News : New multicore app and WUs

©2025 Universitat Pompeu Fabra