Important news for Linux crunchers

Message boards : News : Important news for Linux crunchers
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · Next

AuthorMessage
Profile MJH
Project administrator
Project developer
Project scientist

Send message
Joined: 12 Nov 07
Posts: 696
Credit: 27,266,655
RAC: 0
Level
Val
Scientific publications
watwat
Message 36577 - Posted: 23 Apr 2014, 8:11:22 UTC
Last modified: 24 Apr 2014, 8:33:41 UTC

Hi all,

We are having a problem with correctly scheduling work to Linux clients because the BOINC client for that platform does not report the Nvidia driver version to us.

If you find that you are receiving the cuda60 application version and it is not working for you, the fixes are:

*) Upgrade to driver 334.21 or later

*) Try the patched (v7.3.15) client I have made available at:

http://secure.acellera.com/insecure/boinc/boinc (RedHat etc)
http://secure.acellera.com/insecure/boinc/boinc_ubuntu (Ubuntu)

(the patch itself is at http://secure.acellera.com/insecure/boinc/gpu_nvidia.cpp_patch )

This client reports the driver version and allows a correct scheduling decision to be made.

Matt
ID: 36577 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
HA-SOFT, s.r.o.

Send message
Joined: 3 Oct 11
Posts: 100
Credit: 5,879,292,399
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 36590 - Posted: 23 Apr 2014, 15:40:49 UTC - in response to Message 36577.  

Did you send this path to boinc developers also? This is a very annoying thing in linux version.

Thanks
Zdenek
ID: 36590 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 11 Jul 09
Posts: 1639
Credit: 9,948,917,649
RAC: 8,720,931
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 36591 - Posted: 23 Apr 2014, 15:57:55 UTC - in response to Message 36590.  

If Matt doesn't, I'd be happy to.

I imagine he's waiting for some positive feedback to confirm that the patched version works as intended, without any unexpected side effects.
ID: 36591 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile MJH
Project administrator
Project developer
Project scientist

Send message
Joined: 12 Nov 07
Posts: 696
Credit: 27,266,655
RAC: 0
Level
Val
Scientific publications
watwat
Message 36593 - Posted: 23 Apr 2014, 18:09:33 UTC - in response to Message 36591.  

Already sent upstream. Should be in the next release.

Matt
ID: 36593 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Stoneageman
Avatar

Send message
Joined: 25 May 09
Posts: 224
Credit: 34,057,374,498
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 36594 - Posted: 23 Apr 2014, 19:29:39 UTC

I'm probably having a senior moment but what am I supposed to do with that file as I can't get it to do anything?
ID: 36594 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile MJH
Project administrator
Project developer
Project scientist

Send message
Joined: 12 Nov 07
Posts: 696
Credit: 27,266,655
RAC: 0
Level
Val
Scientific publications
watwat
Message 36597 - Posted: 23 Apr 2014, 20:13:47 UTC - in response to Message 36594.  

Download it, do a "chmod +x boinc" to make it executable, then run it.

If you normally use the boinc mangler or boinccmd, put it in the same directory as those, replacing the "boinc" executable that is already there (take a backup first).

If your OS isn't a redhat clone, you might need to make symlinks for libssl.so.10 and libcrypto.so.10, to whatever libssl/crypto you have in /usr/lib

Matt

ID: 36597 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
biodoc

Send message
Joined: 26 Aug 08
Posts: 183
Credit: 10,085,929,375
RAC: 4
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 36600 - Posted: 23 Apr 2014, 23:06:46 UTC

I'm running linux mint 16 but I'm getting errors libssl.so.10, libcrypto.so.10 not found when I run your boinc app.

ldd boinc (Matt's version)
libssl.so.10 => not found
libcrypto.so.10 => not found


ldd boinc (7.2.42 version)

libssl.so.1.0.0 => /lib/x86_64-linux-gnu/libssl.so.1.0.0 (0x00007f892f87f000)
libcrypto.so.1.0.0 => /lib/x86_64-linux-gnu/libcrypto.so.1.0.0 (0x00007f892f4a2000)


symlinks:

sudo ln -s /lib/x86_64-linux-gnu/libssl.so.1.0.0 /lib/x86_64-linux-gnu/libssl.so.10
sudo ln -s /lib/x86_64-linux-gnu/libcrypto.so.1.0.0 /lib/x86_64-linux-gnu/libcrypto.so.10


ldd boinc (Matt's version)
libssl.so.10 => /lib/x86_64-linux-gnu/libssl.so.10 (0x00007f1823727000)
libcrypto.so.10 => /lib/x86_64-linux-gnu/libcrypto.so.10 (0x00007f182334a000)

Everything looks ok, but when I run Matt's boinc app, I get an error "libssl.so.10, libcrypto.so.10 not found"

Did I miss anything?



ID: 36600 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile MJH
Project administrator
Project developer
Project scientist

Send message
Joined: 12 Nov 07
Posts: 696
Credit: 27,266,655
RAC: 0
Level
Val
Scientific publications
watwat
Message 36605 - Posted: 24 Apr 2014, 8:34:05 UTC - in response to Message 36600.  

ID: 36605 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
biodoc

Send message
Joined: 26 Aug 08
Posts: 183
Credit: 10,085,929,375
RAC: 4
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 36606 - Posted: 24 Apr 2014, 8:51:23 UTC - in response to Message 36605.  

Try this build:

http://secure.acellera.com/insecure/boinc/boinc_ubuntu



ldd boinc_ubuntu

libssl.so.1.0.0 => /lib/x86_64-linux-gnu/libssl.so.1.0.0 (0x00007fdaaf900000)
libcrypto.so.1.0.0 => /lib/x86_64-linux-gnu/libcrypto.so.1.0.0 (0x00007fdaaf523000)

Thu 24 Apr 2014 04:49:30 AM EDT | | CUDA: NVIDIA GPU 0: GeForce GTX 780 Ti (driver version 337.12, CUDA version 6.0, compute capability 3.5, 3072MB, 2759MB available, 5345 GFLOPS peak)

It works! thanks!
ID: 36606 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [VENETO] sabayonino

Send message
Joined: 4 Apr 10
Posts: 50
Credit: 650,142,596
RAC: 352,170
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 36607 - Posted: 24 Apr 2014, 11:13:10 UTC

Hi

Without any patch(es) , no problems found

Gentoo-10

1			24/04/2014 13.09.11	Starting BOINC client version 7.2.33 for x86_64-pc-linux-gnu	

Gentoo-10

6			24/04/2014 13.09.11	CUDA: NVIDIA GPU 0: GeForce GTX 780 (driver version unknown, CUDA version 6.0, compute capability 3.5, 3071MB, 2989MB available, 4818 GFLOPS peak)	
7			24/04/2014 13.09.11	CUDA: NVIDIA GPU 1: GeForce GTX 780 (driver version unknown, CUDA version 6.0, compute capability 3.5, 3072MB, 3002MB available, 4818 GFLOPS peak)	
8			24/04/2014 13.09.11	OpenCL: NVIDIA GPU 0: GeForce GTX 780 (driver version 334.21, device version OpenCL 1.1 CUDA, 3071MB, 2989MB available, 4818 GFLOPS peak)	
9			24/04/2014 13.09.11	OpenCL: NVIDIA GPU 1: GeForce GTX 780 (driver version 334.21, device version OpenCL 1.1 CUDA, 3072MB, 3002MB available, 4818 GFLOPS peak)	
10			24/04/2014 13.09.11	OpenCL CPU: Intel(R) Core(TM) i7-4770 CPU @ 3.40GHz (OpenCL driver vendor: Intel(R) Corporation, driver version 1.1, device version OpenCL 1.1 (Build 31360.31426))	


ldd /usr/bin/boinc
        linux-vdso.so.1 (0x00007fff317ff000)
        libcurl.so.4 => /usr/lib64/libcurl.so.4 (0x00007f9bb5bec000)
        libssl.so.1.0.0 => /usr/lib64/libssl.so.1.0.0 (0x00007f9bb5981000)
        libcrypto.so.1.0.0 => /usr/lib64/libcrypto.so.1.0.0 (0x00007f9bb55aa000)
        libdl.so.2 => /lib64/libdl.so.2 (0x00007f9bb53a6000)
        libz.so.1 => /lib64/libz.so.1 (0x00007f9bb5190000)
        libX11.so.6 => /usr/lib64/libX11.so.6 (0x00007f9bb4e53000)
        libXss.so.1 => /usr/lib64/libXss.so.1 (0x00007f9bb4c4f000)
        libXext.so.6 => /usr/lib64/libXext.so.6 (0x00007f9bb4a3c000)
        libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f9bb481f000)
        libstdc++.so.6 => /usr/lib/gcc/x86_64-pc-linux-gnu/4.8.2/libstdc++.so.6 (0x00007f9bb4515000)
        libm.so.6 => /lib64/libm.so.6 (0x00007f9bb4220000)
        libgcc_s.so.1 => /usr/lib/gcc/x86_64-pc-linux-gnu/4.8.2/libgcc_s.so.1 (0x00007f9bb400a000)
        libc.so.6 => /lib64/libc.so.6 (0x00007f9bb3c5f000)
        libldap-2.4.so.2 => /usr/lib64/libldap-2.4.so.2 (0x00007f9bb3a15000)
        liblber-2.4.so.2 => /usr/lib64/liblber-2.4.so.2 (0x00007f9bb3805000)
        libresolv.so.2 => /lib64/libresolv.so.2 (0x00007f9bb35ee000)
        /lib64/ld-linux-x86-64.so.2 (0x00007f9bb5e4f000)
        libxcb.so.1 => /usr/lib64/libxcb.so.1 (0x00007f9bb33cd000)
        libXau.so.6 => /usr/lib64/libXau.so.6 (0x00007f9bb31c9000)
        libXdmcp.so.6 => /usr/lib64/libXdmcp.so.6 (0x00007f9bb2fc3000)


ID: 36607 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [VENETO] sabayonino

Send message
Joined: 4 Apr 10
Posts: 50
Credit: 650,142,596
RAC: 352,170
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 36675 - Posted: 26 Apr 2014, 16:42:30 UTC
Last modified: 26 Apr 2014, 17:09:42 UTC

Hi huys :D

I have many WUs fault with 780Ti and cuda6



in gpugrid work directory have a problems with libcudart and libcufft

libraries exist :
$ ls -lh | grep lib
-rw-r--r-- 1 saba users 360K 16 mar 14.20 libcudart.so.4.2.9
-rwxr-xr-x 1 saba users 322K 12 apr 19.12 libcudart.so.6.0
-rw-r--r-- 1 saba users  30M 16 mar 14.20 libcufft.so.4.2.9
-rwxr-xr-x 1 saba users  32M 12 apr 19.41 libcufft.so.6.0


but excutable doesn't find them
$ ldd acemd.821-60.bin
        linux-vdso.so.1 (0x00007fffef10c000)
        libdl.so.2 => /lib64/libdl.so.2 (0x00007fd097934000)
        libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fd097717000)
        libnvidia-ml.so.1 => /usr/lib64/libnvidia-ml.so.1 (0x00007fd097478000)
        libcuda.so.1 => /usr/lib64/libcuda.so.1 (0x00007fd096513000)
        libcudart.so.6.0 => not found
        libcufft.so.6.0 => not found
        libstdc++.so.6 => /usr/lib/gcc/x86_64-pc-linux-gnu/4.8.2/libstdc++.so.6 (0x00007fd09620a000)
        libm.so.6 => /lib64/libm.so.6 (0x00007fd095f15000)
        libgcc_s.so.1 => /usr/lib/gcc/x86_64-pc-linux-gnu/4.8.2/libgcc_s.so.1 (0x00007fd095cff000)
        libc.so.6 => /lib64/libc.so.6 (0x00007fd095954000)
        /lib64/ld-linux-x86-64.so.2 (0x00007fd097b38000)
        libz.so.1 => /lib64/libz.so.1 (0x00007fd09573e000)
        librt.so.1 => /lib64/librt.so.1 (0x00007fd095536000)


will try to reset the proiject

now I'm installing cuda-6.0.37 and I will create a symlink to gpugrid work directory

[edit] libraries missing for all excutables
$ acemd.800-42.bin
acemd.800-55.bin
acemd.821-60.bin


libcudart.so.4 => not found
        libcufft.so.4 => not found
ID: 36675 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile microchip
Avatar

Send message
Joined: 4 Sep 11
Posts: 110
Credit: 326,102,587
RAC: 0
Level
Asp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 36740 - Posted: 30 Apr 2014, 15:01:47 UTC
Last modified: 30 Apr 2014, 15:03:14 UTC

I'm still getting errors on the CUDA6 short tasks on my GTX 560. Yes, I tried the boinc version in this thread. Doesn't make a difference. Almost 98% of all CUDA6 tasks error out either immediately or they crunch for some time and then error out. Error codes don't give a clue about what has gone wrong

GPUGRID is the buggiest project I've crunched for thus far

Team Belgium
ID: 36740 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
sis651

Send message
Joined: 25 Nov 13
Posts: 66
Credit: 239,224,028
RAC: 2,335,290
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwat
Message 36761 - Posted: 1 May 2014, 3:12:32 UTC - in response to Message 36740.  
Last modified: 1 May 2014, 3:12:54 UTC

I use the patched (v7.3.15) ubuntu client file above with nvidia-331.38 driver. GPUgrid doesn't get CUDA 6 jobs and run fine with lower version CUDA jobs.
ID: 36761 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
biodoc

Send message
Joined: 26 Aug 08
Posts: 183
Credit: 10,085,929,375
RAC: 4
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 36762 - Posted: 1 May 2014, 8:08:09 UTC - in response to Message 36675.  

Hi huys :D


now I'm installing cuda-6.0.37 and I will create a symlink to gpugrid work directory

[edit] libraries missing for all excutables
$ acemd.800-42.bin
acemd.800-55.bin
acemd.821-60.bin


libcudart.so.4 => not found
        libcufft.so.4 => not found


As Matt said in the original post, you'll need nvidia driver 334.21 or later to run cuda 6.0 tasks. If you have installed the right drivers, try the following.

I would uninstall cuda-6.0.37. The project provides the necessary cuda libraries in the GPUgrid project directory.

Try adding the path to the GPUGrid folder to a new file in /etc/ld.so.conf.d directory.

sudo nano /etc/ld.so.conf.d/gpugrid.conf

on a single line, type in the complete path to your GPUgrid folder and then save the file.

next:

sudo ldconfig

Then check to see if the app finds the libraries.
ID: 36762 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
GPUGRID Role account

Send message
Joined: 15 Feb 07
Posts: 134
Credit: 1,349,535,983
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 36764 - Posted: 1 May 2014, 12:01:27 UTC - in response to Message 36762.  

You don't need to install CUDA yourself. We ship all the necessary runtime libraries along with the application.

Matt
ID: 36764 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
GPUGRID Role account

Send message
Joined: 15 Feb 07
Posts: 134
Credit: 1,349,535,983
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwat
Message 36765 - Posted: 1 May 2014, 12:05:09 UTC - in response to Message 36740.  

microchip,

Looking at the results from your GTX560 machine (#167410), I would say that your GPU is bad.

Matt
ID: 36765 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile microchip
Avatar

Send message
Joined: 4 Sep 11
Posts: 110
Credit: 326,102,587
RAC: 0
Level
Asp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 36767 - Posted: 1 May 2014, 19:57:15 UTC - in response to Message 36765.  

microchip,

Looking at the results from your GTX560 machine (#167410), I would say that your GPU is bad.

Matt


I doubt that. I've done stress tests on both its memory and the GPU itself. It passes every time. It crunched without issues in the past the CUDA5 apps. Every since CUDA6 tasks came, they started to fail on it

Team Belgium
ID: 36767 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile skgiven
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 36774 - Posted: 2 May 2014, 11:44:41 UTC - in response to Message 36767.  

I have a Gigabyte (GV-N66TOC-2GC) GTX660Ti which failed about 1/4 tasks under Linux. Tried different generations of Linux, system updates and a range of different drivers (from 304 to 337) and it kept failing work. It's a dual fan model and ran fairly cool and quiet, so temp probably wasn't an issue.

Since I swapped it into a Win7 system 3days ago it hasn't failed a single WU...
The GTX670 that went from the W7 to the Ubuntu 14.04 system hasn't failed either!

I've had a similar experience with a GTX660 and a GTX260 in the past. Such problems could be down to some oddity between the driver, OS and card, or be an OS, motherboard chipset issue or even a card firmware revision/driver problem.
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help
ID: 36774 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [VENETO] sabayonino

Send message
Joined: 4 Apr 10
Posts: 50
Credit: 650,142,596
RAC: 352,170
Level
Lys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 36778 - Posted: 2 May 2014, 18:40:01 UTC - in response to Message 36762.  
Last modified: 2 May 2014, 18:43:30 UTC


I would uninstall cuda-6.0.37. The project provides the necessary cuda libraries in the GPUgrid project directory.

Try adding the path to the GPUGrid folder to a new file in /etc/ld.so.conf.d directory.

sudo nano /etc/ld.so.conf.d/gpugrid.conf

on a single line, type in the complete path to your GPUgrid folder and then save the file.

next:

sudo ldconfig

Then check to see if the app finds the libraries.


tnx for the trick
cuda is installed in /opt/ (outside boinc directory)

now i'm crunching 1 WU with cuda4.2

so ..
$ ldd acemd.800-42.bin
        linux-vdso.so.1 (0x00007fffeffff000)
        libdl.so.2 => /lib64/libdl.so.2 (0x00007f050160e000)
        libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f05013f1000)
        libnvidia-ml.so.1 => /usr/lib64/libnvidia-ml.so.1 (0x00007f0501152000)
        libcuda.so.1 => /usr/lib64/libcuda.so.1 (0x00007f05001ed000)
        libcudart.so.4 => /home/saba/boinc/projects/www.gpugrid.net/libcudart.so.4 (0x00007f04fff93000)
        libcufft.so.4 => /home/saba/boinc/projects/www.gpugrid.net/libcufft.so.4 (0x00007f04fdf6b000)
        libstdc++.so.6 => /usr/lib/gcc/x86_64-pc-linux-gnu/4.8.2/libstdc++.so.6 (0x00007f04fdc62000)
        libm.so.6 => /lib64/libm.so.6 (0x00007f04fd96d000)
        libgcc_s.so.1 => /usr/lib/gcc/x86_64-pc-linux-gnu/4.8.2/libgcc_s.so.1 (0x00007f04fd757000)
        libc.so.6 => /lib64/libc.so.6 (0x00007f04fd3ac000)
        /lib64/ld-linux-x86-64.so.2 (0x00007f0501812000)
        libz.so.1 => /lib64/libz.so.1 (0x00007f04fd196000)
        librt.so.1 => /lib64/librt.so.1 (0x00007f04fcf8e000)


waiting to finish
ID: 36778 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Carlesa25
Avatar

Send message
Joined: 13 Nov 10
Posts: 328
Credit: 72,619,453
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 36783 - Posted: 3 May 2014, 11:53:25 UTC
Last modified: 3 May 2014, 12:00:47 UTC

Hi, I think that says something does not work.

Right now I have installed the Nvidia driver 331.67 and BOINC 7.3.15 (the GPUGRID recommended) reporting the cuda version installed, but I get only tasks for CUDA 4.2 but step BOINC 7.2.42 (recommended version) I have tasks CUDA 6 . Greetings.
ID: 36783 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
1 · 2 · 3 · Next

Message boards : News : Important news for Linux crunchers

©2025 Universitat Pompeu Fabra