Message boards : Number crunching : GPUGRID and Linux
Author | Message |
---|---|
Hello, | |
ID: 33586 | Rating: 0 | rate: / Reply Quote | |
Generally there is a lack of such sensor readers on Linux, however there is just about enough to get by on. | |
ID: 33592 | Rating: 0 | rate: / Reply Quote | |
Hello: One of the best ways to control the system possibly be "Gkrellm" capable of displaying all values; processor usage, Temperatures (CPU and GPU) fans, Processes etc. .. | |
ID: 33593 | Rating: 0 | rate: / Reply Quote | |
How do you get it to show GPU temperature or fan speed? | |
ID: 33635 | Rating: 0 | rate: / Reply Quote | |
Hello: control fan speed on the GPUs is through "Coolbits" in Nvidia-Settings, the new version shows the fan speed of the GPU if it is supported. | |
ID: 33666 | Rating: 0 | rate: / Reply Quote | |
Carlesa, that would be useful. | |
ID: 33669 | Rating: 0 | rate: / Reply Quote | |
I've seen reports at Einstein that the latest 331 linux beta drivers finally report some more diagnostic information, even listing GPU usage. | |
ID: 33676 | Rating: 0 | rate: / Reply Quote | |
Hello: I'm using Nvidia 331.17 for days and the basic difference with the above is that it shows the fan speed of the GPU and this supports it. | |
ID: 33678 | Rating: 0 | rate: / Reply Quote | |
Hello: I'm using Nvidia 331.17 for days and the basic difference with the above is that it shows the fan speed of the GPU and this supports it. Hello: Sorry I had misunderstood. If that information is the use of GPUs. % Use of the GPU % Use of PCIe Bandwidth % Use video engine At the moment I can not represent you in an outer panel GKrellm type. Greetings. Nvidia-Settings- Use % GPU... etc. | |
ID: 33679 | Rating: 0 | rate: / Reply Quote | |
Hello, I run GPUgrid exclusively on Linux so I'm afraid I have no idea what information MJH makes available in stderr. Linux has apps for measuring system temp and the Linux nvidia drivers provide, IMHO, all the info one could possibly want about nvidia GPUs running in the system. The nvidia-settings binary is the interface to driver. It reads and reports state (temp, %usage, freq, etc.) and allows altering parameters such as fan speed, freq, etc. The problem with nvidia-settings is that it is pretty much a CLI application. If you run it with no command line parameters it pops up a GUI interface but that interface is rather limited, IMHO. If you run it with command line parameters it does not invoke the GUI interface and stays in CLI mode. You could say it has a split-personality disorder. Also, the CLI mode is extremely cryptic. I'm not sure how everybody else feels about it but IMHO the biggest problems with the nvidia drivers on Linux and the nvidia-settings interface are: 1. You can use the interface to either overclock or control fan speed but not both. If you setup to overclock (with Coolbits 1) then nvidia takes away the fan speed part of the interface and runs the fan speed on auto. You can read the fan speed but not adjust it. If setup for manual fan speed adjustment (with Coolbits 4) then you can read freqs and volts but you cannot adjust them. 2. When the fan speed is in auto mode (i.e. you have used Coolbits 1 or have not set Coolbits at all) the user has no control over the threshold temperature (aka target temp). In auto mode the driver keeps the temp hovering between 82C to 85C and sometimes it spikes to 88C on my system. I would like it to run cooler, somewhere closer to 70C. To overcome 1, you can hack the card and remove the PWM control line to make the fan run at 100% all the time. Then you can set Coolbits 1, overclock, and hopefully not go above 80C temperature. Another way to overcome 1 is to build a programmable PWM fan speed controller and use it to control fan speed the way YOU want it controlled. I've been checking that out and it's doable for about $20 (parts only, no labor). The fixed threshold temperature I mention in 2 above can be overcome by setting Coolbits 4 and setting the fan to a high speed. That works if the extra noise is not a problem. If the noise is a problem then it is possible to write a front end to the nvidia-settings app that allows you to specify whatever threshold temp you wish and auto adjusts fan speed to maintain that threshold. I wrote such a front end in Python, it worked very well, but later I simply detached the fan from the onboard speed controller and let it run at 100%. I don't know what MJH has planned. Tell me what kind of info you would like to see and I might program it for you. Or, if you can code and want to roll your own then I might be able to get you pointed in the right direction re: tools, docs, etc. ____________ BOINC <<--- credit whores, pedants, alien hunters | |
ID: 33732 | Rating: 0 | rate: / Reply Quote | |
Hi $ nvidia-settings --help nvidia-smi (CLI-Mode) see $ nvidia-smi --help regards | |
ID: 33733 | Rating: 0 | rate: / Reply Quote | |
Hello Dagorath, in these times the last driver 331.17 Linux (Beta) and Nvidia-Settings set to "Coolbits 4" you have access to the necessary information from the GPU and a very good manual fan control with nothing to envy options in Windows. | |
ID: 33740 | Rating: 0 | rate: / Reply Quote | |
Hi Carlesa, | |
ID: 33749 | Rating: 0 | rate: / Reply Quote | |
Hello: The manual fan control by Coolbits 4 works perfectly, you can look at the thread "Coolbits" to review how to install. | |
ID: 33751 | Rating: 0 | rate: / Reply Quote | |
Hello: The manual fan control by Coolbits 4 works perfectly, you can look at the thread "Coolbits" to review how to install. You can manually reset or you can run a script that specifies a target temperature (eg. 70C) and calls nvidia-settings --query localhost:0[thermalsensor:0]/ThermalSensorReading to query the current temp. If current temp > target temp then call nvidia-settings --assign localhost:0[fan:0]/GPUCurrentFanSpeed=X where X = an interger > current fan speed If current temp < target temp then call nvidia-settings --assign localhost:0[fan:0]/GPUCurrentFanSpeed=X where X = an interger < current fan speed. You can programatically switch from manual fan control to auto fan control with nvidia-settings --assign localhost:0[gpu:0]/GPUFanControlState=1 Pass 0 instead of 1 to the above command to switch back to auto fan control. Run the commands in a loop for continuous auto temp control to your target temp not nvidia's target temp. edit: spelling correction, localhst to localhost ____________ BOINC <<--- credit whores, pedants, alien hunters | |
ID: 33753 | Rating: 0 | rate: / Reply Quote | |
Hello Dagorath thanks for the interesting information new to me, I will try to use it if I can. | |
ID: 33755 | Rating: 0 | rate: / Reply Quote | |
Hi Carlesa, | |
ID: 33756 | Rating: 0 | rate: / Reply Quote | |
Hi Carlesa, Hello: It would be great facilitates things, be good to use their experience, I will watch thanks. Greetings. | |
ID: 33758 | Rating: 0 | rate: / Reply Quote | |
Message boards : Number crunching : GPUGRID and Linux