Message boards :
Graphics cards (GPUs) :
GTX 970 switching to default clock value (1152MHz) after a while
Message board moderation
| Author | Message |
|---|---|
|
Send message Joined: 1 Jan 15 Posts: 1166 Credit: 12,260,898,501 RAC: 1 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
After I had a similar problem with one of my GTX750ti some 4 weeks ago, today my GTX970, during crunching a PABLO_adaptive_goal_KIX, was switching back to default clock 1152MHz. Only after rebooting the PC, the clock value could be increased via the NVIDIA Inspector. However, after a few minutes, the GPU clock again falls to 1152MHz. At this clock, TDP usage is about 55% (whereas it was about 80-90% [depending on the specific task] when running at around 1380MHz). And again, the clock value 1152MHz cannot be changed manually, in neither direction, except when rebooting the system. BTW, NVIDIA Inspector and GPU-Z show exactly same values. Both the GTX750ti and the GTX970 have been crunching four about 1 year. Do I have to assume that after a crunching period of about 1 year, the cards become defective? BTW, I always took good care of the temperature situation, both GPUs never got warmer than 60-63°C. Has anyone else made the same kind of experience? |
|
Send message Joined: 21 Mar 16 Posts: 513 Credit: 4,673,458,277 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Yes, it is very intermittent and random, and has occured on multiple of my GPUs, only maxwell. Both of my 970s have dropped to 1164mhz multiple times and sometimes they stay there until the computer is rebooted. I believe it has nothing to do with the GPU itself, as it is probably a software issue that one of the scientists can investigate. |
|
Send message Joined: 20 Apr 15 Posts: 285 Credit: 1,102,216,607 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]()
|
Can this be the (same) reason why the older 980ti surpasses the 1080 which should be noticably faster just from its specification? See also this posting: http://www.gpugrid.net/forum_thread.php?id=4494&nowrap=true#46410 I would love to see HCF1 protein folding and interaction simulations to help my little boy... someday. |
|
Send message Joined: 21 Mar 16 Posts: 513 Credit: 4,673,458,277 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Can this be the (same) reason why the older 980ti surpasses the 1080 which should be noticably faster just from its specification? Joerg, This is the core of the GPU downclocking for an unknown reason. The difference between the 980ti and the 1080 is mostly the ram. Pascal has a lower IPC than Maxwell so even though the core runs at almost 2ghz it is very similar in performance to the higher IPC of Maxwell. Because this project doesn't utilize the extra speed and bandwidth of the GDDR5X, It's mostly down to the core. The 980ti has 8 billion transistors while the GP104 die (gtx 1070 and 1080) only have 7.2 billion. Even if the performance of each transistor is ever so slightly higher than the Maxwell chip, it's hard to make up for 800 million transistors. That, and there could potentially be more software optimization on the Cuda 6.5 app because it's been around longer. |
|
Send message Joined: 1 Jan 15 Posts: 1166 Credit: 12,260,898,501 RAC: 1 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Yes, it is very intermittent and random, and has occured on multiple of my GPUs, only maxwell. Both of my 970s have dropped to 1164mhz multiple times and sometimes they stay there until the computer is rebooted. I believe it has nothing to do with the GPU itself, as it is probably a software issue that one of the scientists can investigate. it's interesting to read that this phenomenon obviously does not only occur with my cards (which I would have been surprised about anyway). When you say that the reduced GPU clock "sometimes" stays until the computer is rebooted - does this mean that sometimes you are able to raise the clock again without rebooting? In my case (with the GTX750ti as well as with the GTX970) there is no way to change the clock - neither up nor down - until the PC is rebooted. Further, after the reboot of the system, can you go back to the same high clock as before without the clock reverting back to the default value for lenghty time? In my cases, the clock would revert back within few minutes. Final question: for how many months have your cards been crunching? I am asking this because, as said before, with both of my cards this behavour started about a year. And it's never ever happened before. And this seems strange. |
|
Send message Joined: 21 Mar 16 Posts: 513 Credit: 4,673,458,277 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
When you say that the reduced GPU clock "sometimes" stays until the computer is rebooted - does this mean that sometimes you are able to raise the clock again without rebooting? In my case (with the GTX750ti as well as with the GTX970) there is no way to change the clock - neither up nor down - until the PC is rebooted. From my experience, I've had it return back to full boost before, after a seemingly random amount of time. Often times, I will wait 8+ hours and it still won't come back. Further, after the reboot of the system, can you go back to the same high clock as before without the clock reverting back to the default value for lenghty time? Typically, when I restart, it will be at full boost instantly. Final question: for how many months have your cards been crunching? I am asking this because, as said before, with both of my cards this behavour started about a year. And it's never ever happened before. And this seems strange. Both of these cards have been crunching, not just GPUGrid, for well over a year. Temps are typically in the 60s and at most they barely hit 70. I truly think this is some type of software bug inside the application itself. |
|
Send message Joined: 1 Jan 15 Posts: 1166 Credit: 12,260,898,501 RAC: 1 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
... I truly think this is some type of software bug inside the application itself. when this problem first happened to one of my gtx750ti four weeks ago, the card was crunching BNBS tasks, which are known to put extremely heavy load on the GPU. When lateron other tasks were running, the problem did not re-occur. Hence, I now was even more surprised that the same thing happened on the GTX970 with a PABLO-adaptive task. As said before, this has not occurred ever before, with non of my 5 GPUs. Could it really by some bug in the recent GPUGRID applications? Would be great to find out if some other crunchers, too, have experienced this problem. |
|
Send message Joined: 1 Jan 15 Posts: 1166 Credit: 12,260,898,501 RAC: 1 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
When this morning I took the first look at my PCs, I noticed the following situation on both the one with the GTX970 and also the one with the GTX750ti (both crunching a PABLO_adaptive_goal_KIX): In the NVIDIA Inspector, the GPU clock was down at 540 MHz(!), Memory clock 2700MHz (default), GPU Load 0, Power between 84 and 89%. Changing the clock values by the sliders not possible. In GPU-Z, no values were shown at all for GPU clock, memory clock, GPU load, Video Engine load (which normally is 0 anyway) - no values means a "-" in the fields were normally values (or "0") are shown. Power consumption shows same values as the Inspector. However, the "progress" column of the BOINC manager shows a progress in the percentage; as it seems to me (but I might be mistaken) with about same speed as usual. What's going on with these two cards? |
|
Send message Joined: 21 Mar 16 Posts: 513 Credit: 4,673,458,277 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Is the problem still occurring? It keeps happening, but only to my 970s. They are doing a mix between folding@home and GPUGrid, and they seem to randomly go to stock clock. When I restart the computer they almost always jump right back up to boost clock but then sometime later they drop down again. Both cards are on different drivers so it could be a persistent error with the driver. Sometimes when I start folding on top of GPUGrid it raises the boost clock, but sometimes it stays at stock clock. I am truly baffled. |
Retvari ZoltanSend message Joined: 20 Jan 09 Posts: 2380 Credit: 16,897,957,044 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Try to turn off graphic acceleration in your browser, and in Microsoft Excel too (if you have it). |
|
Send message Joined: 1 Jan 15 Posts: 1166 Credit: 12,260,898,501 RAC: 1 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Try to turn off graphic acceleration in your browser, and in Microsoft Excel too (if you have it). I could not find such a setting in the MS Edge browser (the only one on this system) Whenever crunching PABLO_adaptive_goal WUs, the card is showing the same behaviour: GPU clock falling back (not immediately, but after a while) to default 1152MHz whenever it's overclocked beyond around 1250MHz :-( |
Retvari ZoltanSend message Joined: 20 Jan 09 Posts: 2380 Credit: 16,897,957,044 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Do you even browse the internet on this PC? :)Try to turn off graphic acceleration in your browser, and in Microsoft Excel too (if you have it). To turn off graphic acceleration in IE and Edge: Press Windows key + R Type inetcpl.cpl and press enter Click the advanced tab Scroll down to the Graphic acceleration settings Check "Use software rendering instead of GPU rendering" Click OK Restart Your PC |
|
Send message Joined: 1 Jan 15 Posts: 1166 Credit: 12,260,898,501 RAC: 1 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
yes, this is the PC I do most of the Internet browsing :-) So I now changed the settings according to your instructions and restartet the machine. Let's wait and see whether this helps. I will report here. |
|
Send message Joined: 1 Jan 15 Posts: 1166 Credit: 12,260,898,501 RAC: 1 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Let's wait and see whether this helps. I will report here. Unfortunately, this change in the settings did not help here. While crunching a PABLO_adaptive_goal, I had the GPU at about 1320MHz, power was about 77-85%, and after 20 minutes or so the GPU clock dropped the the default value 1152MHz. So either the GPU is defective, or this type of WU does something strange to the GPU. |
Retvari ZoltanSend message Joined: 20 Jan 09 Posts: 2380 Credit: 16,897,957,044 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Have you tried to update the firmware (BIOS) of the card? |
|
Send message Joined: 1 Jan 15 Posts: 1166 Credit: 12,260,898,501 RAC: 1 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
Have you tried to update the firmware (BIOS) of the card? no |
|
Send message Joined: 11 Oct 08 Posts: 1127 Credit: 1,901,927,545 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
I've seen my GTX 980 Ti GPUs sometimes crunch at non-boost 3d clocks (like 1150), when they normally crunch at boost 3d clocks (like 1320). When they are in this "broken mode", while I'm crunching, if I run GPU-Z, then click the question mark "?" to do the "PCI Express Render Test", it shows a graphic, and the GPU clock ramps back up to boost 3d clocks. Then when I close the GPU-Z windows, it goes back down to non-boost 3d clocks, even though I'm still crunching GPUGrid tasks. Long story short: I believe the driver is not correctly recognizing that a 3d compute app wants full boost 3d clocks. I'm not sure if there's a setting the app can make to do that, but I suspect it's a driver bug. And it won't get fixed, unless you can supply a completely reproducible case, and then can report it and get them to listen. I've tried to get that reproducible example, but have never been able to get it. If you have one, please list the steps (all of them, details matter!) here. Thanks, Jacob |
|
Send message Joined: 28 Jul 12 Posts: 819 Credit: 1,591,285,971 RAC: 0 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
When they are in this "broken mode", while I'm crunching, if I run GPU-Z, then click the question mark "?" to do the "PCI Express Render Test", it shows a graphic, and the GPU clock ramps back up to boost 3d clocks. Then when I close the GPU-Z windows, it goes back down to non-boost 3d clocks, even though I'm still crunching GPUGrid tasks. I seem to recall a similar case, though I don't remember the card. The fix for me was simply to run Nvidia Inspector (which is based on GPU-Z), and just allow the clocks to remain at the default speed. It prevented the downclocking. |
|
Send message Joined: 1 Jan 15 Posts: 1166 Credit: 12,260,898,501 RAC: 1 Level ![]() Scientific publications ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
I've seen my GTX 980 Ti GPUs sometimes crunch at non-boost 3d clocks (like 1150), when they normally crunch at boost 3d clocks (like 1320). A few minutes ago, the GPU clock again dropped to default value, and so I tried this thing with the "?" in GPU-Z. Same behavour as described by Jacob. What comes into my mind is that this problem only occurs with the GPUs in Windows 10 systems, never so far in XP. So, either this indeed is a driver bug of the newer drivers that come for Windows 10, or it has to do with the WDDM. |
|
Send message Joined: 27 Aug 16 Posts: 16 Credit: 43,745,875 RAC: 0 Level ![]() Scientific publications
|
For what it's worth, I am experiencing exactly the same issue with my 970 on win 10. Sometimes the clock comes down because I start Chrome (which I can understand) but sometimes it just drops the clock for no reason. GPU usage is still pegged at 100% however. I really have no clue. |
©2025 Universitat Pompeu Fabra