Large scale experiment: MDAD

Message boards : News : Large scale experiment: MDAD
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · Next

AuthorMessage
davidBAM

Send message
Joined: 17 Sep 18
Posts: 11
Credit: 1,857,385,729
RAC: 0
Level
His
Scientific publications
watwatwat
Message 53580 - Posted: 31 Jan 2020, 19:07:56 UTC - in response to Message 53579.  

Thank you for the explanation. $9 is indeed a paltry amount
ID: 53580 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Miklos M.

Send message
Joined: 16 Jun 12
Posts: 17
Credit: 292,288,806
RAC: 0
Level
Asn
Scientific publications
watwatwatwat
Message 53584 - Posted: 1 Feb 2020, 22:57:50 UTC

Got 3 today, for 4 computers, could use many more. It started great a few days back, but now getting too few.
ID: 53584 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Miklos M.

Send message
Joined: 16 Jun 12
Posts: 17
Credit: 292,288,806
RAC: 0
Level
Asn
Scientific publications
watwatwatwat
Message 53585 - Posted: 1 Feb 2020, 23:17:43 UTC

Thank you Toni, just got one more.
ID: 53585 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile robertmiles

Send message
Joined: 16 Apr 09
Posts: 503
Credit: 769,991,668
RAC: 0
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 53586 - Posted: 2 Feb 2020, 1:25:18 UTC

The current situation at GPUGRID is definitely better than the situation at the Predictor@Home project for several months before it shut down. Their development team had split up. One part kept the server, the right to use the Predictor @Home name, and so on. The part that left took away the knowledge of how to create useful new workunits. The remainder of the team could only increase the number of failures each workunit could have every time a previous task for that workunit failed, so for several months. For several months, this meant that very few tasks were available, and all of them failed.

Which if any of you would prefer that situation?
ID: 53586 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jacosito

Send message
Joined: 14 May 18
Posts: 7
Credit: 189,902,931
RAC: 0
Level
Ile
Scientific publications
watwat
Message 53587 - Posted: 2 Feb 2020, 11:30:13 UTC

The message is:
02/02/2020 7:38:19 | GPUGRID | Mensaje del servidor : New version of ACEMD needs 953.20MB more disk space. You currently have 2861.49 MB available and it needs 3814.70 MB.

I have free to use BOINC = 963.65GB

You currently have 2861.49 MB = 2.8 GB
available and it needs 3814.70 MB = 3.8GB

Then 3.8GB > 963GB???

Don't make me sense.
ID: 53587 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Chris

Send message
Joined: 11 Sep 19
Posts: 3
Credit: 832,298,528
RAC: 18
Level
Glu
Scientific publications
wat
Message 53588 - Posted: 2 Feb 2020, 13:31:28 UTC

Anyone having issues with the GPU work units crashing their Geforce RTX 2080 Ti's?

Before this series came out this month my systems was working like a champ.

Suddenly this month it seems like something is making my system overheat if I enable the GPU tasks.

Have a Ryzen 9 3900X that I've been running full tilt for like 6 months now, no problems.

Then suddenly system hangs, with all the fans (CPU, GPU, Chassis, etc) all off and the motherboard unresponsive to the reset buttons and the power button. The LED's on the Chipset and Motherboard remain lit.

To resolve I have to turn power off at the PSU, and then boot.

The only thing that comes up in the system error logs is when I turn the PSU power off, that there is an unexpected Kernel power failure at that time (in the Windows error logs).

Almost like it is in a sleep/suspend mode, but all that is off.

Last night I left the GPU disabled and the CPU only tasks worked fine.
ID: 53588 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Retvari Zoltan
Avatar

Send message
Joined: 20 Jan 09
Posts: 2380
Credit: 16,897,957,044
RAC: 0
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 53589 - Posted: 2 Feb 2020, 14:07:52 UTC - in response to Message 53588.  

Anyone having issues with the GPU work units crashing their Geforce RTX 2080 Ti's?
They are working fine on my hosts.
Perhaps your RTX 2080Ti is overclocked (too much).
What PSU do you use?
Does it have two independent 8-pin PCI-E power connectors?
Are those connected to your RTX 2080Ti?
ID: 53589 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Aurum
Avatar

Send message
Joined: 12 Jul 17
Posts: 404
Credit: 17,408,899,587
RAC: 0
Level
Trp
Scientific publications
watwatwat
Message 53590 - Posted: 2 Feb 2020, 14:20:31 UTC - in response to Message 53587.  

Jacosito, in the BOINC Manager look at Options/Computing Preferences/Disk & Memory tab. There are 3 check boxes. I uncheck the first two and only check the third. Mine says "Use no more than 80% of total." Make sure you give BOINC permission to use enough storage.
ID: 53590 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Aurum
Avatar

Send message
Joined: 12 Jul 17
Posts: 404
Credit: 17,408,899,587
RAC: 0
Level
Trp
Scientific publications
watwatwat
Message 53591 - Posted: 2 Feb 2020, 14:36:33 UTC - in response to Message 53588.  

Then suddenly system hangs, with all the fans (CPU, GPU, Chassis, etc) all off and the motherboard unresponsive to the reset buttons and the power button. The LED's on the Chipset and Motherboard remain lit.

To resolve I have to turn power off at the PSU, and then boot.
This describes behavior I see occasionally with my 1080 Ti's but I don't recall it happening on my 2080 Ti's. I don't know why it happens, I just reboot and it goes away. I never overclock and it's not specific to GG.
ID: 53591 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
csbyseti

Send message
Joined: 4 Oct 09
Posts: 6
Credit: 1,109,686,172
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 53592 - Posted: 2 Feb 2020, 18:15:51 UTC - in response to Message 53588.  

Anyone having issues with the GPU work units crashing their Geforce RTX 2080 Ti's?

Before this series came out this month my systems was working like a champ.

Suddenly this month it seems like something is making my system overheat if I enable the GPU tasks.

Have a Ryzen 9 3900X that I've been running full tilt for like 6 months now, no problems.

Then suddenly system hangs, with all the fans (CPU, GPU, Chassis, etc) all off and the motherboard unresponsive to the reset buttons and the power button. The LED's on the Chipset and Motherboard remain lit.

To resolve I have to turn power off at the PSU, and then boot.

The only thing that comes up in the system error logs is when I turn the PSU power off, that there is an unexpected Kernel power failure at that time (in the Windows error logs).

Almost like it is in a sleep/suspend mode, but all that is off.

Last night I left the GPU disabled and the CPU only tasks worked fine.


If you have to switch of Power Supply AC Side, the Power Supply is blocked by Overcurrent or unstable DC-Voltage. Switching off resets the 'electronic' fuse.

There can be different reasons, overcurrent for Power supply itself, overcurrent detected by mainboard, unstable AC Input Voltage.

Perhaps the RTX2080ti got power load Peaks. The magazine ct has measured for a RTX2080 peaks of 380W without overclocking depending on Card model.
ID: 53592 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Nick Name

Send message
Joined: 3 Sep 13
Posts: 53
Credit: 1,533,531,731
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwat
Message 53593 - Posted: 2 Feb 2020, 21:16:22 UTC - in response to Message 53588.  

...Then suddenly system hangs, with all the fans (CPU, GPU, Chassis, etc) all off and the motherboard unresponsive to the reset buttons and the power button. The LED's on the Chipset and Motherboard remain lit...

I had a similar problem last year. I started seeing invalid work across multiple projects, gradually increasing for awhile until one day almost everything was failing. I found the power cables to the GPU had some burnt pins. Replacing that fixed it for awhile, then I started having problems exactly like you describe. This time I found burnt pins in the PSU. I replaced the PSU and eventually had to RMA the GPU, I think the PSU problems broke something. Fortunately it was repaired under warranty and works great now.

If your PSU power cables and connections to the GPU are ok then I would suspect and test for a failing GPU. Trying another PSU is also a good idea if you have the option.

This assumes you haven't done anything to change the GPU behavior, like overclock it or install new monitoring software. I once had major problems with a certain manufacturer's GPU utility, now I stick to Afterburner or Nvidia Inspector. If you've changed something like this, revert back.
Team USA forum | Team USA page
Join us and #crunchforcures. We are now also folding:join team ID 236370!
ID: 53593 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Chris

Send message
Joined: 11 Sep 19
Posts: 3
Credit: 832,298,528
RAC: 18
Level
Glu
Scientific publications
wat
Message 53614 - Posted: 5 Feb 2020, 1:17:07 UTC - in response to Message 53589.  

Yes (2) 8 pin supplies to my RTX 2080Ti.

The powersupply is a Corsair CX750M.

I probabably am over-taxing it.

Nothing overclocked beyond factory OC (if any).

Thanks.

Was odd, before it ran with everything fully loaded all night long no problem.
ID: 53614 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Chris

Send message
Joined: 11 Sep 19
Posts: 3
Credit: 832,298,528
RAC: 18
Level
Glu
Scientific publications
wat
Message 53615 - Posted: 5 Feb 2020, 1:18:29 UTC - in response to Message 53590.  

Thanks.

Yes I tried that, still did it.

Right now disabling GPU work units.

I am thinking my powersupply is struggling as others suggested.
ID: 53615 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Shayol Ghul

Send message
Joined: 11 Aug 17
Posts: 2
Credit: 1,024,938,819
RAC: 0
Level
Met
Scientific publications
watwatwat
Message 53630 - Posted: 9 Feb 2020, 12:52:03 UTC

At least your receiving work units. Last two weeks I have not received any work units. All equipment running good. Work units average five hours.Please send some work.
ID: 53630 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile robertmiles

Send message
Joined: 16 Apr 09
Posts: 503
Credit: 769,991,668
RAC: 0
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 53632 - Posted: 9 Feb 2020, 20:31:29 UTC - in response to Message 53630.  

At least your receiving work units. Last two weeks I have not received any work units. All equipment running good. Work units average five hours.Please send some work.

Update your graphics drivers. You are using versions known to have problems with some OpenCL work.
ID: 53632 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 11 Jul 09
Posts: 1639
Credit: 10,159,968,649
RAC: 351
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 53633 - Posted: 9 Feb 2020, 21:34:54 UTC - in response to Message 53632.  

... some OpenCL work.

GPUGrid writes its apps in CUDA.
ID: 53633 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
[VENETO] boboviz

Send message
Joined: 10 Sep 10
Posts: 164
Credit: 388,132
RAC: 0
Level

Scientific publications
wat
Message 53635 - Posted: 9 Feb 2020, 22:34:15 UTC - in response to Message 53575.  

Supporting both is more for political-correctness reasons rather than need.

You're right. There is no work for cuda, let alone for opencl
ID: 53635 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
klepel

Send message
Joined: 23 Dec 09
Posts: 189
Credit: 4,798,881,008
RAC: 0
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 53641 - Posted: 11 Feb 2020, 21:03:08 UTC - in response to Message 53614.  

The powersupply is a Corsair CX750M.

I have the same power supply Corsair CX750M. And I have the same problem you describe:
Suddenly this month it seems like something is making my system overheat if I enable the GPU tasks.

Have a Ryzen 9 3900X that I've been running full tilt for like 6 months now, no problems.

Then suddenly system hangs, with all the fans (CPU, GPU, Chassis, etc) all off and the motherboard unresponsive to the reset buttons and the power button. The LED's on the Chipset and Motherboard remain lit.

To resolve I have to turn power off at the PSU, and then boot.).

My system is AMD 1700x and a GTX1070. I tried to resolve the problem by lowering clocks on the CPU since the beginning. What seems to help, is lowering the frequency of the GPU by 120 MHz and increase the fan speed to 97% on this particular GPU. But still the computer freezes frequently.
Lately I was wondering if it might be the PSU as well, as I had a "bluescreen" problem on another computer, I solved with a certified, higher Watt PSU a few years ago.
So it seems to me, that this might be a bad PSU design for 24/7 crunching.
ID: 53641 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jacosito

Send message
Joined: 14 May 18
Posts: 7
Credit: 189,902,931
RAC: 0
Level
Ile
Scientific publications
watwat
Message 53642 - Posted: 12 Feb 2020, 2:47:39 UTC - in response to Message 53590.  
Last modified: 12 Feb 2020, 2:48:05 UTC

The same, not WU.
ID: 53642 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jacosito

Send message
Joined: 14 May 18
Posts: 7
Credit: 189,902,931
RAC: 0
Level
Ile
Scientific publications
watwat
Message 53643 - Posted: 12 Feb 2020, 2:49:50 UTC - in response to Message 53588.  

Can you send me your app_config.xml?

My GPU and CPU, both with liquid refrigeration.

Cheers
ID: 53643 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · Next

Message boards : News : Large scale experiment: MDAD

©2025 Universitat Pompeu Fabra