All acemd3 apps updated (210)

Message boards : News : All acemd3 apps updated (210)
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Toni
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 9 Dec 08
Posts: 1006
Credit: 5,068,599
RAC: 0
Level
Ser
Scientific publications
watwatwatwat
Message 52852 - Posted: 16 Oct 2019, 10:47:28 UTC
Last modified: 16 Oct 2019, 10:58:14 UTC

Currently there should be no major *known* bugs. We should cover Win64 and Linux, with reasonably recent cards.

Unfortunately, an internal cleanup in the filenames will make *existing* WUs fail. Sorry about that. Will send new test WUs soon.

By the way, the scheduler for this app will base its decision simply on the CUDA version supported by your driver, rather than other heuristics.
ID: 52852 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Keith Myers

Send message
Joined: 13 Dec 17
Posts: 1387
Credit: 8,176,692,190
RAC: 6,609,403
Level
Tyr
Scientific publications
watwatwatwatwat
Message 52853 - Posted: 16 Oct 2019, 14:41:05 UTC

Yes, I have had several task failures today when I never had any. Validated three test tasks with the new 2.10 app and one normal task.
ID: 52853 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Toni
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 9 Dec 08
Posts: 1006
Credit: 5,068,599
RAC: 0
Level
Ser
Scientific publications
watwatwatwat
Message 52854 - Posted: 16 Oct 2019, 14:49:15 UTC - in response to Message 52853.  
Last modified: 16 Oct 2019, 14:50:12 UTC

The DHFR210 set was botched because old versions were still lurking around. I deprecated all the old apps now. The 210a set was created after this so should be ok.
ID: 52854 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 52857 - Posted: 16 Oct 2019, 21:26:05 UTC - in response to Message 52852.  

CUDA version supported by your driver, rather than other heuristics

Which version is recommended as the minimum now?

MrS
Scanning for our furry friends since Jan 2002
ID: 52857 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
rod4x4

Send message
Joined: 4 Aug 14
Posts: 266
Credit: 2,219,935,054
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 52860 - Posted: 17 Oct 2019, 0:06:41 UTC - in response to Message 52857.  
Last modified: 17 Oct 2019, 0:07:53 UTC

Looking at the supported applications page - http://www.gpugrid.net/apps.php

Supported "New version of ACEMD" applications:
Linux - GPU/driver capable of CUDA80 or better
Windows - GPU/driver capable of CUDA92 or better
ID: 52860 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
rod4x4

Send message
Joined: 4 Aug 14
Posts: 266
Credit: 2,219,935,054
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 52861 - Posted: 17 Oct 2019, 2:17:38 UTC - in response to Message 52857.  
Last modified: 17 Oct 2019, 2:18:08 UTC

Which version is recommended as the minimum now?


As per Nvidia deployment documentation (previously posted by Keith Myers): https://docs.nvidia.com/deploy/cuda-compatibility/index.html

CUDA80 Minimum Driver r367.48 or higher
CUDA92 Minimum Driver r396.26 or higher
CUDA100 Minimum Driver r410.48 or higher
CUDA101 Minimum Driver r418.39 or higher
ID: 52861 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Toni
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 9 Dec 08
Posts: 1006
Credit: 5,068,599
RAC: 0
Level
Ser
Scientific publications
watwatwatwat
Message 52863 - Posted: 17 Oct 2019, 7:06:29 UTC - in response to Message 52861.  

More failures because the old app (206) is still being sent out despite being deprecated.
ID: 52863 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Toni
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 9 Dec 08
Posts: 1006
Credit: 5,068,599
RAC: 0
Level
Ser
Scientific publications
watwatwatwat
Message 52864 - Posted: 17 Oct 2019, 9:47:14 UTC - in response to Message 52861.  

Which version is recommended as the minimum now?



CUDA80 Minimum Driver r367.48 or higher
CUDA92 Minimum Driver r396.26 or higher
CUDA100 Minimum Driver r410.48 or higher
CUDA101 Minimum Driver r418.39 or higher


Exactly. Updated drivers are necessary for RTX users. They should go for r418.39 or higher.
ID: 52864 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Toni
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 9 Dec 08
Posts: 1006
Credit: 5,068,599
RAC: 0
Level
Ser
Scientific publications
watwatwatwat
Message 52866 - Posted: 17 Oct 2019, 12:26:51 UTC - in response to Message 52864.  

Seems to be working. I added a FAQ item. Old WUs may still fail.
ID: 52866 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jim1348

Send message
Joined: 28 Jul 12
Posts: 819
Credit: 1,591,285,971
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 52867 - Posted: 17 Oct 2019, 12:49:36 UTC

It also depends on what generation of card you have. My GTX 980 with the 430.26 drivers running under Ubuntu 18.04 is getting the CUDA 100 work units. It ran a total time of 43 minutes for a a81-TONI_TESTTESTLONG210. But it uses a whole CPU core, so I just reserve one for it.

I will await the real work units before doing more.
ID: 52867 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Aurum
Avatar

Send message
Joined: 12 Jul 17
Posts: 401
Credit: 17,242,399,587
RAC: 16,556,744
Level
Trp
Scientific publications
watwatwat
Message 52869 - Posted: 18 Oct 2019, 15:15:53 UTC

Now we need 10,000 WUs loaded.
ID: 52869 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
klepel

Send message
Joined: 23 Dec 09
Posts: 189
Credit: 4,774,131,008
RAC: 1,062,741
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 52870 - Posted: 18 Oct 2019, 15:41:48 UTC - in response to Message 52869.  

Now we need 10,000 WUs loaded.

+1
ID: 52870 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
STARBASEn
Avatar

Send message
Joined: 17 Feb 09
Posts: 91
Credit: 1,603,303,394
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 52871 - Posted: 18 Oct 2019, 20:09:41 UTC

Over the past several days I have received 4 ACEMD 210 WU's on two Linux machines with 3 GTX-1060's and all validated fine (3 x 7,500 and 1 x 75,000 points). Linux machines awaiting production WU's anytime.
ID: 52871 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jacob Klein

Send message
Joined: 11 Oct 08
Posts: 1127
Credit: 1,901,927,545
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 52877 - Posted: 20 Oct 2019, 11:46:02 UTC
Last modified: 20 Oct 2019, 12:30:32 UTC

Could you please answer these 2 questions...
... and consider including their answers in the FAQ thread here:
http://www.gpugrid.net/forum_thread.php?id=5002

2 questions:

1) Is the new app capable of resuming on a different GPU? I ask, because my main 2 crunching PCs are below, each having 3 different GPU types, and you said "no major known bugs", but I thought I saw some other report saying earlier how resuming on a different GPU type wasn't working yet.

2) Will this app work for my GTX 660 Ti that is in the same PC as newer generation GPUs? I believe mixing Maxwell and Pascal was a problem previously, which is why I'm asking about your new app.

Depending on your answers, both of these would be major problems for me, as I'm trying to keep these PCs stable while also working long-running RNA World tasks.

Please let me know,
Thanks,
Jacob Klein

PC 1: RTX 2080, GTX 980 Ti, GTX 980
PC 2: GTX 1050 Ti, GTX 970, GTX 660 Ti
ID: 52877 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Keith Myers

Send message
Joined: 13 Dec 17
Posts: 1387
Credit: 8,176,692,190
RAC: 6,609,403
Level
Tyr
Scientific publications
watwatwatwatwat
Message 52879 - Posted: 20 Oct 2019, 22:33:51 UTC - in response to Message 52877.  

As far as I know, you still can't resume a task on a different card type. I solved it by changing my preferences to switch among apps to 360 minutes vice the default 60 minutes and that solves the issue. The task starts and finishes on the same card. Haven't seen any task require that long to finish yet but probably will be adequate until we get the app declared to Main and start getting Long tasks again with the new apps.

If you have the same card type in a multiple card host, there is no issue starting on one card and finishing on another.
ID: 52879 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Toni
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 9 Dec 08
Posts: 1006
Credit: 5,068,599
RAC: 0
Level
Ser
Scientific publications
watwatwatwat
Message 52880 - Posted: 21 Oct 2019, 10:13:24 UTC - in response to Message 52877.  
Last modified: 21 Oct 2019, 10:14:37 UTC

Sadly, one still can't restart between on different card types.

Regarding the mixing of cards: I don't see why it shouldn't work, but a real world test will confirm.



Could you please answer these 2 questions...
... and consider including their answers in the FAQ thread here:
http://www.gpugrid.net/forum_thread.php?id=5002

2 questions:

1) Is the new app capable of resuming on a different GPU? I ask, because my main 2 crunching PCs are below, each having 3 different GPU types, and you said "no major known bugs", but I thought I saw some other report saying earlier how resuming on a different GPU type wasn't working yet.

2) Will this app work for my GTX 660 Ti that is in the same PC as newer generation GPUs? I believe mixing Maxwell and Pascal was a problem previously, which is why I'm asking about your new app.

Depending on your answers, both of these would be major problems for me, as I'm trying to keep these PCs stable while also working long-running RNA World tasks.

Please let me know,
Thanks,
Jacob Klein

PC 1: RTX 2080, GTX 980 Ti, GTX 980
PC 2: GTX 1050 Ti, GTX 970, GTX 660 Ti
ID: 52880 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jacob Klein

Send message
Joined: 11 Oct 08
Posts: 1127
Credit: 1,901,927,545
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 52881 - Posted: 21 Oct 2019, 11:39:21 UTC

Are there any plans to allow the app to resume on a different GPU?
ID: 52881 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Toni
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 9 Dec 08
Posts: 1006
Credit: 5,068,599
RAC: 0
Level
Ser
Scientific publications
watwatwatwat
Message 52882 - Posted: 21 Oct 2019, 11:56:49 UTC - in response to Message 52881.  
Last modified: 21 Oct 2019, 11:57:08 UTC

We looked into it, but do not know if and when there will be progress on the front. For the time being, I've amended the FAQ with a pointer on gpu exclusion.
ID: 52882 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jacob Klein

Send message
Joined: 11 Oct 08
Posts: 1127
Credit: 1,901,927,545
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 52883 - Posted: 21 Oct 2019, 12:08:55 UTC
Last modified: 21 Oct 2019, 12:13:32 UTC

Thank you. That is suitable, and I plan on implementing that approach shortly for one of my systems that gets suspended/resumed a lot. Did you know I'm responsible for exclude_gpu existing? ;)
ID: 52883 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Toni
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 9 Dec 08
Posts: 1006
Credit: 5,068,599
RAC: 0
Level
Ser
Scientific publications
watwatwatwat
Message 52884 - Posted: 21 Oct 2019, 12:11:19 UTC - in response to Message 52883.  

Thank you. That is suitable, and I plan on implementing that approach shortly for one of my systems that gets suspended/resumed a lot. Did you know I'm responsible for exclude_gpu existing? ;)


No I didn't and let me add: well done :)
ID: 52884 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
1 · 2 · Next

Message boards : News : All acemd3 apps updated (210)

©2025 Universitat Pompeu Fabra