Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 109
Posts: 109   Pages: 11   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 3330030 times and has 108 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: BOINC 7.0.40 warning

P.S. I've read elsewhere in past [before 7.0.40] that at times no horse could move a "waiting to run" task back to running state, even when there were idle resources... some scheduling bug persisting, or in the case of HCC1, the client having trouble to distinct GPU from CPU. Did you also suspend the GPU tasks?

BTW knreed is working on fixing the feeder... some clogging due to version number differences [for same science] http://www.worldcommunitygrid.org/forums/wcg/viewpostinthread?post=404338
----------------------------------------
[Edit 1 times, last edit by Former Member at Dec 10, 2012 5:44:42 PM]
[Dec 10, 2012 5:43:29 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Crystal Pellet
Veteran Cruncher
Joined: May 21, 2008
Post Count: 1294
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: BOINC 7.0.40 warning

An HCC CPU task switching to "waiting to run" when a first HCC-GPU was received (if I understand correctly, you had none before left), would be the proper and expected. The HCC-GPU task will grab a CPU core. A HCC-CPU task going to WtR, when a new HCC-CPU was received... not right, lest it's a priority repair job and your queue is stretched.

Clarifying:
I never run HCC CPU, so I had no one running.
I had running 2 SIMAP's and 2 HCC GPU's.
Then new WCG HCC GPU tasks arrived and immediately 1 of the running HCC GPU-tasks changed to 'WtR' and a 3rd SIMAP started.
No reason to prioritize SIMAP, because it has the highest <rec> and no deadline panic.

No horse needed. Only a simple BOINC restart and again 2 HCC GPU-tasks were running concurrently.
----------------------------------------

[Dec 10, 2012 6:20:10 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Ingleside
Veteran Cruncher
Norway
Joined: Nov 19, 2005
Post Count: 974
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: BOINC 7.0.40 warning

Has anyone who jumped on 7.0.40 actually tested app_config to see if it functions to control how many GPU tasks run concurrent on a task card, AND important too, how it behaves with multiple cards of different capability (without app_info to prevent conflict)?

Don't have multiple GPU's, but tested & reported bug, haven't heard about any fix yet...


As for the "Doesn't support anonymous platform", another code-change was for not sending anonymous work to homogeneous app-versions, and since WCG is probably the only project using this functionality it's the only project being denied work...
----------------------------------------


"I make so many mistakes. But then just think of all the mistakes I don't make, although I might."
[Dec 10, 2012 8:25:38 PM]   Link   Report threatening or abusive post: please login first  Go to top 
knreed
Former World Community Grid Tech
Joined: Nov 8, 2004
Post Count: 4504
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: BOINC 7.0.40 warning

I tried updating my BOINC client to version 7.0.40 and completed tasks would not upload. Message tab said anonymous platform not supported. Different message that you get now that is ignored so if you're using an app info file do not update. Switched back to 7.0.36 and all is working again.


This is because a bug was fixed in the BOINC client. World Community Grid has never supported the anonymous platform mechanism. However, because of two bugs, one in the client and one in the server, volunteers were able to fetch work using the anonymous platform at WCG.

The bug that is fixed in the BOINC client is that before 7.0.40, if you were using the app_info.xml file & your client supported an 'alt-platform', then the client would send 'anonymous' as the primary platform but also report an 'alt-platform' (which it should not have been sent).

The issue on the server side is as follows. When the server receives a request, it adds the primary platform and any alt-platforms that the project supports to an 'allowed platform' list. If the first platform on the allowed platform list is the anonymous platform, then the server code handles the request using the anonymous platform code path. If the list size is 0 then a message is reported back to the client that the platforms reported are not supported.

The server bug is that if the primary platform is the anonymous platform, then any alt-platforms should not be considered. Instead, the server code added the alt-platforms to the list anyway.

This is an issue for us because since we do not support the anonymous platform, that is discarded during processing and not added to the allowed platform list. However, the alt-platforms are added to the list. Since the list size is > 0, processing continues down the normal server side logic. It is this that caused the generation of the 'ignored app version' message. It also meant that if I user only specified hcc1 in their app_info.xml file, but they had opted into receiving work for more than hcc1, then they would be assigned work from those other projects and the client would immediately discard that work (and not report that fact to the server).

After discussing with BOINC, the decision was that volunteers should be able to have fine grain control over how many instances of a job are run on their GPU and that the anonymous platform mechanism is not the way that should be done. The decision was made to create the app_config.xml file so that volunteers could have control over this by making changes to the file which has been added in the 7.0.40 client. Volunteers at World Community Grid should be looking as this is stabilized as the way to run multiple copies of HCC1 GPU on your graphics card at one time.

Once the changes for the app_config.xml file are in a stable BOINC client release, we will disable the anonymous platform on the server side. We will provide warning in this forum before we do that and we will allow time for volunteers to upgrade to the stable client build before we disable the feature.
----------------------------------------
[Edit 1 times, last edit by knreed at Dec 11, 2012 4:19:37 PM]
[Dec 11, 2012 4:05:53 PM]   Link   Report threatening or abusive post: please login first  Go to top 
knreed
Former World Community Grid Tech
Joined: Nov 8, 2004
Post Count: 4504
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: BOINC 7.0.40 warning

But after receiving new HCC tasks one of the running HCC tasks change the state to "Waiting to run".
Even suspending all CPU tasks couldn't force this 2nd task to resume again.
Before customizing the app_config I had the same experience, but wasn't sure, whether it was caused by a starting cep-task.
Now I knew it too was caused by receiving new GPU-tasks for WCG.


The download of new workunits for HCC1 GPU from the server causes the BOINC client to overwrite the modified settings read from the app_config.xml file. David is working on this fix and it should appear in the next version of the client.
[Dec 11, 2012 4:08:32 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: BOINC 7.0.40 warning

Saw a check-in note to reapply the config after each RPC. Fix could be in the just released test client 7.0.41.
David  10 Dec 2012
5235 - client: re-apply app configuration after each scheduler RPC;
5236 otherwise the changes get overwritten
5237 client/
5238 cs_scheduler.cpp

then a little later:

Rom 10 Dec 2012
5266 - Tag for 7.0.41 release, all platforms
5267 client_release_7.0.41
5268
5269 /
5270 configure.ac
5271 version.h

The official change note list will tell if it got included. Loaded the x86_64 version anyhow to see what else breaks/or does not. :D
[Dec 11, 2012 4:18:15 PM]   Link   Report threatening or abusive post: please login first  Go to top 
pirogue
Veteran Cruncher
USA
Joined: Dec 8, 2008
Post Count: 685
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: BOINC 7.0.40 warning


Once the changes for the app_config.xml file are in a stable BOINC client release, we will disable the anonymous platform on the server side. We will provide warning in this forum before we do that and we will allow time for volunteers to upgrade to the stable client build before we disable the feature.

Will this change eliminate the need to have settings for each individual project, so that only HCC GPU needs to be in the file? It would be nice to only have to add "special" settings instead of figuring out how to have all projects listed.
----------------------------------------

[Dec 11, 2012 6:19:54 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: BOINC 7.0.40 warning

Exactly!

edit: With the added "major" gain that we can now control how many of a science run concurrent. I've been playing as have a few others, and now have 1 exclusive core that seamlessly processes CEP2 24/7 on an octo. There's a learning curve to go with it... searching the sweat-spot how big to set the buffer, how much to allow being buffered of CEP2. FTM, 4 "in progress" on a 1 day cache, with max_concurrent 1 in app_config. Works a treat... no more step 1 through 7 or whatever to make the slightest chance in mix selection. Client restart is though still required FAIK.
----------------------------------------
[Edit 1 times, last edit by Former Member at Dec 11, 2012 6:34:12 PM]
[Dec 11, 2012 6:23:38 PM]   Link   Report threatening or abusive post: please login first  Go to top 
pirogue
Veteran Cruncher
USA
Joined: Dec 8, 2008
Post Count: 685
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: BOINC 7.0.40 warning

Good. No more lost caches when trying to add a different project.
----------------------------------------

[Dec 11, 2012 7:20:27 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Crystal Pellet
Veteran Cruncher
Joined: May 21, 2008
Post Count: 1294
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: BOINC 7.0.40 warning

But after receiving new HCC tasks one of the running HCC tasks change the state to "Waiting to run".
Even suspending all CPU tasks couldn't force this 2nd task to resume again.
Before customizing the app_config I had the same experience, but wasn't sure, whether it was caused by a starting cep-task.
Now I knew it too was caused by receiving new GPU-tasks for WCG.


The download of new workunits for HCC1 GPU from the server causes the BOINC client to overwrite the modified settings read from the app_config.xml file. David is working on this fix and it should appear in the next version of the client.

Confirmed: Fixed in BOINC client version 7.0.41
----------------------------------------

[Dec 11, 2012 7:51:09 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 109   Pages: 11   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread