Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Completed Research Forum: Uncovering Genome Mysteries Thread: Have you thought about running UGM1 as non-cpu intensive? |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 24
|
Author |
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7244 Status: Offline Project Badges: |
I have given a Url address to a picture of what I see on Bionctasks. http://screencast.com/t/iQSM5pC6mRsg As you can see the wall clock elapse time is 1d,00:58:49, the CPU time used is 00:00:03 ( 3 secs), and the CPU percentage is .003. If the WUs are going to continue this way, then NON-CPU intensive should be used as this will allow more Wus to be run as they don't have to compete with CPU intensive projects for downloading and initiation. When one WU completes another one is immediately downloaded and initiated bypassing all competition. OK, I see what you mean. I am guessing you are running some flavor of Linux. I have occasionally seen this happen where a WU will get "stuck." Merely suspending and resuming the WU will get it going again when the next available slot is open. I am not sure why it happens (may have something to do with the wireless going to sleep.) Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
non-cpu intensive aka nci are special for cpu as only one at the time is allowed onto an install. Example would be wuprop.
In the screenshot missing the checkpoint column. How often did the task do this in the one hour? Also the right hand top cpu indicator shows 100 percent. Given there's 95.7 percent progress there's suggestion that boinctasks or the monitored agent is not properly recording the cpu time. |
||
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7244 Status: Offline Project Badges: |
Given there's 95.7 percent progress there's suggestion that boinctasks or the monitored agent is not properly recording the cpu time. Good point. I had not thought of that possibility. Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
idahofisherman
Cruncher United States Joined: Dec 23, 2004 Post Count: 20 Status: Offline Project Badges: |
I am running XP3 operating system. There have been no checkpoints done.
----------------------------------------
Lets get off the couch, Potato Heads, and join the Idaho team.
----------------------------------------[Edit 1 times, last edit by idahofisherman at Nov 14, 2014 10:15:52 PM] |
||
|
idahofisherman
Cruncher United States Joined: Dec 23, 2004 Post Count: 20 Status: Offline Project Badges: |
The 100% you see in the upper right corner is for the computer that is running Boinctasks which is OFFICE. The ugm1 process is on another computer called Boinc3.
----------------------------------------
Lets get off the couch, Potato Heads, and join the Idaho team.
|
||
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7244 Status: Offline Project Badges: |
I am running XP3 operating system. There have been no checkpoints done. I have never had a unit get "stuck" on any of the Windows flavors I have used over the years. That being said I still think a suspend and resume is worth a try. Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
This one differs from the occasional 'never checkpointing' we saw on ugm, -but- does clock cpu time. Concur, it could be a stuck unit needing suspend and resume with 'leave application in memory when suspended' switched off for the operation.
On xp3, no longer supported by boinc. As time goes legacy pieces are being taken out, which then causes the boinc wrapper/api used for the newer science apps to become without proven/stable compatibility. Leaves the choice of keeping an older agent version to run, if it will. |
||
|
idahofisherman
Cruncher United States Joined: Dec 23, 2004 Post Count: 20 Status: Offline Project Badges: |
I suspended and resumed UGM1 WU then it ended with an error code of 1282.
----------------------------------------
Lets get off the couch, Potato Heads, and join the Idaho team.
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Maybe you can post a copy of the result log as the only 1282 found is gpu related: http://stackoverflow.com/questions/15722803/opengl-shader-error-1282
Wro xp3, seems the platform from boinc development pov is still supported per the software download page http://boinc.berkeley.edu/download_all.php |
||
|
idahofisherman
Cruncher United States Joined: Dec 23, 2004 Post Count: 20 Status: Offline Project Badges: |
Here is the result log:
----------------------------------------Result Log Result Name: ugm1_ ugm1_ 02482_ 0365_ 1-- <core_client_version>7.4.27</core_client_version> <![CDATA[ <message> (unknown error) - exit code 1282 (0x502) </message> <stderr_txt> Unable to open checkpoint file starting from 0 Unhandled Exception Detected... - Unhandled Exception Record - Reason: Illegal Instruction (0xc000001d) at address 0x00429718 Engaging BOINC Windows Runtime Debugger... ******************** BOINC Windows Runtime Debugger Version 7.2.4 Dump Timestamp : 11/15/14 14:36:19 Install Directory : C:\Program Files\BOINC\ Data Directory : C:\Documents and Settings\All Users\Application Data\BOINC Project Symstore : LoadLibraryA( C:\Program Files\BOINC\\dbghelp.dll ): GetLastError = 126 Loaded Library : dbghelp.dll LoadLibraryA( C:\Program Files\BOINC\\symsrv.dll ): GetLastError = 126 LoadLibraryA( symsrv.dll ): GetLastError = 126 LoadLibraryA( C:\Program Files\BOINC\\srcsrv.dll ): GetLastError = 126 LoadLibraryA( srcsrv.dll ): GetLastError = 126 LoadLibraryA( C:\Program Files\BOINC\\version.dll ): GetLastError = 126 Loaded Library : version.dll </stderr_txt> ]]>
Lets get off the couch, Potato Heads, and join the Idaho team.
|
||
|
|