Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 34
Posts: 34   Pages: 4   [ Previous Page | 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 29703 times and has 33 replies Next Thread
JmBoullier
Former Community Advisor
Normandy - France
Joined: Jan 26, 2007
Post Count: 3715
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Elapsed time weirdness

Think you're right on the money with that. Running CEP2 only on that client, times seen never seem to exceed the duration of a job within a task. If not heard from me again on this, I've confirmed.
That might explain why a task has to resume from the checkpoint which has just been taken as I have reported during the beta tests. Simply to keep track of the total CPU time: "Starting job xx, CPU time has been restored to nnnn.nnnnnn."

With the side effect of a rather long low-CPU-activity time clearly visible on BoincTasks when checkpoints are taken-reused.
----------------------------------------
Team--> Decrypthon -->Statistics/Join -->Thread
[Jul 10, 2010 12:31:14 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Elapsed time weirdness

Whatever the weirdness, yesterday there were some Nautilus patches and today there was a small gloobus patch that also hooks into Nautilus Elementary and libtracker which also works to make Linux faster. Today I was doing quite a bit on the machine and kept 2 DDDT2 and 2 CEP2 concurrently running and several finished, one with 9:08 hours reported CPU time and "just" 27 minutes to add for system/user elapsed or 3 minutes per hour or 95% efficiency. That promises something for the coming nights/days when the system is kept full "crunch only" state. And the bonus, credit very close or better than claim.

E200265_ 510_ A.23.C17H8N4S2.128.4.set1d06_ 0-- 1292373 Valid 8/20/10 13:13:56 8/22/10 16:14:31 9.08 163.6 / 156.7
E200265_ 304_ A.23.C17H8N4OS.141.set1d06_ 0-- 1292373 Pending Validation 8/20/10 13:22:27 8/22/10 14:05:37 7.03 126.6 / 0.0
E200265_ 346_ A.23.C17H8N4OS.30.set1d06_ 0-- 1292373 Pending Validation 8/20/10 11:54:48 8/22/10 05:48:25 7.97 143.7 / 0.0
E200265_ 385_ A.23.C17H8N4OS.55.4.set1d06_ 0-- 1292373 Pending Validation 8/20/10 10:37:23 8/22/10 01:40:34 6.76 121.8 / 0.0
E200264_ 727_ A.23.C17H7N3OS2.136.1.set1d06_ 1-- 1292373 Pending Validation 8/20/10 05:04:52 8/21/10 21:14:42 6.55 118.1 / 0.0
E200263_ 879_ A.23.C17H12N2OS2Si.28.0.set1d06_ 0-- 1292373 Valid 8/20/10 04:51:29 8/21/10 13:07:11 6.40 115.4 / 118.9

Going for goldness... still needing 5 days per noon stats.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Aug 22, 2010 6:50:46 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Elapsed time weirdness

Did all updates to my already clean lynx system yesterday and let it run through the night - my results still are at ~60% efficiency.
Guess I will interrupt CEP2 for the moment and wait for the windows version, energy is better spent for projects that use wallclock time well sad
[Aug 24, 2010 10:51:21 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Elapsed time weirdness

hi Ummon,

Of course your client is set to run 100% and nothing of background indexing is running secretly and all these other bits such as running off a HD with the fastest native file system (ext3 / ext4). Have you looked for those odd time losses as some have reported round about the checkpoints?

Efficiency varies here for unknown reason, some 95%, some 92%, maybe multiple concurrent causing bottlenecking. I continue to make sure to run mix and suspend jobs on the quad whenever I see more than 2 running and let anything else skip ahead, HCMD2/HFCC/C4CW now too. Can only imagine the resource battle that occurs when a quad with hyperthreading goes off to do 8 simultaneous. Seeing any high page faulting numbers and or delta there-off?
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Aug 24, 2010 11:18:30 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Elapsed time weirdness

Thanks for the reply Sekerob

I checked for the ´normal´ problems which you also mentioned. It´s a Q8300 System ( no Hyperthreading ) with a new installation of mint9, nothing running in the background. Filesystem is ext3. I ruled out hardware probs since the winXP partition runs fine.

When looking at ´top´ only the 4 cep2 tasks have mentionable CPU usage - but do not stay at 100% all the time, there often are short drops.
During these drops, no other task is going up in CPU usage.

If by odd time losses you mean the "Starting job 1,CPU time has been restored to xxx" - line, I experience it at every checkpoint.

And finally - I´m afraid not to understand what you mean by high page faulting and delta, guess this is too advanced for me confused
[Aug 24, 2010 11:53:16 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Elapsed time weirdness

Ummon, plz look in your BOINC manager and check if "Run Always" is selected in the Activity menu and or the Local Preferences the CPU Time % is set to 100. Probably is, but just wantto rule this pissue out where BOINC default is to process only at 60% load.

(soft) page faulting is when the CPU has to go out and fetch more data not already in memory and explained here: http://en.wikipedia.org/wiki/Page_fault

Some have more than others. When it runs in the tens or hundreds of thousands per second it affects throughput considerably.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
----------------------------------------
[Edit 1 times, last edit by Sekerob at Aug 24, 2010 12:07:19 PM]
[Aug 24, 2010 12:05:26 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Elapsed time weirdness

Sorry, forgot to mention it in my last post.
Boinc is running based on preferences, CPU usage is set to 100%. There are no messages indicating that computation has been suspended due to preferences. Switched to "run always" now just to be sure, system is behaving identically.
Thanks for the wikipedia link, had already googled and read it. I´m just not sure how to check for this on my system to answer your question.
[Aug 24, 2010 12:16:08 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Elapsed time weirdness

OK, had to open up the books myself, walking the walk to go with talking the talk:

In a terminal window enter top, that opens an interactive process window. Hit Capital A i.e. shift A, which subdivides the view to show many fields. The columns to look for are nFLT and nDRT

To watch a specific process easier first do top and choose a CEP2 process, then hit q to quite top then enter

top -p PID

where PID is the process number (don't know yet how to isolate a specific process from within a running top).

then, shift A to see all fields

Also 'u' can be used to see only the boinc owned processes and then do shift A to see the detail. Easier on the eye.

Source of my newly learned:

http://www.linuxforums.org/articles/using-top-more-efficiently_89.html

I'm just seeing 1 or 2 nFLT on the CEP2 processes and nDRT is 0, so not an issue here. Above 10 seems to be bad.

What's frustrating with the hundreds of distros and versions is that something may work in 1 and not the other. Found something that peaked my interest on /dev/shm and some commands to mount something to speed things up and essentially requiring more book reading... I'm so addicted to the GUI way of things and hitting help to find an answer.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
----------------------------------------
[Edit 1 times, last edit by Sekerob at Aug 24, 2010 1:30:21 PM]
[Aug 24, 2010 1:27:29 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Elapsed time weirdness

Thanks for the details, shift-A was new for me, quite interesting the details.

Anyway nFLT/nDRT is 6/0 for 1 process, the other 3 are at 0/0.
So I guess this is not the problem sad
[Aug 24, 2010 1:58:29 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Elapsed time weirdness

Closer examination, had 0 or 1 on CEP2 and 2 on C4CW for nFLT. The nDRT has continued to be zero for the whole time that TOP was up (long refresh time set to minimize load). 6 is 3x more than 2 and 6x more than 1, so not sure how many cycles go to those events. I suppose you might want to closely watch the checkpoint / CPU correlation. See posts JmBoullier as to what he did to determine 6-7 minutes going nowhere on his system at some checkpoints, when Linux loads up a new job. Certainly anything that uses OpenGL extensively is OFF and of course, if you have the graphics on screen that can seriously eat time.

Sorry to not be much of help getting closer to capping that black hole.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Aug 24, 2010 2:07:44 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 34   Pages: 4   [ Previous Page | 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread