Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 11
Posts: 11   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 3356 times and has 10 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Abort eric_b099_ps0000_5 Yes or No Resolved

-
erlc_ b099_ ps0000_ 6-- - In Progress 3/9/10 17:17:57 3/15/10 07:41:57 0.00 0.0 / 0.0
erlc_ b099_ ps0000_ 7-- - In Progress 3/9/10 17:17:54 3/17/10 02:16:16 0.00 0.0 / 0.0
erlc_ b099_ ps0000_ 5-- - In Progress 3/9/10 15:30:11 3/15/10 05:54:11 0.00 0.0 / 0.0 < mine
erlc_ b099_ ps0000_ 4-- - In Progress 3/9/10 08:48:11 3/14/10 23:12:11 0.00 0.0 / 0.0
erlc_ b099_ ps0000_ 3-- 612 User Aborted 3/3/10 17:51:49 3/9/10 14:50:55 67.81 2,139.4 / 0.0
erlc_ b099_ ps0000_ 2-- - No Reply 3/3/10 17:51:25 3/9/10 08:15:25 0.00 0.0 / 0.0
erlc_ b099_ ps0000_ 1-- 612 User Aborted 2/17/10 17:57:44 3/9/10 15:47:35 0.00 0.0 / 0.0
erlc_ b099_ ps0000_ 0-- - No Reply 2/17/10 17:57:19 3/3/10 17:57:19 0.00 0.0 / 0.
This work unit has been running 5 Hrs.
Currently 0.040% complete.
Is there any hope for this work unit.
I will check back in 10 to 12 hrs. and make my decision then.-
erlc_ b099_ ps0000_ 6-- - In Progress 3/9/10 17:17:57 3/15/10 07:41:57 0.00 0.0 / 0.0
erlc_ b099_ ps0000_ 7-- - In Progress 3/9/10 17:17:54 3/17/10 02:16:16 0.00 0.0 / 0.0
erlc_ b099_ ps0000_ 5-- - In Progress 3/9/10 15:30:11 3/15/10 05:54:11 0.00 0.0 / 0.0 < mine
erlc_ b099_ ps0000_ 4-- - In Progress 3/9/10 08:48:11 3/14/10 23:12:11 0.00 0.0 / 0.0
erlc_ b099_ ps0000_ 3-- 612 User Aborted 3/3/10 17:51:49 3/9/10 14:50:55 67.81 2,139.4 / 0.0
erlc_ b099_ ps0000_ 2-- - No Reply 3/3/10 17:51:25 3/9/10 08:15:25 0.00 0.0 / 0.0
erlc_ b099_ ps0000_ 1-- 612 User Aborted 2/17/10 17:57:44 3/9/10 15:47:35 0.00 0.0 / 0.0
erlc_ b099_ ps0000_ 0-- - No Reply 2/17/10 17:57:19 3/3/10 17:57:19 0.00 0.0 / 0.
This work unit has been running 5 Hrs.
Currently 0.040% complete.
Is there any hope for this work unit.
I will check back in 10 to 12 hrs. and make my decision then.
-
erlc_ b099_ ps0000_ 6-- - In Progress 3/9/10 17:17:57 3/15/10 07:41:57 0.00 0.0 / 0.0
erlc_ b099_ ps0000_ 7-- - In Progress 3/9/10 17:17:54 3/17/10 02:16:16 0.00 0.0 / 0.0
erlc_ b099_ ps0000_ 5-- - In Progress 3/9/10 15:30:11 3/15/10 05:54:11 0.00 0.0 / 0.0 < mine
erlc_ b099_ ps0000_ 4-- - In Progress 3/9/10 08:48:11 3/14/10 23:12:11 0.00 0.0 / 0.0
erlc_ b099_ ps0000_ 3-- 612 User Aborted 3/3/10 17:51:49 3/9/10 14:50:55 67.81 2,139.4 / 0.0
erlc_ b099_ ps0000_ 2-- - No Reply 3/3/10 17:51:25 3/9/10 08:15:25 0.00 0.0 / 0.0
erlc_ b099_ ps0000_ 1-- 612 User Aborted 2/17/10 17:57:44 3/9/10 15:47:35 0.00 0.0 / 0.0
erlc_ b099_ ps0000_ 0-- - No Reply 2/17/10 17:57:19 3/3/10 17:57:19 0.00 0.0 / 0.
This work unit has been running 5 Hrs.
Currently 0.040% complete.
Is there any hope for this work unit.
I will check back in 10 to 12 hrs. and make my decision then.
----------------------------------------
[Edit 2 times, last edit by Former Member at Mar 10, 2010 12:58:22 PM]
[Mar 10, 2010 2:50:30 AM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Abort eric_b099_ps0000_5 Yes or No

Leave it running. User Aborts don't usualy mean anything let it run. Could have been bad luck on the previous hosts. Keep an eye on it though and let us know if you see anything strange with it.

-Uplinger
[Mar 10, 2010 3:21:51 AM]   Link   Report threatening or abusive post: please login first  Go to top 
gb009761
Master Cruncher
Scotland
Joined: Apr 6, 2005
Post Count: 2955
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Abort eric_b099_ps0000_5 Yes or No

Well, a 'PS' type, is an "A" type - and thus, will run for an EXTREMELY long time - which may be why the users with '_1' & '_3' aborted it (i.e., they thought that something was wrong).

There were No Replies from '_0' and '_2' - may be due to their buffer being huge/computer not crunching enough hours to finish in time, and thus, I personally don't see any reason why it should be aborted.

All I'd say, is to ensure that it is running properly (probably in excess of 40-50+ hours), and thus, the first checkpoint would take upwards of an hour to be made (i.e. at the 2% point). After that, there's a checkpoint every 2% (i.e., there's 50 checkpoints in total). Thus, be prepared to leave your machine on, crunching for as long as possible (which, I suspect you do, having so many Ruby badges to your name).

Obviously, I'm no expert, so if a CA/WCG tech comes in and advises differently, take their word over mine biggrin

Edit : Uplinger beat me to it laughing
----------------------------------------

----------------------------------------
[Edit 1 times, last edit by gb009761 at Mar 10, 2010 3:23:22 AM]
[Mar 10, 2010 3:22:39 AM]   Link   Report threatening or abusive post: please login first  Go to top 
HutchNYC
Advanced Cruncher
United States
Joined: Nov 27, 2005
Post Count: 97
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Abort eric_b099_ps0000_5 Yes or No

One of my machines picked up copy (underscore)6 of this one. I let it run for over an hour and a half and progress is still at 0.000%. Suspended mine for now until a tech advises.
----------------------------------------
[Mar 10, 2010 3:23:59 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Abort eric_b099_ps0000_5 Yes or No

You guys need to let them run. d oh
[Mar 10, 2010 3:50:21 AM]   Link   Report threatening or abusive post: please login first  Go to top 
JmBoullier
Former Community Advisor
Normandy - France
Joined: Jan 26, 2007
Post Count: 3715
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Abort eric_b099_ps0000_5 Yes or No

For the time being it is still nothing abnormal if this machine is not a superfast one. Keep an eye on it from time to time and tell us which processor it is using next time you post. Something which could seem abnormal for one of those 4+ GHz overclocked modern processors can be just normal for a poor old P4 doing what it can. smile
----------------------------------------
Team--> Decrypthon -->Statistics/Join -->Thread
[Mar 10, 2010 3:54:04 AM]   Link   Report threatening or abusive post: please login first  Go to top 
HutchNYC
Advanced Cruncher
United States
Joined: Nov 27, 2005
Post Count: 97
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Abort eric_b099_ps0000_5 Yes or No

I should have mentioned. It is an i7-920. All other type A's progress after 20-25 minutes at the most, with the first checkpoint (2%) usually about an hour into it.

With this one, still at 0.000% complete after over an hour and a half.

That's why it is out of the ordinary.

Edit

Digging a little deeper, I copied the slots directory and looked at the result.out log.

Some of the "normal" warnings and messages that exist in other WU's of this type are present, but what stands out is the constant loop repeat of:

*** LEVEL 1 WARNING *** BOMLEV IS -1
** ERROR IN SHAKEA ** DEVIATION IN SHAKE TOO LARGE

The result.out file is already at 28,743k in size because this warning/error keeps repeating each loop.

Other type-A WU's that are running fine are only about 962k in size after running fine for several hours.

Hope this might help uplinger or one of the other techs.
----------------------------------------
----------------------------------------
[Edit 1 times, last edit by HutchNYC at Mar 10, 2010 4:32:14 AM]
[Mar 10, 2010 4:06:11 AM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Abort eric_b099_ps0000_5 Yes or No

HutchNYC, this does help alot...I would say abort the work unit. We have seen this on one other work unit (in beta). So no pattern yet but i'm starting to think there might be, but I will test this work unit on my machines and see if anything comes of it.

-Uplinger
[Mar 10, 2010 4:37:09 AM]   Link   Report threatening or abusive post: please login first  Go to top 
HutchNYC
Advanced Cruncher
United States
Joined: Nov 27, 2005
Post Count: 97
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Abort eric_b099_ps0000_5 Yes or No

Ok. Just aborted.

I'll save the copy of the slots directory for a week or so if you determine if you want any of it.

Thanks for the quick reply! smile
----------------------------------------
[Mar 10, 2010 4:39:59 AM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Abort eric_b099_ps0000_5 Yes or No

Hutch, thanks for holding the file...give me about 48 hours to see if I encounter the same issue...

Thanks,
-Uplinger
[Mar 10, 2010 7:06:19 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 11   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread