Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Support Forum: GPU Support Forum Thread: 26 pages of invalids |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 22
|
Author |
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Just noticed I have 26 pages of invalid WU's currently.
Any idea what's going on here? GPU and CPU are stable , otherwise I would see errors i guess |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
After 26 pages, one would think the quota would have been cut down to just 1 per day. Unfortunately the daily quota starts off for a normally validating device at about 4000 or whatever the number is.
If you click on an "invalid" link, and post a copy, we or the techs might see something unusual. OCing or anything else non-standard <count>.0769230</count> for instance in an app_info.xml |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Thanks ,I'll check my app_info file
Like this you mean ? Result Log Result Name: X0930076221368200610191522_ 0-- <core_client_version>7.0.28</core_client_version> <![CDATA[ <stderr_txt> Commandline: projects/www.worldcommunitygrid.org/wcg_hcc1_img_7.05_windows_intelx86__ati_hcc1 --zipfile X0930076221368200610191522.zip --imagelist images.txt --device 0 <app_init_data> <major_version>7</major_version> <minor_version>0</minor_version> <release>28</release> <app_version>705</app_version> <app_name>hcc1</app_name> <project_preferences> <color_scheme>Tahiti Sunset</color_scheme> <max_frames_sec>3</max_frames_sec> <max_gfx_cpu_pct>5.0</max_gfx_cpu_pct> </project_preferences> <project_dir>C:\ProgramData\BOINC/projects/www.worldcommunitygrid.org</project_dir> <boinc_dir>C:\ProgramData\BOINC</boinc_dir> <wu_name>X0930076221368200610191522</wu_name> <result_name>X0930076221368200610191522_0</result_name> <comm_obj_name>boinc_1</comm_obj_name> <slot>1</slot> <wu_cpu_time>0.000000</wu_cpu_time> <starting_elapsed_time>0.000000</starting_elapsed_time> <using_sandbox>0</using_sandbox> <user_total_credit>31408879.187295</user_total_credit> <user_expavg_credit>11167.855886</user_expavg_credit> <host_total_credit>3574463.594169</host_total_credit> <host_expavg_credit>8959.008477</host_expavg_credit> <resource_share_fraction>1.000000</resource_share_fraction> <checkpoint_period>600.000000</checkpoint_period> <fraction_done_start>0.000000</fraction_done_start> <fraction_done_end>1.000000</fraction_done_end> <gpu_type>ATI</gpu_type> <gpu_device_num>0</gpu_device_num> <gpu_opencl_dev_index>0</gpu_opencl_dev_index> <ncpus>1.000000</ncpus> <rsc_fpops_est>26106628485363.000000</rsc_fpops_est> <rsc_fpops_bound>522132569707260.000000</rsc_fpops_bound> <rsc_memory_bound>78643200.000000</rsc_memory_bound> <rsc_disk_bound>50000000.000000</rsc_disk_bound> <computation_deadline>1353575110.000000</computation_deadline> <vbox_window>0</vbox_window> </app_init_data> INFO: gpu_type set in init_data.xml to ATI INFO: gpu_device_num set in init_data.xml to 0 Boinc requested ATI gpu device number0 Unzipping input images ../../projects/www.worldcommunitygrid.org/X0930076221368200610191522_X0930076221368200610191522.zip Processing jobdescription Number of Images defined in image list is 2 Found compute platform Advanced Micro Devices, Inc. Selecting this platform CL_DEVICE_NAME: Cypress CL_DEVICE_VENDOR: Advanced Micro Devices, Inc. CL_DEVICE_VERSION: 1084.2 (VM) CL_DEVICE_MAX_COMPUTE_UNITS: CL_DEVICE_MAX_WORK_ITEM_DIMENSIONS: 3 CL_DEVICE_MAX_WORK_ITEM_SIZES: 256 / 256 / 256 CL_DEVICE_MAX_WORK_GROUP_SIZE: 256 CL_DEVICE_MAX_CLOCK_FREQUENCY: 850 MHz CL_DEVICE_ADDRESS_BITS: 32 CL_DEVICE_MAX_MEM_ALLOC_SIZE: 512 MByte CL_DEVICE_GLOBAL_MEM_SIZE: 1024 MByte CL_DEVICE_ERROR_CORRECTION_SUPPORT: no CL_DEVICE_LOCAL_MEM_TYPE: local CL_DEVICE_LOCAL_MEM_SIZE: 32 KByte CL_DEVICE_MAX_CONSTANT_BUFFER_SIZE: 64 KByte CL_DEVICE_QUEUE_PROPERTIES: CL_QUEUE_PROFILING_ENABLE CL_DEVICE_EXTENSIONS: cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_gl_sharing cl_ext_atomic_counters_32 cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_dx9_media_sharing Estimated kernel execution time = 0.36391 [sec] Starting analysis of X0930076221368200610191522.jp2... Extracting GLCM features... Total kernel time: 139.811203 (1026 kernel executions) Total memory transfer time: 59.201248 Average kernel time: 0.136268 Min kernel time: 0.126351 (dx=5 dy=25 sample_dist=24 ) Max kernel time: 0.146034 dx=1 dy=2 sample_dist=1 INFO: GPU calculations complete. Total time for X0930076221368200610191522.jp2: 365 seconds Finished Image #0, pctComplete = 0.500000 Starting analysis of X0930076220100200610191543.jp2... Extracting GLCM features... Total kernel time: 122.004700 (1026 kernel executions) Total memory transfer time: 95.851219 Average kernel time: 0.118913 Min kernel time: 0.112021 (dx=5 dy=25 sample_dist=24 ) Max kernel time: 0.128195 dx=1 dy=2 sample_dist=1 INFO: GPU calculations complete. Total time for X0930076220100200610191543.jp2: 306 seconds Finished Image #1, pctComplete = 1.000000 CPU time used = 223.518233 18:43:06 (1684): called boinc_finish </stderr_txt> ]]> |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
here's the app_info file i'm using :
<app_info> <app> <name>hcc1</name> <user_friendly_name>Help Conquer Cancer</user_friendly_name> </app> <file_info> <name>wcg_hcc1_img_7.05_windows_intelx86__ati_hcc1</name> <executable/> </file_info> <file_info> <name>hcckernel.cl.7.05</name> <executable/> </file_info> <app_version> <app_name>hcc1</app_name> <version_num>705</version_num> <platform>windows_intelx86</platform> <plan_class>ati_hcc1</plan_class> <avg_ncpus>1.0</avg_ncpus> <max_ncpus>1.0</max_ncpus> <coproc> <type>ATI</type> <count>.5</count> </coproc> <file_ref> <file_name>wcg_hcc1_img_7.05_windows_intelx86__ati_hcc1</file_name> <main_program/> </file_ref> <file_ref> <file_name>hcckernel.cl.7.05</file_name> <open_name>hcckernel.cl</open_name> </file_ref> </app_version> </app_info> |
||
|
nanoprobe
Master Cruncher Classified Joined: Aug 29, 2008 Post Count: 2998 Status: Offline Project Badges: |
Can you give us the specs on the problem machine and how many tasks are you running at the same time?
----------------------------------------
In 1969 I took an oath to defend and protect the U S Constitution against all enemies, both foreign and Domestic. There was no expiration date.
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
specs :
Intel 980X HT off 143x29 (4147Mhz) Gigabyte X58 UD4P 6x2GB ram 715Mhz 7-7-6-21 1T Sapphire HD 5850 850Mhz core / 1000Mhz RAM 64GB Intel SLC SSD 1 TB Samsung hard disk running 2 GPU WU's at the same time. System is stable and the problems started with the longer GPU tasks. |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
https://secure.worldcommunitygrid.org/forums/...ead,34252_offset,0#401135
SekeRob says it better than I could. Reported this to WCG staff. Lawrence |
||
|
nanoprobe
Master Cruncher Classified Joined: Aug 29, 2008 Post Count: 2998 Status: Offline Project Badges: |
specs : Intel 980X HT off 143x29 (4147Mhz) Gigabyte X58 UD4P 6x2GB ram 715Mhz 7-7-6-21 1T Sapphire HD 5850 850Mhz core / 1000Mhz RAM 64GB Intel SLC SSD 1 TB Samsung hard disk running 2 GPU WU's at the same time. System is stable and the problems started with the longer GPU tasks. Just a guess but maybe the overclock on your card and the dual WU tasks aren't playing nice together. Try lowering your core clock and see what happens. And FWIW you can lower your ram clocks and lose no efficiency while saving power and heat. I drop mine by 50% at least.
In 1969 I took an oath to defend and protect the U S Constitution against all enemies, both foreign and Domestic. There was no expiration date.
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
The core clock on my GPU has been changed to 850 Mhz yesterday.
Most invalids have a date before I changed the core clock, so I guess that's not the problem. Also , if there is any instability I think I would have errors on WU's which is not the case. |
||
|
nanoprobe
Master Cruncher Classified Joined: Aug 29, 2008 Post Count: 2998 Status: Offline Project Badges: |
The core clock on my GPU has been changed to 850 Mhz yesterday. Most invalids have a date before I changed the core clock, so I guess that's not the problem. Also , if there is any instability I think I would have errors on WU's which is not the case. Not that it's related but I had a bunch invalids when I tried to run 2 5870s in the same machine. None of them showed any error messages. When I went to a single GPU on that machine the invalids stopped. I compared my invalid tasks with a wingman who had a validated task and I couldn't find any differences in the logs. I guess I missed something. Not that any of this helps you but invalids seem to show up for no apparant reason.
In 1969 I took an oath to defend and protect the U S Constitution against all enemies, both foreign and Domestic. There was no expiration date.
|
||
|
|