Author |
Message |
|
Looks like a new problem
Linux 64, 6.5.0, 177 driver, hostID16551 (Q6600, 4 GB, GTX260)
Was OK for a while with up to 4 WUs, but now I cannot get any new ones.
Server believes I wil not finish in time: BOINC runs 2.2% of time, computation enabled 100% of that.
On my 24/7 dedicated cruncher BOINC actually runs all the time. So looks to me as if the 2.2% refers to the time used for GPUGRID only (4 other WU - ABC, PG and Cosmology - use the rest).
Will try some more manual updates before my last WU finishes.
kind regards
Alain |
|
|
Beyond![Avatar](https://www.gravatar.com/avatar/026deda0a0d87168ee4e605155a8e102?s=100&d=identicon) Send message
Joined: 23 Nov 08 Posts: 1112 Credit: 6,162,416,256 RAC: 0 Level
![Tyrosine - More than 5B credits Tyr](img/badges/aa/badge_tyr.png) Scientific publications
![Top 25% (348th/2932) contribution to Buch et al, J. Chem. Inf. Model. 2010 wat](img/badges/papers/badge_pub_ruby.png) ![Top 25% (281st/2466) contribution to Sadiq et al, Proteins 2010 wat](img/badges/papers/badge_pub_ruby.png) ![Top 10% (165th/3118) contribution to Selent et al, PLoS Comput Biol 2010 wat](img/badges/papers/badge_pub_emerald.png) ![Top 10% (133rd/4410) contribution to Buch et al, PNAS 2011 wat](img/badges/papers/badge_pub_emerald.png) ![Top 10% (45th/2450) contribution to Giorgino et al, J. Chem. Theory Comput. 2011 wat](img/badges/papers/badge_pub_emerald.png) ![Top 1% (42nd/9662) contribution to Buch et al, J. Chem. Theory Comput. 2011 wat](img/badges/papers/badge_pub_sapphire.png) ![Top 10% (119th/5798) contribution to Sadiq et al, PNAS 2012 wat](img/badges/papers/badge_pub_emerald.png) ![Top 1% (17th/2163) contribution to Bisignano et al. JCIM 2014 wat](img/badges/papers/badge_pub_sapphire.png) ![Top 10% (19th/1283) contribution to Doerr et al. JCTC 2014 wat](img/badges/papers/badge_pub_emerald.png) ![Top 1% (19th/2838) contribution to Stanley et al, Nat Commun 2014 wat](img/badges/papers/badge_pub_sapphire.png) ![Top 10% (148th/3183) contribution to Lauro et al., JCIM 2014 wat](img/badges/papers/badge_pub_emerald.png) ![Top 10% (49th/3611) contribution to Ferruz et al., JCIM 2015 wat](img/badges/papers/badge_pub_emerald.png) ![Top 1% (8th/4128) contribution to Ferruz et al., Sci Rep 2016 wat](img/badges/papers/badge_pub_sapphire.png) ![Top 10% (124th/4815) contribution to Stanley et al., Sci Rep 2016 wat](img/badges/papers/badge_pub_emerald.png) ![Top 1% (6th/4730) contribution to Noe et al., Nat Chem 2017 wat](img/badges/papers/badge_pub_sapphire.png) ![Top 1% (7th/1348) contribution to Doerr et al, JCTC 2017 wat](img/badges/papers/badge_pub_sapphire.png) ![Top 1% (9th/4634) contribution to Martinez-Rosell et al, JCIM 2018 wat](img/badges/papers/badge_pub_sapphire.png) ![Top 1% (9th/1656) contribution to Kapoor et al., Sci Rep 2017 wat](img/badges/papers/badge_pub_sapphire.png) ![Top 1% (15th/1885) contribution to Ferruz et al., Sci Rep 2018 wat](img/badges/papers/badge_pub_sapphire.png) ![Top 10% (17th/1022) contribution to Wang et al., ACS Cent. Sci. 2019 wat](img/badges/papers/badge_pub_emerald.png) ![Top 25% (137th/672) contribution to Martinez-Rosell et al, JCIM 2020 wat](img/badges/papers/badge_pub_ruby.png) ![Top 10% (140th/1541) contribution to Rodriguez-Espigares et al., Nat Meth 2020 wat](img/badges/papers/badge_pub_emerald.png) ![Top 1% (11th/1450) contribution to Herrera-Nieto et al, Sci Rep 2020 wat](img/badges/papers/badge_pub_sapphire.png) ![Top 10% (71st/6232) contribution to Herrera-Nieto et al, JCIM 2020 wat](img/badges/papers/badge_pub_emerald.png) |
Have you tried setting the BOINC resource share for GPUGRID higher?
|
|
|
|
Many thanks, had not thought of that.
Raised from 100 to 500% (too bad for the already out of line long term debt) and got 2 WU with manual update (x2).
Thanks again
Alain |
|
|
|
Think I figured out what is happening.
This morning my on fraction had grown to 8.8%, so I still had to do a manual update to report 2 WU finished and to get a new one. This succeeded.
A bit later I tried to get another new WU to notice that on fraction was now less than before, even close to zero.
What happened? Being on leave I am playing around with my machines, including the Linux one (hostID 16551). So somewhat unusual I restarted this one a couple of times. And checking on the client_state.xml I noticed that with every restart of the BOINC client the ON_frac value is reset to zero!
I believe this is an unwanted feature of 6.5.0. On previous versions it was an average over time, which makes sense.
Can anyone please confirm this?
Many thanks.
Alain |
|
|
|
Not sure if this related to the aforementioned problem or not. But, it seems to be similar, so here goes... Since installing Boinc 6.5.0 I noticed that I am having trouble getting work from any project, I have the work cache set to 0.6 days. Currently running 1 'USPME type' GPU and 4 prime grid which take about 12 mins each. BOINC manager is only allowing me 1 or 2 extra PG tasks as cache when there should be about 100 or more.
Also noticed these messages when starting Boinc manager
Wed 24 Dec 2008 18:15:00 GMT||Starting BOINC client version 6.5.0 for x86_64-pc-linux-gnu
Wed 24 Dec 2008 18:15:00 GMT||[error] bad value -1.000000 of time stats connected_frac; ignoring
Wed 24 Dec 2008 18:15:00 GMT||[error] bad value -1.000000 of time stats active_frac; ignoring
Wed 24 Dec 2008 18:15:00 GMT||Processor: 4 GenuineIntel Intel(R) Core(TM)2 Quad CPU Q9450 @ 2.66GHz [Family 6 Model 23 Stepping 7]
Wed 24 Dec 2008 18:15:00 GMT||Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall lm constant_tsc arch_perfmon pebs bts rep_good nopl pni monitor ds_cpl vmx smx est tm2 ssse3 cx16 xtpr sse4
Wed 24 Dec 2008 18:15:00 GMT||OS: Linux: 2.6.27-9-generic
Wed 24 Dec 2008 18:15:00 GMT||Memory: 3.87 GB physical, 953.66 MB virtual
Wed 24 Dec 2008 18:15:00 GMT||Disk: 26.58 GB total, 21.83 GB free
Also noted that my 'on' time was reading 0.0145....... after it had been on for 5 hours solid.
On another note... I noticed that the GPU task was waiting for memory, so I increased its allowance to 90% and memory usage climbed to 2.3GB. I have since rebooted and it is currently @ 380MB and climbing. Apps are left suspended in memory.
Hope some of this can help.
Mark |
|
|
mike047Send message
Joined: 21 Dec 08 Posts: 47 Credit: 7,330,049 RAC: 0 Level
![Serine - More than 5M credits Ser](img/badges/aa/badge_ser.png) Scientific publications
![Top 75% (176th/251) contribution to De Fabritiis et al, Proteins 2008 wat](img/badges/papers/badge_pub_silver.png) ![Top 10% (54th/2932) contribution to Buch et al, J. Chem. Inf. Model. 2010 wat](img/badges/papers/badge_pub_emerald.png) ![Top 10% (33rd/2466) contribution to Sadiq et al, Proteins 2010 wat](img/badges/papers/badge_pub_emerald.png) ![Top 10% (194th/3118) contribution to Selent et al, PLoS Comput Biol 2010 wat](img/badges/papers/badge_pub_emerald.png) ![Top 90% (2024th/2450) contribution to Giorgino et al, J. Chem. Theory Comput. 2011 wat](img/badges/papers/badge_pub_bronze.png) ![Top 10% (831st/9662) contribution to Buch et al, J. Chem. Theory Comput. 2011 wat](img/badges/papers/badge_pub_emerald.png) ![Top 75% (2268th/3183) contribution to Lauro et al., JCIM 2014 wat](img/badges/papers/badge_pub_silver.png) |
If you will run the boinc benchmark, it will give you the memory back..you won't have to reboot. This is on 6.4.2.
mike |
|
|
|
If you will run the boinc benchmark, it will give you the memory back..you won't have to reboot. This is on 6.4.2.
mike
Thanks for that Mike. I'll remember that next time I use Linux. I'm currently back running GPU on Windoze with the Boinc 6.5.0 release and notice that it does not suffer the same problems as Linux does. Seems much more stable on Windoze and appears to use much less CPU.
Mark
|
|
|
|
Checked again this morning. on_frac reset only happens after restart on 6.5.0 Linux 64 bit.
After downgrade to 6.4.5 on_frac stays as it was and continues to increase.
My VISTA 32, unfortunately without a suitable GPU, is also OK even on 6.5.0.
So staying with 6.4.5 for now since my machines will have to be shutdown for the night till my visitors are gone (they sleep in my PC room).
Merry Christmas to all of you.
Alain |
|
|
|
Checked again this morning. on_frac reset only happens after restart on 6.5.0 Linux 64 bit.
After downgrade to 6.4.5 on_frac stays as it was and continues to increase.
My VISTA 32, unfortunately without a suitable GPU, is also OK even on 6.5.0.
So staying with 6.4.5 for now since my machines will have to be shutdown for the night till my visitors are gone (they sleep in my PC room).
Merry Christmas to all of you.
Alain
Confirmed by Dagorath (thank you) who put it also on the boinc-alpha list.
See BOINC forum
Kind regards
Alain |
|
|