Message boards :
Number crunching :
Computation error: EXIT_TIME_LIMIT_EXCEEDED
Message board moderation
Author | Message |
---|---|
Send message Joined: 29 Sep 20 Posts: 14 Credit: 64,341 RAC: 0 |
After 5 hours 26 min 52 sec runtime: http://150.254.66.104/gaiaathome/result.php?resultid=2291547 |
Send message Joined: 8 Apr 20 Posts: 13 Credit: 1,125,955 RAC: 15,672 |
I had 2 WUs that were continuing to run past 6 hours (on different machines). I just aborted them. Hope I didn't do that prematurely. (I've only had 1 2_Gaia WUs complete successfully so far) |
Send message Joined: 21 Aug 19 Posts: 110 Credit: 888,695 RAC: 8,066 |
I am waiting for end of wus in progress and I will change 2_Gaia app to shorter time for each wus .... |
Send message Joined: 21 Aug 19 Posts: 23 Credit: 91,637 RAC: 0 |
increase the difficulty of the task <workunit> or decrease application performance <app_version> by changing the size of the task, you keep the proportion, but it is too optimistic changing the difficulty of a task is easiest. performance is recalculated by the client after the test and task completion |
Send message Joined: 29 Sep 20 Posts: 4 Credit: 50,008 RAC: 0 |
At threadopener Holy Cow , 60K seconds and a lousy 8.33 points for real !? |
Send message Joined: 8 Apr 20 Posts: 13 Credit: 1,125,955 RAC: 15,672 |
My validated 2_Gaia WUs haven't been that stingy on credit. Nothing to holler about either :). |
Send message Joined: 15 Oct 19 Posts: 11 Credit: 2,848,916 RAC: 0 |
The revised 2_Gaia have a large drop in credits today compared to earlier (yesterday). I assume this change happened after a fix to stop the timeouts as my machine is now running tasks longer. I am fine with 40-60 cr/hour on this machine, however, the current WUs are at 9.15 credits regardless of time which results in 2 to 3 cr/hour. ...something got messed up somewhere.... |
Send message Joined: 27 Apr 20 Posts: 40 Credit: 2,829,759 RAC: 58,401 |
I may run into this error soon as I have 3 work units that are using cpu but have been running 10, 12 and 15 hours. The time remaining has run out on two of them and 3 seconds left on the other but it will take a long time to do those 3 seconds. Thanks Conan |
Send message Joined: 29 Sep 20 Posts: 14 Credit: 64,341 RAC: 0 |
Finally the task http://150.254.66.104/gaiaathome/result.php?resultid=2338280 mentioned here ended with the error EXIT_TIME_LIMIT_EXCEEDED after almost 36 hours runtime. |
Send message Joined: 15 Oct 19 Posts: 11 Credit: 2,848,916 RAC: 0 |
Too many long running tasks that end in error...I have decided to set my machine to no new tasks for the time being until the current kinks are worked out. |
Send message Joined: 9 Oct 20 Posts: 3 Credit: 264,349 RAC: 0 |
Should we manually cancel tasks that are being calculated much longer than 2 hours, let's say those running over 5 hours? They are in infinite loop or something? I've got 2 tasks running over 10 hours and they will probably end with errors. |
Send message Joined: 21 Aug 19 Posts: 110 Credit: 888,695 RAC: 8,066 |
My previous solution was limited to 2h calculations, it does not work perfectly because sometimes the computation time is very long for one loop. I checked the time every one loop. I will change to a protection using kernel signals and I will terminate the process after max 3h ( 2h for normal time without problems). Next will send a signal of correct process ending to boinc and i will save temporary results. I hope that this solution will save your credits ..... |
Send message Joined: 15 Oct 19 Posts: 11 Credit: 2,848,916 RAC: 0 |
I wonder if there is not something else that might be causing problems.. This WU http://150.254.66.104/gaiaathome/workunit.php?wuid=1164886, the first two tasks were aborted after long run times. However the third task seemed to run for the normal length of time and was valid. So why were two tasks screwy and yet the WU still ended up having good results? Is there still some sort of calculation bug in the system that would lead one set of hardware to go to an infinite loop yet another set of hardware runs just fine? |
Send message Joined: 26 Feb 20 Posts: 17 Credit: 3,807,621 RAC: 50,596 |
I had 2 WUs that were continuing to run past 6 hours (on different machines). I just aborted them. Hope I didn't do that prematurely. (I've only had 1 2_Gaia WUs complete successfully so far) Hey that's good for me passing you...ol buddy!!! :-) :-))) |
Send message Joined: 21 Aug 19 Posts: 110 Credit: 888,695 RAC: 8,066 |
It is problem of numerical integrations... Nominal star travels long distance from the Sun ( long integrations step, short calculation time for it) Next 10 clons of star travel silimary distance (short calculation time) Next clons travel very close to Sun ( short integration step, long calculations time) - and this situations is problematic I plan stop at the moment calculations and save previous 10 clones. |
Send message Joined: 27 Sep 20 Posts: 10 Credit: 8,003,319 RAC: 38,434 |
Long running tasks are not necessarily bad, if they are needed to perform the calculations properly. But they should give more credits, based on the longer run times. And you really should add checkpoints so the tasks don't have to start over at 0% if they are paused or interrupted (computer reboot, for example). |
Send message Joined: 19 May 20 Posts: 37 Credit: 31,365 RAC: 1,089 |
And you really should add checkpoints so the tasks don't have to start over at 0% if they are paused or interrupted (computer reboot, for example). +1 |
Send message Joined: 16 Sep 19 Posts: 4 Credit: 1,256,430 RAC: 0 |
And you really should add checkpoints so the tasks don't have to start over at 0% if they are paused or interrupted (computer reboot, for example). +1 |
©2024 GAVIP-GC