Merge ~rbalint/autopkgtest-cloud:more-retries into autopkgtest-cloud:master
Proposed by
Balint Reczey
Status: | Merged | ||||||||
---|---|---|---|---|---|---|---|---|---|
Merge reported by: | Balint Reczey | ||||||||
Merged at revision: | bc4d6865bf81ffd8f5958b5dabc503fef9289f94 | ||||||||
Proposed branch: | ~rbalint/autopkgtest-cloud:more-retries | ||||||||
Merge into: | autopkgtest-cloud:master | ||||||||
Diff against target: |
156 lines (+65/-53) 1 file modified
worker/worker (+65/-53) |
||||||||
Related bugs: |
|
Reviewer | Review Type | Date Requested | Status |
---|---|---|---|
Iain Lane (community) | Approve | ||
Review via email:
|
To post a comment you must log in.
There was an error fetching revisions from git servers. Please try again in a few minutes. If the problem persists, contact Launchpad support.
Thanks, but sorry, this isn't going to work as-is.
To understand why, find the code where FAIL_STRINGS_REGEX is used. It's exclusively where a run has *temporarily* failed (exitcode == 16). What we had here was a permanent failure (code 4). We need to check that the exitcode is in (2, 4, 6, 8) and then grep for a different set of strings - obviously factoring out the existing logic in the most sensible way to apply to both cases.
Also the 'Temporary failure' should not be restricted to a package - we can get this happening in runs of any test. Those other ones are restricted to systemd* and linux* because they are packages which genuinely can - and do - break booting of the instances completely when they have a bug. autopkgtest isn't great at catching that so we do it outside.
OOI, do you have any ideas why the clock is moving backwards? That feels like something we should fix properly rather than retrying on, if we can understand why it happens and why provisioning hasn't given us the right time (or if our idea of time got out of step with reality...).