Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Reset downtime at resume, 2. try | Håkon Hallingstad | 2024-01-10 | 1 | -6/+23 |
| | |||||
* | Revert "Reset downtime at resume" | Harald Musum | 2024-01-06 | 1 | -23/+6 |
| | |||||
* | Reset downtime at resume | Håkon Hallingstad | 2024-01-05 | 1 | -6/+23 |
| | |||||
* | Add javadoc | Martin Polden | 2023-10-16 | 1 | -12/+13 |
| | |||||
* | Update copyright | Jon Bratseth | 2023-10-09 | 1 | -1/+1 |
| | |||||
* | Add enums for infrastructure and add to vespametricsset as needed for ↵ | yngveaasheim | 2023-07-31 | 1 | -2/+3 |
| | | | | infrastructure services. | ||||
* | Ensure correct lock order when failing tenant hosts | jonmv | 2023-07-14 | 1 | -25/+40 |
| | |||||
* | Add two TODOs about locks taken in the wrong order | jonmv | 2023-07-12 | 1 | -1/+1 |
| | |||||
* | Don't fail nodes undergoing CMR (#26743) | Ola Aunrønning | 2023-04-14 | 1 | -1/+15 |
| | |||||
* | maintainer success factor baseline deviation | bjormel | 2023-03-29 | 1 | -1/+1 |
| | |||||
* | Do not hold application lock while replacing failing node | Martin Polden | 2023-03-10 | 1 | -27/+36 |
| | |||||
* | Reduce NodeFailer activate timeout | Håkon Hallingstad | 2022-12-21 | 1 | -2/+4 |
| | |||||
* | Revert "Revert collect(Collectors.toList())" | Henning Baldersheim | 2022-12-04 | 1 | -1/+1 |
| | |||||
* | Revert collect(Collectors.toList()) | Henning Baldersheim | 2022-12-04 | 1 | -1/+1 |
| | |||||
* | Merge branch 'master' into bratseth/discard-warmup-metrics | Jon Bratseth | 2022-12-03 | 1 | -1/+1 |
|\ | |||||
| * | collect(Collectors.toList()) -> toList() | Henning Baldersheim | 2022-12-02 | 1 | -1/+1 |
| | | |||||
* | | Discard metrics right after restart | Jon Bratseth | 2022-12-03 | 1 | -10/+5 |
|/ | |||||
* | Allow 4% of nodes to fail before throttling | Martin Polden | 2022-12-01 | 1 | -1/+1 |
| | |||||
* | Reapply "Remove HostLivenessTracker" | Valerij Fredriksen | 2022-10-14 | 1 | -60/+3 |
| | | | | This reverts commit a5ed12b351806b187613457b58982ca67f537594. | ||||
* | Revert "Remove HostLivenessTracker" | Valerij Fredriksen | 2022-10-13 | 1 | -3/+60 |
| | |||||
* | Remove node failing for ready nodes | Valerij Fredriksen | 2022-10-13 | 1 | -60/+3 |
| | |||||
* | Only use wantToFail just before activate | Håkon Hallingstad | 2022-07-11 | 1 | -5/+10 |
| | |||||
* | Revert "Revert "Avoid the host lock while failing the children"" | Håkon Hallingstad | 2022-07-11 | 1 | -46/+54 |
| | |||||
* | Revert "Avoid the host lock while failing the children" | Håkon Hallingstad | 2022-07-11 | 1 | -54/+46 |
| | | | | This reverts commit 2cdaef56e18ace2ee2269d28f959f5a534bd68ee. | ||||
* | Revert update of comment | Håkon Hallingstad | 2022-07-11 | 1 | -2/+2 |
| | |||||
* | Define main-chain-graph flag | Håkon Hallingstad | 2022-07-08 | 1 | -2/+2 |
| | |||||
* | Avoid the host lock while failing the children | Håkon Hallingstad | 2022-07-05 | 1 | -46/+54 |
| | |||||
* | Read nodes less | Jon Bratseth | 2022-04-22 | 1 | -7/+4 |
| | |||||
* | Keep a chronological log of events per node | Martin Polden | 2022-04-19 | 1 | -1/+1 |
| | |||||
* | Revert "Preserve all node events" | Jon Bratseth | 2022-04-12 | 1 | -9/+4 |
| | |||||
* | Fix after review feedback | Martin Polden | 2022-04-12 | 1 | -4/+4 |
| | |||||
* | Preserve all node events | Martin Polden | 2022-04-12 | 1 | -1/+6 |
| | | | | Node events are now limited by a total size limit, instead of one per type. | ||||
* | Increase node failure throttling from 2 to 3 % | Jon Bratseth | 2022-04-08 | 1 | -1/+1 |
| | |||||
* | Do not allocate nodes to suspended hosts | Valerij Fredriksen | 2022-02-03 | 1 | -14/+3 |
| | |||||
* | Add Orchestrator to NodeRepository | Valerij Fredriksen | 2022-02-03 | 1 | -7/+3 |
| | |||||
* | Merge pull request #20938 from vespa-engine/bratseth/modular-profiles | Jon Bratseth | 2022-01-26 | 1 | -1/+1 |
|\ | | | | | Bratseth/modular profiles | ||||
| * | No functional changes | Jon Bratseth | 2022-01-25 | 1 | -1/+1 |
| | | |||||
* | | Increase down grace time while nodes are suspended | Jon Bratseth | 2022-01-25 | 1 | -28/+42 |
| | | |||||
* | | No functional changes | Jon Bratseth | 2022-01-25 | 1 | -48/+50 |
|/ | |||||
* | Remove dead code | Martin Polden | 2021-10-25 | 1 | -16/+0 |
| | |||||
* | Update 2017 copyright notices. | gjoranv | 2021-10-07 | 1 | -1/+1 |
| | |||||
* | Update ↵ | Håkon Hallingstad | 2021-08-16 | 1 | -3/+0 |
| | | | | | node-repository/src/main/java/com/yahoo/vespa/hosted/provision/maintenance/NodeFailer.java Co-authored-by: Valerij Fredriksen <freva@users.noreply.github.com> | ||||
* | Do not fail ready nodes w/o recent config requests | Håkon Hallingstad | 2021-08-16 | 1 | -11/+10 |
| | | | | | | This code was used to support non-Docker tenant hosts, but now only affects ready cfg and proxy containers which may not even exist and cannot possibly issue config requests (when in ready). | ||||
* | Revert "Revert "Emit a success factor from maintainers"" | Jon Bratseth | 2021-06-06 | 1 | -3/+13 |
| | | | | This reverts commit cd1b747b4f65fa3a6ed6aace23235db7591638c5. | ||||
* | Revert "Emit a success factor from maintainers" | Arnstein Ressem | 2021-06-04 | 1 | -13/+3 |
| | |||||
* | Return success factor | Jon Bratseth | 2021-06-04 | 1 | -3/+13 |
| | |||||
* | Never throttle failing of children on failed hosts | Martin Polden | 2021-04-27 | 1 | -14/+15 |
| | |||||
* | Node failing improvements | Jon Bratseth | 2021-04-12 | 1 | -4/+21 |
| | | | | | - Fail hosts that wants to fail and do not have active children - Clear want to fail on failing in case the nodes are later reactivated | ||||
* | Move nodes to 'failed' during activate | Jon Bratseth | 2021-04-08 | 1 | -7/+10 |
| | |||||
* | Less Docker | Martin Polden | 2021-02-18 | 1 | -1/+1 |
| |