Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | jackson 2.16 changes some of its default settings so we consolidate our use ↵ | Henning Baldersheim | 2023-11-23 | 2 | -9/+9 |
| | | | | | | of the ObjectMapper. Unless special options are used, use a common instance, or create via factory metod. | ||||
* | Update copyright | Jon Bratseth | 2023-10-09 | 10 | -11/+11 |
| | |||||
* | Renames and minor refactorings, no funcational changes | Harald Musum | 2023-07-09 | 1 | -10/+10 |
| | |||||
* | Modernize | Harald Musum | 2023-06-07 | 1 | -3/+4 |
| | |||||
* | Move method | Harald Musum | 2023-05-12 | 1 | -4/+11 |
| | |||||
* | Minor cleanup | Harald Musum | 2023-05-08 | 1 | -1/+1 |
| | | | | GC dead code, optimize imports, fix unnecessary throws statements | ||||
* | Implement toString for implementations of UnitState | Harald Musum | 2023-04-20 | 1 | -0/+3 |
| | |||||
* | Simplify NodeStateChangeChecker constructor | Harald Musum | 2023-03-29 | 1 | -5/+5 |
| | |||||
* | Merge pull request #26606 from ↵ | Harald Musum | 2023-03-28 | 1 | -1/+2 |
|\ | | | | | | | | | vespa-engine/hmusum/add-config-for-max-number-of-groups-allowed-to-be-down Add config for max number of content groups allowed to be down | ||||
| * | Add config for max number of content groups allowed to be down | Harald Musum | 2023-03-28 | 1 | -1/+2 |
| | | |||||
* | | Simplify and add todo about removing disk_distribution in Vespa 9 | Harald Musum | 2023-03-28 | 1 | -8/+1 |
|/ | |||||
* | Move test code to test class | Harald Musum | 2023-03-26 | 1 | -2/+44 |
| | |||||
* | Fix more tests | jonmv | 2022-10-22 | 6 | -343/+436 |
| | |||||
* | Replace jettison with jackson | jonmv | 2022-10-21 | 1 | -79/+83 |
| | |||||
* | Remove unused and undocumented partition state request support | Harald Musum | 2022-09-07 | 3 | -85/+7 |
| | |||||
* | Convert clustercontroller-core to junit5 | Bjørn Christian Seime | 2022-07-29 | 9 | -425/+414 |
| | |||||
* | Trigger saveWantedState when nodes are removed or orphaned wanted states are ↵ | Håkon Hallingstad | 2022-04-20 | 2 | -10/+16 |
| | | | | loaded | ||||
* | GC use of deprecated junit assertThat and unify | Henning Baldersheim | 2021-12-21 | 1 | -0/+1 |
| | |||||
* | Update Verizon Media copyright notices. | gjoranv | 2021-10-07 | 5 | -5/+5 |
| | |||||
* | Update 2017 copyright notices. | gjoranv | 2021-10-07 | 6 | -6/+6 |
| | |||||
* | Disallow >1 group to suspend | Håkon Hallingstad | 2021-04-16 | 1 | -10/+12 |
| | | | | | | | If there is more than one group, disallow suspending a node if there is a node in another group that has a user wanted state != UP. If there is 1 group, disallow suspending more than 1 node. | ||||
* | No longer allow suspension if in maintenance | Håkon Hallingstad | 2021-04-15 | 2 | -13/+12 |
| | | | | | | If a storage node falls out of Slobrok, it will change from UP to Maintenance after 60s, then after further 30s go to Down. Avoid allowing suspension in the 30s grace period just because it is Maintenance mode. | ||||
* | Revert "Revert "Avoid safe mutations in master moratorium and increase first ↵ | Håkon Hallingstad | 2021-03-24 | 3 | -2/+10 |
| | | | | cluster state broadcast deadline [run-systemtest]"" | ||||
* | Revert "Avoid safe mutations in master moratorium and increase first cluster ↵ | Håkon Hallingstad | 2021-03-24 | 3 | -10/+2 |
| | | | | state broadcast deadline [run-systemtest]" | ||||
* | Avoid safe-set-node-state in master moratorium | Håkon Hallingstad | 2021-03-24 | 3 | -2/+10 |
| | |||||
* | Include metrics always. | Henning Baldersheim | 2021-03-12 | 1 | -27/+0 |
| | |||||
* | Revert "Revert "GC unused DiskState and add the partition metrics to node ↵ | Henning Baldersheim | 2021-03-12 | 1 | -0/+10 |
| | | | | level."" | ||||
* | Revert "GC unused DiskState and add the partition metrics to node level." | Harald Musum | 2021-03-12 | 1 | -10/+0 |
| | |||||
* | GC unused DiskState and add the partition metrics to node level. | Henning Baldersheim | 2021-03-12 | 1 | -0/+10 |
| | |||||
* | Revert "GC unused DiskState" | Arnstein Ressem | 2021-03-12 | 3 | -0/+73 |
| | |||||
* | GC unused DiskState | Henning Baldersheim | 2021-03-11 | 3 | -73/+0 |
| | |||||
* | GC use of void DiskState. | Henning Baldersheim | 2021-03-11 | 1 | -3/+0 |
| | |||||
* | GC use of NodeState.getDiskCount and NodeState.getDiskStates. | Henning Baldersheim | 2021-03-11 | 2 | -47/+0 |
| | |||||
* | Remove unused aguments and methods | Harald Musum | 2021-02-21 | 2 | -23/+14 |
| | |||||
* | Mincor cleanup, no functional changs | Harald Musum | 2021-02-21 | 5 | -21/+19 |
| | |||||
* | Fail safe maintenance if other nodes are not up | Håkon Hallingstad | 2021-02-19 | 1 | -6/+8 |
| | |||||
* | Add test | Håkon Hallingstad | 2021-02-12 | 2 | -11/+49 |
| | |||||
* | Also deny maintenance when another node is in maintenance | Håkon Hallingstad | 2021-02-12 | 1 | -1/+1 |
| | | | | | | | The cluster controller today already denies setting a node X safely to maintenance M, if there is another node Y in another group that has wanted state M. Which means that if Y is in M but wanted state is not M, X is allowed to be set in M. This is an edge case which is rare. | ||||
* | Allows setting a node safely to maintenance in these two new circumstances: | Håkon Hallingstad | 2021-01-21 | 2 | -32/+134 |
| | | | | | | | | | | | | | 1. The node has state MAINTENANCE with (user) wanted state UP. 2. There are other nodes in the same hierarchical group that are set in MAINTENANCE with the same description. Also made the following change. 3. Deny a request for safe MAINTENANCE or DOWN, if the wanted state is already set but with a different description. If the descriptions are the same, it is assumed to be the same operator (e.g. Orchestrator) having changed its mind. | ||||
* | Remove stray parameter doc | Håkon Hallingstad | 2020-01-27 | 1 | -4/+2 |
| | |||||
* | Remove use-bucket-space-metric feature flag | Håkon Hallingstad | 2020-01-26 | 2 | -5/+5 |
| | | | | | | | | | | The flag controlled config read by the Cluster Controller. Therefore, I have left the ModelContextImpl.Properties method and implementation (now always returning true), but the model has stopped using that method internally, and the config is no longer used in the CC. The field in the fleetcontroller.def is left unchanged and documented as deprecated. | ||||
* | Test metric value for different dimensions, and more | Håkon Hallingstad | 2020-01-17 | 1 | -2/+4 |
| | |||||
* | Use bucket_space metric in retirement | Håkon Hallingstad | 2020-01-17 | 2 | -3/+3 |
| | | | | | | | | | | | | This makes the Cluster Controller use the vds.datastored.bucket_space.buckets_total, dimension bucketSpace=default, to determine whether a content node manages zero buckets, and if so, will allow the node to go permanently down. This is used when a node is retiring, and it is to be removed from the application. The change is guarded by the use-bucket-space-metric, default true. If the new metric doesn't work as expected, we can revert to using the current/old metric by flipping the flag. The flag can be controlled per application. | ||||
* | Add non-converged nodes to task deadline exceeded messages | Tor Brede Vekterli | 2019-11-04 | 1 | -4/+5 |
| | | | | | Makes it easier for an external observer to understand what set of nodes is causing the cluster state to not converge. | ||||
* | Use mockito-core 3.1.0 | Håkon Hallingstad | 2019-10-18 | 2 | -4/+4 |
| | |||||
* | Cleanup tests, no functional changes | Harald Musum | 2019-09-03 | 5 | -55/+58 |
| | |||||
* | Revert "Revert "Revert "Revert "Enforce CC timeouts in Orchestrator 4"""" | Håkon Hallingstad | 2018-11-01 | 2 | -10/+40 |
| | |||||
* | Revert "Revert "Revert "Enforce CC timeouts in Orchestrator 4""" | Håkon Hallingstad | 2018-11-01 | 2 | -40/+10 |
| | |||||
* | Revert "Revert "Enforce CC timeouts in Orchestrator [4]"" | Håkon Hallingstad | 2018-11-01 | 2 | -10/+40 |
| | |||||
* | Revert "Enforce CC timeouts in Orchestrator [4]" | Harald Musum | 2018-10-31 | 2 | -40/+10 |
| |