Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Update copyright | Jon Bratseth | 2023-10-09 | 81 | -80/+82 |
| | |||||
* | Consolidate hamcrest usage to 2.x and remove cthul-matchers | Bjørn Christian Seime | 2023-08-29 | 9 | -18/+1 |
| | |||||
* | Add content cluster name to generated feed block message | Tor Brede Vekterli | 2023-07-26 | 2 | -28/+30 |
| | | | | | | | | | | | Messages now prefixed with content cluster name to help disambiguate which cluster is exceeding its limits in multi-cluster deployments. Example message: ``` in content cluster 'my-cool-cluster': disk on node 1 [my-node-1.example.com] is 81.0% full (the configured limit is 80.0%). See https://docs.vespa.ai/en/operations/feed-block.html ``` | ||||
* | Make generated automatic feed block error messages more user-friendly | Tor Brede Vekterli | 2023-07-26 | 3 | -34/+49 |
| | | | | | | | | | | | | | | | | | | Messages are generated centrally by the cluster controller and pushed to content nodes as part of a cluster state bundle; the distributors nodes merely repeat back what they have been told. This changes the cluster controller feed block error message code to be less ambiguous and to include a URL to our public documentation about feed blocks. Example of _old_ message: ``` disk on node 1 [storage.1.local] (0.510 > 0.500) ``` Same feed block with _new_ message: ``` disk on node 1 [storage.1.local] is 51.0% full (the configured limit is 50.0%). See https://docs.vespa.ai/en/operations/feed-block.html ``` | ||||
* | Check min replication seen from all distributors that are UP | Harald Musum | 2023-07-17 | 1 | -1/+1 |
| | | | | | This means we will also check distributors that are on same node as a retired storage node | ||||
* | Fix minor issues after code reviews | Harald Musum | 2023-07-17 | 1 | -2/+0 |
| | |||||
* | Check redundancy also for groups that are up | Harald Musum | 2023-07-16 | 1 | -18/+27 |
| | | | | | | | When we allow several groups to go down for maintenance we should check nodes in the groups that are up if they have the required redundancy. They might be up but have not yet synced all buckets after coming up. We want to wait with allowing more nodes to be taken down until that is done. | ||||
* | Renames and minor refactorings, no funcational changes | Harald Musum | 2023-07-09 | 2 | -109/+109 |
| | |||||
* | Simplify | Harald Musum | 2023-07-05 | 1 | -1/+1 |
| | |||||
* | Minor refactoring and start of some new test | Harald Musum | 2023-07-05 | 1 | -11/+32 |
| | |||||
* | Move fetchStatusPage | Harald Musum | 2023-06-19 | 1 | -0/+30 |
| | |||||
* | Modernize | Harald Musum | 2023-06-07 | 1 | -3/+4 |
| | |||||
* | Require non-null MetricUpdater | Harald Musum | 2023-06-05 | 2 | -16/+7 |
| | |||||
* | Add back some testing of getMaster() | Harald Musum | 2023-06-05 | 1 | -0/+60 |
| | |||||
* | Remove support for RPC method getMaster, only used in tests | Harald Musum | 2023-06-05 | 2 | -135/+1 |
| | |||||
* | Simplify and remove some test methods | Harald Musum | 2023-06-01 | 5 | -25/+16 |
| | |||||
* | ZooKeeper is always used, simplify | Harald Musum | 2023-06-01 | 3 | -35/+9 |
| | |||||
* | Require non-null zooKeeperServerAddress in FleetControllerOptions | Harald Musum | 2023-06-01 | 4 | -125/+134 |
| | |||||
* | Copy options when reconfiguring | Harald Musum | 2023-06-01 | 1 | -20/+10 |
| | |||||
* | Require distribution to be non-null and fix tests | Harald Musum | 2023-05-26 | 1 | -0/+1 |
| | |||||
* | Remove RPC method only used in tests | Harald Musum | 2023-05-26 | 1 | -44/+1 |
| | |||||
* | Simplify and minor cleanup | Harald Musum | 2023-05-15 | 1 | -125/+133 |
| | |||||
* | Avoid duplicating code | Harald Musum | 2023-05-13 | 3 | -23/+21 |
| | |||||
* | Set waiter in createFleetController | Harald Musum | 2023-05-13 | 1 | -4/+4 |
| | |||||
* | Create slobrok in constructor and simplify setup | Harald Musum | 2023-05-12 | 11 | -50/+45 |
| | |||||
* | Inject timer from test classes instead of inheriting | Harald Musum | 2023-05-12 | 13 | -124/+152 |
| | |||||
* | Move method | Harald Musum | 2023-05-12 | 2 | -10/+11 |
| | |||||
* | Fix exception message | Harald Musum | 2023-05-12 | 1 | -1/+1 |
| | |||||
* | Remove testname and logging related to starting and stopping | Harald Musum | 2023-05-12 | 10 | -59/+7 |
| | | | | Not used, reintroduce using junit TestInfo class if needed | ||||
* | Remove advanceTime call | Harald Musum | 2023-05-12 | 1 | -1/+0 |
| | | | | | | | The timer used in fleetcontroller in these tests (except one in a disabled test) are instances of RealTimer, whereas the one used in this call is a FakeTimer from the superclass, so this call makes no sense IMHO. | ||||
* | Minor cleanup | Harald Musum | 2023-05-08 | 14 | -40/+20 |
| | | | | GC dead code, optimize imports, fix unnecessary throws statements | ||||
* | Implement toString for implementations of UnitState | Harald Musum | 2023-04-20 | 1 | -0/+3 |
| | |||||
* | FIx retiredOrNotUpGroups() | Harald Musum | 2023-04-18 | 1 | -1/+1 |
| | |||||
* | Handle case where a node has another description for wanted state | Harald Musum | 2023-04-18 | 1 | -11/+24 |
| | | | | Also add group indexes for disallow messages where relevant | ||||
* | Check state down later and simplify | Harald Musum | 2023-04-17 | 1 | -5/+28 |
| | |||||
* | Parameterize tests | Harald Musum | 2023-04-14 | 1 | -109/+140 |
| | |||||
* | Merge branch 'master' into hmusum/allow-groups-to-be-down | Harald Musum | 2023-04-14 | 1 | -1/+1 |
|\ | |||||
| * | Restore isControlledShutdown and simplify tests a bit | Harald Musum | 2023-04-13 | 1 | -9/+25 |
| | | |||||
| * | Fix bug in distributuon config generation | Harald Musum | 2023-04-13 | 1 | -1/+2 |
| | | |||||
* | | Reimplement checking of other nodes and nodes in groups being not up | Harald Musum | 2023-04-12 | 1 | -3/+3 |
| | | |||||
* | | Restore isControlledShutdown and simplify tests a bit | Harald Musum | 2023-04-10 | 1 | -13/+14 |
| | | |||||
* | | Allow more than 1 group in a content to be down at the same time | Harald Musum | 2023-03-30 | 1 | -11/+132 |
| | | | | | | | | | | Based on config, all functional changes guarded by config field max_number_of_groups_allowed_to_be_down in fleetcontroller config | ||||
* | | Fix bug in distributuon config generation | Harald Musum | 2023-03-30 | 1 | -1/+2 |
|/ | |||||
* | MInor changes after code review | Harald Musum | 2023-03-29 | 1 | -6/+5 |
| | |||||
* | Simplify NodeStateChangeChecker constructor | Harald Musum | 2023-03-29 | 2 | -13/+9 |
| | |||||
* | Simplify now that we can create HierarchicalGroupVisiting in constructor | Harald Musum | 2023-03-29 | 1 | -14/+5 |
| | |||||
* | Use correct redundancy and don't set active_per_leaf_group | Harald Musum | 2023-03-29 | 1 | -4/+3 |
| | |||||
* | Make sure to make correct config for flat clusters (1 group) | Harald Musum | 2023-03-29 | 1 | -3/+23 |
| | |||||
* | Create real distribution config for all tests | Harald Musum | 2023-03-29 | 1 | -52/+50 |
| | |||||
* | Merge pull request #26606 from ↵ | Harald Musum | 2023-03-28 | 3 | -14/+12 |
|\ | | | | | | | | | vespa-engine/hmusum/add-config-for-max-number-of-groups-allowed-to-be-down Add config for max number of content groups allowed to be down |