aboutsummaryrefslogtreecommitdiffstats
path: root/clustercontroller-core/src/main/java/com/yahoo
Commit message (Collapse)AuthorAgeFilesLines
* Update copyrightJon Bratseth2023-10-09119-119/+119
|
* GC unused count and averageHenning Baldersheim2023-08-301-8/+0
|
* Revert "More logging at higher level, need more info"Harald Musum2023-08-251-5/+3
|
* Add content cluster name to generated feed block messageTor Brede Vekterli2023-07-262-6/+7
| | | | | | | | | | | Messages now prefixed with content cluster name to help disambiguate which cluster is exceeding its limits in multi-cluster deployments. Example message: ``` in content cluster 'my-cool-cluster': disk on node 1 [my-node-1.example.com] is 81.0% full (the configured limit is 80.0%). See https://docs.vespa.ai/en/operations/feed-block.html ```
* Make generated automatic feed block error messages more user-friendlyTor Brede Vekterli2023-07-262-4/+11
| | | | | | | | | | | | | | | | | | Messages are generated centrally by the cluster controller and pushed to content nodes as part of a cluster state bundle; the distributors nodes merely repeat back what they have been told. This changes the cluster controller feed block error message code to be less ambiguous and to include a URL to our public documentation about feed blocks. Example of _old_ message: ``` disk on node 1 [storage.1.local] (0.510 > 0.500) ``` Same feed block with _new_ message: ``` disk on node 1 [storage.1.local] is 51.0% full (the configured limit is 50.0%). See https://docs.vespa.ai/en/operations/feed-block.html ```
* More logging at higher level, need more infoHarald Musum2023-07-201-3/+5
|
* Use cluster state to check if disttributors are UPHarald Musum2023-07-181-5/+4
|
* Check min replication seen from all distributors that are UPHarald Musum2023-07-171-28/+16
| | | | | This means we will also check distributors that are on same node as a retired storage node
* Fix minor issues after code reviewsHarald Musum2023-07-171-1/+1
|
* Avoid code duplicationHarald Musum2023-07-161-9/+7
|
* Check redundancy also for groups that are upHarald Musum2023-07-161-0/+79
| | | | | | | When we allow several groups to go down for maintenance we should check nodes in the groups that are up if they have the required redundancy. They might be up but have not yet synced all buckets after coming up. We want to wait with allowing more nodes to be taken down until that is done.
* Split out method for finding min replication per distributorHarald Musum2023-07-141-15/+20
|
* Separate code at a higher level based on groupes setup or notHarald Musum2023-07-111-71/+66
|
* More minor changesHarald Musum2023-07-091-22/+29
|
* Split out method to avoid repeating codeHarald Musum2023-07-091-69/+68
|
* Renames and minor refactorings, no funcational changesHarald Musum2023-07-093-149/+117
|
* SimplifyHarald Musum2023-07-092-35/+17
|
* SimplifyHarald Musum2023-07-054-15/+8
|
* Code cleanup, no functional changesHarald Musum2023-07-051-5/+4
|
* Minor refactoring and start of some new testHarald Musum2023-07-051-14/+14
|
* Replace metrics strings with the corresponding enum constants.gjoranv2023-06-222-4/+6
| | | | - Add 'metrics' as provided dep for clustercontroller-core
* Revert "Replace metrics strings with the corresponding enum constants."Harald Musum2023-06-222-6/+4
|
* Replace metrics strings with the corresponding enum constants.gjoranv2023-06-212-4/+6
| | | | - Add 'metrics' as provided dep for clustercontroller-core
* Move fetchStatusPageHarald Musum2023-06-192-42/+49
|
* ModernizeHarald Musum2023-06-072-20/+19
|
* Require non-null MetricUpdaterHarald Musum2023-06-051-6/+4
|
* Remove support for RPC method getMaster, only used in testsHarald Musum2023-06-052-41/+1
|
* Inline and simplifyHarald Musum2023-06-051-15/+2
|
* Set nextInLineCount to 0 initially instead of checking fleetcontroller countHarald Musum2023-06-022-2/+3
|
* ZooKeeper is always used, simplifyHarald Musum2023-06-014-61/+19
|
* Require non-null zooKeeperServerAddress in FleetControllerOptionsHarald Musum2023-06-011-1/+1
|
* Move into method that does reconfigurationHarald Musum2023-06-012-1/+1
|
* Remove use of stateGatherCount config, simplify and deprecate config fieldHarald Musum2023-05-304-22/+6
|
* Set stateGatherCount to 1Harald Musum2023-05-301-1/+1
| | | | | This is the default config value and changing it made running unit tests locally faster and stable
* Revert "Cosmetic changes"Harald Musum2023-05-262-19/+15
| | | | This reverts commit 0293a4fed6d1d6293e907e6a2c5bf38ecb48775c.
* Cosmetic changesHarald Musum2023-05-262-15/+19
|
* Created modified bundle only onceHarald Musum2023-05-261-6/+5
|
* Require non-null arguments to AnnotatedClusterStateHarald Musum2023-05-262-11/+6
| | | | Simplify method using in tests only
* Require distribution to be non-null and fix testsHarald Musum2023-05-262-7/+4
|
* Remove RPC method only used in testsHarald Musum2023-05-261-23/+2
|
* Simplify and minor cleanupHarald Musum2023-05-151-0/+2
|
* Move pattern definition to handlersHarald Musum2023-05-127-50/+21
|
* Fix typoHarald Musum2023-05-081-1/+1
|
* Minor cleanupHarald Musum2023-05-0825-108/+38
| | | | GC dead code, optimize imports, fix unnecessary throws statements
* Implement toString for implementations of UnitStateHarald Musum2023-04-201-0/+2
|
* Rename variablesHarald Musum2023-04-191-4/+4
|
* Update ↵Harald Musum2023-04-191-1/+1
| | | | | clustercontroller-core/src/main/java/com/yahoo/vespa/clustercontroller/core/NodeStateChangeChecker.java Co-authored-by: Tor Brede Vekterli <vekterli@yahooinc.com>
* Rename and fix logicHarald Musum2023-04-181-12/+12
|
* FIx retiredOrNotUpGroups()Harald Musum2023-04-181-4/+6
|
* Handle case where a node has another description for wanted stateHarald Musum2023-04-181-8/+66
| | | | Also add group indexes for disallow messages where relevant