Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Merge pull request #22225 from ↵ | Håkon Hallingstad | 2022-04-22 | 3 | -27/+11 |
|\ | | | | | | | | | vespa-engine/jonmv/clustercontroller-core-test-retries-and-random-cleanup Jonmv/clustercontroller core test retries and random cleanup | ||||
| * | Unignore more tests—they look like all the others—and lower timeout | jonmv | 2022-04-22 | 2 | -5/+1 |
| | | |||||
| * | Use fake timer where needed, unignore those tests, and remove fake timer ↵ | jonmv | 2022-04-22 | 1 | -22/+10 |
| | | | | | | | | usage elsehwere | ||||
* | | Fix reference to storeWantedStates | Håkon Hallingstad | 2022-04-21 | 1 | -2/+2 |
| | | |||||
* | | Improve test method name | Håkon Hallingstad | 2022-04-21 | 1 | -1/+1 |
| | | |||||
* | | Add unit test | Håkon Hallingstad | 2022-04-20 | 1 | -1/+55 |
| | | |||||
* | | Trigger saveWantedState when nodes are removed or orphaned wanted states are ↵ | Håkon Hallingstad | 2022-04-20 | 9 | -36/+46 |
| | | | | | | | | loaded | ||||
* | | Remove redundant parameters from ZooKeeperDatabase constructor | Håkon Hallingstad | 2022-04-19 | 1 | -1/+1 |
| | | |||||
* | | Use plural for methods returning collection | Håkon Hallingstad | 2022-04-19 | 7 | -24/+24 |
|/ | |||||
* | Fix comparison between manually deployed package and not, and remove ↵ | Jon Marius Venstad | 2022-01-13 | 1 | -1/+1 |
| | | | | outdated safeguard | ||||
* | GC use of deprecated junit assertThat and unify | Henning Baldersheim | 2021-12-21 | 15 | -104/+84 |
| | |||||
* | Use FleetControllerContext in ZooKeeperDatabase | Håkon Hallingstad | 2021-12-13 | 9 | -19/+67 |
| | |||||
* | Remove config generation -1/0 from CC at :19050/status/<clustername>/config | Håkon Hallingstad | 2021-10-20 | 6 | -19/+15 |
| | |||||
* | Revert changes to config generation | Håkon Hallingstad | 2021-10-20 | 6 | -16/+21 |
| | |||||
* | Fixes after review round | Håkon Hallingstad | 2021-10-19 | 5 | -8/+8 |
| | |||||
* | Improve logging of FleetController and DatabaseHandler | Håkon Hallingstad | 2021-10-15 | 12 | -62/+79 |
| | |||||
* | Some optimizations of RpcServerTest | Håkon Hallingstad | 2021-10-14 | 3 | -21/+29 |
| | |||||
* | Reduce running time of MasterElectionTest from 28 to 12s | Håkon Hallingstad | 2021-10-14 | 1 | -0/+6 |
| | |||||
* | Update Verizon Media copyright notices. | gjoranv | 2021-10-07 | 30 | -30/+30 |
| | |||||
* | Update 2018 copyright notices. | gjoranv | 2021-10-07 | 19 | -19/+19 |
| | |||||
* | Update 2017 copyright notices. | gjoranv | 2021-10-07 | 32 | -32/+32 |
| | |||||
* | More lazy debug log message generation | Jon Marius Venstad | 2021-04-28 | 2 | -25/+25 |
| | |||||
* | Reapply "add more logging" (new and updated slobrok logging) | Arne Juul | 2021-04-21 | 2 | -0/+2 |
| | | | | This reverts commit 9aa3d6fe6567e3eee9108d6fffbc50d5874e72e3. | ||||
* | Revert "add more logging" | Harald Musum | 2021-04-20 | 2 | -2/+0 |
| | |||||
* | track API change in mock | Arne Juul | 2021-04-19 | 2 | -0/+2 |
| | |||||
* | Improve test names | Håkon Hallingstad | 2021-04-16 | 1 | -4/+4 |
| | |||||
* | Disallow >1 group to suspend | Håkon Hallingstad | 2021-04-16 | 2 | -15/+176 |
| | | | | | | | If there is more than one group, disallow suspending a node if there is a node in another group that has a user wanted state != UP. If there is 1 group, disallow suspending more than 1 node. | ||||
* | No longer allow suspension if in maintenance | Håkon Hallingstad | 2021-04-15 | 2 | -13/+12 |
| | | | | | | If a storage node falls out of Slobrok, it will change from UP to Maintenance after 60s, then after further 30s go to Down. Avoid allowing suspension in the 30s grace period just because it is Maintenance mode. | ||||
* | Merge branch 'master' into hmusum/cleanup-7 | Harald Musum | 2021-04-08 | 4 | -6/+8 |
|\ | |||||
| * | Add remote task queue size metric in cluster controller | Håkon Hallingstad | 2021-04-01 | 4 | -6/+8 |
| | | |||||
* | | Cleanup tests a bit | Harald Musum | 2021-04-08 | 3 | -43/+49 |
| | | |||||
* | | Fix typo in class name | Harald Musum | 2021-04-08 | 1 | -1/+1 |
|/ | |||||
* | Revert "Revert "Avoid safe mutations in master moratorium and increase first ↵ | Håkon Hallingstad | 2021-03-24 | 4 | -5/+26 |
| | | | | cluster state broadcast deadline [run-systemtest]"" | ||||
* | Revert "Avoid safe mutations in master moratorium and increase first cluster ↵ | Håkon Hallingstad | 2021-03-24 | 4 | -26/+5 |
| | | | | state broadcast deadline [run-systemtest]" | ||||
* | Merge pull request #17085 from ↵ | Håkon Hallingstad | 2021-03-24 | 4 | -5/+26 |
|\ | | | | | | | | | vespa-engine/hakonhall/increase-the-minimum-time-before-first-cluster-state-broadcast-run-systemtest Avoid safe mutations in master moratorium and increase first cluster state broadcast deadline [run-systemtest] | ||||
| * | Avoid safe-set-node-state in master moratorium | Håkon Hallingstad | 2021-03-24 | 4 | -5/+26 |
| | | |||||
* | | Revert deferred ZK connectivity for now | Tor Brede Vekterli | 2021-03-22 | 1 | -12/+1 |
| | | | | | | | | | | | | Instead, we'll want to create a more generalized solution that considers all sources of node information (Slobrok _and_ explicit health check RPCs) before potentially publishing a state or processing tasks. | ||||
* | | Inhibit ZooKeeper connections until our local Slobrok mirror is ready. | Tor Brede Vekterli | 2021-03-19 | 2 | -1/+17 |
|/ | | | | | | | | Otherwise, if there are transient Slobrok issues during CC startup and we end up winning the election, we risk publishing a cluster state where the entire cluster appears down (since we do not have any knowledge of Slobrok node mapping state). This will adversely affect availability for all the obvious reasons. | ||||
* | Guard against ever accidentally publishing a default constructed state | Tor Brede Vekterli | 2021-03-19 | 1 | -16/+16 |
| | | | | | Since version 0 states were ambiguous with the sentinel values for "not written to ZK/not tagged as official", this could be mis-interpreted. | ||||
* | use US locale | Kristian Aune | 2021-03-19 | 1 | -3/+4 |
| | |||||
* | Revert "Inhibit ZooKeeper connections until our local Slobrok mirror is ready." | Tor Brede Vekterli | 2021-03-18 | 2 | -17/+1 |
| | |||||
* | Merge pull request #17029 from ↵ | Tor Brede Vekterli | 2021-03-18 | 2 | -1/+17 |
|\ | | | | | | | | | vespa-engine/vekterli/inhibit-db-connectivity-until-slobrok-is-ready Inhibit ZooKeeper connections until our local Slobrok mirror is ready. | ||||
| * | Inhibit ZooKeeper connections until our local Slobrok mirror is ready. | Tor Brede Vekterli | 2021-03-18 | 2 | -1/+17 |
| | | | | | | | | | | | | | | | | Otherwise, if there are transient Slobrok issues during CC startup and we end up winning the election, we risk publishing a cluster state where the entire cluster appears down (since we do not have any knowledge of Slobrok node mapping state). This will adversely affect availability for all the obvious reasons. | ||||
* | | Include metrics always. | Henning Baldersheim | 2021-03-12 | 1 | -27/+0 |
| | | |||||
* | | Revert "Revert "GC unused DiskState and add the partition metrics to node ↵ | Henning Baldersheim | 2021-03-12 | 1 | -0/+10 |
| | | | | | | | | level."" | ||||
* | | Revert "GC unused DiskState and add the partition metrics to node level." | Harald Musum | 2021-03-12 | 1 | -10/+0 |
| | | |||||
* | | GC unused DiskState and add the partition metrics to node level. | Henning Baldersheim | 2021-03-12 | 1 | -0/+10 |
| | | |||||
* | | GC unused import | Henning Baldersheim | 2021-03-12 | 2 | -2/+0 |
|/ | |||||
* | Merge pull request #16926 from ↵ | Tor Brede Vekterli | 2021-03-12 | 1 | -9/+23 |
|\ | | | | | | | | | vespa-engine/vekterli/dont-store-full-bundle-objects-in-state-history Don't store full bundle objects in state history | ||||
| * | Don't store full bundle objects in cluster state history | Tor Brede Vekterli | 2021-03-12 | 1 | -9/+23 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | Bundles have a lot of sub-objects per state, so in systems with a high amount of node entries, this adds unnecessary pressure on the heap. Instead, store the string representations of the bundle and the string representation of the diff to the previous state version (if any). This is also inherently faster than computing the diffs on-demand on every status page render. Also remove mutable `official` field from `ClusterState`. Not worth violating immutability of an object just to get some prettier (but with high likelihood actually more confusing) status page rendering. |