Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Fix comparison between manually deployed package and not, and remove ↵ | Jon Marius Venstad | 2022-01-13 | 1 | -1/+1 |
| | | | | outdated safeguard | ||||
* | GC use of deprecated junit assertThat and unify | Henning Baldersheim | 2021-12-21 | 15 | -104/+84 |
| | |||||
* | Failing to find zk system state aborts tick [run-systemtest] | Håkon Hallingstad | 2021-12-14 | 2 | -14/+15 |
| | |||||
* | Use FleetControllerContext in ZooKeeperDatabase | Håkon Hallingstad | 2021-12-13 | 16 | -110/+195 |
| | |||||
* | Update 2019 Oath copyrights. | gjoranv | 2021-10-27 | 4 | -4/+4 |
| | |||||
* | Log version to complete remote task | Håkon Hallingstad | 2021-10-25 | 3 | -2/+12 |
| | | | | | | | | | | | | | | Normally, if a SetNodeStateRequest changes the state of a node, scheduleVersionDependentTasksForFutureCompletion(FleetController.java:1003) will ensure that the request waits for the successful publication of the next cluster state version before returning 200. There are reasons to believe there is an edge case, likely triggered by losing the ZooKeeper connection just prior to trying to set the new wanted state in ZK, that makes scheduleVersionDependentTasksForFutureCompletion() complete the request at the current version. This PR will make it possible to prove or disprove the theory. | ||||
* | Remove config generation -1/0 from CC at :19050/status/<clustername>/config | Håkon Hallingstad | 2021-10-20 | 9 | -43/+22 |
| | |||||
* | Revert changes to config generation | Håkon Hallingstad | 2021-10-20 | 9 | -29/+45 |
| | |||||
* | Fixes after review round | Håkon Hallingstad | 2021-10-19 | 12 | -95/+90 |
| | |||||
* | Improve logging of FleetController and DatabaseHandler | Håkon Hallingstad | 2021-10-15 | 20 | -255/+370 |
| | |||||
* | Some optimizations of RpcServerTest | Håkon Hallingstad | 2021-10-14 | 4 | -22/+31 |
| | |||||
* | Reduce running time of MasterElectionTest from 28 to 12s | Håkon Hallingstad | 2021-10-14 | 2 | -1/+11 |
| | |||||
* | Update Verizon Media copyright notices. | gjoranv | 2021-10-07 | 68 | -68/+68 |
| | |||||
* | Update 2018 copyright notices. | gjoranv | 2021-10-07 | 32 | -32/+32 |
| | |||||
* | Update 2017 copyright notices. | gjoranv | 2021-10-07 | 99 | -99/+99 |
| | |||||
* | Revert "Revert "Avoid copying data just to compress them when it is not ↵ | Henning Baldersheim | 2021-08-30 | 1 | -3/+6 |
| | | | | necessary."" | ||||
* | Revert "Avoid copying data just to compress them when it is not necessary." | Henning Baldersheim | 2021-08-30 | 1 | -6/+3 |
| | |||||
* | Use explicit import. | Henning Baldersheim | 2021-08-30 | 1 | -1/+5 |
| | |||||
* | Update ↵ | Henning Baldersheim | 2021-08-30 | 1 | -1/+0 |
| | | | | | | | | clustercontroller-core/src/main/java/com/yahoo/vespa/clustercontroller/core/rpc/SlimeClusterStateBundleCodec.java Remove unused Co-authored-by: Harald Musum <musum@verizonmedia.com> | ||||
* | Avoid copying data just to compress them when it is not necessary. | Henning Baldersheim | 2021-08-30 | 1 | -1/+1 |
| | |||||
* | Add metric for didWork in FleetController tick | Håkon Hallingstad | 2021-06-25 | 5 | -64/+83 |
| | |||||
* | drop empty buffers instead of using small buffers | Håvard Pettersen | 2021-06-15 | 3 | -3/+3 |
| | |||||
* | No functional changes | Jon Bratseth | 2021-06-01 | 1 | -1/+1 |
| | |||||
* | GC some unused methods and simplify | Henning Baldersheim | 2021-05-23 | 1 | -2/+5 |
| | |||||
* | Set forkCount paramter for maven-surefire-plugin to speed up tests | gjoranv | 2021-05-14 | 1 | -0/+7 |
| | |||||
* | Let the supervisor owner set the small buffer option | Jon Marius Venstad | 2021-05-03 | 1 | -0/+1 |
| | |||||
* | One more lazy | Jon Marius Venstad | 2021-04-28 | 1 | -1/+2 |
| | |||||
* | More lazy debug log message generation | Jon Marius Venstad | 2021-04-28 | 14 | -133/+123 |
| | |||||
* | Reapply "add more logging" (new and updated slobrok logging) | Arne Juul | 2021-04-21 | 2 | -0/+2 |
| | | | | This reverts commit 9aa3d6fe6567e3eee9108d6fffbc50d5874e72e3. | ||||
* | Revert "add more logging" | Harald Musum | 2021-04-20 | 2 | -2/+0 |
| | |||||
* | track API change in mock | Arne Juul | 2021-04-19 | 2 | -0/+2 |
| | |||||
* | Improve test names | Håkon Hallingstad | 2021-04-16 | 1 | -4/+4 |
| | |||||
* | Disallow >1 group to suspend | Håkon Hallingstad | 2021-04-16 | 5 | -27/+277 |
| | | | | | | | If there is more than one group, disallow suspending a node if there is a node in another group that has a user wanted state != UP. If there is 1 group, disallow suspending more than 1 node. | ||||
* | No longer allow suspension if in maintenance | Håkon Hallingstad | 2021-04-15 | 3 | -17/+14 |
| | | | | | | If a storage node falls out of Slobrok, it will change from UP to Maintenance after 60s, then after further 30s go to Down. Avoid allowing suspension in the 30s grace period just because it is Maintenance mode. | ||||
* | Merge branch 'master' into hmusum/cleanup-7 | Harald Musum | 2021-04-08 | 6 | -22/+32 |
|\ | |||||
| * | Add remote task queue size metric in cluster controller | Håkon Hallingstad | 2021-04-01 | 6 | -22/+32 |
| | | |||||
* | | Cleanup tests a bit | Harald Musum | 2021-04-08 | 3 | -43/+49 |
| | | |||||
* | | Fix typo in class name | Harald Musum | 2021-04-08 | 1 | -1/+1 |
|/ | |||||
* | Log when transitioning out of CC moratorium | Håkon Hallingstad | 2021-03-26 | 1 | -6/+3 |
| | |||||
* | Make default deadline to first broadcast 30s | Håkon Hallingstad | 2021-03-24 | 3 | -3/+5 |
| | |||||
* | Revert "Revert "Avoid safe mutations in master moratorium and increase first ↵ | Håkon Hallingstad | 2021-03-24 | 13 | -17/+71 |
| | | | | cluster state broadcast deadline [run-systemtest]"" | ||||
* | Revert "Avoid safe mutations in master moratorium and increase first cluster ↵ | Håkon Hallingstad | 2021-03-24 | 13 | -71/+17 |
| | | | | state broadcast deadline [run-systemtest]" | ||||
* | Merge pull request #17085 from ↵ | Håkon Hallingstad | 2021-03-24 | 13 | -17/+71 |
|\ | | | | | | | | | vespa-engine/hakonhall/increase-the-minimum-time-before-first-cluster-state-broadcast-run-systemtest Avoid safe mutations in master moratorium and increase first cluster state broadcast deadline [run-systemtest] | ||||
| * | Avoid safe-set-node-state in master moratorium | Håkon Hallingstad | 2021-03-24 | 12 | -16/+68 |
| | | |||||
| * | Increase the minimum time before first cluster state broadcast [run-systemtest] | Håkon Hallingstad | 2021-03-19 | 1 | -1/+3 |
| | | |||||
* | | Revert deferred ZK connectivity for now | Tor Brede Vekterli | 2021-03-22 | 3 | -21/+2 |
| | | | | | | | | | | | | Instead, we'll want to create a more generalized solution that considers all sources of node information (Slobrok _and_ explicit health check RPCs) before potentially publishing a state or processing tasks. | ||||
* | | Make sure to reset any election shortcuts if we go from !ZK -> ZK | Tor Brede Vekterli | 2021-03-19 | 1 | -5/+13 |
| | | |||||
* | | Use local leader state for decisions rather than election handler | Tor Brede Vekterli | 2021-03-19 | 1 | -5/+7 |
| | | | | | | | | | | | | | | | | | | Avoids potentially publishing cluster states _before_ we have triggered our own leadership election edge handling code. Could happen if code called prior to the election edge logic checked the election handler state and erroneously thought we had performed the prerequisite actions we're supposed to do when assuming leadership (such as reading back current state from ZK). | ||||
* | | Don't allow short-circuiting election phase if only one node configured if ↵ | Tor Brede Vekterli | 2021-03-19 | 2 | -2/+10 |
| | | | | | | | | using ZK | ||||
* | | Inhibit ZooKeeper connections until our local Slobrok mirror is ready. | Tor Brede Vekterli | 2021-03-19 | 6 | -2/+41 |
|/ | | | | | | | | Otherwise, if there are transient Slobrok issues during CC startup and we end up winning the election, we risk publishing a cluster state where the entire cluster appears down (since we do not have any knowledge of Slobrok node mapping state). This will adversely affect availability for all the obvious reasons. |