Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Revert "Revert "Revert "Revert "Enforce CC timeouts in Orchestrator 4"""" | Håkon Hallingstad | 2018-11-01 | 5 | -21/+64 |
| | |||||
* | Revert "Revert "Revert "Enforce CC timeouts in Orchestrator 4""" | Håkon Hallingstad | 2018-11-01 | 5 | -64/+21 |
| | |||||
* | Revert "Revert "Enforce CC timeouts in Orchestrator [4]"" | Håkon Hallingstad | 2018-11-01 | 5 | -21/+64 |
| | |||||
* | Revert "Enforce CC timeouts in Orchestrator [4]" | Harald Musum | 2018-10-31 | 5 | -64/+21 |
| | |||||
* | Revert "Revert "Revert "Revert "Enforce CC timeouts in Orchestrator 2"""" | Håkon Hallingstad | 2018-10-30 | 5 | -21/+64 |
| | |||||
* | Revert "Revert "Revert "Enforce CC timeouts in Orchestrator 2""" | Håkon Hallingstad | 2018-10-30 | 5 | -64/+21 |
| | |||||
* | Revert "Revert "Enforce CC timeouts in Orchestrator 2"" | Håkon Hallingstad | 2018-10-29 | 5 | -21/+64 |
| | |||||
* | Revert "Enforce CC timeouts in Orchestrator 2" | Håkon Hallingstad | 2018-10-29 | 5 | -64/+21 |
| | |||||
* | Fixes after review round | Håkon Hallingstad | 2018-10-26 | 1 | -1/+1 |
| | |||||
* | set-node-state probing in CC | Håkon Hallingstad | 2018-10-24 | 5 | -21/+64 |
| | |||||
* | Minor fixes | Jon Bratseth | 2018-10-14 | 2 | -38/+39 |
| | |||||
* | Add copyright header | Jon Bratseth | 2018-10-01 | 2 | -0/+2 |
| | |||||
* | set-node-state timeout in CC | Håkon Hallingstad | 2018-06-22 | 6 | -5/+45 |
| | |||||
* | Warn on timeout | Håkon Hallingstad | 2018-06-15 | 1 | -1/+1 |
| | |||||
* | Do not wait for version ack for failed set-node-state | Håkon Hallingstad | 2018-06-13 | 5 | -6/+82 |
| | |||||
* | Remove support for ancient legacy node state protocol versions | Tor Brede Vekterli | 2018-06-11 | 3 | -83/+17 |
| | | | | | Protocol versions 0 and 1 haven't been in use for years. No point in maintaining complexity to support automatic downgrades to these. | ||||
* | Merge pull request #5766 from ↵ | Geir Storli | 2018-05-03 | 3 | -3/+31 |
|\ | | | | | | | | | vespa-engine/vekterli/only-derive-default-space-node-states-when-global-doc-types-present Only derive default bucket space node states when cluster has global docs | ||||
| * | Only derive default bucket space node states when cluster has global docs | Tor Brede Vekterli | 2018-05-02 | 3 | -3/+31 |
| | | | | | | | | | | | | | | Lets cluster controller use new protocols for sending compressed cluster state bundles, but without triggering implicit Maintenance edges for nodes in the default bucket space. Also allows for easy live reconfiguration when global document types are added or removed. | ||||
* | | Revert "Revert "Gjoranv/java9 prep 05"" | gjoranv | 2018-05-02 | 1 | -1/+1 |
|/ | |||||
* | Revert "Gjoranv/java9 prep 05" | gjoranv | 2018-05-02 | 1 | -1/+1 |
| | |||||
* | Java 9: Replace 'new Integer' with 'Integer.valueOf' | gjoranv | 2018-04-30 | 1 | -1/+1 |
| | |||||
* | Avoid candidate state racing with published state in tests | Tor Brede Vekterli | 2018-04-27 | 1 | -0/+15 |
| | | | | | | | | Since the tests using `StateWaiter` expects to observe _both_ versioned and unversioned (candidate) states, we ignore candidate states iff they are equal to the versioned state we have already observed. Otherwise, tests waiting for a _versioned_ state risk never observing the version number itself (only a candidate following it) and hang until they time out. | ||||
* | Merge pull request #5710 from vespa-engine/gjoranv/java9-prep-01 | Bjørn Christian Seime | 2018-04-25 | 1 | -9/+0 |
|\ | | | | | Gjoranv/java9 prep 01 | ||||
| * | Remove explicit maven-compiler-plugin config. Inherit from parent. | gjoranv | 2018-04-25 | 1 | -9/+0 |
| | | |||||
* | | Remove redundant task processing step | Tor Brede Vekterli | 2018-04-25 | 1 | -1/+0 |
| | | | | | | | | Already implicitly called by saveLatestClusterStateBundle() | ||||
* | | Update log message | Tor Brede Vekterli | 2018-04-25 | 1 | -1/+1 |
| | | |||||
* | | ZooKeeper-persist and load published cluster state bundles | Tor Brede Vekterli | 2018-04-24 | 23 | -125/+716 |
|/ | | | | | | | | | | | | | | | | Store synchronously upon each new versioned state, load whenever controller is elected master. Effectively carries over visible node states from one controller's lifetime to the next. This removes the edge case where default bucket space content nodes are marked as in Maintainence until their global merge status is known. To avoid controller tripping over its own feet, state bundles are now _not_ versioned at all until the initial send time period has passed. This prevents overwriting the state persisted from a previous controller with a transient state where all nodes are down due to not having Slobrok contact yet. A new cluster state recompute+send edge has been added when the master passes its initial state send time period. | ||||
* | Add metric for node availbility ratio | Tor Brede Vekterli | 2018-04-12 | 3 | -6/+62 |
| | | | | | Separate node type dimensions are used for distributors and storage nodes | ||||
* | Add missing test for existing node state metrics | Tor Brede Vekterli | 2018-04-12 | 2 | -0/+131 |
| | |||||
* | Fix grammar in Event message | Tor Brede Vekterli | 2018-04-06 | 2 | -2/+2 |
| | |||||
* | Add information about derived bucket space state in cluster state history. | Geir Storli | 2018-03-23 | 4 | -21/+71 |
| | |||||
* | Add buckets total to per bucket space stats on html status page. | Geir Storli | 2018-03-20 | 3 | -13/+33 |
| | |||||
* | Add config to clustercontroller for the min merge completion ratio of ↵ | Geir Storli | 2018-03-20 | 11 | -38/+93 |
| | | | | buckets in a bucket space before it is considered complete. | ||||
* | Fix ClusterStatsChangeTracker to handle case where current node is not found ↵ | Geir Storli | 2018-03-19 | 3 | -6/+15 |
| | | | | | | in previous state. Cluster stats may have changed in this case. | ||||
* | Extend cluster v2 to output the published distribution state. | Geir Storli | 2018-03-15 | 13 | -34/+92 |
| | | | | This is the baseline cluster state + per bucket space states. | ||||
* | Remove never used per storage node ops latencies in host info. | Geir Storli | 2018-03-13 | 3 | -69/+4 |
| | |||||
* | Merge pull request #5269 from ↵ | Geir Storli | 2018-03-09 | 5 | -70/+104 |
|\ | | | | | | | | | vespa-engine/geirst/fix-cluster-stats-change-tracker-in-clustercontroller Fix ClusterStatsChangeTracker to keep previous 'may have merges pendi… | ||||
| * | Fix ClusterStatsChangeTracker to keep previous 'may have merges pending' ↵ | Geir Storli | 2018-03-08 | 5 | -70/+104 |
| | | | | | | | | state per content node. | ||||
* | | More descriptive test names | Tor Brede Vekterli | 2018-03-08 | 1 | -5/+5 |
| | | |||||
* | | Update constraint mock variable naming | Tor Brede Vekterli | 2018-03-08 | 1 | -4/+4 |
| | | |||||
* | | Must use published state from default bucket space, not baseline | Tor Brede Vekterli | 2018-03-08 | 1 | -1/+7 |
| | | |||||
* | | Unify style | Tor Brede Vekterli | 2018-03-08 | 1 | -6/+9 |
| | | |||||
* | | Add a simple maintenance state transition constraint for nodes in default space | Tor Brede Vekterli | 2018-03-08 | 6 | -14/+133 |
|/ | | | | | | | | Avoids transitioning an already up/init/retired node to maintenance mode when global merges are pending, but allows transitions when the node is considered down/maintenance in the already published state. Not stateful, so triggers false positives on cluster controller restart edges. | ||||
* | Simplifications. | Geir Storli | 2018-03-07 | 2 | -7/+3 |
| | |||||
* | Fix AggregatedStatsMergePendingChecker to take state of aggregated cluster ↵ | Geir Storli | 2018-03-07 | 9 | -85/+134 |
| | | | | | | | | | | stats into account. We may have merges pending if: - we don't yet have updates from all distributors - we don't have stats for the content node in question Also move mayHaveMergesPendingInGlobalSpace() to AggregatedStatsMergePendingChecker. | ||||
* | Also list derived bucket space states in html state. | Geir Storli | 2018-03-07 | 2 | -3/+13 |
| | |||||
* | Add bucket space column to html state. | Geir Storli | 2018-03-07 | 1 | -6/+20 |
| | |||||
* | Use computeIfAbsent() instead of explicit get() and put(). | Geir Storli | 2018-03-06 | 1 | -6/+1 |
| | |||||
* | Add factory functions for NodeEvent and improve some function names. | Geir Storli | 2018-03-06 | 8 | -35/+42 |
| | |||||
* | Extend EventDiffCalculator to emit events for changes in derived bucket ↵ | Geir Storli | 2018-03-02 | 8 | -80/+324 |
| | | | | space cluster states. |