summaryrefslogtreecommitdiffstats
path: root/node-repository
Commit message (Collapse)AuthorAgeFilesLines
* Rely on markWantoToRetire to re-read nodeJon Bratseth2019-11-191-6/+5
|
* Fast-track cleanup after failed rebalancingJon Bratseth2019-11-193-14/+43
|
* SimplifyJon Marius Venstad2019-11-191-1/+1
|
* Skip tester owned nodes when considering best movesJon Marius Venstad2019-11-191-0/+1
|
* Simplify since we don't have node locksJon Bratseth2019-11-181-21/+25
|
* Lock node throughput as wellJon Bratseth2019-11-181-25/+23
|
* Lock application througoutJon Bratseth2019-11-181-14/+26
|
* Do rebalancing deployments in the RebalancerJon Bratseth2019-11-188-48/+180
| | | | | | | This avoids cases where the Rebalancer chooses a move which turns out to not be legal when actually making the redeployment, causing another node to be selected as target.
* Choose cheaper nodes with less skew regardless of parent host stateJon Bratseth2019-11-181-5/+5
|
* Merge pull request #11317 from vespa-engine/mortent/dev-noderepo-devhostMorten Tokle2019-11-152-0/+6
|\ | | | | Support dev hosts in node repo
| * Allow dev hosts with only ipv4Morten Tokle2019-11-151-0/+5
| |
| * Support infrastructure applications on devhostsMorten Tokle2019-11-151-0/+1
| |
* | Merge pull request #11314 from vespa-engine/mpolden/passthrough-lb-serviceMartin Polden2019-11-154-13/+85
|\ \ | |/ |/| Implement PassthroughLoadBalancerService
| * Implement PassthroughLoadBalancerServiceMartin Polden2019-11-154-13/+85
| | | | | | | | Intended for development purposes.
* | Merge pull request #11313 from ↵Martin Polden2019-11-152-3/+6
|\ \ | | | | | | | | | | | | vespa-engine/hakonhall/get-actual-number-of-config-servers Get actual number of config servers
| * | Get actual number of config serversHåkon Hallingstad2019-11-152-3/+6
| |/
* | Merge pull request #11300 from ↵Andreas Eriksen2019-11-155-14/+10
|\ \ | | | | | | | | | | | | vespa-engine/andreer/remove-direct-routing-feature-flags remove direct routing feature flags
| * | remove direct routing feature flagsandreer2019-11-145-14/+10
| | |
* | | Move compatible check to enumsJon Bratseth2019-11-141-3/+23
| |/ |/|
* | Merge pull request #11284 from ↵Håkon Hallingstad2019-11-144-70/+123
|\ \ | |/ |/| | | | | vespa-engine/hakonhall/allow-overriding-noderepositorymaintenance-durations-with-flag Add flag to control reboot interval
| * Do not use discrete probabilitiesHåkon Hallingstad2019-11-141-6/+3
| |
| * Read reboot-interval-in-days dynamicallyHåkon Hallingstad2019-11-134-51/+112
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | But also: Changes the distribution of the scheduling past 1x reboot interval: hosts will be scheduled for reboot evenly distributed in the whole 1x-2x range, and is by this guaranteed to be scheduled at latest at 2x. The expected time before a reboot was scheduled was 1.33 reboot intervals, while there was no guarantee of an upper time. The new algorithm has an expected time before reboot of 1.5 reboot intervals, bound to 2x. The old would have a higher probability of reboot passing the 1x boundary, while a lower probability than the new as one nears 2x. So I think the new algorithm also have the nice property of avoiding thundering herd, perhaps even more so than the old: For instance when most hosts are rebooted at the same time in a zone, they would tend to be rescheduled for reboot closer to each other with the old than with the new. And, enabling the new algoritm should also not lead to too many hosts suddenly having to reboot, or at least that's what I hope. I can sanity-check this before merge - I guess it would be dominated by the number of hosts in west/east that are beyond 2x.
| * Make flag only for NodeRebooter, and remove fetching from environmentHåkon Hallingstad2019-11-131-25/+15
| |
| * Allow overriding NodeRepositoryMaintenance durations with flagHåkon Hallingstad2019-11-131-17/+22
| |
* | Explicit NodeResources defaultsJon Bratseth2019-11-1322-23/+24
| |
* | Pass and receive remoteStorageJon Bratseth2019-11-131-1/+0
| |
* | Compute free resources using just numbersJon Bratseth2019-11-137-18/+15
| |
* | Add NodeResources.storageTypeJon Bratseth2019-11-1332-58/+142
|/
* Merge pull request #11251 from vespa-engine/hmusum/log-parent-hosts-not-readyHarald Musum2019-11-081-3/+10
|\ | | | | Log parent hosts that are not ready
| * Fix logger nameHarald Musum2019-11-081-1/+1
| |
| * Log parent hosts that are not readyHarald Musum2019-11-081-3/+10
| |
* | Merge pull request #11249 from ↵Andreas Eriksen2019-11-081-1/+3
|\ \ | |/ |/| | | | | vespa-engine/olaa/use-requested-resources-for-allocation-failures Use requested resources when finding allocation failures
| * Use requested resources when finding allocation failuresOla Aunrønning2019-11-081-1/+3
| |
* | make required disk speed patchableandreer2019-11-081-0/+12
|/
* Pass requestedResources through HostResourcesJon Bratseth2019-11-061-2/+2
|
* add metrics for application allocationsandreer2019-11-052-0/+44
|
* Preserve resources decided implicitly by policies as requestedJon Bratseth2019-11-042-6/+6
|
* Remove debug logValerij Fredriksen2019-10-311-2/+0
|
* -1 typoJon Bratseth2019-10-3121-21/+21
|
* Fix typosJon Bratseth2019-10-3121-22/+22
|
* Output requested node resourcesJon Bratseth2019-10-3122-0/+40
|
* Merge pull request #11174 from ↵Andreas Eriksen2019-10-311-1/+1
|\ | | | | | | | | vespa-engine/andreer/lower-capacity-report-interval lower capacity report interval
| * lower capacity report intervalandreer2019-10-311-1/+1
| | | | | | | | mainly in order to clear the alert faster once we get it fixed
* | Revert "Revert "Add devhost node type""Martin Polden2019-10-312-0/+4
| |
* | Check capacity by requested, not assigned resourcesJon Bratseth2019-10-301-16/+24
| |
* | Remember requested resources on nodesJon Bratseth2019-10-3018-72/+148
| | | | | | | | | | | | This may be different from assigned resources e.g in that requested resources may specify DiskSpeed.any while assigned resources always have a definite disk speed.
* | use node repo disk speed as-iskkraune2019-10-251-7/+2
| |
* | Don't rebalance in AWSJon Bratseth2019-10-244-3/+11
| |
* | Add metric hostedVespa.docker.skew to measure average host skewJon Bratseth2019-10-224-2/+31
| |
* | Merge pull request #11031 from ↵Harald Musum2019-10-211-0/+4
|\ \ | | | | | | | | | | | | vespa-engine/hakonhall/return-504-gateway-timeout-on-lock-timeout-from-orchestrator Return 504 Gateway Timeout on lock timeout from Orchestrator