Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Allow load balancer expiry when cluster nodes are deallocated | Martin Polden | 2019-08-20 | 3 | -27/+57 |
| | | | | | Previous check only allowed expiry when all nodes in the application were deallocated. | ||||
* | Deactivate surplus load balancers | Martin Polden | 2019-08-19 | 4 | -41/+89 |
| | |||||
* | Merge pull request #10293 from ↵ | Jon Bratseth | 2019-08-15 | 25 | -396/+97 |
|\ | | | | | | | | | vespa-engine/bratseth/remove-allocation-by-flavor-take-2 Revert "Merge pull request #10281 from vespa-engine/revert-10274-brat… | ||||
| * | Revert "Merge pull request #10281 from ↵ | Jon Bratseth | 2019-08-14 | 25 | -396/+97 |
| | | | | | | | | | | | | | | vespa-engine/revert-10274-bratseth/remove-allocation-by-flavor" This reverts commit 3e7642b5f236321b299a2496098c19e7fc55843c, reversing changes made to fc6ebf45c0ef126043eb9db4cf613958ce665411. | ||||
* | | Set data in zookeeper node instead of deleting and creating when path is ↵ | Harald Musum | 2019-08-15 | 1 | -2/+13 |
|/ | | | | unchanged | ||||
* | Revert "Bratseth/remove allocation by flavor" | Harald Musum | 2019-08-14 | 25 | -97/+396 |
| | |||||
* | Cleanup | Jon Bratseth | 2019-08-14 | 3 | -52/+1 |
| | |||||
* | Remove canonical flavor | Jon Bratseth | 2019-08-14 | 3 | -3/+3 |
| | |||||
* | Cleanup | Jon Bratseth | 2019-08-14 | 5 | -12/+5 |
| | |||||
* | Remove handling of legacy flavor allocations | Jon Bratseth | 2019-08-14 | 13 | -72/+36 |
| | |||||
* | Remove support for legacy flavor strings | Jon Bratseth | 2019-08-14 | 4 | -257/+52 |
| | |||||
* | Do not try to deploy after operator changes on all config servers | Harald Musum | 2019-08-14 | 1 | -5/+14 |
| | | | | | | If application is not deployed locally on a config server, the last deployment time used will be epoch. Change this so that deployment will only be attempted on the server where the application has been deployed locally. | ||||
* | Ress resources is not compatible with more resources | Valerij Fredriksen | 2019-08-13 | 1 | -3/+3 |
| | |||||
* | Allocate nodes through the host for tests in dev | Valerij Fredriksen | 2019-08-13 | 4 | -33/+37 |
| | |||||
* | Fix off-by-one error in tester | Valerij Fredriksen | 2019-08-13 | 2 | -2/+2 |
| | |||||
* | Always prepare with Capacity rather than count | Valerij Fredriksen | 2019-08-13 | 2 | -13/+10 |
| | |||||
* | flavor -> nodeResources | Valerij Fredriksen | 2019-08-13 | 1 | -10/+10 |
| | |||||
* | Use flag for node resource memory for admin nodes | Harald Musum | 2019-08-12 | 3 | -4/+8 |
| | |||||
* | Add throttled host metric | Valerij Fredriksen | 2019-08-07 | 2 | -6/+13 |
| | |||||
* | Nonfunctional changes only | Jon Bratseth | 2019-08-05 | 1 | -4/+4 |
| | |||||
* | Remove code that handles node resources based on flavors for cd | Harald Musum | 2019-08-02 | 1 | -15/+8 |
| | | | | | We don't use flavors in cd tests any more, only node resources, remove unused code | ||||
* | Return 409 with error code LOAD_BALANCER_NOT_READY if deployment fails due ↵ | Harald Musum | 2019-08-01 | 2 | -18/+1 |
| | | | | | | | | | to that Move LoadBalancerServiceException to config-provisioning so that it can be used from config server as well. Return 409 and LOAD_BALANCER_NOT_READY as error code if we get a LoadBalancerServiceException. Retry deploy on LOAD_BALANCER_NOT_READY. | ||||
* | Revert "Return 409 with error code TRANSIENT_ERROR when getting ↵ | Harald Musum | 2019-08-01 | 12 | -11/+28 |
| | | | | TransientExce…" | ||||
* | Move LoadBalancerServiceException to config-provisioning module | Harald Musum | 2019-08-01 | 2 | -18/+1 |
| | |||||
* | Move some exceptions to its own package (making them not part of public API) | Harald Musum | 2019-08-01 | 11 | -11/+11 |
| | |||||
* | Set memory for admin cluster node to 2 Gb in CD | Harald Musum | 2019-07-31 | 1 | -1/+6 |
| | |||||
* | Use 2.5 Gb memory for nodes in admin cluster in CD | Harald Musum | 2019-07-29 | 1 | -1/+1 |
| | | | | | | We have reduced max heap sizes for logserver-container and metricsproxy-container since we started using 3 Gb, so it should be possible to reduce memory for these nodes, trying with .5 Gb less | ||||
* | Remove noop code (does not return) | Harald Musum | 2019-07-26 | 1 | -3/+0 |
| | | | | | Since we haven't noticed any issues with this not working, remove to make code simpler | ||||
* | Remove unused fields from API response | Martin Polden | 2019-07-19 | 3 | -13/+3 |
| | |||||
* | Made the capacity endpoint accept a comma-separated list instead of JSON. | mgimle | 2019-07-18 | 3 | -17/+8 |
| | | | | Made the endpoint accept calls which do not end with a trailing slash. | ||||
* | Implemented suggested changes to CapacityChecker. | mgimle | 2019-07-15 | 3 | -60/+61 |
| | |||||
* | Added endpoint for explaining the conclusions of the host capacity checker. | mgimle | 2019-07-15 | 6 | -0/+273 |
| | |||||
* | Separated out the capacity checking part of the capacity report maintainer. | mgimle | 2019-07-15 | 5 | -429/+568 |
| | | | | | Made aws zones excluded from the capacity calculation because the metric is nonsensical for an automatically scaling zone. Fixed some missing renames. | ||||
* | Removed Agent enums pertaining to the now-removed NodeRetirer | mgimle | 2019-07-12 | 2 | -3/+1 |
| | |||||
* | Replace 'jdisc' with 'container' in node-repository | gjoranv | 2019-07-11 | 2 | -4/+4 |
| | |||||
* | Reduced size of node repository dump. | mgimle | 2019-07-09 | 1 | -1/+1 |
| | |||||
* | Renamed from NodeAlerter to CapacityReportMaintainer. | mgimle | 2019-07-09 | 5 | -34/+33 |
| | |||||
* | Incorporated suggested changes to the NodeAlerter. | mgimle | 2019-07-08 | 1 | -104/+109 |
| | |||||
* | Added more comments, made private members private, | mgimle | 2019-07-08 | 3 | -107/+171 |
| | | | | | | | | made static inner class candidates static, moved filtering of uninteresting hosts and tenants to the alerter from the test case generator, added a test for overcommitted nodes, logically segmented the generated test cases, removed explicit null checks in favour of Optionals, changed some inner class initialization behaviour. | ||||
* | Implemented the Node Alerter maintainer and metric. | mgimle | 2019-07-08 | 6 | -0/+848 |
| | | | | | | | | | | | | | | | This metric attempts to determine how many hosts it's possible to lose before there's no place to fit its tenants, by finding a "shortest path to failure". Since finding the actual path is np-hard, this maintainer constructs a heuristic based on "repeated removals", and finds greedily finds a path to failure with it. The Node Alerter also exposes the "overcommittedNodes" metric, counting how many hosts have children expecting more resources than it can provide. Finally, this commit adds an obfuscated dump of data from zookeeper, useful for running tests which require a node repository which reflects reality. | ||||
* | Update javadoc | Martin Polden | 2019-07-03 | 1 | -3/+1 |
| | |||||
* | Merge pull request #9931 from ↵ | Martin Polden | 2019-07-01 | 2 | -2/+2 |
|\ | | | | | | | | | vespa-engine/freva/move-serializer-to-non-public-api Move serializers to non public api | ||||
| * | Move NetworkPortsSerializer to non-public-api serialization package | Valerij Fredriksen | 2019-07-01 | 2 | -2/+2 |
| | | |||||
* | | Reduced log for transient errors in ApplicationMaintainer | Valerij Fredriksen | 2019-06-30 | 1 | -0/+4 |
| | | |||||
* | | Simplify OperatorChangeApplicationMaintainer | Valerij Fredriksen | 2019-06-30 | 2 | -23/+17 |
| | | |||||
* | | Simplify PeriodicApplicationMaintainer | Valerij Fredriksen | 2019-06-30 | 2 | -16/+7 |
| | | |||||
* | | Always activate through MockDeployer | Valerij Fredriksen | 2019-06-30 | 3 | -76/+51 |
| | | |||||
* | | Ignore TransientException in NodeFailer and RetiredExpirer | Valerij Fredriksen | 2019-06-29 | 2 | -2/+14 |
|/ | |||||
* | Fix TODO | Martin Polden | 2019-06-27 | 1 | -2/+0 |
| | |||||
* | Read and write all load balancers on construction | Martin Polden | 2019-06-27 | 2 | -7/+19 |
| |