Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Do not try to deploy after operator changes on all config servers | Harald Musum | 2019-08-14 | 1 | -5/+14 |
| | | | | | | If application is not deployed locally on a config server, the last deployment time used will be epoch. Change this so that deployment will only be attempted on the server where the application has been deployed locally. | ||||
* | Ress resources is not compatible with more resources | Valerij Fredriksen | 2019-08-13 | 1 | -3/+3 |
| | |||||
* | Allocate nodes through the host for tests in dev | Valerij Fredriksen | 2019-08-13 | 4 | -33/+37 |
| | |||||
* | Fix off-by-one error in tester | Valerij Fredriksen | 2019-08-13 | 2 | -2/+2 |
| | |||||
* | Always prepare with Capacity rather than count | Valerij Fredriksen | 2019-08-13 | 2 | -13/+10 |
| | |||||
* | flavor -> nodeResources | Valerij Fredriksen | 2019-08-13 | 1 | -10/+10 |
| | |||||
* | Use flag for node resource memory for admin nodes | Harald Musum | 2019-08-12 | 3 | -4/+8 |
| | |||||
* | Add throttled host metric | Valerij Fredriksen | 2019-08-07 | 2 | -6/+13 |
| | |||||
* | Nonfunctional changes only | Jon Bratseth | 2019-08-05 | 1 | -4/+4 |
| | |||||
* | Remove code that handles node resources based on flavors for cd | Harald Musum | 2019-08-02 | 1 | -15/+8 |
| | | | | | We don't use flavors in cd tests any more, only node resources, remove unused code | ||||
* | Return 409 with error code LOAD_BALANCER_NOT_READY if deployment fails due ↵ | Harald Musum | 2019-08-01 | 2 | -18/+1 |
| | | | | | | | | | to that Move LoadBalancerServiceException to config-provisioning so that it can be used from config server as well. Return 409 and LOAD_BALANCER_NOT_READY as error code if we get a LoadBalancerServiceException. Retry deploy on LOAD_BALANCER_NOT_READY. | ||||
* | Revert "Return 409 with error code TRANSIENT_ERROR when getting ↵ | Harald Musum | 2019-08-01 | 12 | -11/+28 |
| | | | | TransientExce…" | ||||
* | Move LoadBalancerServiceException to config-provisioning module | Harald Musum | 2019-08-01 | 2 | -18/+1 |
| | |||||
* | Move some exceptions to its own package (making them not part of public API) | Harald Musum | 2019-08-01 | 11 | -11/+11 |
| | |||||
* | Set memory for admin cluster node to 2 Gb in CD | Harald Musum | 2019-07-31 | 1 | -1/+6 |
| | |||||
* | Use 2.5 Gb memory for nodes in admin cluster in CD | Harald Musum | 2019-07-29 | 1 | -1/+1 |
| | | | | | | We have reduced max heap sizes for logserver-container and metricsproxy-container since we started using 3 Gb, so it should be possible to reduce memory for these nodes, trying with .5 Gb less | ||||
* | Remove noop code (does not return) | Harald Musum | 2019-07-26 | 1 | -3/+0 |
| | | | | | Since we haven't noticed any issues with this not working, remove to make code simpler | ||||
* | Remove unused fields from API response | Martin Polden | 2019-07-19 | 3 | -13/+3 |
| | |||||
* | Made the capacity endpoint accept a comma-separated list instead of JSON. | mgimle | 2019-07-18 | 3 | -17/+8 |
| | | | | Made the endpoint accept calls which do not end with a trailing slash. | ||||
* | Implemented suggested changes to CapacityChecker. | mgimle | 2019-07-15 | 3 | -60/+61 |
| | |||||
* | Added endpoint for explaining the conclusions of the host capacity checker. | mgimle | 2019-07-15 | 6 | -0/+273 |
| | |||||
* | Separated out the capacity checking part of the capacity report maintainer. | mgimle | 2019-07-15 | 5 | -429/+568 |
| | | | | | Made aws zones excluded from the capacity calculation because the metric is nonsensical for an automatically scaling zone. Fixed some missing renames. | ||||
* | Removed Agent enums pertaining to the now-removed NodeRetirer | mgimle | 2019-07-12 | 2 | -3/+1 |
| | |||||
* | Replace 'jdisc' with 'container' in node-repository | gjoranv | 2019-07-11 | 2 | -4/+4 |
| | |||||
* | Reduced size of node repository dump. | mgimle | 2019-07-09 | 1 | -1/+1 |
| | |||||
* | Renamed from NodeAlerter to CapacityReportMaintainer. | mgimle | 2019-07-09 | 5 | -34/+33 |
| | |||||
* | Incorporated suggested changes to the NodeAlerter. | mgimle | 2019-07-08 | 1 | -104/+109 |
| | |||||
* | Added more comments, made private members private, | mgimle | 2019-07-08 | 3 | -107/+171 |
| | | | | | | | | made static inner class candidates static, moved filtering of uninteresting hosts and tenants to the alerter from the test case generator, added a test for overcommitted nodes, logically segmented the generated test cases, removed explicit null checks in favour of Optionals, changed some inner class initialization behaviour. | ||||
* | Implemented the Node Alerter maintainer and metric. | mgimle | 2019-07-08 | 6 | -0/+848 |
| | | | | | | | | | | | | | | | This metric attempts to determine how many hosts it's possible to lose before there's no place to fit its tenants, by finding a "shortest path to failure". Since finding the actual path is np-hard, this maintainer constructs a heuristic based on "repeated removals", and finds greedily finds a path to failure with it. The Node Alerter also exposes the "overcommittedNodes" metric, counting how many hosts have children expecting more resources than it can provide. Finally, this commit adds an obfuscated dump of data from zookeeper, useful for running tests which require a node repository which reflects reality. | ||||
* | Update javadoc | Martin Polden | 2019-07-03 | 1 | -3/+1 |
| | |||||
* | Merge pull request #9931 from ↵ | Martin Polden | 2019-07-01 | 2 | -2/+2 |
|\ | | | | | | | | | vespa-engine/freva/move-serializer-to-non-public-api Move serializers to non public api | ||||
| * | Move NetworkPortsSerializer to non-public-api serialization package | Valerij Fredriksen | 2019-07-01 | 2 | -2/+2 |
| | | |||||
* | | Reduced log for transient errors in ApplicationMaintainer | Valerij Fredriksen | 2019-06-30 | 1 | -0/+4 |
| | | |||||
* | | Simplify OperatorChangeApplicationMaintainer | Valerij Fredriksen | 2019-06-30 | 2 | -23/+17 |
| | | |||||
* | | Simplify PeriodicApplicationMaintainer | Valerij Fredriksen | 2019-06-30 | 2 | -16/+7 |
| | | |||||
* | | Always activate through MockDeployer | Valerij Fredriksen | 2019-06-30 | 3 | -76/+51 |
| | | |||||
* | | Ignore TransientException in NodeFailer and RetiredExpirer | Valerij Fredriksen | 2019-06-29 | 2 | -2/+14 |
|/ | |||||
* | Fix TODO | Martin Polden | 2019-06-27 | 1 | -2/+0 |
| | |||||
* | Read and write all load balancers on construction | Martin Polden | 2019-06-27 | 2 | -7/+19 |
| | |||||
* | Avoid reading all load balancers on prepare | Martin Polden | 2019-06-27 | 2 | -9/+10 |
| | |||||
* | Remove unused rotations parameter | Martin Polden | 2019-06-26 | 20 | -72/+56 |
| | |||||
* | Remove support for legacy serialization format | Martin Polden | 2019-06-25 | 3 | -48/+5 |
| | | | | New format is now used everywhere. | ||||
* | Revert "Do not require changeAt to be after previous value" | Martin Polden | 2019-06-25 | 1 | -0/+4 |
| | | | | This reverts commit a1bd5a9f0e9a72f24b96e08755deb6c17e0e7b41. | ||||
* | Force load balancer reconfig when in non-active state | Martin Polden | 2019-06-25 | 6 | -13/+81 |
| | |||||
* | Fix exception message | Martin Polden | 2019-06-25 | 1 | -1/+1 |
| | |||||
* | Log reals | Martin Polden | 2019-06-25 | 1 | -2/+2 |
| | |||||
* | Rename test | Martin Polden | 2019-06-24 | 1 | -1/+1 |
| | |||||
* | Never provision a new LB in activate | Martin Polden | 2019-06-24 | 2 | -4/+29 |
| | |||||
* | Fix name | Martin Polden | 2019-06-24 | 1 | -3/+3 |
| | |||||
* | Use common exception class for transient provisioning exceptions | Martin Polden | 2019-06-21 | 2 | -1/+25 |
| |