aboutsummaryrefslogtreecommitdiffstats
path: root/node-repository/src/main/java/com/yahoo/vespa/hosted/provision/maintenance/NodeFailer.java
Commit message (Expand)AuthorAgeFilesLines
...
* getNode(...) -> node(...)Jon Bratseth2021-02-101-4/+3
* Remove getNodes(...): Always use list(...)Jon Bratseth2021-02-101-1/+1
* Remove getNodes(...): Always use list(...)Jon Bratseth2021-02-091-4/+4
* Revert "Revert "Move node operations into a separate class""Jon Bratseth2021-02-091-16/+16
* Revert "Move node operations into a separate class"Jon Bratseth2021-02-091-16/+16
* Move node operations into a separate classJon Bratseth2021-02-081-16/+16
* Allow a grace period after node re-activationValerij Fredriksen2020-12-021-1/+3
* Don't make changes when there are zone-wide problemsJon Bratseth2020-11-131-0/+2
* Use node repository clockJon Bratseth2020-11-121-11/+6
* Separate node failure maintenance from failingJon Bratseth2020-11-121-100/+7
* Store scaling events in ZooKeeperJon Bratseth2020-10-221-1/+1
* Use default timeoutMartin Polden2020-10-091-1/+1
* Catch correct exception in NodeFailerMartin Polden2020-10-091-3/+4
* Fix commentMartin Polden2020-09-291-1/+1
* Reduce lock timeoutMartin Polden2020-09-291-1/+1
* Only lock application if node status changedMartin Polden2020-09-291-21/+31
* Run NodeFailer with larger intervalHarald Musum2020-09-251-3/+3
* Emit QoS metric for all maintainersMartin Polden2020-07-161-3/+5
* isDockerHost -> isHostJon Bratseth2020-07-011-9/+12
* Continue on exceptionJon Bratseth2020-06-291-0/+4
* Handle node disappearing after taking lockMartin Polden2020-05-271-28/+21
* only throttle node failures when nodes are still in state "failed"andreer2020-05-221-0/+1
* Use vespajlib maintenance package in node-repositoryMartin Polden2020-04-291-2/+1
* LogLevel.INFO -> Level.INFOgjoranv2020-04-251-1/+1
* Import java.util.logging.Level instead of com.yahoo.log.LogLevelgjoranv2020-04-251-1/+1
* Avoid building lots of ApplicationInstancesHåkon Hallingstad2020-03-081-1/+1
* Moved to more specific methods on ServiceMonitorHåkon Hallingstad2020-02-281-2/+1
* Prepare for setting PERMANENTLY_DOWNHåkon Hallingstad2020-01-301-1/+1
* Record the specific change agent in the node historyJon Bratseth2020-01-231-1/+1
* Unreserve hosts with allocationsJon Bratseth2020-01-221-1/+1
* Remove mitigation for "NodeFailer" agentHarald Musum2020-01-081-5/+5
* Use static factory method instead of constructor to signal copyingMartin Polden2020-01-031-1/+1
* Remove hardwareFailure and hardwareDivergence from node-repo maintainersValerij Fredriksen2019-09-191-20/+8
* Fail readying a node with a hard fail reportHåkon Hallingstad2019-09-111-1/+1
* Add throttled host metricValerij Fredriksen2019-08-071-4/+11
* Nonfunctional changes onlyJon Bratseth2019-08-051-4/+4
* Revert "Return 409 with error code TRANSIENT_ERROR when getting TransientExce...Harald Musum2019-08-011-1/+1
* Move some exceptions to its own package (making them not part of public API)Harald Musum2019-08-011-1/+1
* Ignore TransientException in NodeFailer and RetiredExpirerValerij Fredriksen2019-06-291-2/+8
* Remove nodeAdminInContainer from configserver.defValerij Fredriksen2019-06-011-6/+2
* Require lock reference for all write operationsMartin Polden2019-05-151-5/+5
* Disallow failing config/controller(hosts)Valerij Fredriksen2019-05-091-5/+13
* Non-functional cleanupValerij Fredriksen2019-05-061-4/+4
* Remove unused variableValerij Fredriksen2019-05-061-4/+0
* Move JobControl and InfrastructureVersions to NodeRepositoryValerij Fredriksen2019-05-061-2/+1
* Use the type of the node reportHåkon Hallingstad2019-02-281-21/+8
* Merge pull request #8545 from vespa-engine/hakonhall/stop-using-agentnodefail...Jon Bratseth2019-02-181-5/+5
|\
| * Stop using Agent.NodeFailer until v6 is goneHåkon Hallingstad2019-02-181-5/+5
* | Require all child nodes to be suspended in NodeFailerHåkon Hallingstad2019-02-181-1/+11
|/
* Only fail tenant host nodes with failure reportsHåkon Hallingstad2019-02-181-6/+8