index
:
vespa
6
7
andreer/permanent-enclave-flag
aressem/test-dummy
aressem/test-pr-bk
aressem/test-pr-build-3
aressem/test-valgrind
arnej/cosmetic-message-fix
arnej/golang-slime-port-1
arnej/remove-convert-in-calculator
arnej/use-our-shell-quote
arnej/wip-sand-fixups
balder/apply-termwise-filters-on-match-phase-2
balder/cpu-specific-compiles-for-bit-operations
balder/deinline
balder/hosted-always-convert-percentages-in-config-model
balder/no-longer-need-commit-and-wait
balder/prepare-for-hw-specialized-hamming-distance
balder/thread-local-jetty-bytebuffer-pool
balder/update-defaults-for-use-xxx-fetch-postings
balder/zncurve
bjormel/aws-main-controller
bjormel/aws-main-controller-take2
bratseth/grouping-trace
bratseth/linguistics-context-rebased
bratseth/more-exclusive-take-2
bratseth/stem-prefixes
bratseth/streamed-fill
freva/secrets
hakonhall/enumerate-all-prod-regions
hakonhall/fix-remembertoupdatesystemflagsdataarchive-javadoc
havardpe/enable-nested-ctf-meta-data
havardpe/extract-default-query-feature-values
havardpe/protoc-gen-csi
interns/languageserver
interns/theodorkl/congocc
jdk21-preparations
jonmv/allow-private-endpoints-in-dev-perf
jonmv/dependency-inversion-for-mbus-config
jvenstad/utils
kkraune/ci-warning
ldalves/querybuilder
leandroalves/prod-controller
lesters/bert-testing
lesters/external-llms
lesters/stateless-onnx-eval-once
master
mortent/calypso
mortent/new-public-cd-endpoint
mpolden/update-abi
olaa/delete-flags
olaa/otel-config-model
renovate/junit5-monorepo
renovate/major-protobuf.vespa.version
renovate/maven-shade-plugin.vespa.version
revert-26576-revert-26567-bjorncs/cloud-app-validation
revert-26584-revert-26578-bjorncs/tlsv13
revert-27857-bjorncs/tls13
revert-28660-revert-28656-hmusum/fix-onnx-model-cost
revert-30559-toregge/require-vespa-build-dependencies-for-vespa-devel
vekterli/change-test-and-set-update-not-found-semantics
yngveaasheim/skeleton-for-component-in-metrics-enum
An engine for low-latency computation over large data sets
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
path:
root
/
node-repository
/
src
/
main
/
java
/
com
/
yahoo
/
vespa
/
hosted
/
provision
/
maintenance
/
NodeFailer.java
Commit message (
Expand
)
Author
Age
Files
Lines
*
Handle node disappearing after taking lock
Martin Polden
2020-05-27
1
-28
/
+21
*
only throttle node failures when nodes are still in state "failed"
andreer
2020-05-22
1
-0
/
+1
*
Use vespajlib maintenance package in node-repository
Martin Polden
2020-04-29
1
-2
/
+1
*
LogLevel.INFO -> Level.INFO
gjoranv
2020-04-25
1
-1
/
+1
*
Import java.util.logging.Level instead of com.yahoo.log.LogLevel
gjoranv
2020-04-25
1
-1
/
+1
*
Avoid building lots of ApplicationInstances
Håkon Hallingstad
2020-03-08
1
-1
/
+1
*
Moved to more specific methods on ServiceMonitor
Håkon Hallingstad
2020-02-28
1
-2
/
+1
*
Prepare for setting PERMANENTLY_DOWN
Håkon Hallingstad
2020-01-30
1
-1
/
+1
*
Record the specific change agent in the node history
Jon Bratseth
2020-01-23
1
-1
/
+1
*
Unreserve hosts with allocations
Jon Bratseth
2020-01-22
1
-1
/
+1
*
Remove mitigation for "NodeFailer" agent
Harald Musum
2020-01-08
1
-5
/
+5
*
Use static factory method instead of constructor to signal copying
Martin Polden
2020-01-03
1
-1
/
+1
*
Remove hardwareFailure and hardwareDivergence from node-repo maintainers
Valerij Fredriksen
2019-09-19
1
-20
/
+8
*
Fail readying a node with a hard fail report
Håkon Hallingstad
2019-09-11
1
-1
/
+1
*
Add throttled host metric
Valerij Fredriksen
2019-08-07
1
-4
/
+11
*
Nonfunctional changes only
Jon Bratseth
2019-08-05
1
-4
/
+4
*
Revert "Return 409 with error code TRANSIENT_ERROR when getting TransientExce...
Harald Musum
2019-08-01
1
-1
/
+1
*
Move some exceptions to its own package (making them not part of public API)
Harald Musum
2019-08-01
1
-1
/
+1
*
Ignore TransientException in NodeFailer and RetiredExpirer
Valerij Fredriksen
2019-06-29
1
-2
/
+8
*
Remove nodeAdminInContainer from configserver.def
Valerij Fredriksen
2019-06-01
1
-6
/
+2
*
Require lock reference for all write operations
Martin Polden
2019-05-15
1
-5
/
+5
*
Disallow failing config/controller(hosts)
Valerij Fredriksen
2019-05-09
1
-5
/
+13
*
Non-functional cleanup
Valerij Fredriksen
2019-05-06
1
-4
/
+4
*
Remove unused variable
Valerij Fredriksen
2019-05-06
1
-4
/
+0
*
Move JobControl and InfrastructureVersions to NodeRepository
Valerij Fredriksen
2019-05-06
1
-2
/
+1
*
Use the type of the node report
Håkon Hallingstad
2019-02-28
1
-21
/
+8
*
Merge pull request #8545 from vespa-engine/hakonhall/stop-using-agentnodefail...
Jon Bratseth
2019-02-18
1
-5
/
+5
|
\
|
*
Stop using Agent.NodeFailer until v6 is gone
Håkon Hallingstad
2019-02-18
1
-5
/
+5
*
|
Require all child nodes to be suspended in NodeFailer
Håkon Hallingstad
2019-02-18
1
-1
/
+11
|
/
*
Only fail tenant host nodes with failure reports
Håkon Hallingstad
2019-02-18
1
-6
/
+8
*
Remove hardwareDivergence from node-admin
Håkon Hallingstad
2019-02-18
1
-0
/
+1
*
Fail instead of retire on failure report in NodeFailer
Håkon Hallingstad
2019-02-15
1
-64
/
+14
*
Use valerijs super stream
Håkon Hallingstad
2019-02-13
1
-14
/
+18
*
10s timeout
Håkon Hallingstad
2019-02-13
1
-1
/
+1
*
Rename to activeNodes
Håkon Hallingstad
2019-02-13
1
-3
/
+3
*
Max 1 active host with wantToRetire, and fix NodeFailer.hasHardwareIssue
Håkon Hallingstad
2019-02-12
1
-11
/
+18
*
Also fail on badDiskType, badInterfaceSpeed, badCpuCount
Håkon Hallingstad
2019-02-12
1
-1
/
+7
*
Retire/fail hosts with failure reports
Håkon Hallingstad
2019-02-12
1
-10
/
+101
*
Nonfunctional changes only
Jon Bratseth
2019-02-05
1
-1
/
+1
*
Implement Iterable in NodeList
Valerij Fredriksen
2019-01-30
1
-1
/
+1
*
Remove duplicated child node filtering
Martin Polden
2019-01-14
1
-1
/
+1
*
Clarify physical nodes
Martin Polden
2019-01-03
1
-1
/
+1
*
Increase allowed to fail fraction
Martin Polden
2019-01-03
1
-1
/
+1
*
Always allow 2 parent hosts to fail in a 24 hour period
Martin Polden
2019-01-03
1
-19
/
+29
*
Include throttled active nodes
Martin Polden
2018-12-06
1
-6
/
+15
*
Emit metric for throttled node failures
Martin Polden
2018-12-06
1
-5
/
+17
*
Fail nodes because of hardware failure
Valerij Fredriksen
2018-08-21
1
-0
/
+14
*
Return active nodes that should be failed with reason
Valerij Fredriksen
2018-08-21
1
-31
/
+38
*
Simplify common history check
Valerij Fredriksen
2018-08-21
1
-15
/
+8
*
Nonfunctional changes only
Jon Bratseth
2018-03-19
1
-2
/
+3
[next]