summaryrefslogtreecommitdiffstats
path: root/zkfacade/src/main/java/com
Commit message (Collapse)AuthorAgeFilesLines
...
* Move lock metrics to MetricsReporterHåkon Hallingstad2020-10-0312-186/+423
| | | | | | | | | | | | | | | Adds two new metrics: - The load of acquiring each lock path: The average number of threads waiting to acquire the lock within the last minute (or unit of time). Aka the lock queue (depth). - The load of the lock for each lock path: The average number of threads holding the lock within the last minute (or unit of time). This is always <= 1. Aka the lock utilization. Changes the LockCounters to LockMetrics, and exporting those once every minute through MetricReporter which is designed for this.
* Merge pull request #14657 from ↵Håkon Hallingstad2020-10-015-32/+102
|\ | | | | | | | | vespa-engine/hakonhall/add-metrics-to-lock-attempts Add metrics to lock attempts
| * Add metrics to lock attemptsHåkon Hallingstad2020-10-015-32/+102
| |
* | Merge pull request #14655 from vespa-engine/mpolden/fix-agentValerij Fredriksen2020-10-011-1/+1
|\ \ | |/ |/| Store correct agent when adding nodes
| * Store correct agent when adding nodesMartin Polden2020-10-011-1/+1
| |
* | Record locks taken for external deploysHåkon Hallingstad2020-09-306-55/+191
|/ | | | | | | | | | | | - Information about a lock attempt now includes a list of lock attempts done while holding the lock, forming a tree (forest) structure. - Records the duration and locking attempts done as part of an external deploy, forming a tree of locks with timing info. The currently active external deploys are shown in an "ongoing-recording" field of /nodes/v2/locks. - The 3 longest external deploys are kept in "recordings" in /nodes/v2/locks. - Extracts the global process-wide parts of ThreadLockStats into separate class for clarity.
* Add count of failed releasesHåkon Hallingstad2020-09-284-10/+18
|
* More info -> attempt renamesHåkon Hallingstad2020-09-282-10/+10
|
* LockInfo -> LockAttempt, ThreadLockInfo -> ThreadLockStats, and moreHåkon Hallingstad2020-09-284-67/+67
|
* Use deque as stackHåkon Hallingstad2020-09-282-14/+9
|
* Mock lock path from thread to per-lock (bug)Håkon Hallingstad2020-09-264-79/+164
|
* Dump stack trace once per threadHåkon Hallingstad2020-09-262-27/+29
|
* Adds method name to stack trace and adds timeout count and testHåkon Hallingstad2020-09-254-4/+50
|
* Remove reentrant lock no longer neededHåkon Hallingstad2020-09-254-50/+30
|
* Add duration of acquire, in locked, and totalHåkon Hallingstad2020-09-252-3/+18
|
* Avoid double iterationHåkon Hallingstad2020-09-251-9/+8
|
* Make stacktraces for active locks during request handlingHåkon Hallingstad2020-09-242-26/+30
|
* Also show the longest-living historical locks, with stack traceHåkon Hallingstad2020-09-242-11/+59
|
* Count events per zk path and move to separate packageHåkon Hallingstad2020-09-245-37/+72
|
* Expose locks info in REST APIHåkon Hallingstad2020-09-243-2/+203
|
* Avoid unnecesary logging: Reduce log level or removeHarald Musum2020-08-311-1/+1
|
* Revert "Upgrade to Curator 4"Harald Musum2020-08-172-448/+45
|
* Upgrade to Curator 4Harald Musum2020-08-162-45/+448
|
* Actually don't create parentsJon Marius Venstad2020-08-111-1/+1
|
* Avoid creating session path when creating waitersJon Marius Venstad2020-08-101-1/+1
|
* Revert "Revert "Reapply "Upgrade to Curator 2.13.0"""Harald Musum2020-08-031-38/+76
|
* Revert "Reapply "Upgrade to Curator 2.13.0""Harald Musum2020-07-301-76/+38
|
* Revert "Revert "Upgrade to Curator 2.13.0""Harald Musum2020-07-301-38/+76
|
* Revert "Upgrade to Curator 2.13.0"Harald Musum2020-07-301-76/+38
|
* Remove stray fileHarald Musum2020-07-291-1199/+0
|
* Upgrade to Curator 2.13.0Harald Musum2020-07-292-38/+1275
|
* Do not wait longer for more participants in barrierHarald Musum2020-05-251-11/+2
|
* Wait longer for servers to reach barrierHarald Musum2020-04-293-20/+49
| | | | | | | 1. Wait up to 2 seconds for all to reach barrier. 2. If not, wait up to 4 seconds for the server that waits for the barrier to be one of the respondents AND a majority of servers have reached barrier. 3. If not, wait for a majority of servers to have reached barrier.
* LogLevel.DEBUG -> Level.FINEgjoranv2020-04-251-2/+2
|
* Import java.util.logging.Level instead of com.yahoo.log.LogLevelgjoranv2020-04-251-1/+1
|
* Let Curator own re-entrant locksMartin Polden2020-04-142-1/+17
|
* Use Duration for timeoutsMartin Polden2020-04-141-18/+14
|
* Merge pull request #11815 from ↵Jon Marius Venstad2020-03-201-6/+22
|\ | | | | | | | | vespa-engine/jvenstad/wrap-curator-mutex-with-reentarnt-lock Hold a JVM-wide reentrant lock to grab mutex — helps ZK stale reads?
| * Be less stupoidJon Marius Venstad2020-01-161-1/+4
| |
| * Differentiate between failing to acquire the two locksJon Marius Venstad2020-01-161-10/+7
| |
| * Swap order of locks, to avoid doubling timeout durationJon Marius Venstad2020-01-161-6/+14
| |
| * Hold a JVM-wide reentrant lock to grab mutex — helps ZK stale reads?Jon Marius Venstad2020-01-161-1/+9
| |
* | Update Curator.javaJon Marius Venstad2020-01-201-3/+2
| |
* | Log curator state changes SUSPENDED, RECONNECTED and LOSTJon Marius Venstad2020-01-201-1/+8
|/
* Create zookeeper client config file only when necessaryHarald Musum2020-01-091-16/+18
|
* Revert "Reapply "Upgrade to Curator 2.13.0""Harald Musum2020-01-091-76/+38
|
* Revert "Revert "Upgrade to Curator 2.13.0""Harald Musum2020-01-071-38/+76
|
* Cache historic runs (with mock spuport for ZK node versions)Jon Marius Venstad2019-12-203-5/+18
|
* Revert "Upgrade to Curator 2.13.0"Harald Musum2019-12-201-76/+38
|
* Upgrade to Curator 2.13.0Harald Musum2019-12-191-38/+76
|