vespa - An engine for low-latency computation over large data sets

	Commit message (Collapse)	Author	Age	Files	Lines
*	Remove rather pointless metrics stress test	Tor Brede Vekterli	2024-05-13	2	-138/+0
\| \| \| \| \|	Test has not served much of a purpose other than burning CPU cycles and seemingly greatly confusing Valgrind's thread scheduler.
*	Add embedder metrics to vespa9 metricset	Yngve Aasheim	2024-05-10	2	-4/+8
\|
*	Include BILLING_WEBHOOK_FAILURES in infrastructure metric set	Ola Aunronning	2024-04-17	1	-0/+1
\|
*	Make metrics generic to webhook, not filter	Bjørn Christian Seime	2024-04-15	1	-2/+2
\|
*	Unify on List.of	Henning Baldersheim	2024-04-11	1	-3/+2
\|
*	Support pipelining (batching) of mutating ops to same bucket	Tor Brede Vekterli	2024-04-09	2	-8/+25
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Bucket operations require either exclusive (single writer) or shared (multiple readers) access. Prior to this commit, this means that many enqueued feed operations to the same bucket introduce pipeline stalls due to each operation having to wait for all prior operations to the bucket to complete entirely (including fsync of WAL append). This is a likely scenario when feeding a document set that was previously acquired through visiting, as such documents will inherently be output in bucket-order. With this commit, a configurable number of feed operations (put, remove and update) bound for the exact same bucket may be sent asynchronously to the persistence provider in the context of the _same_ write lock. This mirrors how merge operations work for puts and removes. Batching is fairly conservative, and will _not_ batch across further messages when any of the following holds: * A non-feed operation is encountered * More than one mutating operation is encountered for the same document ID * No more persistence throttler tokens can be acquired * Max batch size has been reached Updating the bucket DB, assigning bucket info and sending replies is deferred until _all_ batched operations complete. Max batch size is (re-)configurable live and defaults to a batch size of 1, which shall have the exact same semantics as the legacy behavior. Additionally, clock sampling for persistence threads have been abstracted away to allow for mocking in tests (no need for sleep!).
*	Emit suspended seconds. Update metrics for non-active nodes	Ola Aunronning	2024-03-27	1	-0/+1
\|
*	Wire Prometheus metric export to state V1 APIs	Tor Brede Vekterli	2024-03-21	3	-24/+64
\| \| \| \| \| \| \| \| \| \|	Extends metric producer classes with the requested exposition format. As a consequence, the State API server has been changed to allow emitting other content types than just `application/json`. Add custom Prometheus rendering for Slobrok, as it does its own domain-specific metric tracking. However, since it has non-destructive sampling properties, we can actually use proper `counter` types.
*	Simplify sample emplacement	Tor Brede Vekterli	2024-03-19	1	-5/+5
\|
*	Support internal metric rendering in Prometheus text format in C++	Tor Brede Vekterli	2024-03-19	6	-31/+531
\| \| \| \| \| \| \| \| \| \| \|	Maps all internal metrics to one or more labelled time series. Due to poor compatibility between the data model (and sampling strategy) of the legacy metrics framework and that of Prometheus, all time series are emitted as `untyped` metrics. This is a stop-gap solution on the way to "properly" supporting Prometheus exposition, and the output of this renderer should therefore only be used for internal purposes.
*	Expiry metrics are counters, not gauges	Ola Aunronning	2024-03-12	1	-5/+5
\|
*	Update metrics/src/main/java/ai/vespa/metrics/Labels.java	Yngve Aasheim	2024-02-13	1	-1/+1
\| \| \|	Co-authored-by: Ola Aunrønning <olaa@yahooinc.com>
*	Add legacy names to label enum	Yngve Aasheim	2024-02-12	1	-18/+29
\|
*	Add enum for coredumps.processed	Yngve Aasheim	2024-02-12	1	-0/+1
\|
*	Add missing dependency.	Yngve Aasheim	2024-02-12	1	-0/+1
\|
*	Add metric needed for the stand-up dashboard	Yngve Aasheim	2024-02-12	1	-0/+2
\|
*	Merge pull request #30178 from vespa-engine/bjormel/expose_expire_metrics	Bjørn Meland	2024-02-05	2	-0/+10
\|\ \| \| \| \|	Expose expire metrics
\| *	Now, really expose the metrics	bjormel	2024-02-05	1	-0/+5
\| \|
\| *	Expose expire metrics	bjormel	2024-02-05	1	-0/+5
\| \|
* \|	Change ai.vespa.instance_id to ai.vespa.instance also	Yngve Aasheim	2024-02-05	1	-1/+1
\| \|
* \|	Add ai.vespa.node	Ola Aunronning	2024-02-05	1	-0/+1
\|/
*	Use stored entry count rather than bucket count for (dis-)allowing permanent ↵	Tor Brede Vekterli	2024-01-26	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	node down edge The stored entry count encompasses both visible documents and tombstones. Using this count rather than bucket count avoids any issues where a node only containing empty buckets (i.e. no actual data) is prohibited from being marked as permanently down. Entry count is cross-checked with the visible document count; if the former is zero, the latter should always be zero as well. Since entry/doc counts were only recently introduced as part of the HostInfo payload, we have to handle the case where these do not exist. If entry count is not present, the decision to allow or disallow the transition falls back to the bucket count check.
*	Add enum skeleton for labels	Yngve Aasheim	2024-01-23	1	-0/+52
\|
*	Track correct metric	Martin Polden	2024-01-18	1	-1/+1
\|
*	Add temporary metric for tracking grid usage	Bjørn Christian Seime	2024-01-16	1	-0/+2
\|
*	Expose clusterAutoscaled metric	Yngve Aasheim	2024-01-03	1	-0/+1
\|
*	Emit metric counting autoscale events	Martin Polden	2023-12-11	1	-0/+1
\|
*	Merge pull request #29571 from vespa-engine/mpolden/detect-redist	Jon Bratseth	2023-12-06	1	-1/+5
\|\ \| \| \| \|	Let distributor metric decide cluster stability
\| *	Add merge pending metric	Martin Polden	2023-12-06	1	-1/+5
\| \|
* \|	Update ClusterControllerMetrics.java	Yngve Aasheim	2023-12-05	1	-2/+2
\| \|
* \|	Add enums for kinesislogger metrics	Yngve Aasheim	2023-12-05	1	-1/+7
\|/
*	Add metric for job runner executor size, to compute util	jonmv	2023-11-30	2	-0/+2
\|
*	Add deployment job duration metric	jonmv	2023-11-27	2	-1/+3
\|
*	Merge pull request #29447 from ↵	Henning Baldersheim	2023-11-23	3	-0/+9
\|\ \| \| \| \| \| \| \| \|	vespa-engine/vekterli/expose-remove-by-gid-metrics Expose `remove_by_gid` persistence-level metrics
\| *	Expose `remove_by_gid` persistence-level metrics	Tor Brede Vekterli	2023-11-23	3	-0/+9
\| \|
* \|	Add metrics for billing webhook filter	Bjørn Christian Seime	2023-11-22	1	-0/+2
\| \|
* \|	Add new metric	Øyvind Grønnesby	2023-11-21	1	-0/+1
\|/
*	Include empty exclusive hosts metric	Ola Aunronning	2023-11-07	1	-0/+1
\|
*	Export estimated merge memory usage metric	Tor Brede Vekterli	2023-11-03	3	-0/+3
\| \| \| \| \|	Having visibility of this number will make it easier to choose sensible defaults based on observations of existing systems.
*	Merge pull request #28972 from ↵	Ola Aunrønning	2023-10-17	1	-1/+7
\|\ \| \| \| \| \| \| \| \|	vespa-engine/yngveaasheim/add-description-to-metric-set-reference-doc Add a short description to metric set reference documentation
\| *	Add a short description to metric set reference documentation	Yngve Aasheim	2023-10-17	1	-1/+7
\| \|
* \|	Introduce metrics for mail sending	Bjørn Christian Seime	2023-10-16	2	-1/+10
\|/
*	Add .min suffix for singleton.is_active	Yngve Aasheim	2023-10-12	1	-1/+1
\|
*	Add .min suffix for singleton.is_active	Yngve Aasheim	2023-10-12	1	-1/+1
\|
*	Revert "Merge pull request #28879 from ↵	jonmv	2023-10-11	2	-0/+4
\| \| \| \| \| \| \|	vespa-engine/revert-28869-jonmv/job-runner-thread-metrics" This reverts commit 67351aa3e2adbbb4872097ed799f1ca837f35e6d, reversing changes made to aed7902ee0371efb89747d467c4a2f8124ddc08d.
*	Revert "Jonmv/job runner thread metrics"	Harald Musum	2023-10-11	2	-4/+0
\|
*	Add metrics for job-runner threads	jonmv	2023-10-11	2	-0/+4
\|
*	Correct copyright headers	Jon Bratseth	2023-10-09	4	-5/+4
\|
*	Update copyright	Jon Bratseth	2023-10-09	95	-73/+95
\|
*	Remove some metrics from Vesa9vespa metricset.	yngveaasheim	2023-10-06	1	-5/+1
\|