vespa - An engine for low-latency computation over large data sets

	Commit message (Collapse)	Author	Age	Files	Lines
*	Revert changes to config generation	Håkon Hallingstad	2021-10-20	10	-30/+46
\|
*	Fixes after review round	Håkon Hallingstad	2021-10-19	12	-95/+90
\|
*	Improve logging of FleetController and DatabaseHandler	Håkon Hallingstad	2021-10-15	21	-256/+371
\|
*	Merge pull request #19566 from ↵	Håkon Hallingstad	2021-10-14	4	-22/+31
\|\ \| \| \| \| \| \| \| \|	vespa-engine/hakonhall/some-optimizations-of-rpcservertest Some optimizations of RpcServerTest
\| *	Some optimizations of RpcServerTest	Håkon Hallingstad	2021-10-14	4	-22/+31
\| \|
* \|	Merge pull request #19564 from vespa-engine/freva/prepare-container-fs	Valerij Fredriksen	2021-10-14	27	-175/+166
\|\ \ \| \| \| \| \| \|	Prepare to use ContainerPaths
\| * \|	Update default container storage root	Valerij Fredriksen	2021-10-14	4	-11/+11
\| \| \|
\| * \|	Use String instead of Path where possible	Valerij Fredriksen	2021-10-14	5	-33/+28
\| \| \|
\| * \|	Simplify with UnixPath	Valerij Fredriksen	2021-10-14	8	-64/+61
\| \| \|
\| * \|	Create factory method for NodeAgentContext builder	Valerij Fredriksen	2021-10-14	12	-29/+33
\| \| \|
\| * \|	Create factory method for ContainerFileSystem	Valerij Fredriksen	2021-10-14	5	-38/+33
\| \| \|
* \| \|	Merge pull request #19544 from vespa-engine/container-config-improvements	gjoranv	2021-10-14	10	-142/+163
\|\ \ \ \| \| \| \| \| \| \| \|	Container config improvements [run-systemtest]
\| * \| \|	Init the config generation to 1 instead of 0.	gjoranv	2021-10-14	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	- An initial value of 0, generated config generation sequence 1,1,2,3,... causing an exception in Container.getConfigAndCreateGraph when it got bootstrap configs with generation=1 twice.
\| * \| \|	Rename config retriever field.	gjoranv	2021-10-14	1	-7/+7
\| \| \| \|
\| * \| \|	Allow exceptions from the config system to propagate up.	gjoranv	2021-10-13	1	-9/+2
\| \| \| \|
\| * \| \|	Simplify and improve config retrieval.	gjoranv	2021-10-13	1	-34/+27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	- Retrive bootstrap snapshot first when the system is in the stable state. When bootstrap is newer than components, retrieve the new components generation. This avoids getting exceptions from the config system when a component that takes a config with missing default value has been removed. - Do not close set up empty component subscriber after bootstrap, should be unnecessary as it's always done when component config keys are changed. - Declare getConfigsOnce private. - Improve debug logging
\| * \| \|	Improve debug logging.	gjoranv	2021-10-13	1	-4/+5
\| \| \| \|
\| * \| \|	minor: rearrange fields.	gjoranv	2021-10-08	1	-1/+2
\| \| \| \|
\| * \| \|	Improve debugging of CloudSubscriber by adding a name.	gjoranv	2021-10-08	7	-18/+21
\| \| \| \|
\| * \| \|	Simplify by taking a SubscriberFactory instead of a Function.	gjoranv	2021-10-08	3	-11/+10
\| \| \| \|
\| * \| \|	Move CloudSubscriber to separate class file.	gjoranv	2021-10-08	2	-75/+101
\| \| \| \|
\| * \| \|	Add more debug log for config generations.	gjoranv	2021-10-08	2	-3/+7
\| \| \| \|
\| * \| \|	Use correct method name in log message.	gjoranv	2021-10-08	1	-1/+1
\| \| \| \|
\| * \| \|	Improve comment	gjoranv	2021-10-08	1	-1/+2
\| \| \| \|
* \| \| \|	Merge pull request #19565 from vespa-engine/hmusum/config-cleanup-1	Henning Baldersheim	2021-10-14	3	-25/+7
\|\ \ \ \ \| \| \| \| \| \| \| \| \| \|	Cleanup, no functional changes
\| * \| \| \|	Cleanup, no functional changes	Harald Musum	2021-10-14	3	-25/+7
\| \| \| \| \|
* \| \| \| \|	Merge pull request #19559 from vespa-engine/hmusum/upgrade-to-curator-5.2.0	Håkon Hallingstad	2021-10-14	13	-13/+24
\|\ \ \ \ \ \| \| \| \| \| \| \| \| \| \| \| \|	Upgrade to Curator 5.2.0 [run-systemtest]
\| * \| \| \| \|	Upgrade to Curator 5.2.0	Harald Musum	2021-10-14	13	-13/+24
\| \| \|_\|/ / \| \|/\| \| \|
* \| \| \| \|	Merge pull request #19554 from ↵	Henning Baldersheim	2021-10-14	4	-5/+9
\|\ \ \ \ \ \| \|_\|/ / / \|/\| \| \| \| \| \| \| \| \| \| \| \| \| \|	vespa-engine/hmusum/application-package-maintainer-changes Improve download of application package in maintainer [run-systemtest]
\| * \| \| \|	Improve download of application package in maintainer	Harald Musum	2021-10-14	4	-5/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Set downloadFromOtherSourceIfNotFound to false, so that the receiving config server that gets the request don't try to download a file reference. This will be done by the ApplicationPackageMaintainer on the other server anyway.
* \| \| \| \|	Merge pull request #19562 from vespa-engine/balder/prevent-division-by-zero	Jon Bratseth	2021-10-14	2	-11/+18
\|\ \ \ \ \ \| \| \| \| \| \| \| \| \| \| \| \|	Prevent division by zero
\| * \| \| \| \|	Update ↵	Jon Bratseth	2021-10-14	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	metrics-proxy/src/main/java/ai/vespa/metricsproxy/service/SystemPoller.java
\| * \| \| \| \|	Prevent division by zero	Henning Baldersheim	2021-10-14	2	-11/+18
\| \| \| \| \| \|
* \| \| \| \| \|	Merge pull request #19560 from ↵	Tor Brede Vekterli	2021-10-14	6	-2/+43
\|\ \ \ \ \ \ \| \|/ / / / / \|/\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	vespa-engine/vekterli/add-distributor-enhanced-maintenance-scheduling-feature-flag Add feature flag for enhanced distributor maintenance scheduling
\| * \| \| \| \|	Add feature flag for enhanced distributor maintenance scheduling	Tor Brede Vekterli	2021-10-14	6	-2/+43
\| \| \|/ / / \| \|/\| \| \|
* \| \| \| \|	Merge pull request #19556 from ↵	Tor Brede Vekterli	2021-10-14	9	-16/+89
\|\ \ \ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	vespa-engine/vekterli/add-metric-for-max-time-since-bucket-gc Add metric for max time since bucket GC was last run
\| * \| \| \| \|	Add metric for max time since bucket GC was last run	Tor Brede Vekterli	2021-10-14	9	-16/+89
\| \| \|_\|_\|/ \| \|/\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Max time is aggregated across all buckets. If this metric value grows substantially larger than the configured GC period it indicates that GC is being starved.
* \| \| \| \|	Merge pull request #19547 from ↵	Geir Storli	2021-10-14	5	-33/+111
\|\ \ \ \ \ \| \|_\|/ / / \|/\| \| \| \| \| \| \| \| \| \| \| \| \| \|	vespa-engine/toregge/add-detailed-metrics-for-failed-merge-operations Add detailed metrics for failed merge operations.
\| * \| \| \|	Use ASSERT_NO_FATAL_FAILURE() to propagate fatal failures.	Tor Egge	2021-10-14	1	-6/+6
\| \| \| \| \|
\| * \| \| \|	Add detailed metrics for failed merge operations.	Tor Egge	2021-10-14	5	-33/+111
\| \| \| \| \|
* \| \| \| \|	Merge pull request #19379 from ↵	Tor Brede Vekterli	2021-10-14	13	-81/+252
\|\ \ \ \ \ \| \|_\|/ / / \|/\| \| \| \| \| \| \| \| \| \| \| \| \| \|	vespa-engine/vekterli/avoid-stalling-maintenance-scheduling-if-single-op-blocked Don't let a blocked maintenance operation inhibit remaining maintenance queue [run-systemtest]
\| * \| \| \|	Use blocking scheduling semantics for bucket activation maintenance	Tor Brede Vekterli	2021-10-14	3	-4/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We consider bucket maintenance so latency critical that we'll prefer to stall scheduling of subsequent buckets instead of risking having to re-scan the DB to encounter the bucket again.
\| * \| \| \|	Make implicit bucket priority DB clearing on scheduling configurable	Tor Brede Vekterli	2021-10-14	8	-16/+90
\| \| \| \| \|
\| * \| \| \|	Don't let a blocked maintenance operation inhibit remaining maintenance queue	Tor Brede Vekterli	2021-10-14	9	-72/+150
\|/ / / / \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The old maintenance scheduler behavior is to only remove a bucket from the priority DB if its maintenance operation was successfully started. Failing to start an operation could happen from both max pending throttling as well as operation/bucket-specific blocking behavior. Since the scheduler would encounter the same bucket as the one previously blocked upon its next tick invocation, a single blocked bucket would run the risk of head-of-line stalling the rest of the remaining maintenance queue (assuming the ongoing DB scan did not encounter any higher priority buckets). This commit changes the following aspects of maintenance scheduling: * Always clear entries from the priority DB before trying to start an operation. A blocked operation will be retried the next time the regular bucket DB scan encounters the bucket. * Avoid trying to start (and clear) inherently doomed operations by _not_ trying to schedule any operations if it would be blocked due to too many pending maintenance operations anyway. Introduces a new `PendingWindowChecker` interface for this purpose. * Explicitly inhibit all maintenance scheduling if a pending cluster state is present. Operations are already _implicitly_ blocked from starting if there's a pending cluster state, but this would cause the priority DB from being pointlessly cleared.
* \| \| \|	Merge pull request #19553 from vespa-engine/balder/test-system-metrics	Jon Bratseth	2021-10-14	4	-55/+210
\|\ \ \ \ \| \|_\|/ / \|/\| \| \|	Balder/test system metrics
\| * \| \|	cpu.util -> cpu_util	Henning Baldersheim	2021-10-14	2	-5/+9
\| \| \| \|
\| * \| \|	Make system metrics testable.	Henning Baldersheim	2021-10-14	4	-53/+204
\| \| \| \|
* \| \| \|	Merge pull request #19548 from ↵	Håkon Hallingstad	2021-10-14	2	-1/+11
\|\ \ \ \ \| \|/ / / \|/\| \| \| \| \| \| \| \| \| \| \|	vespa-engine/hakonhall/reduce-running-time-of-masterelectiontest-from-28-to-12s Reduce running time of MasterElectionTest from 28 to 12s
\| * \| \|	Reduce running time of MasterElectionTest from 28 to 12s	Håkon Hallingstad	2021-10-14	2	-1/+11
\| \| \| \|
* \| \| \|	Merge pull request #19551 from vespa-engine/mpolden/image-selection-cleanup	Martin Polden	2021-10-14	17	-202/+102
\|\ \ \ \ \| \| \| \| \| \| \| \| \| \|	Stop reading container images from ZK