vespa - An engine for low-latency computation over large data sets

	Commit message (Collapse)	Author	Age	Files	Lines
*	Update 2019 Oath copyrights.	gjoranv	2021-10-27	1	-1/+1
\|
*	Update 2017 copyright notices.	gjoranv	2021-10-07	15	-15/+15
\|
*	Disallow cfg suspension based solely on being down	Håkon Hallingstad	2021-09-23	1	-1/+9
\|
*	Add ServiceStatus.UNKNOWN	Håkon Hallingstad	2021-09-13	1	-1/+6
\|
*	Revert "Revert "Pass around orchestration parameters""	Håkon Hallingstad	2021-07-29	3	-0/+21
\|
*	Revert "Pass around orchestration parameters"	Håkon Hallingstad	2021-07-29	3	-21/+0
\|
*	Use OrchestrationParams	Håkon Hallingstad	2021-07-28	2	-0/+19
\|
*	OrchestrationParams	Håkon Hallingstad	2021-07-28	1	-0/+2
\|
*	Allow Jackson deserialization of model types	Bjørn Christian Seime	2021-04-12	1	-0/+10
\|
*	Avoid serialization of utility methods	Håkon Hallingstad	2021-04-02	1	-0/+7
\|
*	Require 3 config server (and controller) hosts	Håkon Hallingstad	2021-03-23	1	-0/+14
\| \| \| \| \| \| \| \| \|	We already require 3 config server (and controller) nodes, but it is not sufficient to protect the hosts from being left with only 1 healthy host: Say the config server host application contains 2 nodes. An upgrade of host-admin on one of those nodes is allowed, since only the host is suspended and none of the 2 nodes are down. This is fixed by handling config server hosts similar to config servers: assume 3 nodes.
*	Support delegating content node suspension to cluster controller	Håkon Hallingstad	2021-01-22	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This PR introduces a new flag group-suspension, which if true, enables: - Instead of allowing at most one storagenode to suspend at any given time, it will now ignore storagenode, searchnode, and distributor service clusters, and rely on the cluster controller to allow or deny the request to suspend. This will increase the load on the cluster controllers. Combined with earlier changes to the cluster controller, this new flag effectively guard the feature of allowing all nodes within a hierarchical group to suspend concurrently. I also took the opportunity to tune related policies: - Allow at most one config server and controller to be down at any given time. This is actually a no-op, since it was effectivelly equal to the older policy of 10% down. - Allows 20% of all host-admins to be down, not just tenant host-admins. This is effectively equal to the old policy of 10% except that it may allow 2 proxy host-admins to go down at the same time. Should be fine.
*	Update ↵	Håkon Hallingstad	2020-09-18	1	-1/+1
\| \| \| \| \|	application-model/src/main/java/com/yahoo/vespa/applicationmodel/ClusterId.java Co-authored-by: Harald Musum <musum@verizonmedia.com>
*	30s down-moratorium before allowing suspension	Håkon Hallingstad	2020-09-18	5	-16/+79
\|
*	Orchestrator should assume 3 controllers	Håkon Hallingstad	2020-06-22	3	-3/+18
\|
*	Moved to more specific methods on ServiceMonitor	Håkon Hallingstad	2020-02-28	1	-13/+15
\|
*	Unit test 1-d map short form modify update	Jon Bratseth	2020-01-14	1	-0/+1
\|
*	Assume at least 3 config server in Orchestrator	Håkon Hallingstad	2019-08-13	4	-0/+11
\|
*	Health rest API	Håkon Hallingstad	2019-01-31	1	-4/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Makes a new REST API /orchestrator/v1/health/<ApplicationId> that shows the list of services that are monitored for health. This information is currently a bit difficult to infer from /orchestrator/v1/instances/<ApplicationInstanceReference> since it is the combined view of health and Slobrok. There are already APIs for Slobrok. Example content: $ curl -s localhost:19071/orchestrator/v1/health/hosted-vespa:zone-config-serve\ rs:default\|jq . { "services": [ { "clusterId": "zone-config-servers", "serviceType": "configserver", "configId": "zone-config-servers/cfg6", "status": { "serviceStatus": "UP", "lastChecked": 1548939111.708718, "since": 1548939051.686223, "endpoint": "http://cfg4.prod.cd-us-central-1.vespahosted.ne1.yahoo.com:19071/state/v1/health" } }, ... ] } This view is slightly different from the application model view, just because that's exactly how the health monitoring is structured (individual monitors against endpoints). The "endpoint" information will also be added to /instances if the status comes from health and not Slobrok.
*	Revert "Preserve serviceStatus in service instance for backwards compatibility"	Jon Marius Venstad	2019-01-28	1	-2/+0
\|
*	Preserve serviceStatus in service instance for backwards compatibility	Håkon Hallingstad	2019-01-25	1	-0/+2
\|
*	Metadata about /state/v1/health status	Håkon Hallingstad	2019-01-25	2	-7/+108
\| \| \| \| \| \| \| \| \| \| \| \| \|	The service monitor uses /state/v1/health to monitor config servers and the host admins (but not yet tenant host admins). This commit adds some metadata about the status of a service: - The time the status was last checked - The time the status changed to the current This can be used to e.g. make more intelligent decisions in the Orchestrator, e.g. only allowing a service to suspend if it has been DOWN longer than X seconds (to avoid spurious DOWN to break redundancy and uptime guarantees).
*	Nonfunctional changes only	Jon Bratseth	2019-01-21	1	-0/+1
\|
*	6-SNAPSHOT -> 7-SNAPSHOT	Arnstein Ressem	2019-01-21	1	-2/+2
\|
*	Support monitoring health of tenant hosts	Håkon Hallingstad	2019-01-16	1	-5/+0
\|
*	Revert "Revert "Add infrastructure applications to DuperModel""	Håkon Hallingstad	2018-12-03	1	-0/+3
\|
*	Revert "Add infrastructure applications to DuperModel"	Harald Musum	2018-12-03	1	-3/+0
\|
*	Add infrastructure applications to DuperModel	Håkon Hallingstad	2018-11-30	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	DuperModel is (will be) responsible for both active tenant applications (through SuperModel) and infrastructure applications. This PR is one step in that direction: - All infrastructure applications (config, confighost, controller, controllerhost, and proxyhost) are owned and managed by DuperModel. - The InfrastructureProvisioner retrieves all possible infra apps from the DuperModel (through a reduced API), and "activates" each of them if target is set and there are any nodes etc. - The InfrastructureProvisioner then notifies the DuperModel which apps have been activated, and with which hosts. - The DuperModel can then build delegate artificially create ApplicationInfo, which gets translated into the application model, and finally the service model. - The resulting service model has NOT_CHECKED for each hostadmin service instance. This is sufficient for goal 1 of this sprint. - The config server application currently has health, so that's kept as-is for now. - Feature flags have been tried and works and allows 1. to disable adding the infra apps in the DuperModel, and 2. to enable the infra configserver instead of the currently created configserver w/health.
*	Remove explicit maven-compiler-plugin config. Inherit from parent.	gjoranv	2018-04-25	1	-10/+0
\|
*	Support reporting UP for node admin outside zone app	Håkon Hallingstad	2018-02-26	2	-0/+7
\| \| \| \| \| \| \| \| \|	If the nodeAdminInContainer ConfigserverConfig has been set, with this PR, the service monitor will always report the node admin container service as UP, thereby avoiding issues related to standalone node admin seemingly being down when not running as part of the application. This postpones checking /status/v1/health for later.
*	Split parent + container-dependency-versions from root pom.	gjoranv	2017-12-01	1	-0/+1
\| \| \| \| \| \|	- Add missing dependencies so that all provided non-yahoo jars are listed in container-dependency-versions. - Add relativePath for all child poms of parent.
*	Revert "Gjoranv/split parent2"	gjoranv	2017-11-30	1	-1/+0
\|
*	Split parent + container-dependency-versions from root pom.	gjoranv	2017-11-30	1	-0/+1
\| \| \| \| \| \|	- Add missing dependencies so that all provided non-yahoo jars are listed in container-dependency-versions. - Add relativePath for all child poms of parent.
*	Revert "Gjoranv/split parent"	gjoranv	2017-11-29	1	-1/+0
\|
*	Split parent + container-dependency-versions from root pom.	gjoranv	2017-11-29	1	-0/+1
\| \| \| \| \| \|	- Add missing dependencies so that all provided non-yahoo jars are listed in container-dependency-versions. - Add relativePath for all child poms of parent.
*	Avoid recursive toString	Håkon Hallingstad	2017-10-25	2	-4/+0
\|
*	Avoid recursive hashCode and equals	Håkon Hallingstad	2017-10-25	2	-6/+4
\|
*	Provide more info in host Orchestrator REST API	Håkon Hallingstad	2017-10-25	2	-6/+36
\|
*	Remove status type parameter in application model classes	Håkon Hallingstad	2017-10-22	4	-15/+26
\|
*	Include orchestrator and service-model fat jars	Håkon Hallingstad	2017-10-19	1	-0/+2
\|
*	Nonfunctional changes	Jon Bratseth	2017-08-30	11	-0/+17
\|
*	Update copyright headers	Jon Bratseth	2017-06-14	13	-2/+13
\|
*	Revert "Update copyright headers"	Jon Bratseth	2017-06-14	13	-13/+2
\|
*	Update copyright headers	Jon Bratseth	2017-06-14	13	-2/+13
\|
*	Revert "Copyright header"	Jon Bratseth	2017-06-13	13	-13/+2
\|
*	Copyright header	Jon Bratseth	2017-06-13	13	-2/+13
\|
*	Adds classes to give the Orchestrator policy classes a simplified view of Vespa.	Håkon Hallingstad	2017-04-28	2	-1/+10
\| \| \| \| \| \| \| \| \|	This should be a no-op. The only changes that actually could have an impact are the changes to getting the cluster controllers, but it should be functionally equivalent. This PR will make it easier to change the Orchestrator policy to allow suspending several nodes (NodeGroup) in an application on a single Docker host.
*	Remove non-working provider for custom ObjectMapper	Bjørn Christian Seime	2016-11-16	8	-8/+8
\|
*	Replace Scala case classes with Java POJOs	Bjørn Christian Seime	2016-11-16	25	-207/+548
\|
*	Revert "Bjorncs/rewrite to java"	Harald Musum	2016-11-16	25	-548/+207
\|