vespa - An engine for low-latency computation over large data sets

	Commit message (Collapse)	Author	Age	Files	Lines
*	Revert "Revert "Unify access to assets needed during rank-setup.""	Henning Baldersheim	2022-09-07	1	-2/+2
\|
*	Revert "Unify access to assets needed during rank-setup."	Tor Egge	2022-09-07	1	-2/+2
\|
*	Unify access to assets needed during rank-setup.	Henning Baldersheim	2022-09-06	1	-2/+2
\|
*	remove unused doxygen setup files	Arne Juul	2022-08-29	1	-994/+0
\|
*	Add support for two-phase document garbage collection	Tor Brede Vekterli	2022-08-17	2	-0/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	If enabled, garbage collection is performed in two phases (metadata gathering and deletion) instead of just a single phase. Two-phase GC allows for ensuring the same set of documents is deleted across all nodes and explicitly takes write locks on the distributor to prevent concurrent feed ops to GC'd documents from potentially creating inconsistencies. Two-phase GC is only used _iff_ all replica content nodes support the feature _and_ it's enabled in config. An additional field has been added to the feature negotiation functionality to communicate support from content nodes to distributors.
*	Don't add tombstone in dummy persistence when a newer entry already exists	Tor Brede Vekterli	2022-08-17	2	-5/+52
\| \| \| \| \| \| \|	This better mirrors how Proton actually works, since it's not a multi version store. Since only the highest timestamped entry for a document is the one that is ever considered on a node, there's no point in storing an explicit tombstone that cannot be referenced.
*	Add wrapper for <doc id, timestamp> tuple and update APIs to use this	Tor Brede Vekterli	2022-07-07	9	-12/+69
\| \| \| \| \|	Feels more intuitive to have a tuple that implies "document foo at timestamp bar" rather than the current inverse of "timestamp bar with document foo".
*	Collapse persistencetypes into persistence	Henning Baldersheim	2022-05-18	4	-1/+121
\|
*	- Move persitence/spi/types.h under to persitence/spi/types.h	Henning Baldersheim	2022-05-18	3	-3/+3
\| \| \| \|	- Cut dependency to persistencetypes for searchlib.
*	GC unused code and dependencies	Henning Baldersheim	2022-05-14	1	-5/+0
\|
*	Don't attempt to actually execute document moves from a cancelled bucket mover	Tor Brede Vekterli	2022-05-12	2	-3/+63
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This prevents the following race condition where the bucket mover logic fails to notify the content layer that the bucket sub DB status has changed for a particular bucket: 1. Bucket state is changed over SPI, a mover is created and registered and a BucketTask is scheduled onto the persistence queues to actually do the document reads and finalize the move. 2. Before the bucket task is executed, bucket state is changed again over the SPI. A new mover is created, the old one is cancelled (tagging mover as not consistent) and another BucketTask is scheduled onto the persistence queues. Note: the old task still remains. 3. Old bucket task is executed and performs the actual document moving despite being cancelled. No notification is done towards the content layer since the mover was tagged as not being consistent. 4. New bucket task is executed and tries to move the same document set as the old mover. Since the documents are no longer present in the source document DB, the moves fail. This tags the mover as inconsistent and no notification is done. Bucket is automatically rechecked, but since all docs are already moved away there is nothing more to do and no subsequent mover is created. This means the "should notify?" edge is not triggered and the content layer remains blissfully unaware of any sub DB changes. This commit simply changes cancellation to actually inhibit document moves from taking place. This lets the preempting mover successfully complete its moves, thus triggering the notify-edge as expected.
*	GC unused Context parameter	Henning Baldersheim	2022-03-31	10	-323/+259
\|
*	Remove copy constructors.	Henning Baldersheim	2022-03-28	1	-108/+46
\|
*	Use both lvalue and rvalue specifier to avoid explicit std::move()	Henning Baldersheim	2022-03-28	1	-3/+1
\|
*	Avoid the need for clone by using unique_ptr.	Henning Baldersheim	2022-03-28	1	-2/+1
\|
*	Avoid need to copy/clone FieldUpdate	Henning Baldersheim	2022-03-27	1	-49/+16
\|
*	Move BucketIdListResult	Henning Baldersheim	2022-03-09	4	-20/+25
\|
*	Reduce visibility of document::Document	Henning Baldersheim	2022-03-07	1	-0/+1
\|
*	Reduce use of Identifiable for document::DatatType	Henning Baldersheim	2022-03-03	2	-0/+5
\|
*	Since we schedule the last chunk for commit in triggerSyncNow, we can assert ↵	Henning Baldersheim	2022-03-02	1	-2/+2
\| \| \| \| \| \|	that we will be fully synced on the next pull when it happens in the singleCommitter thread. That allows for further simplification.
*	Revert "Revert "Balder/refactor docentry""	Henning Baldersheim	2022-01-07	11	-230/+320
\|
*	Revert "Balder/refactor docentry"	Arnstein Ressem	2022-01-07	11	-320/+230
\|
*	- Flags -> Enum.	Henning Baldersheim	2022-01-06	7	-71/+65
\| \| \| \|	- Consistently use DocEntryList as type for std::vector<spi::DocEntry::UP>
*	Only care about size of payload. Also add payload containing only doctype ↵	Henning Baldersheim	2022-01-06	6	-30/+74
\| \| \| \|	and gid
*	Use enum class for the flags.	Henning Baldersheim	2022-01-06	6	-47/+44
\|
*	Simplify by avoid both DocumentSize and PersistedDocumentSize. That is the same.	Henning Baldersheim	2022-01-06	7	-164/+153
\|
*	Simplify DocEntry to get a clean interface with multiple implementations, ↵	Henning Baldersheim	2022-01-06	10	-90/+156
\| \| \| \| \| \|	instead of an mutant. Also add tests for the different variations a DocEntry can have.
*	Declare noexcept move constructor and assignment for storage::spi::Result.	Tor Egge	2021-12-11	2	-0/+4
\|
*	more descriptive name for header file	Arne H Juul	2021-12-02	1	-1/+1
\|
*	track namespace move in documenttypes.def	Arne H Juul	2021-12-02	2	-4/+2
\| \| \| \| \| \| \|	* For C++ code this introduces a "document::config" namespace, which will sometimes conflict with the global "config" namespace. * Move all forward-declarations of the types DocumenttypesConfig and DocumenttypesConfigBuilder to a common header file.
*	Handle case where bucket spaces have differing maintenance state for a node	Tor Brede Vekterli	2021-11-24	3	-25/+29
\| \| \| \| \| \| \| \| \| \| \|	Only skip deactivating buckets if the entire _node_ is marked as maintenance state, i.e. the node has maintenance state across all bucket spaces provided in the bundle. Otherwise treat the state transition as if the node goes down, deactivating all buckets. Also ensure that the bucket deactivation logic above the SPI is identical to that within Proton. This avoids bucket DBs getting out of sync between the two.
*	Continue serving search queries when in Maintenance node state	Tor Brede Vekterli	2021-11-24	3	-10/+46
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously, entering maintenance state would implicitly deactivate all buckets on the searchnode and cause empty responses to be returned for searches. However, container query dispatch uses async health pings to decide which nodes to route queries to, so it would be possible for a node to still be used for queries for a few seconds until the ping discovered that the node should not be used. In the case of multiple groups without multiple ready replicas within the group, this would cause transient coverage loss since the dispatcher would not realize it should route queries to other groups instead. With this commit, maintenance edge behavior is changed as follows: - Buckets are _not_ deactivated when going from an available state to the maintenance state. However, they _are_ deactivate when going from maintenance state to an available state in order to avoid transient query duplicates immediately after the change. - Searches are executed as normal instead of returning empty replies when the node is in maintenance state. The following behavior is intentionally _not_ changed: - The search interface is still marked as offline when in maintenance state, as this signals that the node should be taken out of rotation. In particular, it's critical that the RPC health ping response is explicitly tagged as having zero active docs when the search interface is offline, even though many buckets may now actually be active. Otherwise, queries would not be gracefully drained from the node.
*	Revert "Continue serving search queries when in Maintenance node state ↵	Henning Baldersheim	2021-11-23	3	-58/+18
\| \| \| \|	[run-systemtest]"
*	Handle case where bucket spaces have differing maintenance state for a node	Tor Brede Vekterli	2021-11-23	3	-25/+29
\| \| \| \| \| \| \| \| \| \| \|	Only skip deactivating buckets if the entire _node_ is marked as maintenance state, i.e. the node has maintenance state across all bucket spaces provided in the bundle. Otherwise treat the state transition as if the node goes down, deactivating all buckets. Also ensure that the bucket deactivation logic above the SPI is identical to that within Proton. This avoids bucket DBs getting out of sync between the two.
*	Continue serving search queries when in Maintenance node state	Tor Brede Vekterli	2021-11-22	3	-10/+46
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously, entering maintenance state would implicitly deactivate all buckets on the searchnode and cause empty responses to be returned for searches. However, container query dispatch uses async health pings to decide which nodes to route queries to, so it would be possible for a node to still be used for queries for a few seconds until the ping discovered that the node should not be used. In the case of multiple groups without multiple ready replicas within the group, this would cause transient coverage loss since the dispatcher would not realize it should route queries to other groups instead. With this commit, maintenance edge behavior is changed as follows: - Buckets are _not_ deactivated when going from an available state to the maintenance state. However, they _are_ deactivate when going from maintenance state to an available state in order to avoid transient query duplicates immediately after the change. - Searches are executed as normal instead of returning empty replies when the node is in maintenance state. The following behavior is intentionally _not_ changed: - The search interface is still marked as offline when in maintenance state, as this signals that the node should be taken out of rotation. In particular, it's critical that the RPC health ping response is explicitly tagged as having zero active docs when the search interface is offline, even though many buckets may now actually be active. Otherwise, queries would not be gracefully drained from the node.
*	Let removeAsync handle list of documents.	Henning Baldersheim	2021-11-18	7	-28/+68
\|
*	Move removeLocation over to Asynchandler and issue all removes for one ↵	Henning Baldersheim	2021-11-17	1	-17/+17
\| \| \| \| \| \|	bucket before waiting for the replies. Prepare RemoveResult to contain more replies.
*	Adjust dummy persistence spi semantics towards proton spi semantics when	Tor Egge	2021-10-27	4	-36/+96
\| \| \| \| \|	bucket doesn't exist: setActiveState(), put(), remove() creates bucket if it doesn't already exist.
*	Adjust dummy persistence spi semantics towards proton spi semantics when	Tor Egge	2021-10-25	3	-7/+56
\| \| \| \| \| \| \|	bucket doesn't exist: * getBucketInfo() returns success with empty bucket info * createIterator() returns success * iterate() returns empty complete result.
*	Undo auto format	Henning Baldersheim	2021-10-25	1	-230/+217
\|
*	create/delete bucket will never throw.	Henning Baldersheim	2021-10-25	4	-247/+244
\|
*	Async createBucket	Henning Baldersheim	2021-10-25	5	-6/+14
\|
*	Add noexcept specifier to operation complete callback.	Tor Egge	2021-10-22	3	-3/+3
\|
*	Only keep async variant to simplify what to implement and what fallback ↵	Henning Baldersheim	2021-10-18	6	-130/+63
\| \| \| \|	there are.
*	Implement async delete bucket.	Henning Baldersheim	2021-10-18	6	-31/+20
\|
*	Make setActiveState async.	Henning Baldersheim	2021-10-17	6	-10/+24
\|
*	Factor out CatchResult	Henning Baldersheim	2021-10-15	4	-19/+49
\|
*	Revert "- Refactor and use CatchResult in the PersistenceEngine in ↵	Henning Baldersheim	2021-10-15	4	-49/+19
\| \| \| \|	preparatio…"
*	- Refactor and use CatchResult in the PersistenceEngine in preparation for ↵	Henning Baldersheim	2021-10-15	4	-19/+49
\| \| \| \|	making more moretaions async.
*	Update Verizon Media copyright notices.	gjoranv	2021-10-07	13	-13/+13
\|