vespa - An engine for low-latency computation over large data sets

	Commit message (Collapse)	Author	Age	Files	Lines
*	Drop support for old gtest.	Tor Egge	2022-11-18	1	-1/+1
\|
*	Simplify state version checks by requiring exact version match	Tor Brede Vekterli	2022-09-22	1	-1/+5
\| \| \| \| \| \| \| \| \| \| \|	The existing state unification logic was likely to help ensure that various distributor availability-states were treated as if they were simply Up, but the distributor has not been able to even _be_ in other available states than Up for many years. So it's effectively pointless. Remove unification entirely and instead require both the distributor and content node to be mutually in sync with the exact cluster state version.
*	Avoid bucket DB race during content node cluster state transition	Tor Brede Vekterli	2022-09-21	1	-4/+56
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	It was possible for a distributor bucket fetch request to be processed _after_ a cluster state was enabled (and internally propagated) on the content node, but _before_ all side effects of this enabling were complete and fully visible. This could cause inconsistent information to be returned to the distributor, causing nodes to get out of sync bucket metadata. This commit handles such transition periods by introducing an implicit barrier between observing the incoming command and outgoing reply for a particular cluster state version. Upon observing the reply for a version, all side effects must already be visible since the reply is only sent once internal state processing is complete (both above and below the SPI). Until initiated and completed versions converge, requests are rejected and will be transparently retried by the distributors.
*	Remove legacy distribution hash fallback	Tor Brede Vekterli	2022-06-09	1	-20/+4
\| \| \| \| \|	Was used to handle rolling upgrades between versions with different semantics a long time ago on the 7 branch.
*	Make ConfigUri constructors explicit and use same context where possible in ↵	Henning Baldersheim	2022-02-20	1	-3/+3
\| \| \| \|	proton.
*	more descriptive name for header file	Arne H Juul	2021-12-02	1	-1/+1
\|
*	track namespace move in documenttypes.def	Arne H Juul	2021-12-02	1	-1/+1
\| \| \| \| \| \| \|	* For C++ code this introduces a "document::config" namespace, which will sometimes conflict with the global "config" namespace. * Move all forward-declarations of the types DocumenttypesConfig and DocumenttypesConfigBuilder to a common header file.
*	Update 2019 Oath copyrights.	gjoranv	2021-10-27	1	-1/+1
\|
*	Update 2017 copyright notices.	gjoranv	2021-10-07	4	-4/+4
\|
*	Guard against processing bucket requests with inconsistent internal state ↵	Tor Brede Vekterli	2021-03-03	1	-8/+37
\| \| \| \| \| \| \| \| \| \|	version There's a tiny window of time between when the bucket manager observes a new state version and when the state version actually is visible in the rest of the process. We must ensure that we don't end up processing requests when these two differ, or we might erroneously process requests for version X using a state only valid for version Y < X.
*	- Reduce visibility of ClusterState and Distribution.	Henning Baldersheim	2021-02-19	1	-0/+2
\|
*	Inhibit activation of replicas out of sync with a replica majority	Tor Brede Vekterli	2021-02-17	1	-0/+54
\| \| \| \| \| \| \| \| \| \|	Adds a configurable max number of groups (default 0) whose replica activation is inhibited if the replica's bucket info is out of sync with a majority of other replicas. Intended to be used for the case where a group comes back up after transient unavailability and where the nodes are out of sync and should preferably not be activated until post-merging.
*	Wire in HostInfo to FileStorManager.	Tor Egge	2021-01-18	1	-1/+1
\|
*	Start expensive tests earlier.	Henning Baldersheim	2021-01-13	1	-2/+0
\|
*	Have the BufferType::_emptyEntry be static.	Henning Baldersheim	2021-01-10	1	-0/+1
\| \| \| \|	Use an array of buffer types in the array class.
*	- Wire in the guard to make it evident that we have it when making changes ↵	Henning Baldersheim	2020-12-15	1	-0/+1
\| \| \| \| \| \|	that require it. - Clean up some old members and code not used any more.
*	Add test for explicit read guard iterator key ordering	Tor Brede Vekterli	2020-10-30	1	-0/+23
\|
*	Add striped implementation of B-tree content node bucket database	Tor Brede Vekterli	2020-10-30	1	-46/+8
\| \| \| \| \| \| \| \| \| \|	Abstracts away multiple underlying B-tree DBs that each hold a subset of the super bucket space. Offers ordered iteration via a priority-queue based view over the sub DBs. Not yet ready for prime time, as the striping inherently requires an absolute lower bound on the bucket bits used in the system, which is currently not enforced.
*	Remove legacy Judy array-backed bucket DB implementation	Tor Brede Vekterli	2020-10-28	4	-359/+2
\|
*	GC unused disk dimension.	Henning Baldersheim	2020-10-20	1	-2/+2
\|
*	Remove legacy bucket DB initializer component	Tor Brede Vekterli	2020-10-19	2	-597/+0
\|
*	Greatly simplify bucket DB persistence provider bootstrap procedure	Tor Brede Vekterli	2020-10-16	3	-21/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The legacy bucket DB initialization logic was designed for the case where bucket information was spread across potentially millions of files residing on spinning rust drives. It was therefore async and running in parallel with client operations, adding much complexity in order to deal with a myriad of concurrency edge cases. Replace this with a very simple, synchronous init method that expects the provider to have the required information readily and cheaply available. This effectively removes the concept of a node's "initializing" state, moving directly from reported state Down to Up. Even though a node still technically starts up in Initializing state, we never end up reporting this to the Cluster Controller as the DB init completes before the RPC server stack is set up. Legacy bucket DB initializer code will be removed in a separate pass. Also simplify bucket DB interface contract for mutating iteration, indicating that it is done in an unspecified order.
*	GC disk related code.	Henning Baldersheim	2020-10-15	2	-17/+6
\|
*	Remove partitions from SPI.	Tor Egge	2020-10-14	2	-34/+8
\|
*	Use std::mutex/std::condition_variable over vespalib::Monitor	Henning Baldersheim	2020-10-13	1	-2/+1
\|
*	Add noexcept as indicated by -Wnoeexcept	Henning Baldersheim	2020-10-07	1	-1/+1
\|
*	Add content node bucket DB memory usage metrics	Tor Brede Vekterli	2020-09-04	1	-1/+19
\|
*	Merge pull request #13819 from ↵	Tor Brede Vekterli	2020-07-08	1	-14/+68
\|\ \| \| \| \| \| \| \| \|	vespa-engine/vekterli/basic-snapshot-support-for-content-node-bucket-db Vekterli/basic snapshot support for content node bucket db
\| *	Use bucket DB read guards for metric and status aggregation	Tor Brede Vekterli	2020-07-07	1	-0/+16
\| \|
\| *	Expose ReadGuard via AbstractLockableMap interface	Tor Brede Vekterli	2020-07-07	1	-14/+52
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Add working B-tree snapshot read guard impl * Add placeholder wrapper read guard for legacy DB * Enforce value const-ness of existing for_each_chunked iteration API * Return read guard entries by value instead of modifying ref argument
* \|	Consolidate search for GTest.	Tor Egge	2020-07-07	1	-1/+0
\|/
*	Merge pull request #13706 from ↵	Tor Brede Vekterli	2020-06-30	2	-156/+257
\|\ \| \| \| \| \| \| \| \|	vespa-engine/vekterli/btree-bucket-db-support-on-content-node Create generic B-tree bucket DB and content node DB implementation
\| *	Address review comments	Tor Brede Vekterli	2020-06-29	1	-7/+6
\| \| \| \| \| \| \| \| \| \|	Also rewrite some GMock macros that triggered Valgrind warnings due to default test object printers accessing uninitialized memory.
\| *	Create generic B-tree bucket DB and content node DB implementation	Tor Brede Vekterli	2020-06-25	2	-156/+258
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is the first stage of removing the legacy DB implementation. Support for B-tree specific functionality such as lock-free snapshot reads will be added soon. This commit is just for feature parity. Abstract away actual database implementation to allow it to be chosen dynamically at startup. This abstraction does incur some overhead via call indirections and type erasures of callbacks, so it's likely it will be removed once the transition to the new B-tree DB has been completed. Since the algorithms used for bucket key operations is so similar between the content node and distributor, a generic B-tree backed bucket database has been created. The distributor DB will be rewritten around this code very soon. Due to the strong coupling between bucket locking and actual DB implementation details, the new bucket DB has a fairly significant code overlap with the legacy implementation. This is to avoid spending time abstracting away and factoring out code for a legacy implementation that is to be removed entirely anyway. Remove existing LockableMap functionality not used or that's only used by tests.
* \|	Use find_package to find gtest library.	Tor Egge	2020-06-29	1	-1/+2
\|/
*	Remove unused legacy bucket DB functionality	Tor Brede Vekterli	2020-06-03	1	-149/+0
\|
*	- Update metrics less often by removing the forceEventLogging alltogether.	Henning Baldersheim	2020-05-13	1	-3/+3
\| \| \| \|	- Let default bucket iteration work in smaller chunks with shorter waits.
*	Some libraries print "0x0" for a null void ptr,	Tor Egge	2020-04-22	1	-1/+1
\|
*	Disable old, non-deterministic test	Tor Brede Vekterli	2019-09-27	1	-1/+1
\| \| \| \|	Needs to be rewritten or discarded.
*	Add config override for simulating bucket info request processing latency	Tor Brede Vekterli	2019-09-20	1	-1/+1
\| \| \| \| \| \| \| \|	Simulates added request latency caused by the BucketManager computing bucket ownership for a very large number of buckets. Fetched at BucketManager init only, so not a dynamic config. This is only meant for internal testing so should not have any practical consequences.
*	Remove DocIdString outside of document	Henning Baldersheim	2019-08-19	1	-4/+2
\|
*	Remove the use and testing of legacy groupdoc/userdoc/orderdoc document ids.	Henning Baldersheim	2019-08-09	1	-2/+2
\|
*	Remove CppUnit dependencies in modules	Tor Brede Vekterli	2019-06-26	1	-2/+2
\| \| \| \|	Move test config helpers out of cppunit submodule.
*	Convert remaining CppUnit tests to GTest	Tor Brede Vekterli	2019-06-25	1	-1/+1
\| \| \| \| \| \|	Move base message sender stub out to common test module to avoid artificial dependency from persistence tests to the distributor tests.
*	Convert LockableMapTest from CppUnit to GTest	Tor Brede Vekterli	2019-06-07	2	-628/+199
\| \| \| \| \|	Remove convoluted thread stress test which didn't actually _verify_ any kind of correctness (aside from the test not outright crashing).
*	Convert JudyMultiMapTest from CppUnit to Gtest	Tor Brede Vekterli	2019-06-07	3	-72/+51
\|
*	Convert JudyArrayTest from CppUnit to Gtest	Tor Brede Vekterli	2019-06-07	3	-166/+96
\|
*	Convert BucketManagerTest and InitializerTest to gtest	Tor Brede Vekterli	2019-06-07	3	-768/+249
\| \| \| \| \|	Still some residual vdstestlib CppUnit traces that will need cleaning up later.
*	Convert BucketInfoTest from CppUnit to GTest	Tor Brede Vekterli	2019-06-06	2	-106/+46
\|
*	Create gtest runner per test sub-module.	Geir Storli	2019-06-04	3	-0/+25
\| \| \| \|	This makes it possible to run storage tests in parallel.