vespa - An engine for low-latency computation over large data sets

	Commit message (Collapse)	Author	Age	Files	Lines
*	Rename id -> localChunkId	Henning Baldersheim	2023-10-09	2	-4/+4
\|
*	Add test for BucketIndexStore and the iterator.	Henning Baldersheim	2023-10-09	3	-10/+44
\|
*	Factor out the lid to bucket mapping.	Henning Baldersheim	2023-10-09	2	-65/+99
\|
*	Explicit destructor.	Henning Baldersheim	2023-10-09	2	-5/+7
\|
*	- Use a single store for mapping lid to its data that are split into ↵	Henning Baldersheim	2023-10-06	5	-78/+149
\| \| \| \| \| \|	partitions and chunks. - This enable memory to be released after compaction is done.
*	- Avoid holding a bucketizer guard. Just get it everytime you need it.	Henning Baldersheim	2023-10-05	2	-25/+3
\| \| \| \| \|	- Max hold time is often above 2-3 seconds. This makes it very likely that a sudden buildup might add l ot of memory to onhold.
*	Use ConstBufferRef and add some noexcept	Henning Baldersheim	2023-10-05	16	-76/+79
\|
*	Merge pull request #28801 from ↵	Henning Baldersheim	2023-10-05	2	-0/+6
\|\ \| \| \| \| \| \| \| \|	vespa-engine/balder/disable-cache-for-removed-subdb Disable cache for removed only docsubdb.
\| *	Add test for disabling of cache in removed db	Henning Baldersheim	2023-10-05	2	-0/+5
\| \|
\| *	Disable cache for removed only docsubdb.	Henning Baldersheim	2023-10-05	1	-0/+1
\| \|
* \|	Merge pull request #28800 from ↵	Henning Baldersheim	2023-10-05	2	-6/+6
\|\ \ \| \|/ \|/\| \| \| \| \|	vespa-engine/balder/reduce-max-number-of-lids-2-8m - Reduce max lids per file and max file size to 4M and 256M during un…
\| *	- Reduce max lids per file and max file size to 4M and 256M during unit testing.	Henning Baldersheim	2023-10-05	2	-6/+6
\| \| \| \| \| \| \| \|	- Reduce max lids from 40M to 8M as default configuration.
* \|	Merge branch 'master' into balder/refactor-for-clarity	Henning Baldersheim	2023-10-05	4	-20/+35
\|\ \
\| * \|	- Instead of keeping a map of bucketId => lids, just append everything to a ↵	Henning Baldersheim	2023-10-04	4	-21/+36
\| \|/ \| \| \| \| \| \| \| \| \| \| \| \|	vector and sort when complete. - This significantly improves memory usage during compaction. Instead of many heap allocations - You now get fewer mmapped allocations that are dropped when done.
* /	- Number of partitions is fixed compile time => use std::array.	Henning Baldersheim	2023-10-05	4	-22/+25
\|/ \| \| \|	- Use unique_ptr on outer object instead of unique_ptr on multiple non-movable inner objects.
*	GC unused include	Henning Baldersheim	2023-10-04	1	-2/+0
\|
*	Process idx file in streaming fashion instead of first reading all and then ↵	Henning Baldersheim	2023-10-04	2	-73/+48
\| \| \| \|	process.
*	GC unused and non computed return value.	Henning Baldersheim	2023-10-04	4	-46/+53
\| \| \| \|	Refactor to prepare for streaming read.
*	Use large allocator and control size of TmpChunkMeta.	Henning Baldersheim	2023-10-04	1	-1/+2
\|
*	Merge pull request #28776 from ↵	Tor Egge	2023-10-03	1	-3/+4
\|\ \| \| \| \| \| \| \| \|	vespa-engine/toregge/avoid-unaligned-read-while-decoding-serialized-query-stack-dump Avoid unaligned read while decoding serialized query stack dump.
\| *	Avoid unaligned read while decoding serialized query stack dump.	Tor Egge	2023-10-03	1	-3/+4
\| \|
* \|	Merge pull request #28773 from ↵	Henning Baldersheim	2023-10-03	3	-5/+6
\|\ \ \| \|/ \|/\| \| \| \| \|	vespa-engine/geirst/dfa-table-as-default-fuzzy-matching-algorithm Use DfaTable as default fuzzy matching algorithm for maxEditDistance …
\| *	Use DfaTable as default fuzzy matching algorithm for maxEditDistance <= 2.	Geir Storli	2023-10-03	3	-5/+6
\| \|
* \|	Prevent eternal loop if bit vectors are shorter than docid limit	Henning Baldersheim	2023-10-03	3	-8/+8
\| \|
* \|	Add disabled test to prove eternal loop.	Henning Baldersheim	2023-10-03	1	-4/+35
\| \|
* \|	Add test counting seeks	Henning Baldersheim	2023-10-03	1	-0/+16
\| \|
* \|	Refactor test	Henning Baldersheim	2023-10-03	1	-127/+90
\|/
*	Revert "Use DfaTable as default fuzzy matching algorithm for maxEditDistance ↵	Henning Baldersheim	2023-10-02	2	-2/+2
\| \| \| \|	…"
*	Merge pull request #28765 from ↵	Geir Storli	2023-10-02	2	-2/+2
\|\ \| \| \| \| \| \| \| \|	vespa-engine/geirst/dfa-table-as-default-fuzzy-matching-algorithm Use DfaTable as default fuzzy matching algorithm for maxEditDistance …
\| *	Use DfaTable as default fuzzy matching algorithm for maxEditDistance <= 2.	Geir Storli	2023-10-02	2	-2/+2
\| \|
* \|	Merge pull request #28736 from ↵	Henning Baldersheim	2023-10-02	4	-16/+34
\|\ \ \| \| \| \| \| \| \| \| \| \| \| \|	vespa-engine/balder/use-as-bitvector-api-instead-of-casting - Use asBitVectorIterator instead of isBitVector + casting to present…
\| * \|	Expose only necessary meta information for bitvector, not the iterator interface	Henning Baldersheim	2023-10-02	4	-19/+32
\| \| \|
\| * \|	- Use asBitVectorIterator instead of isBitVector + casting to present a ↵	Henning Baldersheim	2023-09-29	4	-10/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	BitVectorIterator interface. - Allow Filter wrapper to expose underlying BitVector. - This ensures that the bitvectors are handled first during termwise evaluation, as they have a constant cost and will reduce the cost for the ones coming later on.
* \| \|	Merge pull request #28723 from ↵	Henning Baldersheim	2023-10-02	4	-36/+73
\|\ \ \ \| \|_\|/ \|/\| \| \| \| \| \| \| \|	vespa-engine/balder/lift-out-single-leaf-iterators-from-ws Lift out single iterators if they are leafs and tfmd is not needed.
\| * \|	Use new scoped if syntax.	Henning Baldersheim	2023-10-02	1	-2/+1
\| \| \|
\| * \|	If there is a single child in the ws, that also is a leaf, it will be be ↵	Henning Baldersheim	2023-09-29	3	-5/+19
\| \| \| \| \| \| \| \| \| \| \| \|	lifted out directly.
\| * \|	Add test for single term wsets	Henning Baldersheim	2023-09-29	1	-12/+32
\| \| \|
\| * \|	Use braced initializers	Henning Baldersheim	2023-09-29	1	-21/+19
\| \| \|
\| * \|	Lift out single iterators if they are leafs and tfmd is not needed.	Henning Baldersheim	2023-09-29	2	-3/+9
\| \|/
* \|	Normalize class names in attribute weighted set blueprint test.	Tor Egge	2023-09-29	1	-4/+27
\| \|
* \|	Merge pull request #28737 from vespa-engine/geirst/fuzzy-posting-list-fallback	Geir Storli	2023-09-29	2	-3/+43
\|\ \ \| \|/ \|/\|	Add fallback to using posting list when fuzzy and being non-strict.
\| *	Add fallback to using posting list when fuzzy and being non-strict.	Geir Storli	2023-09-29	2	-3/+43
\| \|
* \|	Reduce code duplication between fillArray and fillBitVector in	Tor Egge	2023-09-29	2	-23/+35
\|/ \| \| \|	PostingListFoldedSearchContextT.
*	- Resolve (!field_is_filter && !_tmd.isNotNeeded()) once upfront.	Henning Baldersheim	2023-09-29	1	-5/+5
\| \| \| \|	- Lift out single items if filter or match data not needed.
*	Lift out single iterators if either field is filter, or termfieldmatchdata ↵	Henning Baldersheim	2023-09-28	1	-1/+1
\| \| \| \|	is not needed.
*	Add noexcept	Henning Baldersheim	2023-09-28	1	-32/+34
\|
*	Merge pull request #28687 from ↵	Geir Storli	2023-09-28	4	-45/+153
\|\ \| \| \| \| \| \| \| \|	vespa-engine/toregge/avoid-unneeded-counting-of-hits Avoid counting hits in range multiple times.
\| *	Store a limited number of posting list indexes in countHits() to	Tor Egge	2023-09-27	4	-10/+70
\| \| \| \| \| \| \| \| \| \|	reduce amount of dictionary entry filtering in fillArray() and fillBitVector() for regexp search and fuzzy search.
\| *	Avoid counting hits in range multiple times.	Tor Egge	2023-09-27	2	-43/+91
\| \|
* \|	Merge pull request #28691 from ↵	Henning Baldersheim	2023-09-27	2	-5/+3
\|\ \ \| \| \| \| \| \| \| \| \| \| \| \|	vespa-engine/vekterli/preserve-successor-prefix-during-matching Preserve prefix of input DFA successor string