summaryrefslogtreecommitdiffstats
path: root/searchlib
Commit message (Collapse)AuthorAgeFilesLines
* Add attribute tokens dfw.Tor Egge2023-10-303-0/+14
|
* - Reduce peak memory usage for unit tests from 5G to 0.5G.Henning Baldersheim2023-10-271-7/+8
|
* Use the entire docid range as approx hits with non-btree dictionary.Geir Storli2023-10-261-1/+3
| | | | | This fixes a bug introduced in the improved modeling of match strategies to use in numeric range search.
* Simplify after review feedback.Geir Storli2023-10-261-9/+7
|
* Improve modelling of match strategies to use in numeric range search.Geir Storli2023-10-263-113/+98
| | | | | | | | This should improve the performance by choosing the strategy that is most optimal in different scenarios: lookup-based filter matching vs posting lists merging. The modelling is based on results from the range search performance test: https://github.com/vespa-engine/system-test/blob/master/tests/performance/range_search/.
* Revert "Improve modelling of match strategies to use in numeric range search."Tor Egge2023-10-193-96/+113
|
* Simplify after review feedback.Geir Storli2023-10-191-9/+7
|
* Improve modelling of match strategies to use in numeric range search.Geir Storli2023-10-193-113/+98
| | | | | | | | This should improve the performance by choosing the strategy that is most optimal in different scenarios: lookup-based filter matching vs posting lists merging. The modelling is based on results from the range search performance test: https://github.com/vespa-engine/system-test/blob/master/tests/performance/range_search/.
* Merge pull request #29005 from vespa-engine/toregge/remove-obsolete-todoHenning Baldersheim2023-10-181-1/+0
|\ | | | | Remove obsolete TODO.
| * Remove obsolete TODO.Tor Egge2023-10-181-1/+0
| |
* | getBloatCount => getErasedCountHenning Baldersheim2023-10-183-4/+4
| |
* | - Modifications are guarded by external lock.Henning Baldersheim2023-10-174-26/+31
|/ | | | - Use relaxed atomics to ensure reader visibility.
* Merge pull request #28950 from ↵Henning Baldersheim2023-10-171-0/+56
|\ | | | | | | | | vespa-engine/toregge/add-info-about-evolution-of-disk-index-dictionary-format Add some info about evolution of disk index dictionary format.
| * Add some info about evolution of disk index dictionary format.Tor Egge2023-10-161-0/+56
| |
* | fix some trailing whitespaceArne Juul2023-10-165-10/+10
|/
* Move more checks to TokenExtractor.Tor Egge2023-10-127-90/+324
|
* Avoid reflection by providing explicit constructionHenning Baldersheim2023-10-1164-265/+125
|
* - Use a Supplier<Identifable> to construct objects from id.Henning Baldersheim2023-10-1121-48/+37
| | | | | - Provide lambda for default constructing frequently used objects to avoid reflection.
* Pass empty vector to TokenExtractor::extract.Tor Egge2023-10-102-1/+1
|
* Consolidate extraction of tokens from annotated string field value.Tor Egge2023-10-108-87/+143
|
* Merge pull request #28836 from vespa-engine/bratseth/copyright-2Jon Bratseth2023-10-102457-2472/+2472
|\ | | | | Update copyright
| * Correct copyright headersJon Bratseth2023-10-0914-14/+14
| |
| * Update copyrightJon Bratseth2023-10-092456-2458/+2458
| |
* | Rename id -> localChunkIdHenning Baldersheim2023-10-092-4/+4
| |
* | Add test for BucketIndexStore and the iterator.Henning Baldersheim2023-10-093-10/+44
| |
* | Factor out the lid to bucket mapping.Henning Baldersheim2023-10-092-65/+99
| |
* | Explicit destructor.Henning Baldersheim2023-10-092-5/+7
| |
* | - Use a single store for mapping lid to its data that are split into ↵Henning Baldersheim2023-10-065-78/+149
|/ | | | | | partitions and chunks. - This enable memory to be released after compaction is done.
* - Avoid holding a bucketizer guard. Just get it everytime you need it.Henning Baldersheim2023-10-052-25/+3
| | | | | - Max hold time is often above 2-3 seconds. This makes it very likely that a sudden buildup might add l ot of memory to onhold.
* Use ConstBufferRef and add some noexceptHenning Baldersheim2023-10-0516-76/+79
|
* Merge pull request #28801 from ↵Henning Baldersheim2023-10-052-0/+6
|\ | | | | | | | | vespa-engine/balder/disable-cache-for-removed-subdb Disable cache for removed only docsubdb.
| * Add test for disabling of cache in removed dbHenning Baldersheim2023-10-052-0/+5
| |
| * Disable cache for removed only docsubdb.Henning Baldersheim2023-10-051-0/+1
| |
* | Merge pull request #28800 from ↵Henning Baldersheim2023-10-052-6/+6
|\ \ | |/ |/| | | | | vespa-engine/balder/reduce-max-number-of-lids-2-8m - Reduce max lids per file and max file size to 4M and 256M during un…
| * - Reduce max lids per file and max file size to 4M and 256M during unit testing.Henning Baldersheim2023-10-052-6/+6
| | | | | | | | - Reduce max lids from 40M to 8M as default configuration.
* | Merge branch 'master' into balder/refactor-for-clarityHenning Baldersheim2023-10-054-20/+35
|\ \
| * | - Instead of keeping a map of bucketId => lids, just append everything to a ↵Henning Baldersheim2023-10-044-21/+36
| |/ | | | | | | | | | | | | vector and sort when complete. - This significantly improves memory usage during compaction. Instead of many heap allocations - You now get fewer mmapped allocations that are dropped when done.
* / - Number of partitions is fixed compile time => use std::array.Henning Baldersheim2023-10-054-22/+25
|/ | | | - Use unique_ptr on outer object instead of unique_ptr on multiple non-movable inner objects.
* GC unused includeHenning Baldersheim2023-10-041-2/+0
|
* Process idx file in streaming fashion instead of first reading all and then ↵Henning Baldersheim2023-10-042-73/+48
| | | | process.
* GC unused and non computed return value.Henning Baldersheim2023-10-044-46/+53
| | | | Refactor to prepare for streaming read.
* Use large allocator and control size of TmpChunkMeta.Henning Baldersheim2023-10-041-1/+2
|
* Merge pull request #28776 from ↵Tor Egge2023-10-031-3/+4
|\ | | | | | | | | vespa-engine/toregge/avoid-unaligned-read-while-decoding-serialized-query-stack-dump Avoid unaligned read while decoding serialized query stack dump.
| * Avoid unaligned read while decoding serialized query stack dump.Tor Egge2023-10-031-3/+4
| |
* | Merge pull request #28773 from ↵Henning Baldersheim2023-10-033-5/+6
|\ \ | |/ |/| | | | | vespa-engine/geirst/dfa-table-as-default-fuzzy-matching-algorithm Use DfaTable as default fuzzy matching algorithm for maxEditDistance …
| * Use DfaTable as default fuzzy matching algorithm for maxEditDistance <= 2.Geir Storli2023-10-033-5/+6
| |
* | Prevent eternal loop if bit vectors are shorter than docid limitHenning Baldersheim2023-10-033-8/+8
| |
* | Add disabled test to prove eternal loop.Henning Baldersheim2023-10-031-4/+35
| |
* | Add test counting seeksHenning Baldersheim2023-10-031-0/+16
| |
* | Refactor testHenning Baldersheim2023-10-031-127/+90
|/