summaryrefslogtreecommitdiffstats
path: root/streamingvisitors/src/tests
Commit message (Collapse)AuthorAgeFilesLines
* Treat regex and fuzzy whole-field matching as 1 logical wordTor Brede Vekterli2024-01-221-0/+16
| | | | | | We have concluded that this is the most semantically correct way of reporting the count, and as a bonus it avoids having to do a separate pass over the string buffer.
* Adjust search::streaming::Hit to better matchTor Egge2024-01-222-2/+3
| | | | search::fef::TermFieldMatchDataPosition.
* Support fuzzy term matching in streaming searchTor Brede Vekterli2024-01-181-28/+192
| | | | | | | | | | Uses a DFA-based matcher for max edits in {1, 2} and falls back to the legacy non-DFA matcher for all other values (including 0). Currently only supports fuzzy matching across the full field string, i.e. there's no implicit tokenization or whitespace removal. This matches the semantics we currently have for fuzzy search over attributes in a non-streaming case
* Propagate normalizing mode and max field length to new searcherTor Brede Vekterli2024-01-161-0/+12
| | | | | Needed to avoid default normalizing mode/max field length being used in the reconfigured searcher instance.
* Add regular expression support to streaming searchTor Brede Vekterli2024-01-151-2/+47
| | | | | | | | | | | | | | Introduces an explicit regex query term node (which wraps an RE2 regex instance internally) and extends the existing UTF-8 flexible string searcher to use this query node. Regex matching is optionally case (in)sensitive depending on the normalization mode used. Note on `searcher/searcher_test.cpp`: this adds a magic sentinel `#` char prefix to query term parsing in the test to let a query term be interpreted as a regex rather than exact/prefix/suffix/substring match.
* Just use normalize_mode directly from searcher.Henning Baldersheim2024-01-121-2/+2
|
* Split out tokenizer and test it explicit.Henning Baldersheim2024-01-111-0/+21
|
* Use the normalize_mode config.Henning Baldersheim2024-01-101-13/+13
|
* Simplify ancient carefully hand optimized code in favour of simple readable codeHenning Baldersheim2024-01-101-4/+9
|
* Code cleanupHenning Baldersheim2024-01-102-6/+5
|
* - Fold query for streaming search based on either query item type, or field ↵Henning Baldersheim2024-01-051-8/+11
| | | | | | | | definition. - This ensures that query processing and document processing is symmetric for streaming search. No longer rely on java query processing being symmetric with backend c++ variant. - Indexed search does no normalization in backend and uses query as is.
* - Modernize codeHenning Baldersheim2024-01-041-24/+24
| | | | - Unify some conversion tables.
* Revert "Revert "Balder/only rewrite numeric terms for text fields""Henning Baldersheim2024-01-031-1/+1
|
* Revert "Balder/only rewrite numeric terms for text fields"Henning Baldersheim2024-01-031-1/+1
|
* Only rewrite numeric terms when searching text fields.Henning Baldersheim2024-01-021-1/+1
|
* Standard plural of leaf is leaves.Tor Egge2023-11-301-1/+1
|
* Update copyrightJon Bratseth2023-10-0926-24/+26
|
* Use "_test" suffix for unit test cpp files.Geir Storli2023-08-308-4/+4
|
* Use WordFolder as helper instead of inheriting static stuff.Henning Baldersheim2023-07-251-1/+1
|
* Unpack interleaved features for streaming search.Tor Egge2023-07-191-0/+59
|
* Setup search visitor without proton process.Tor Egge2023-05-101-4/+2
|
* Pass transport and file distributor connection spec to SearchEnvironmentTor Egge2023-05-101-2/+5
| | | | | in preparation for using RankingAssetsBuilder when handling config in streaming search.
* Add SearchEnvironmentSnapshot for streaming search.Tor Egge2023-05-051-2/+6
|
* Test match features returned in streaming search result.Geir Storli2023-04-285-28/+59
|
* Merge pull request #26893 from ↵Arne H Juul2023-04-271-3/+1
|\ | | | | | | | | vespa-engine/arnej/remove-unused-distance-functions-3 remove unused distance functions
| * remove unused distance functionsArne Juul2023-04-271-3/+1
| |
* | Merge pull request #26897 from ↵Geir Storli2023-04-2720-560/+352
|\ \ | | | | | | | | | | | | vespa-engine/geirst/search-visitor-query-execution-test Test basic query execution in streaming search visitor.
| * | Test basic query execution in search visitor.Geir Storli2023-04-2720-560/+352
| |/
* / Populate match features in search result for streaming search.Tor Egge2023-04-271-0/+60
|/
* Rewrite searchvisitor test to GTest.Geir Storli2023-04-272-35/+17
|
* Rewrite streamingvisitors hit collector unit test to use gtest.Tor Egge2023-04-262-49/+27
|
* Merge pull request #26850 from ↵Geir Storli2023-04-252-14/+59
|\ | | | | | | | | vespa-engine/geirst/nearest-neighbor-target-hits-in-streaming Use targetHits in nearestNeighbor streaming searcher.
| * Use targetHits in nearestNeighbor streaming searcher.Geir Storli2023-04-252-14/+59
| | | | | | | | A distance heap is used to limit the number of produced document matches.
* | Move search::FeatureValues to vespalib::FeatureValues in preparation forTor Egge2023-04-251-1/+1
|/ | | | extending vdslib::SearchResult.
* Provide FieldPathMap and IQueryEnvironment when preparing streaming searchers.Geir Storli2023-04-205-41/+40
| | | | This is required to prepare the NearestNeighborFieldSearcher.
* Add exact nearest neighbor searcher over the streamed values of a tensor field.Geir Storli2023-04-202-0/+164
| | | | Note: Integration into the searchvisitor remains.
* Unpack match data for nearest neighbor query node in streaming search.Tor Egge2023-04-192-0/+104
|
* Reduce creation of Document instances without DocumentTypeRepo.Geir Storli2023-03-133-4/+4
|
* Rename KeywordExtractor to QueryTermFilter.Tor Egge2023-01-254-132/+132
|
* Emtpy index name means default index.Tor Egge2023-01-251-0/+7
|
* Add new KeywordExtractor with two factories (one each for indexed searchTor Egge2023-01-242-0/+125
| | | | and streaming search).
* Expose SameElement query terms to ranking.Geir Storli2023-01-121-1/+1
| | | | | | A TermFieldMatchData is allocated per SameElement term, and this is used to signal matching docids in doUnpack() on the SameElement search iterator. This allows using the matches() rank feature on a field (virtual) that is searched using a SameElement term.
* Change from typedef to using in streamingvisitors C++ code.Geir Storli2022-12-213-16/+16
|
* fix typo invokation -> invocationThinh Bui2022-11-142-3/+3
|
* Use SlimeFiller instead of SlimeFieldWriter for streaming search.Tor Egge2022-09-191-155/+1
|
* Reduce usage of RawBuf.Henning Baldersheim2022-08-291-5/+17
| | | | Remove some unused code.
* perform feature renaming in streaming alsoArne H Juul2022-06-221-2/+4
|
* Revert "Revert "Collapse vsm into streamingvisitors""Henning Baldersheim2022-05-1523-4/+2131
|
* Revert "Collapse vsm into streamingvisitors"Henning Baldersheim2022-05-1523-2131/+4
|
* Collapse vsm into streamingvisitorsHenning Baldersheim2022-05-1423-4/+2131
|