Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Merge pull request #29667 from vespa-engine/jobergum/splade-embedder | Jo Kristian Bergum | 2024-01-04 | 1 | -6/+61 |
|\ | | | | | Add SPLADE embedder | ||||
| * | Add test coverage of mapped tensor in indexing embed | Jo Kristian Bergum | 2023-12-19 | 1 | -6/+61 |
| | | |||||
* | | Enable setting max-occurrences in field match. | Tor Egge | 2024-01-04 | 1 | -0/+10 |
|/ | |||||
* | If we index the original in addition to stems, lowercase it | Jon Bratseth | 2023-11-20 | 1 | -4/+5 |
| | |||||
* | Revert "Merge pull request #29328 from ↵ | Jon Bratseth | 2023-11-14 | 2 | -38/+55 |
| | | | | | | | vespa-engine/revert-29314-bratseth/casing-take-2" This reverts commit a72e949533a46d665440a9c72ca2b8fb58f3a9c3, reversing changes made to 944d635d00e165166508ef23399e9ed65a87a9c8. | ||||
* | Revert "Bratseth/casing take 2" | Harald Musum | 2023-11-13 | 2 | -55/+38 |
| | |||||
* | Cleanup | Jon Bratseth | 2023-11-10 | 1 | -1/+0 |
| | |||||
* | Prefer first stem to original if non equal | Jon Bratseth | 2023-11-10 | 1 | -34/+52 |
| | |||||
* | Revert "Revert "Don't lowercase linguistics annotations"" | Jon Bratseth | 2023-11-09 | 2 | -5/+5 |
| | | | | This reverts commit 0dfd4fe4c6ddbded490da36e71f27c4b70aa4226. | ||||
* | Revert "Don't lowercase linguistics annotations" | Jon Bratseth | 2023-11-09 | 2 | -5/+5 |
| | |||||
* | Test that casing is preserved | Jon Bratseth | 2023-11-09 | 1 | -3/+3 |
| | |||||
* | Don't lowercase linguistics annotations | Jon Bratseth | 2023-11-09 | 2 | -3/+3 |
| | | | | | | Tokens are already lowercased by our bundled linguistics components. Lowercasing again when annotating precludes plugging in a lingustics component which preserves casing. | ||||
* | Update copyright | Jon Bratseth | 2023-10-09 | 96 | -96/+96 |
| | |||||
* | Return the expected output | Jon Bratseth | 2023-09-27 | 59 | -215/+215 |
| | | | | | | | | | | | In if-else expressions, return the output of the executed branch rather than the input. The current behavior was undocumented and quite unexpected, so I suggest we treat that as a bug. Also return the last executed expression in a script as its output (rather than nothing. In addition, improve some error messages. | ||||
* | - Add utility to do substring extraction by codepoints, instead of java ↵ | Henning Baldersheim | 2023-09-15 | 1 | -3/+3 |
| | | | | | | char index. - Test and use it in SubstringExpression in indeing language. | ||||
* | remove test duplicate | Jo Kristian Bergum | 2023-08-16 | 1 | -6/+0 |
| | |||||
* | Add support for converting iso-8601 date strings to epoch time | Jo Kristian Bergum | 2023-08-16 | 1 | -0/+57 |
| | |||||
* | Resolve parent before children | Jon Bratseth | 2023-04-14 | 1 | -0/+25 |
| | |||||
* | Replace reflection by visitor | Jon Bratseth | 2023-03-31 | 1 | -35/+0 |
| | |||||
* | More understandable errors, and implement inner convert | Jon Bratseth | 2023-03-31 | 1 | -0/+26 |
| | |||||
* | Retrieve execution value explicitly by '_' | Jon Bratseth | 2023-03-24 | 1 | -0/+39 |
| | |||||
* | Handle missing values | Jon Bratseth | 2023-02-07 | 1 | -5/+12 |
| | |||||
* | Deprecate xml methods | Henning Baldersheim | 2023-01-27 | 4 | -0/+4 |
| | |||||
* | Support embedding an array to a mixed 2d tensor | Jon Bratseth | 2023-01-27 | 1 | -0/+34 |
| | |||||
* | Validate rank profiles early | Jon Bratseth | 2023-01-25 | 1 | -1/+1 |
| | |||||
* | Add headers | Jon Bratseth | 2023-01-23 | 1 | -0/+1 |
| | |||||
* | Skip statements on partial updates only | Jon Bratseth | 2023-01-23 | 1 | -0/+5 |
| | |||||
* | More tests | Jon Bratseth | 2023-01-20 | 2 | -14/+47 |
| | |||||
* | Support choice expressions | Jon Bratseth | 2023-01-20 | 4 | -11/+52 |
| | |||||
* | Expect the correction exceptions | Jon Bratseth | 2023-01-19 | 1 | -2/+2 |
| | |||||
* | Improve test | Jon Bratseth | 2023-01-09 | 1 | -23/+37 |
| | |||||
* | Revert "Revert collect(Collectors.toList())" | Henning Baldersheim | 2022-12-04 | 1 | -1/+1 |
| | |||||
* | Revert collect(Collectors.toList()) | Henning Baldersheim | 2022-12-04 | 1 | -1/+1 |
| | |||||
* | collect(Collectors.toList()) -> toList() | Henning Baldersheim | 2022-12-02 | 1 | -1/+1 |
| | |||||
* | switch to new-style config | Arne H Juul | 2022-04-12 | 4 | -1342/+1108 |
| | |||||
* | Add embedder selection argument to indexing language | Lester Solbakken | 2022-03-21 | 4 | -28/+69 |
| | |||||
* | Type inference where the output type is an array | Jon Bratseth | 2022-02-09 | 1 | -0/+64 |
| | |||||
* | Cleanup | Jon Bratseth | 2022-02-06 | 1 | -3/+0 |
| | |||||
* | Add hash function | Jon Bratseth | 2022-02-04 | 1 | -1/+46 |
| | |||||
* | Update Verizon Media copyright notices. | gjoranv | 2021-10-07 | 2 | -2/+2 |
| | |||||
* | Update 2017 copyright notices. | gjoranv | 2021-10-07 | 92 | -92/+92 |
| | |||||
* | Encapsulate in a context | Jon Bratseth | 2021-10-01 | 1 | -3/+3 |
| | |||||
* | Pass destination | Jon Bratseth | 2021-09-30 | 1 | -6/+14 |
| | | | | | This allows embedders to switch on it to enable bucket testing and similar. | ||||
* | encode -> embed | Jon Bratseth | 2021-09-28 | 4 | -19/+13 |
| | |||||
* | Add 'encode' expression | Jon Bratseth | 2021-09-19 | 5 | -7/+73 |
| | |||||
* | Non-functional changes only | Jon Bratseth | 2021-09-17 | 9 | -17/+17 |
| | |||||
* | Non-functional changes only | Jon Bratseth | 2021-09-17 | 3 | -5/+5 |
| | |||||
* | we want to compare Linguistics objects for equivalence | Arne Juul | 2021-08-04 | 1 | -0/+3 |
| | |||||
* | don't call accentDrop at all for empty input | Arne Juul | 2021-07-16 | 1 | -2/+25 |
| | |||||
* | try to trap spurious failure | Arne Juul | 2021-07-13 | 1 | -0/+35 |
| | | | | | | * we have seen spurious failures when verifying output from accent dropping; so far nothing reproducible, so add some extra logging and retry once if it happens (in case it's some kind of race-condition glitch). |