Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Update copyright | Jon Bratseth | 2023-10-09 | 73 | -73/+73 |
| | |||||
* | Use Guice 6.0 | Bjørn Christian Seime | 2023-09-04 | 1 | -1/+1 |
| | | | | | | https://github.com/google/guice/wiki/Guice600 We cannot upgrade to 7.x as we export javax.inject from container. 6.x supports both the old javax.inject and the new jakarta.inject replacement. | ||||
* | Allow sampling of fractional millis | Bjørn Christian Seime | 2023-08-25 | 2 | -4/+3 |
| | |||||
* | Add generic metrics for embedders | Bjørn Christian Seime | 2023-08-04 | 2 | -1/+56 |
| | |||||
* | Add necessary options to use failOnWarnings | gjoranv | 2023-06-05 | 1 | -0/+1 |
| | |||||
* | Don't remove indexable symbols when stemming | Jon Bratseth | 2023-06-02 | 5 | -8/+17 |
| | |||||
* | Add bundle type to all CORE bundles. | gjoranv | 2023-05-25 | 1 | -0/+3 |
| | |||||
* | Update ABI spec | Jon Bratseth | 2023-05-22 | 1 | -0/+1 |
| | |||||
* | Always treat each symbol as a separate token | Jon Bratseth | 2023-05-22 | 4 | -20/+56 |
| | |||||
* | Threat 'other symbols' as letters | Jon Bratseth | 2023-05-22 | 2 | -2/+10 |
| | | | | | The unicode class 'other symbols' contains emojis, math symbols, etc. Treat these as letter characters to support searching for them. | ||||
* | Use dollar and hour base units | Jon Bratseth | 2023-05-19 | 1 | -2/+2 |
| | |||||
* | Use metric enums everywhere | Jon Bratseth | 2023-03-06 | 1 | -1/+1 |
| | |||||
* | Add abi spec | Lester Solbakken | 2023-02-10 | 1 | -0/+1 |
| | |||||
* | Add decoding of sentencepiece token sequence to text | Lester Solbakken | 2023-02-10 | 1 | -0/+11 |
| | |||||
* | Compute code points in whole string only when needed | jonmv | 2022-12-06 | 2 | -6/+17 |
| | |||||
* | Split out opennlp-linguistics | Henning Baldersheim | 2022-11-26 | 14 | -783/+0 |
| | |||||
* | Update ABI spec format, and update all specs | jonmv | 2022-10-25 | 1 | -198/+198 |
| | |||||
* | much simpler CharSequenceNormalizer | Arne Juul | 2022-10-06 | 3 | -9/+100 |
| | |||||
* | Merge pull request #24007 from vespa-engine/bratseth/cleanup-082 | Jon Bratseth | 2022-09-25 | 2 | -13/+11 |
|\ | | | | | No functional changes | ||||
| * | No functional changes | Jon Bratseth | 2022-09-11 | 2 | -13/+11 |
| | | |||||
* | | Make validation messages clearer given multiple instances | Jon Bratseth | 2022-09-15 | 1 | -2/+0 |
|/ | |||||
* | bump protoc version | Arne Juul | 2022-08-27 | 1 | -4/+0 |
| | |||||
* | Determine token types considering all characters | Jon Bratseth | 2022-08-16 | 6 | -119/+133 |
| | |||||
* | Set project version to 8-SNAPSHOT | gjoranv | 2022-06-08 | 1 | -2/+2 |
| | |||||
* | Remove on Vespa 8 | Jon Bratseth | 2022-06-08 | 2 | -10/+1 |
| | |||||
* | Use '@Inject' from 'annotations' in multiple bundles | Bjørn Christian Seime | 2022-05-06 | 2 | -2/+2 |
| | |||||
* | Resolve rank profile inputs | Jon Bratseth | 2022-04-21 | 1 | -1/+1 |
| | |||||
* | Update abi-spec | Lester Solbakken | 2022-03-22 | 1 | -1/+1 |
| | |||||
* | Rename defaultEmbedderName to defaultEmbedderId | Lester Solbakken | 2022-03-22 | 1 | -2/+2 |
| | |||||
* | Add convenience function to represent embedder as map | Lester Solbakken | 2022-03-21 | 2 | -3/+30 |
| | |||||
* | Stem by linguistics in rule bases | Jon Bratseth | 2022-01-10 | 2 | -3/+21 |
| | | | | Also add a @language directive to stem in other languages than english. | ||||
* | unify java warnings (use compiler args from parent) | Arne H Juul | 2022-01-06 | 1 | -8/+0 |
| | |||||
* | annotate intentional switch fallthrough | Arne H Juul | 2022-01-06 | 1 | -0/+1 |
| | |||||
* | Specify how the class is actually loaded | Jon Marius Venstad | 2021-12-21 | 1 | -1/+1 |
| | |||||
* | Provide array of correct size. | Jon Marius Venstad | 2021-12-20 | 1 | -1/+1 |
| | |||||
* | Override ngram creation with something less silly | Jon Marius Venstad | 2021-12-20 | 2 | -1/+32 |
| | |||||
* | Use smaller chunks for faster detection | Jon Marius Venstad | 2021-12-20 | 1 | -2/+2 |
| | |||||
* | Expand test case for language detection | Jon Marius Venstad | 2021-12-20 | 1 | -3/+28 |
| | |||||
* | Upper bound on input size, and use opennlp before simple detector | Jon Marius Venstad | 2021-12-20 | 1 | -6/+3 |
| | |||||
* | Avoid putting nulls in languange map | Jon Marius Venstad | 2021-12-20 | 1 | -2/+5 |
| | |||||
* | Revert "Merge pull request #20578 from ↵ | Jon Marius Venstad | 2021-12-20 | 13 | -176/+246 |
| | | | | | | | vespa-engine/revert-20568-jonmv/replace-optimaize-with-lingua" This reverts commit 5476504932cd90eb2dad82dbab633e3ffa2034c3, reversing changes made to 235a78cc4707f78d18c6818a577de1b7507f5e40. | ||||
* | Revert "Replace optimaize with OpenNLP language detector [run-systemtest]" | Jon Marius Venstad | 2021-12-18 | 13 | -246/+176 |
| | |||||
* | Re-add files | Jon Marius Venstad | 2021-12-18 | 5 | -0/+142 |
| | |||||
* | Move model to module where it is needed, to simplify, at the cost of larger ↵ | Jon Marius Venstad | 2021-12-18 | 3 | -22/+21 |
| | | | | bundles | ||||
* | Replace UrlcharSequenceNormalizer with one with an improved regex | Jon Marius Venstad | 2021-12-17 | 1 | -6/+0 |
| | |||||
* | Mockito test scope | Jon Marius Venstad | 2021-12-17 | 1 | -0/+1 |
| | |||||
* | Add some javadoc, and no need to handle null return for model | Jon Marius Venstad | 2021-12-17 | 2 | -2/+4 |
| | |||||
* | Replace optimaize with OpenNLP language detector | Jon Marius Venstad | 2021-12-17 | 8 | -170/+102 |
| | |||||
* | Add a BERT embedder | Jon Bratseth | 2021-12-16 | 1 | -2/+3 |
| | |||||
* | Time out requests after 200s | Jon Marius Venstad | 2021-12-13 | 1 | -1/+0 |
| |