Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Determine token types considering all characters | Jon Bratseth | 2022-08-16 | 6 | -119/+133 |
| | |||||
* | Set project version to 8-SNAPSHOT | gjoranv | 2022-06-08 | 1 | -2/+2 |
| | |||||
* | Remove on Vespa 8 | Jon Bratseth | 2022-06-08 | 2 | -10/+1 |
| | |||||
* | Use '@Inject' from 'annotations' in multiple bundles | Bjørn Christian Seime | 2022-05-06 | 2 | -2/+2 |
| | |||||
* | Resolve rank profile inputs | Jon Bratseth | 2022-04-21 | 1 | -1/+1 |
| | |||||
* | Update abi-spec | Lester Solbakken | 2022-03-22 | 1 | -1/+1 |
| | |||||
* | Rename defaultEmbedderName to defaultEmbedderId | Lester Solbakken | 2022-03-22 | 1 | -2/+2 |
| | |||||
* | Add convenience function to represent embedder as map | Lester Solbakken | 2022-03-21 | 2 | -3/+30 |
| | |||||
* | Stem by linguistics in rule bases | Jon Bratseth | 2022-01-10 | 2 | -3/+21 |
| | | | | Also add a @language directive to stem in other languages than english. | ||||
* | unify java warnings (use compiler args from parent) | Arne H Juul | 2022-01-06 | 1 | -8/+0 |
| | |||||
* | annotate intentional switch fallthrough | Arne H Juul | 2022-01-06 | 1 | -0/+1 |
| | |||||
* | Specify how the class is actually loaded | Jon Marius Venstad | 2021-12-21 | 1 | -1/+1 |
| | |||||
* | Provide array of correct size. | Jon Marius Venstad | 2021-12-20 | 1 | -1/+1 |
| | |||||
* | Override ngram creation with something less silly | Jon Marius Venstad | 2021-12-20 | 2 | -1/+32 |
| | |||||
* | Use smaller chunks for faster detection | Jon Marius Venstad | 2021-12-20 | 1 | -2/+2 |
| | |||||
* | Expand test case for language detection | Jon Marius Venstad | 2021-12-20 | 1 | -3/+28 |
| | |||||
* | Upper bound on input size, and use opennlp before simple detector | Jon Marius Venstad | 2021-12-20 | 1 | -6/+3 |
| | |||||
* | Avoid putting nulls in languange map | Jon Marius Venstad | 2021-12-20 | 1 | -2/+5 |
| | |||||
* | Revert "Merge pull request #20578 from ↵ | Jon Marius Venstad | 2021-12-20 | 13 | -176/+246 |
| | | | | | | | vespa-engine/revert-20568-jonmv/replace-optimaize-with-lingua" This reverts commit 5476504932cd90eb2dad82dbab633e3ffa2034c3, reversing changes made to 235a78cc4707f78d18c6818a577de1b7507f5e40. | ||||
* | Revert "Replace optimaize with OpenNLP language detector [run-systemtest]" | Jon Marius Venstad | 2021-12-18 | 13 | -246/+176 |
| | |||||
* | Re-add files | Jon Marius Venstad | 2021-12-18 | 5 | -0/+142 |
| | |||||
* | Move model to module where it is needed, to simplify, at the cost of larger ↵ | Jon Marius Venstad | 2021-12-18 | 3 | -22/+21 |
| | | | | bundles | ||||
* | Replace UrlcharSequenceNormalizer with one with an improved regex | Jon Marius Venstad | 2021-12-17 | 1 | -6/+0 |
| | |||||
* | Mockito test scope | Jon Marius Venstad | 2021-12-17 | 1 | -0/+1 |
| | |||||
* | Add some javadoc, and no need to handle null return for model | Jon Marius Venstad | 2021-12-17 | 2 | -2/+4 |
| | |||||
* | Replace optimaize with OpenNLP language detector | Jon Marius Venstad | 2021-12-17 | 8 | -170/+102 |
| | |||||
* | Add a BERT embedder | Jon Bratseth | 2021-12-16 | 1 | -2/+3 |
| | |||||
* | Time out requests after 200s | Jon Marius Venstad | 2021-12-13 | 1 | -1/+0 |
| | |||||
* | Update 2020 Oath copyrights. | gjoranv | 2021-10-27 | 2 | -2/+2 |
| | |||||
* | Update 2019 Oath copyrights. | gjoranv | 2021-10-27 | 1 | -1/+1 |
| | |||||
* | Update Verizon Media copyright notices. | gjoranv | 2021-10-07 | 3 | -3/+3 |
| | |||||
* | Update 2018 copyright notices. | gjoranv | 2021-10-07 | 3 | -3/+3 |
| | |||||
* | Update 2017 copyright notices. | gjoranv | 2021-10-07 | 70 | -70/+70 |
| | |||||
* | Encapsulate in a context | Jon Bratseth | 2021-10-01 | 2 | -16/+65 |
| | |||||
* | Pass destination | Jon Bratseth | 2021-09-30 | 2 | -8/+14 |
| | | | | | This allows embedders to switch on it to enable bucket testing and similar. | ||||
* | encode -> embed | Jon Bratseth | 2021-09-28 | 3 | -64/+64 |
| | |||||
* | Separate component from linguistics | Jon Bratseth | 2021-09-25 | 17 | -1206/+0 |
| | |||||
* | Linguistics cleanup | Jon Bratseth | 2021-09-21 | 18 | -45/+29 |
| | |||||
* | Add 'encode' expression | Jon Bratseth | 2021-09-19 | 2 | -1/+35 |
| | |||||
* | Provide a (non-working) encoder by default | Jon Bratseth | 2021-09-17 | 1 | -1/+1 |
| | |||||
* | Update ABI spec | Jon Bratseth | 2021-09-17 | 1 | -20/+46 |
| | |||||
* | Cleanup | Jon Bratseth | 2021-09-17 | 5 | -9/+2 |
| | |||||
* | Refactor to separate classes | Jon Bratseth | 2021-09-17 | 8 | -203/+279 |
| | |||||
* | Encoder interface | Jon Bratseth | 2021-09-16 | 3 | -4/+55 |
| | |||||
* | Encode to sparse tensor | Jon Bratseth | 2021-09-16 | 3 | -0/+17 |
| | |||||
* | Encode to dense tensor | Jon Bratseth | 2021-09-16 | 3 | -4/+36 |
| | |||||
* | Use a result builder | Jon Bratseth | 2021-09-16 | 1 | -21/+53 |
| | |||||
* | Make SentencePieceEncoder configurable | Jon Bratseth | 2021-09-16 | 6 | -56/+283 |
| | |||||
* | Merge pull request #19130 from vespa-engine/bratseth/sp-export | Jo Kristian Bergum | 2021-09-14 | 2 | -0/+79 |
|\ | | | | | Make public | ||||
| * | Add to abi spec | Jon Bratseth | 2021-09-14 | 1 | -0/+72 |
| | |