Commit message (Collapse) | Author | Age | Files | Lines | ||
---|---|---|---|---|---|---|
... | ||||||
* | Expand test case for language detection | Jon Marius Venstad | 2021-12-20 | 1 | -3/+28 | |
| | ||||||
* | Upper bound on input size, and use opennlp before simple detector | Jon Marius Venstad | 2021-12-20 | 1 | -6/+3 | |
| | ||||||
* | Avoid putting nulls in languange map | Jon Marius Venstad | 2021-12-20 | 1 | -2/+5 | |
| | ||||||
* | Revert "Merge pull request #20578 from ↵ | Jon Marius Venstad | 2021-12-20 | 13 | -176/+246 | |
| | | | | | | | vespa-engine/revert-20568-jonmv/replace-optimaize-with-lingua" This reverts commit 5476504932cd90eb2dad82dbab633e3ffa2034c3, reversing changes made to 235a78cc4707f78d18c6818a577de1b7507f5e40. | |||||
* | Revert "Replace optimaize with OpenNLP language detector [run-systemtest]" | Jon Marius Venstad | 2021-12-18 | 13 | -246/+176 | |
| | ||||||
* | Re-add files | Jon Marius Venstad | 2021-12-18 | 5 | -0/+142 | |
| | ||||||
* | Move model to module where it is needed, to simplify, at the cost of larger ↵ | Jon Marius Venstad | 2021-12-18 | 3 | -22/+21 | |
| | | | | bundles | |||||
* | Replace UrlcharSequenceNormalizer with one with an improved regex | Jon Marius Venstad | 2021-12-17 | 1 | -6/+0 | |
| | ||||||
* | Mockito test scope | Jon Marius Venstad | 2021-12-17 | 1 | -0/+1 | |
| | ||||||
* | Add some javadoc, and no need to handle null return for model | Jon Marius Venstad | 2021-12-17 | 2 | -2/+4 | |
| | ||||||
* | Replace optimaize with OpenNLP language detector | Jon Marius Venstad | 2021-12-17 | 8 | -170/+102 | |
| | ||||||
* | Add a BERT embedder | Jon Bratseth | 2021-12-16 | 1 | -2/+3 | |
| | ||||||
* | Time out requests after 200s | Jon Marius Venstad | 2021-12-13 | 1 | -1/+0 | |
| | ||||||
* | Update 2020 Oath copyrights. | gjoranv | 2021-10-27 | 2 | -2/+2 | |
| | ||||||
* | Update 2019 Oath copyrights. | gjoranv | 2021-10-27 | 1 | -1/+1 | |
| | ||||||
* | Update Verizon Media copyright notices. | gjoranv | 2021-10-07 | 3 | -3/+3 | |
| | ||||||
* | Update 2018 copyright notices. | gjoranv | 2021-10-07 | 3 | -3/+3 | |
| | ||||||
* | Update 2017 copyright notices. | gjoranv | 2021-10-07 | 70 | -70/+70 | |
| | ||||||
* | Encapsulate in a context | Jon Bratseth | 2021-10-01 | 2 | -16/+65 | |
| | ||||||
* | Pass destination | Jon Bratseth | 2021-09-30 | 2 | -8/+14 | |
| | | | | | This allows embedders to switch on it to enable bucket testing and similar. | |||||
* | encode -> embed | Jon Bratseth | 2021-09-28 | 3 | -64/+64 | |
| | ||||||
* | Separate component from linguistics | Jon Bratseth | 2021-09-25 | 17 | -1206/+0 | |
| | ||||||
* | Linguistics cleanup | Jon Bratseth | 2021-09-21 | 18 | -45/+29 | |
| | ||||||
* | Add 'encode' expression | Jon Bratseth | 2021-09-19 | 2 | -1/+35 | |
| | ||||||
* | Provide a (non-working) encoder by default | Jon Bratseth | 2021-09-17 | 1 | -1/+1 | |
| | ||||||
* | Update ABI spec | Jon Bratseth | 2021-09-17 | 1 | -20/+46 | |
| | ||||||
* | Cleanup | Jon Bratseth | 2021-09-17 | 5 | -9/+2 | |
| | ||||||
* | Refactor to separate classes | Jon Bratseth | 2021-09-17 | 8 | -203/+279 | |
| | ||||||
* | Encoder interface | Jon Bratseth | 2021-09-16 | 3 | -4/+55 | |
| | ||||||
* | Encode to sparse tensor | Jon Bratseth | 2021-09-16 | 3 | -0/+17 | |
| | ||||||
* | Encode to dense tensor | Jon Bratseth | 2021-09-16 | 3 | -4/+36 | |
| | ||||||
* | Use a result builder | Jon Bratseth | 2021-09-16 | 1 | -21/+53 | |
| | ||||||
* | Make SentencePieceEncoder configurable | Jon Bratseth | 2021-09-16 | 6 | -56/+283 | |
| | ||||||
* | Merge pull request #19130 from vespa-engine/bratseth/sp-export | Jo Kristian Bergum | 2021-09-14 | 2 | -0/+79 | |
|\ | | | | | Make public | |||||
| * | Add to abi spec | Jon Bratseth | 2021-09-14 | 1 | -0/+72 | |
| | | ||||||
| * | Make public | Jon Bratseth | 2021-09-14 | 1 | -0/+7 | |
| | | ||||||
* | | Merge pull request #19131 from vespa-engine/bratseth/sp-simplify | Jon Bratseth | 2021-09-14 | 1 | -13/+7 | |
|\ \ | | | | | | | Slight algorithm simplification | |||||
| * | | Slight algorithm simplification | Jon Bratseth | 2021-09-14 | 1 | -6/+4 | |
| | | | ||||||
| * | | Slight algorithm simplification | Jon Bratseth | 2021-09-14 | 1 | -6/+3 | |
| | | | ||||||
| * | | Slight algorithm simplification | Jon Bratseth | 2021-09-14 | 1 | -11/+10 | |
| |/ | ||||||
* / | More unit tests | Jon Bratseth | 2021-09-14 | 1 | -1/+20 | |
|/ | ||||||
* | Pure Java sentencepiece implementation | Jon Bratseth | 2021-09-13 | 7 | -2/+731 | |
| | ||||||
* | we want to compare Linguistics objects for equivalence | Arne Juul | 2021-08-04 | 4 | -1/+9 | |
| | ||||||
* | Require replacements to be applied during tokenization | Jon Bratseth | 2021-06-15 | 3 | -12/+11 | |
| | ||||||
* | Revert "Merge pull request #17754 from ↵ | Jon Bratseth | 2021-05-05 | 8 | -13/+336 | |
| | | | | | | | vespa-engine/revert-17747-bratseth/special-tokens-take-2" This reverts commit a2c9cd4bc04f1a3eaa31524b3970b96be5c2eda9, reversing changes made to 8c61a373af0066fbdf1cca354c24b197c7347321. | |||||
* | Revert "Reapply "Bratseth/special tokens"" | Jon Bratseth | 2021-05-05 | 8 | -336/+13 | |
| | ||||||
* | Revert "Merge pull request #17746 from ↵ | Jon Bratseth | 2021-05-05 | 8 | -13/+336 | |
| | | | | | | | vespa-engine/revert-17738-revert-17737-revert-17736-bratseth/special-tokens" This reverts commit 491856b396d003885e159345fe3f533f0fa35933, reversing changes made to 3720186303f4aef1d185525eaf61092097a64ec9. | |||||
* | Revert "Revert "Revert "Bratseth/special tokens""" | Jon Bratseth | 2021-05-05 | 8 | -336/+13 | |
| | ||||||
* | Revert "Revert "Bratseth/special tokens"" | Jon Bratseth | 2021-05-04 | 8 | -13/+336 | |
| | ||||||
* | Revert "Bratseth/special tokens" | Jon Bratseth | 2021-05-04 | 8 | -336/+13 | |
| |