Commit message (Expand) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Construct array right away instead of going via a single element list and the... | Henning Baldersheim | 2024-01-18 | 1 | -2/+3 |
* | Update copyright | Jon Bratseth | 2023-10-09 | 18 | -21/+21 |
* | HuggingFace Tokenizer expects path to be a directory | Bjørn Christian Seime | 2023-08-31 | 1 | -2/+18 |
* | Prefer truncation configuration from tokenizer model | Bjørn Christian Seime | 2023-06-12 | 2 | -13/+104 |
* | Test padding with truncation | Bjørn Christian Seime | 2023-06-08 | 1 | -1/+1 |
* | Disable padding and make it configurable | Bjørn Christian Seime | 2023-06-08 | 1 | -3/+7 |
* | Introduce services.xml syntax for configuring HuggingFace embedders | Bjørn Christian Seime | 2023-06-02 | 2 | -13/+1 |
* | Make truncation and max length configurable | Bjørn Christian Seime | 2023-05-26 | 2 | -5/+17 |
* | Implement deconstruct | Bjørn Christian Seime | 2023-05-16 | 1 | -0/+1 |
* | Change parameter type to 'model' | Bjørn Christian Seime | 2023-05-12 | 1 | -1/+1 |
* | Revert "Revert "Bjorncs/huggingface tokenizer"" | Bjørn Christian Seime | 2023-05-12 | 4 | -0/+183 |
* | Revert "Bjorncs/huggingface tokenizer" | Arnstein Ressem | 2023-05-12 | 4 | -183/+0 |
* | Disable special tokens by default | Bjørn Christian Seime | 2023-05-11 | 2 | -12/+9 |
* | Mark HF integration as beta | Bjørn Christian Seime | 2023-05-11 | 2 | -0/+5 |
* | Make HF tokenizer a separate embedder | Bjørn Christian Seime | 2023-05-11 | 4 | -0/+181 |
* | Add skipping of control tokens | Lester Solbakken | 2023-02-10 | 2 | -6/+22 |
* | Add decoding of sentencepiece token sequence to text | Lester Solbakken | 2023-02-10 | 3 | -2/+24 |
* | Revert "Revert collect(Collectors.toList())" | Henning Baldersheim | 2022-12-04 | 1 | -1/+1 |
* | Revert collect(Collectors.toList()) | Henning Baldersheim | 2022-12-04 | 1 | -1/+1 |
* | collect(Collectors.toList()) -> toList() | Henning Baldersheim | 2022-12-02 | 1 | -1/+1 |
* | Use '@Inject' from 'annotations' in multiple bundles | Bjørn Christian Seime | 2022-05-06 | 2 | -2/+2 |
* | BERT -> WordPiece, make subword prefix configurable | Jon Bratseth | 2021-12-17 | 5 | -33/+65 |
* | Add a BERT embedder | Jon Bratseth | 2021-12-16 | 6 | -37/+305 |
* | Add custom `@Beta` annotation | Bjørn Christian Seime | 2021-12-03 | 1 | -1/+1 |
* | Correct copyright headers | Jon Bratseth | 2021-10-20 | 1 | -1/+0 |
* | Add missiung copyrights | Jon Bratseth | 2021-10-20 | 1 | -0/+1 |
* | Encapsulate in a context | Jon Bratseth | 2021-10-01 | 1 | -9/+7 |
* | Update linguisticvs-components | Jon Bratseth | 2021-09-30 | 1 | -3/+9 |
* | encode -> embed | Jon Bratseth | 2021-09-28 | 2 | -16/+15 |
* | Use full filename | Jon Bratseth | 2021-09-27 | 1 | -0/+0 |
* | Separate component from linguistics | Jon Bratseth | 2021-09-25 | 10 | -0/+818 |