Commit message (Expand) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Update copyright | Jon Bratseth | 2023-10-09 | 2 | -3/+3 |
* | Introduce services.xml syntax for configuring HuggingFace embedders | Bjørn Christian Seime | 2023-06-02 | 1 | -13/+0 |
* | Make truncation and max length configurable | Bjørn Christian Seime | 2023-05-26 | 1 | -1/+3 |
* | Change parameter type to 'model' | Bjørn Christian Seime | 2023-05-12 | 1 | -1/+1 |
* | Revert "Revert "Bjorncs/huggingface tokenizer"" | Bjørn Christian Seime | 2023-05-12 | 1 | -0/+11 |
* | Revert "Bjorncs/huggingface tokenizer" | Arnstein Ressem | 2023-05-12 | 1 | -11/+0 |
* | Disable special tokens by default | Bjørn Christian Seime | 2023-05-11 | 1 | -0/+2 |
* | Make HF tokenizer a separate embedder | Bjørn Christian Seime | 2023-05-11 | 1 | -0/+9 |
* | BERT -> WordPiece, make subword prefix configurable | Jon Bratseth | 2021-12-17 | 1 | -2/+5 |
* | Add a BERT embedder | Jon Bratseth | 2021-12-16 | 1 | -0/+11 |
* | encode -> embed | Jon Bratseth | 2021-09-28 | 1 | -1/+1 |
* | Use full filename | Jon Bratseth | 2021-09-27 | 1 | -0/+0 |
* | Separate component from linguistics | Jon Bratseth | 2021-09-25 | 1 | -0/+18 |