Commit message (Expand) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Update copyright | Jon Bratseth | 2023-10-09 | 1 | -2/+2 |
* | Prefer truncation configuration from tokenizer model | Bjørn Christian Seime | 2023-06-12 | 1 | -1/+8 |
* | Test padding with truncation | Bjørn Christian Seime | 2023-06-08 | 1 | -2/+3 |
* | Verify presence of special token | Bjørn Christian Seime | 2023-06-08 | 1 | -2/+7 |
* | Disable padding and make it configurable | Bjørn Christian Seime | 2023-06-08 | 1 | -9/+23 |
* | Make truncation and max length configurable | Bjørn Christian Seime | 2023-05-26 | 1 | -2/+28 |
* | Revert "Revert "Bjorncs/huggingface tokenizer"" | Bjørn Christian Seime | 2023-05-12 | 1 | -0/+88 |
* | Revert "Bjorncs/huggingface tokenizer" | Arnstein Ressem | 2023-05-12 | 1 | -88/+0 |
* | Disable special tokens by default | Bjørn Christian Seime | 2023-05-11 | 1 | -0/+1 |
* | Make HF tokenizer a separate embedder | Bjørn Christian Seime | 2023-05-11 | 1 | -0/+87 |