aboutsummaryrefslogtreecommitdiffstats
path: root/linguistics-components/src/main/resources
Commit message (Expand)AuthorAgeFilesLines
* Update copyrightJon Bratseth2023-10-092-3/+3
* Introduce services.xml syntax for configuring HuggingFace embeddersBjørn Christian Seime2023-06-021-13/+0
* Make truncation and max length configurableBjørn Christian Seime2023-05-261-1/+3
* Change parameter type to 'model'Bjørn Christian Seime2023-05-121-1/+1
* Revert "Revert "Bjorncs/huggingface tokenizer""Bjørn Christian Seime2023-05-121-0/+11
* Revert "Bjorncs/huggingface tokenizer"Arnstein Ressem2023-05-121-11/+0
* Disable special tokens by defaultBjørn Christian Seime2023-05-111-0/+2
* Make HF tokenizer a separate embedderBjørn Christian Seime2023-05-111-0/+9
* BERT -> WordPiece, make subword prefix configurableJon Bratseth2021-12-171-2/+5
* Add a BERT embedderJon Bratseth2021-12-161-0/+11
* encode -> embedJon Bratseth2021-09-281-1/+1
* Use full filenameJon Bratseth2021-09-271-0/+0
* Separate component from linguisticsJon Bratseth2021-09-251-0/+18