summaryrefslogtreecommitdiffstats
path: root/linguistics-components/src/main/java
Commit message (Expand)AuthorAgeFilesLines
* Test padding with truncationBjørn Christian Seime2023-06-081-1/+1
* Disable padding and make it configurableBjørn Christian Seime2023-06-081-3/+7
* Introduce services.xml syntax for configuring HuggingFace embeddersBjørn Christian Seime2023-06-021-0/+1
* Make truncation and max length configurableBjørn Christian Seime2023-05-261-4/+14
* Implement deconstructBjørn Christian Seime2023-05-161-0/+1
* Revert "Revert "Bjorncs/huggingface tokenizer""Bjørn Christian Seime2023-05-123-0/+172
* Revert "Bjorncs/huggingface tokenizer"Arnstein Ressem2023-05-123-172/+0
* Disable special tokens by defaultBjørn Christian Seime2023-05-111-12/+7
* Mark HF integration as betaBjørn Christian Seime2023-05-112-0/+5
* Make HF tokenizer a separate embedderBjørn Christian Seime2023-05-113-0/+172
* Add skipping of control tokensLester Solbakken2023-02-102-6/+22
* Add decoding of sentencepiece token sequence to textLester Solbakken2023-02-103-2/+24
* Revert "Revert collect(Collectors.toList())"Henning Baldersheim2022-12-041-1/+1
* Revert collect(Collectors.toList())Henning Baldersheim2022-12-041-1/+1
* collect(Collectors.toList()) -> toList()Henning Baldersheim2022-12-021-1/+1
* Use '@Inject' from 'annotations' in multiple bundlesBjørn Christian Seime2022-05-062-2/+2
* BERT -> WordPiece, make subword prefix configurableJon Bratseth2021-12-174-31/+60
* Add a BERT embedderJon Bratseth2021-12-165-37/+294
* Add custom `@Beta` annotationBjørn Christian Seime2021-12-031-1/+1
* Correct copyright headersJon Bratseth2021-10-201-1/+0
* Add missiung copyrightsJon Bratseth2021-10-201-0/+1
* Encapsulate in a contextJon Bratseth2021-10-011-9/+7
* Update linguisticvs-componentsJon Bratseth2021-09-301-3/+9
* encode -> embedJon Bratseth2021-09-281-15/+14
* Separate component from linguisticsJon Bratseth2021-09-258-0/+490