aboutsummaryrefslogtreecommitdiffstats
path: root/model-integration/src/main/java/ai/vespa
Commit message (Expand)AuthorAgeFilesLines
* Less verbose logging when failing to find CUDA and it is optionalJo Kristian Bergum2023-10-261-2/+2
* Disable CPU arena allocator for ONNXBjørn Christian Seime2023-10-191-0/+1
* Update copyrightJon Bratseth2023-10-0983-86/+90
* Don't index PAD and re-factoringJo Kristian Bergum2023-09-261-32/+28
* Add config options + licenseJo Kristian Bergum2023-09-211-0/+1
* Ensure Onnx/Hugginface resources are cleaned up on deconstructionBjørn Christian Seime2023-09-211-0/+6
* Add ColBERT embedderJo Kristian Bergum2023-09-211-0/+299
* - Use equals when comparing Optional<Long>Henning Baldersheim2023-09-132-4/+4
* Use thread safe hash mapBjørn Christian Seime2023-08-311-2/+2
* Merge pull request #27969 from vespa-engine/bjorncs/embedder-metricsJon Bratseth2023-08-313-7/+80
|\
| * Allow sampling of fractional millisBjørn Christian Seime2023-08-253-15/+10
| * Add generic metrics for embeddersBjørn Christian Seime2023-08-043-7/+85
* | Better error message when importing models with illegal namesLester Solbakken2023-08-291-0/+25
|/
* Log when GPU configuration is successfulMartin Polden2023-07-191-3/+8
* Log warning when failing to use GPUMartin Polden2023-07-191-1/+6
* update onnx.protoArne Juul2023-06-232-5/+7
* Prefer truncation configuration from tokenizer modelBjørn Christian Seime2023-06-121-6/+19
* Add missing wiring of pooling strategyBjørn Christian Seime2023-06-081-11/+1
* Disable padding and make it configurableBjørn Christian Seime2023-06-081-0/+1
* Make pooling strategy configurable for Huggingface embedderBjørn Christian Seime2023-06-053-17/+54
* Properly ignore token type ids from tokenizer if disabledBjørn Christian Seime2023-05-301-2/+2
* Remove dead codeBjørn Christian Seime2023-05-261-7/+0
* Make truncation and max length configurableBjørn Christian Seime2023-05-261-12/+3
* Revert "Revert "Bjorncs/huggingface tokenizer""Bjørn Christian Seime2023-05-124-190/+28
* Revert "Bjorncs/huggingface tokenizer"Arnstein Ressem2023-05-124-28/+190
* Handle models requiring token type idsBjørn Christian Seime2023-05-111-13/+19
* Don't lower caseBjørn Christian Seime2023-05-111-1/+1
* Disable special tokens by defaultBjørn Christian Seime2023-05-111-0/+1
* Mark HF integration as betaBjørn Christian Seime2023-05-111-0/+2
* Make HF tokenizer a separate embedderBjørn Christian Seime2023-05-114-177/+6
* Don't specify both package and namespaceBjørn Christian Seime2023-05-111-0/+1
* Reapply "Bjorncs/embedder onnx gpu"Bjørn Christian Seime2023-05-094-6/+40
* Revert "Bjorncs/embedder onnx gpu"Geir Storli2023-05-084-40/+6
* Handle nullsBjørn Christian Seime2023-05-081-0/+4
* fixup! Require GPU when requested and available for Bert + HF embeddersBjørn Christian Seime2023-05-081-1/+1
* Require GPU when requested and available for Bert + HF embeddersBjørn Christian Seime2023-05-083-1/+4
* Require GPU when available for ONNX evaluation in global-phase and embeddersBjørn Christian Seime2023-05-082-5/+36
* Make thread pool size configurableBjørn Christian Seime2023-05-054-17/+19
* Make normalization optionalBjørn Christian Seime2023-05-051-1/+4
* Allow for manual configuration of GPUBjørn Christian Seime2023-05-051-1/+5
* Move config to same package as componentBjørn Christian Seime2023-05-051-1/+0
* Split out HF TokenizerBjørn Christian Seime2023-05-054-23/+174
* Put the openai client in a separate componentJon Bratseth2023-04-2511-279/+4
* Export APIJon Bratseth2023-04-203-0/+33
* Merge pull request #26777 from vespa-engine/bratseth/openai-clientJon Bratseth2023-04-198-0/+258
|\
| * Use record and use default record toStringJon Bratseth2023-04-191-9/+1
| * Fix typoJon Bratseth2023-04-191-1/+1
| * Avoid ignored directory nameJon Bratseth2023-04-191-0/+44
| * Llm completion abstraction and OpenAi implementationJon Bratseth2023-04-197-0/+222
* | Merge pull request #26753 from vespa-engine/bjorncs/global-phaseArne H Juul2023-04-191-5/+9
|\ \ | |/ |/|