Commit message (Collapse) | Author | Age | Files | Lines | ||
---|---|---|---|---|---|---|
... | ||||||
* | Handle models requiring token type ids | Bjørn Christian Seime | 2023-05-11 | 2 | -13/+20 | |
| | ||||||
* | Don't lower case | Bjørn Christian Seime | 2023-05-11 | 1 | -1/+1 | |
| | ||||||
* | Disable special tokens by default | Bjørn Christian Seime | 2023-05-11 | 1 | -0/+1 | |
| | ||||||
* | Mark HF integration as beta | Bjørn Christian Seime | 2023-05-11 | 1 | -0/+2 | |
| | ||||||
* | Make HF tokenizer a separate embedder | Bjørn Christian Seime | 2023-05-11 | 5 | -197/+6 | |
| | ||||||
* | Don't specify both package and namespace | Bjørn Christian Seime | 2023-05-11 | 2 | -1/+1 | |
| | ||||||
* | Upgrade HF Tokenizer to 0.22.1 | Bjørn Christian Seime | 2023-05-08 | 1 | -1/+1 | |
| | ||||||
* | Handle nulls | Bjørn Christian Seime | 2023-05-08 | 1 | -0/+4 | |
| | ||||||
* | fixup! Require GPU when requested and available for Bert + HF embedders | Bjørn Christian Seime | 2023-05-08 | 1 | -1/+1 | |
| | ||||||
* | Require GPU when requested and available for Bert + HF embedders | Bjørn Christian Seime | 2023-05-08 | 5 | -5/+6 | |
| | ||||||
* | Require GPU when available for ONNX evaluation in global-phase and embedders | Bjørn Christian Seime | 2023-05-08 | 3 | -5/+42 | |
| | ||||||
* | Make thread pool size configurable | Bjørn Christian Seime | 2023-05-05 | 5 | -17/+24 | |
| | ||||||
* | Make normalization optional | Bjørn Christian Seime | 2023-05-05 | 2 | -2/+8 | |
| | ||||||
* | Allow for manual configuration of GPU | Bjørn Christian Seime | 2023-05-05 | 2 | -1/+8 | |
| | ||||||
* | Move config to same package as component | Bjørn Christian Seime | 2023-05-05 | 2 | -1/+1 | |
| | ||||||
* | Split out HF Tokenizer | Bjørn Christian Seime | 2023-05-05 | 4 | -23/+174 | |
| | ||||||
* | Put the openai client in a separate component | Jon Bratseth | 2023-04-25 | 15 | -482/+20 | |
| | ||||||
* | Export API | Jon Bratseth | 2023-04-20 | 4 | -1/+218 | |
| | ||||||
* | Merge pull request #26777 from vespa-engine/bratseth/openai-client | Jon Bratseth | 2023-04-19 | 10 | -0/+312 | |
|\ | | | | | Llm completion abstraction and OpenAi implementation | |||||
| * | Use record and use default record toString | Jon Bratseth | 2023-04-19 | 1 | -9/+1 | |
| | | ||||||
| * | Fix typo | Jon Bratseth | 2023-04-19 | 1 | -1/+1 | |
| | | ||||||
| * | Avoid ignored directory name | Jon Bratseth | 2023-04-19 | 2 | -2/+45 | |
| | | ||||||
| * | Llm completion abstraction and OpenAi implementation | Jon Bratseth | 2023-04-19 | 9 | -0/+277 | |
| | | ||||||
* | | Merge pull request #26753 from vespa-engine/bjorncs/global-phase | Arne H Juul | 2023-04-19 | 1 | -5/+9 | |
|\ \ | |/ |/| | Use quarter vcpu by default if execution mode is parallel | |||||
| * | Use quarter vcpu by default if execution mode is parallel | Bjørn Christian Seime | 2023-04-17 | 1 | -5/+9 | |
| | | ||||||
* | | Merge pull request #26754 from vespa-engine/bratseth/jdk20 | Jon Bratseth | 2023-04-18 | 1 | -2/+2 | |
|\ \ | | | | | | | Build with jdk20 | |||||
| * | | Build with jdk20 | Jon Bratseth | 2023-04-17 | 1 | -2/+2 | |
| | | | ||||||
* | | | Pull endtoken | Jon Bratseth | 2023-04-18 | 1 | -1/+1 | |
| | | | ||||||
* | | | Revert "Merge pull request #26744 from ↵ | Jon Bratseth | 2023-04-18 | 2 | -8/+14 | |
| |/ |/| | | | | | | | | | | | vespa-engine/revert-26708-allow-start-end-sequence-tokens-as-args-bertbaseembedder" This reverts commit d025a93015e66efc0027d81a64e70530d6cb240e, reversing changes made to 4f2f29e1459b900d4b074f5cfc4c126837c54bfd. | |||||
* | | Revert "Allow start end sequence tokens as args bertbaseembedder" | Jon Bratseth | 2023-04-14 | 2 | -14/+8 | |
|/ | ||||||
* | Include createTokenTypeIds | connell gough | 2023-04-13 | 1 | -0/+1 | |
| | ||||||
* | Remove separator input and fix spelling error | connell gough | 2023-04-13 | 1 | -3/+2 | |
| | ||||||
* | Add default special tokens | connell gough | 2023-04-13 | 1 | -2/+2 | |
| | ||||||
* | Add special tokens as arguments and allow tokenTypeIds to be null | connell gough | 2023-04-13 | 2 | -7/+13 | |
| | ||||||
* | Add lz4-java for xxhash | Bjørn Christian Seime | 2023-03-31 | 1 | -0/+5 | |
| | ||||||
* | Support loading ONNX models through byte array | Bjørn Christian Seime | 2023-03-30 | 4 | -36/+162 | |
| | | | | Rewrite OnnxRuntimeTest to test through it's public API | |||||
* | Don't reuse runtime between methods | Bjørn Christian Seime | 2023-03-30 | 1 | -21/+18 | |
| | | | | Caching evaluators between test methods may have unwanted side effects | |||||
* | Revert "Arnej/unify cell type conversion" | Henning Baldersheim | 2023-03-12 | 3 | -44/+21 | |
| | ||||||
* | cell-type conversions should match | Arne Juul | 2023-03-10 | 2 | -20/+39 | |
| | ||||||
* | add BFLOAT16 from newer onnx.proto version | Arne Juul | 2023-03-10 | 1 | -1/+5 | |
| | ||||||
* | Revert "Deopend on component to get AbstractComponent" | Bjørn Christian Seime | 2023-02-28 | 1 | -6/+0 | |
| | ||||||
* | Deopend on component to get AbstractComponent | Jon Bratseth | 2023-02-28 | 1 | -0/+6 | |
| | ||||||
* | Extend AbstractComponent not AbstractResource | Bjørn Christian Seime | 2023-02-28 | 3 | -9/+9 | |
| | ||||||
* | Replace `OnnxEvaluatorCache` with OnnxRuntime | Bjørn Christian Seime | 2023-02-27 | 14 | -228/+325 | |
| | | | | | | Require an `OnnxRuntime` instance to create `OnnxEvaluator` instances. Cache underlying `OrtSession` instead of `OnnxEvaluator`. Move static helpers for checking Onnx runtime availability from `OnnxEvaluator` to `OnnxRuntime`. | |||||
* | handle non-identifier onnx input/output names: instead of the conflicting | Arne Juul | 2023-02-22 | 1 | -0/+30 | |
| | | | | | ad-hoc code in OnnxEvaluator, do it as part of general input/output mapping in OnnxModel. | |||||
* | Cache Onnx model instances | Bjørn Christian Seime | 2023-02-22 | 3 | -0/+141 | |
| | | | | | | Manage lifecycle of OnnxEvaluator instances explicitly to allow instances to be cached without use WeakHashmap/finalizers. Inject shared Onnx model cache in ModelsEvaluator. | |||||
* | Make OnnxEvaluator closable | Bjørn Christian Seime | 2023-02-22 | 1 | -1/+12 | |
| | ||||||
* | Implement equals()/hashCode() | Bjørn Christian Seime | 2023-02-22 | 1 | -1/+17 | |
| | | | | Required for using instances as key in HashMap | |||||
* | Add initial text generator component | Lester Solbakken | 2023-02-18 | 9 | -2/+410 | |
| | ||||||
* | ensure outputs with names as promised by getOutputInfo() | Arne Juul | 2023-02-10 | 3 | -1/+123 | |
| |