Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Export API | Jon Bratseth | 2023-04-20 | 4 | -1/+218 |
| | |||||
* | Merge pull request #26777 from vespa-engine/bratseth/openai-client | Jon Bratseth | 2023-04-19 | 10 | -0/+312 |
|\ | | | | | Llm completion abstraction and OpenAi implementation | ||||
| * | Use record and use default record toString | Jon Bratseth | 2023-04-19 | 1 | -9/+1 |
| | | |||||
| * | Fix typo | Jon Bratseth | 2023-04-19 | 1 | -1/+1 |
| | | |||||
| * | Avoid ignored directory name | Jon Bratseth | 2023-04-19 | 2 | -2/+45 |
| | | |||||
| * | Llm completion abstraction and OpenAi implementation | Jon Bratseth | 2023-04-19 | 9 | -0/+277 |
| | | |||||
* | | Merge pull request #26753 from vespa-engine/bjorncs/global-phase | Arne H Juul | 2023-04-19 | 1 | -5/+9 |
|\ \ | |/ |/| | Use quarter vcpu by default if execution mode is parallel | ||||
| * | Use quarter vcpu by default if execution mode is parallel | Bjørn Christian Seime | 2023-04-17 | 1 | -5/+9 |
| | | |||||
* | | Merge pull request #26754 from vespa-engine/bratseth/jdk20 | Jon Bratseth | 2023-04-18 | 1 | -2/+2 |
|\ \ | | | | | | | Build with jdk20 | ||||
| * | | Build with jdk20 | Jon Bratseth | 2023-04-17 | 1 | -2/+2 |
| | | | |||||
* | | | Pull endtoken | Jon Bratseth | 2023-04-18 | 1 | -1/+1 |
| | | | |||||
* | | | Revert "Merge pull request #26744 from ↵ | Jon Bratseth | 2023-04-18 | 2 | -8/+14 |
| |/ |/| | | | | | | | | | | | vespa-engine/revert-26708-allow-start-end-sequence-tokens-as-args-bertbaseembedder" This reverts commit d025a93015e66efc0027d81a64e70530d6cb240e, reversing changes made to 4f2f29e1459b900d4b074f5cfc4c126837c54bfd. | ||||
* | | Revert "Allow start end sequence tokens as args bertbaseembedder" | Jon Bratseth | 2023-04-14 | 2 | -14/+8 |
|/ | |||||
* | Include createTokenTypeIds | connell gough | 2023-04-13 | 1 | -0/+1 |
| | |||||
* | Remove separator input and fix spelling error | connell gough | 2023-04-13 | 1 | -3/+2 |
| | |||||
* | Add default special tokens | connell gough | 2023-04-13 | 1 | -2/+2 |
| | |||||
* | Add special tokens as arguments and allow tokenTypeIds to be null | connell gough | 2023-04-13 | 2 | -7/+13 |
| | |||||
* | Add lz4-java for xxhash | Bjørn Christian Seime | 2023-03-31 | 1 | -0/+5 |
| | |||||
* | Support loading ONNX models through byte array | Bjørn Christian Seime | 2023-03-30 | 4 | -36/+162 |
| | | | | Rewrite OnnxRuntimeTest to test through it's public API | ||||
* | Don't reuse runtime between methods | Bjørn Christian Seime | 2023-03-30 | 1 | -21/+18 |
| | | | | Caching evaluators between test methods may have unwanted side effects | ||||
* | Revert "Arnej/unify cell type conversion" | Henning Baldersheim | 2023-03-12 | 3 | -44/+21 |
| | |||||
* | cell-type conversions should match | Arne Juul | 2023-03-10 | 2 | -20/+39 |
| | |||||
* | add BFLOAT16 from newer onnx.proto version | Arne Juul | 2023-03-10 | 1 | -1/+5 |
| | |||||
* | Revert "Deopend on component to get AbstractComponent" | Bjørn Christian Seime | 2023-02-28 | 1 | -6/+0 |
| | |||||
* | Deopend on component to get AbstractComponent | Jon Bratseth | 2023-02-28 | 1 | -0/+6 |
| | |||||
* | Extend AbstractComponent not AbstractResource | Bjørn Christian Seime | 2023-02-28 | 3 | -9/+9 |
| | |||||
* | Replace `OnnxEvaluatorCache` with OnnxRuntime | Bjørn Christian Seime | 2023-02-27 | 14 | -228/+325 |
| | | | | | | Require an `OnnxRuntime` instance to create `OnnxEvaluator` instances. Cache underlying `OrtSession` instead of `OnnxEvaluator`. Move static helpers for checking Onnx runtime availability from `OnnxEvaluator` to `OnnxRuntime`. | ||||
* | handle non-identifier onnx input/output names: instead of the conflicting | Arne Juul | 2023-02-22 | 1 | -0/+30 |
| | | | | | ad-hoc code in OnnxEvaluator, do it as part of general input/output mapping in OnnxModel. | ||||
* | Cache Onnx model instances | Bjørn Christian Seime | 2023-02-22 | 3 | -0/+141 |
| | | | | | | Manage lifecycle of OnnxEvaluator instances explicitly to allow instances to be cached without use WeakHashmap/finalizers. Inject shared Onnx model cache in ModelsEvaluator. | ||||
* | Make OnnxEvaluator closable | Bjørn Christian Seime | 2023-02-22 | 1 | -1/+12 |
| | |||||
* | Implement equals()/hashCode() | Bjørn Christian Seime | 2023-02-22 | 1 | -1/+17 |
| | | | | Required for using instances as key in HashMap | ||||
* | Add initial text generator component | Lester Solbakken | 2023-02-18 | 9 | -2/+410 |
| | |||||
* | ensure outputs with names as promised by getOutputInfo() | Arne Juul | 2023-02-10 | 3 | -1/+123 |
| | |||||
* | Fix GPU detection | Martin Polden | 2023-02-09 | 1 | -1/+1 |
| | |||||
* | Allow fallback to CPU if nodes are provisioned without GPU | Martin Polden | 2023-02-08 | 2 | -13/+24 |
| | |||||
* | Remove 'required' attribute | Martin Polden | 2023-01-26 | 2 | -22/+14 |
| | |||||
* | Skip CUDA entirely if GPU device is optional | Martin Polden | 2023-01-26 | 2 | -24/+39 |
| | | | | CUDA may fail after its library is loaded, e.g. when the session is created. | ||||
* | Do not filter on error code | Martin Polden | 2023-01-24 | 1 | -4/+1 |
| | |||||
* | Support configuration of GPU device to use in ONNX model | Martin Polden | 2023-01-23 | 1 | -1/+24 |
| | |||||
* | Add JNA in lib/jars + export as packages from jdisc_core | Bjørn Christian Seime | 2023-01-20 | 1 | -1/+21 |
| | | | | Jdisc-core will embed JNA. The JNA in lib/jars is used by fatjars only. | ||||
* | Unify on Streams.toList() | Henning Baldersheim | 2023-01-17 | 2 | -4/+3 |
| | |||||
* | Exclude in fat-deps | Jon Bratseth | 2023-01-09 | 1 | -1/+4 |
| | |||||
* | Huggingface not needed in config model | Jon Bratseth | 2023-01-09 | 1 | -0/+1 |
| | |||||
* | Allow model-integration:huggingface dependencies | Jon Bratseth | 2023-01-06 | 1 | -0/+6 |
| | |||||
* | DJL-based HuggingFaceEmbedder prototype | Andrii Yurkiv | 2023-01-04 | 4 | -0/+232 |
| | |||||
* | trying to make an OnnxEvaluator with empty path would fail | Arne Juul | 2022-12-12 | 1 | -0/+8 |
| | |||||
* | Merge pull request #25075 from vespa-engine/arnej/onnxruntime-as-bundle | Arne H Juul | 2022-12-05 | 1 | -3/+5 |
|\ | | | | | add container-onnxruntime-bundle | ||||
| * | rename to just "container-onnxruntime" | Arne Juul | 2022-12-02 | 1 | -1/+1 |
| | | |||||
| * | add container-onnxruntime-bundle | Arne Juul | 2022-12-02 | 1 | -3/+5 |
| | | |||||
* | | Revert "Revert collect(Collectors.toList())" | Henning Baldersheim | 2022-12-04 | 4 | -4/+4 |
| | |