aboutsummaryrefslogtreecommitdiffstats
path: root/model-integration
Commit message (Collapse)AuthorAgeFilesLines
* Export APIJon Bratseth2023-04-204-1/+218
|
* Merge pull request #26777 from vespa-engine/bratseth/openai-clientJon Bratseth2023-04-1910-0/+312
|\ | | | | Llm completion abstraction and OpenAi implementation
| * Use record and use default record toStringJon Bratseth2023-04-191-9/+1
| |
| * Fix typoJon Bratseth2023-04-191-1/+1
| |
| * Avoid ignored directory nameJon Bratseth2023-04-192-2/+45
| |
| * Llm completion abstraction and OpenAi implementationJon Bratseth2023-04-199-0/+277
| |
* | Merge pull request #26753 from vespa-engine/bjorncs/global-phaseArne H Juul2023-04-191-5/+9
|\ \ | |/ |/| Use quarter vcpu by default if execution mode is parallel
| * Use quarter vcpu by default if execution mode is parallelBjørn Christian Seime2023-04-171-5/+9
| |
* | Merge pull request #26754 from vespa-engine/bratseth/jdk20Jon Bratseth2023-04-181-2/+2
|\ \ | | | | | | Build with jdk20
| * | Build with jdk20Jon Bratseth2023-04-171-2/+2
| | |
* | | Pull endtokenJon Bratseth2023-04-181-1/+1
| | |
* | | Revert "Merge pull request #26744 from ↵Jon Bratseth2023-04-182-8/+14
| |/ |/| | | | | | | | | | | vespa-engine/revert-26708-allow-start-end-sequence-tokens-as-args-bertbaseembedder" This reverts commit d025a93015e66efc0027d81a64e70530d6cb240e, reversing changes made to 4f2f29e1459b900d4b074f5cfc4c126837c54bfd.
* | Revert "Allow start end sequence tokens as args bertbaseembedder"Jon Bratseth2023-04-142-14/+8
|/
* Include createTokenTypeIdsconnell gough2023-04-131-0/+1
|
* Remove separator input and fix spelling errorconnell gough2023-04-131-3/+2
|
* Add default special tokensconnell gough2023-04-131-2/+2
|
* Add special tokens as arguments and allow tokenTypeIds to be nullconnell gough2023-04-132-7/+13
|
* Add lz4-java for xxhashBjørn Christian Seime2023-03-311-0/+5
|
* Support loading ONNX models through byte arrayBjørn Christian Seime2023-03-304-36/+162
| | | | Rewrite OnnxRuntimeTest to test through it's public API
* Don't reuse runtime between methodsBjørn Christian Seime2023-03-301-21/+18
| | | | Caching evaluators between test methods may have unwanted side effects
* Revert "Arnej/unify cell type conversion"Henning Baldersheim2023-03-123-44/+21
|
* cell-type conversions should matchArne Juul2023-03-102-20/+39
|
* add BFLOAT16 from newer onnx.proto versionArne Juul2023-03-101-1/+5
|
* Revert "Deopend on component to get AbstractComponent"Bjørn Christian Seime2023-02-281-6/+0
|
* Deopend on component to get AbstractComponentJon Bratseth2023-02-281-0/+6
|
* Extend AbstractComponent not AbstractResourceBjørn Christian Seime2023-02-283-9/+9
|
* Replace `OnnxEvaluatorCache` with OnnxRuntimeBjørn Christian Seime2023-02-2714-228/+325
| | | | | | Require an `OnnxRuntime` instance to create `OnnxEvaluator` instances. Cache underlying `OrtSession` instead of `OnnxEvaluator`. Move static helpers for checking Onnx runtime availability from `OnnxEvaluator` to `OnnxRuntime`.
* handle non-identifier onnx input/output names: instead of the conflictingArne Juul2023-02-221-0/+30
| | | | | ad-hoc code in OnnxEvaluator, do it as part of general input/output mapping in OnnxModel.
* Cache Onnx model instancesBjørn Christian Seime2023-02-223-0/+141
| | | | | | Manage lifecycle of OnnxEvaluator instances explicitly to allow instances to be cached without use WeakHashmap/finalizers. Inject shared Onnx model cache in ModelsEvaluator.
* Make OnnxEvaluator closableBjørn Christian Seime2023-02-221-1/+12
|
* Implement equals()/hashCode()Bjørn Christian Seime2023-02-221-1/+17
| | | | Required for using instances as key in HashMap
* Add initial text generator componentLester Solbakken2023-02-189-2/+410
|
* ensure outputs with names as promised by getOutputInfo()Arne Juul2023-02-103-1/+123
|
* Fix GPU detectionMartin Polden2023-02-091-1/+1
|
* Allow fallback to CPU if nodes are provisioned without GPUMartin Polden2023-02-082-13/+24
|
* Remove 'required' attributeMartin Polden2023-01-262-22/+14
|
* Skip CUDA entirely if GPU device is optionalMartin Polden2023-01-262-24/+39
| | | | CUDA may fail after its library is loaded, e.g. when the session is created.
* Do not filter on error codeMartin Polden2023-01-241-4/+1
|
* Support configuration of GPU device to use in ONNX modelMartin Polden2023-01-231-1/+24
|
* Add JNA in lib/jars + export as packages from jdisc_coreBjørn Christian Seime2023-01-201-1/+21
| | | | Jdisc-core will embed JNA. The JNA in lib/jars is used by fatjars only.
* Unify on Streams.toList()Henning Baldersheim2023-01-172-4/+3
|
* Exclude in fat-depsJon Bratseth2023-01-091-1/+4
|
* Huggingface not needed in config modelJon Bratseth2023-01-091-0/+1
|
* Allow model-integration:huggingface dependenciesJon Bratseth2023-01-061-0/+6
|
* DJL-based HuggingFaceEmbedder prototypeAndrii Yurkiv2023-01-044-0/+232
|
* trying to make an OnnxEvaluator with empty path would failArne Juul2022-12-121-0/+8
|
* Merge pull request #25075 from vespa-engine/arnej/onnxruntime-as-bundleArne H Juul2022-12-051-3/+5
|\ | | | | add container-onnxruntime-bundle
| * rename to just "container-onnxruntime"Arne Juul2022-12-021-1/+1
| |
| * add container-onnxruntime-bundleArne Juul2022-12-021-3/+5
| |
* | Revert "Revert collect(Collectors.toList())"Henning Baldersheim2022-12-044-4/+4
| |