aboutsummaryrefslogtreecommitdiffstats
path: root/model-integration
Commit message (Expand)AuthorAgeFilesLines
* Put the openai client in a separate componentJon Bratseth2023-04-2515-482/+20
* Export APIJon Bratseth2023-04-204-1/+218
* Merge pull request #26777 from vespa-engine/bratseth/openai-clientJon Bratseth2023-04-1910-0/+312
|\
| * Use record and use default record toStringJon Bratseth2023-04-191-9/+1
| * Fix typoJon Bratseth2023-04-191-1/+1
| * Avoid ignored directory nameJon Bratseth2023-04-192-2/+45
| * Llm completion abstraction and OpenAi implementationJon Bratseth2023-04-199-0/+277
* | Merge pull request #26753 from vespa-engine/bjorncs/global-phaseArne H Juul2023-04-191-5/+9
|\ \ | |/ |/|
| * Use quarter vcpu by default if execution mode is parallelBjørn Christian Seime2023-04-171-5/+9
* | Merge pull request #26754 from vespa-engine/bratseth/jdk20Jon Bratseth2023-04-181-2/+2
|\ \
| * | Build with jdk20Jon Bratseth2023-04-171-2/+2
* | | Pull endtokenJon Bratseth2023-04-181-1/+1
* | | Revert "Merge pull request #26744 from vespa-engine/revert-26708-allow-start-...Jon Bratseth2023-04-182-8/+14
| |/ |/|
* | Revert "Allow start end sequence tokens as args bertbaseembedder"Jon Bratseth2023-04-142-14/+8
|/
* Include createTokenTypeIdsconnell gough2023-04-131-0/+1
* Remove separator input and fix spelling errorconnell gough2023-04-131-3/+2
* Add default special tokensconnell gough2023-04-131-2/+2
* Add special tokens as arguments and allow tokenTypeIds to be nullconnell gough2023-04-132-7/+13
* Add lz4-java for xxhashBjørn Christian Seime2023-03-311-0/+5
* Support loading ONNX models through byte arrayBjørn Christian Seime2023-03-304-36/+162
* Don't reuse runtime between methodsBjørn Christian Seime2023-03-301-21/+18
* Revert "Arnej/unify cell type conversion"Henning Baldersheim2023-03-123-44/+21
* cell-type conversions should matchArne Juul2023-03-102-20/+39
* add BFLOAT16 from newer onnx.proto versionArne Juul2023-03-101-1/+5
* Revert "Deopend on component to get AbstractComponent"Bjørn Christian Seime2023-02-281-6/+0
* Deopend on component to get AbstractComponentJon Bratseth2023-02-281-0/+6
* Extend AbstractComponent not AbstractResourceBjørn Christian Seime2023-02-283-9/+9
* Replace `OnnxEvaluatorCache` with OnnxRuntimeBjørn Christian Seime2023-02-2714-228/+325
* handle non-identifier onnx input/output names: instead of the conflictingArne Juul2023-02-221-0/+30
* Cache Onnx model instancesBjørn Christian Seime2023-02-223-0/+141
* Make OnnxEvaluator closableBjørn Christian Seime2023-02-221-1/+12
* Implement equals()/hashCode()Bjørn Christian Seime2023-02-221-1/+17
* Add initial text generator componentLester Solbakken2023-02-189-2/+410
* ensure outputs with names as promised by getOutputInfo()Arne Juul2023-02-103-1/+123
* Fix GPU detectionMartin Polden2023-02-091-1/+1
* Allow fallback to CPU if nodes are provisioned without GPUMartin Polden2023-02-082-13/+24
* Remove 'required' attributeMartin Polden2023-01-262-22/+14
* Skip CUDA entirely if GPU device is optionalMartin Polden2023-01-262-24/+39
* Do not filter on error codeMartin Polden2023-01-241-4/+1
* Support configuration of GPU device to use in ONNX modelMartin Polden2023-01-231-1/+24
* Add JNA in lib/jars + export as packages from jdisc_coreBjørn Christian Seime2023-01-201-1/+21
* Unify on Streams.toList()Henning Baldersheim2023-01-172-4/+3
* Exclude in fat-depsJon Bratseth2023-01-091-1/+4
* Huggingface not needed in config modelJon Bratseth2023-01-091-0/+1
* Allow model-integration:huggingface dependenciesJon Bratseth2023-01-061-0/+6
* DJL-based HuggingFaceEmbedder prototypeAndrii Yurkiv2023-01-044-0/+232
* trying to make an OnnxEvaluator with empty path would failArne Juul2022-12-121-0/+8
* Merge pull request #25075 from vespa-engine/arnej/onnxruntime-as-bundleArne H Juul2022-12-051-3/+5
|\
| * rename to just "container-onnxruntime"Arne Juul2022-12-021-1/+1
| * add container-onnxruntime-bundleArne Juul2022-12-021-3/+5