aboutsummaryrefslogtreecommitdiffstats
path: root/model-integration
Commit message (Expand)AuthorAgeFilesLines
* Avoid methods deprecated in jackson 2.17.1Henning Baldersheim6 days1-2/+2
* Revert "Update jackson2.vespa.version to v2.17.0"Henning Baldersheim7 days1-2/+2
* Merge pull request #31120 from vespa-engine/lesters/local-llm-timeoutHarald Musum7 days4-13/+57
|\
| * Update ABI specLester Solbakken7 days1-0/+2
| * Add timeout for requests waiting to start local llm inferenceLester Solbakken7 days3-13/+55
* | Avoid deprecated methods.Henning Baldersheim7 days1-2/+2
|/
* Merge pull request #31049 from vespa-engine/jobergum/add-prepend-embedder-sup...Bjørn Christian Seime2024-04-262-1/+64
|\
| * add prepend supportJo Kristian Bergum2024-04-252-1/+64
* | Update defaults for local LLM configLester Solbakken2024-04-241-3/+3
|/
* Revert "Specifically set number of threads to use in llama unit test"Harald Musum2024-04-221-4/+5
* Specifically set number of threads to use in llama unit testLester Solbakken2024-04-221-5/+4
* Remove unneccessary importLester Solbakken2024-04-221-1/+0
* Set minimum number of threads to 1Lester Solbakken2024-04-221-1/+1
* Disable local LLM unit testsLester Solbakken2024-04-161-1/+6
* Reapply "Lesters/add local llms 2"Lester Solbakken2024-04-1613-0/+957
* Revert "Lesters/add local llms 2"Harald Musum2024-04-1513-957/+0
* Reapply "Lesters/add local llms"Lester Solbakken2024-04-1513-0/+957
* Revert "Lesters/add local llms"Lester Solbakken2024-04-1513-957/+0
* Merge branch 'master' into lesters/add-local-llmsLester Solbakken2024-04-129-23/+15
|\
| * Unify on List.ofHenning Baldersheim2024-04-117-17/+11
| * Unify on Map.ofHenning Baldersheim2024-04-111-3/+2
* | Move LLM client stuff from container-search to model-integrationLester Solbakken2024-04-1213-0/+958
|/
* cache more and re-factorJo Kristian Bergum2024-04-082-68/+109
* Key by embedder id and don't recompute inputsJon Bratseth2024-04-072-65/+73
* Add equivalent to `Map.computeIfAbsent()` to simplify typical usage of the cacheBjørn Christian Seime2024-04-042-20/+3
* Add caching of onnx inference output using Context cacheJo Kristian Bergum2024-04-042-18/+55
* Support for dimensionality flexbility and caching onnx inference output using...Jo Kristian Bergum2024-04-042-53/+131
* Add some more tests on the binarizationJo Kristian Bergum2024-03-302-2/+39
* relax testing on float strings due to small inference differences in platformsJo Kristian Bergum2024-03-291-5/+10
* fix unwanted importJo Kristian Bergum2024-03-291-1/+0
* Add support for binarization and matryoshka for hf-embedderJo Kristian Bergum2024-03-293-5/+140
* All embedders are the sameJon Bratseth2024-02-091-2/+2
* Support embedding into rank 3 tensorsJon Bratseth2024-02-023-29/+42
* - Add alternative sparsify implementation using generic tensor.reduce/map.Henning Baldersheim2024-01-312-9/+52
* - Put the inner loops in separate methods. This improves ability to inline.Henning Baldersheim2024-01-202-54/+52
* Rename getIndex => getDirectIndexHenning Baldersheim2024-01-201-1/+1
* Add a class for assist efficient traversal of dimensions in an IndexedTensor.Henning Baldersheim2024-01-192-4/+9
* Cache sizes.totalSize() in variable to prevent recomputation.Henning Baldersheim2024-01-181-20/+19
* Since both value and log(value) are monotonically increasing for value >= 1,Henning Baldersheim2024-01-181-8/+8
* Construct array right away instead of going via a single element list and the...Henning Baldersheim2024-01-181-5/+15
* Avoid generic reduce and keep PAD token embeddingJo Kristian Bergum2024-01-152-24/+47
* remove extra spaceJo Kristian Bergum2024-01-111-1/+1
* address reviewJo Kristian Bergum2024-01-112-43/+25
* Avoid generic reduce to reduce gc pressureJo Kristian Bergum2024-01-112-19/+61
* finalJo Kristian Bergum2024-01-061-1/+1
* handle multilingual models betterJo Kristian Bergum2024-01-063-65/+147
* Allow mapped 1d tensor for embed expressionsJo Kristian Bergum2023-12-172-13/+13
* Add a splade embedder implementationJo Kristian Bergum2023-12-155-0/+30962
* Move Jackson util from vespajlib to container-core.Henning Baldersheim2023-11-243-3/+3
* jackson 2.16 changes some of its default settings so we consolidate our use o...Henning Baldersheim2023-11-233-8/+7