index
:
vespa
6
7
andreer/permanent-enclave-flag
aressem/test-dummy
aressem/test-pr-build-3
arnej/add-metric-renames
arnej/cosmetic-message-fix
arnej/golang-slime-port-1
arnej/remove-convert-in-calculator
arnej/use-our-shell-quote
balder/apply-termwise-filters-on-match-phase-2
balder/cpu-specific-compiles-for-bit-operations
balder/deinline
balder/hosted-always-convert-percentages-in-config-model
balder/no-longer-need-commit-and-wait
balder/prepare-for-hw-specialized-hamming-distance
balder/thread-local-jetty-bytebuffer-pool
balder/update-defaults-for-use-xxx-fetch-postings
balder/zncurve
bjormel/aws-main-controller
bjormel/aws-main-controller-take2
bratseth/linguistics-context-rebased
bratseth/more-exclusive-take-2
bratseth/stem-prefixes
bratseth/streamed-fill
freva/delete-unused
freva/fix-zk-version
hakonhall/enumerate-all-prod-regions
hakonhall/fix-remembertoupdatesystemflagsdataarchive-javadoc
havardpe/enable-nested-ctf-meta-data
havardpe/extract-default-query-feature-values
havardpe/protoc-gen-csi
jdk21-preparations
jonmv/dependency-inversion-for-mbus-config
jvenstad/utils
kkraune/ci-warning
kkraune/logs
ldalves/querybuilder
leandroalves/prod-controller
lesters/bert-testing
lesters/external-llms
lesters/stateless-onnx-eval-once
marius/add-more-significance-searcher-tests
marius/add-significance-model-tool
master
mortent/calypso
mortent/new-public-cd-endpoint
mpolden/update-abi
olaa/delete-flags
olaa/otel-config-model
renovate/jackson2.vespa.version
renovate/junit5-monorepo
revert-26576-revert-26567-bjorncs/cloud-app-validation
revert-26584-revert-26578-bjorncs/tlsv13
revert-27857-bjorncs/tls13
revert-28660-revert-28656-hmusum/fix-onnx-model-cost
revert-30559-toregge/require-vespa-build-dependencies-for-vespa-devel
toregge/dont-promote-vla-warning-to-error-when-compiling-with-clang-18
toregge/non-const-evaluate-and-evaluate-hits-member-functions-in-streaming-search-query-nodes
vekterli/change-test-and-set-update-not-found-semantics
yngveaasheim/skeleton-for-component-in-metrics-enum
An engine for low-latency computation over large data sets
about
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
path:
root
/
model-integration
Commit message (
Expand
)
Author
Age
Files
Lines
*
Avoid methods deprecated in jackson 2.17.1
Henning Baldersheim
6 days
1
-2
/
+2
*
Revert "Update jackson2.vespa.version to v2.17.0"
Henning Baldersheim
7 days
1
-2
/
+2
*
Merge pull request #31120 from vespa-engine/lesters/local-llm-timeout
Harald Musum
7 days
4
-13
/
+57
|
\
|
*
Update ABI spec
Lester Solbakken
7 days
1
-0
/
+2
|
*
Add timeout for requests waiting to start local llm inference
Lester Solbakken
7 days
3
-13
/
+55
*
|
Avoid deprecated methods.
Henning Baldersheim
7 days
1
-2
/
+2
|
/
*
Merge pull request #31049 from vespa-engine/jobergum/add-prepend-embedder-sup...
Bjørn Christian Seime
2024-04-26
2
-1
/
+64
|
\
|
*
add prepend support
Jo Kristian Bergum
2024-04-25
2
-1
/
+64
*
|
Update defaults for local LLM config
Lester Solbakken
2024-04-24
1
-3
/
+3
|
/
*
Revert "Specifically set number of threads to use in llama unit test"
Harald Musum
2024-04-22
1
-4
/
+5
*
Specifically set number of threads to use in llama unit test
Lester Solbakken
2024-04-22
1
-5
/
+4
*
Remove unneccessary import
Lester Solbakken
2024-04-22
1
-1
/
+0
*
Set minimum number of threads to 1
Lester Solbakken
2024-04-22
1
-1
/
+1
*
Disable local LLM unit tests
Lester Solbakken
2024-04-16
1
-1
/
+6
*
Reapply "Lesters/add local llms 2"
Lester Solbakken
2024-04-16
13
-0
/
+957
*
Revert "Lesters/add local llms 2"
Harald Musum
2024-04-15
13
-957
/
+0
*
Reapply "Lesters/add local llms"
Lester Solbakken
2024-04-15
13
-0
/
+957
*
Revert "Lesters/add local llms"
Lester Solbakken
2024-04-15
13
-957
/
+0
*
Merge branch 'master' into lesters/add-local-llms
Lester Solbakken
2024-04-12
9
-23
/
+15
|
\
|
*
Unify on List.of
Henning Baldersheim
2024-04-11
7
-17
/
+11
|
*
Unify on Map.of
Henning Baldersheim
2024-04-11
1
-3
/
+2
*
|
Move LLM client stuff from container-search to model-integration
Lester Solbakken
2024-04-12
13
-0
/
+958
|
/
*
cache more and re-factor
Jo Kristian Bergum
2024-04-08
2
-68
/
+109
*
Key by embedder id and don't recompute inputs
Jon Bratseth
2024-04-07
2
-65
/
+73
*
Add equivalent to `Map.computeIfAbsent()` to simplify typical usage of the cache
Bjørn Christian Seime
2024-04-04
2
-20
/
+3
*
Add caching of onnx inference output using Context cache
Jo Kristian Bergum
2024-04-04
2
-18
/
+55
*
Support for dimensionality flexbility and caching onnx inference output using...
Jo Kristian Bergum
2024-04-04
2
-53
/
+131
*
Add some more tests on the binarization
Jo Kristian Bergum
2024-03-30
2
-2
/
+39
*
relax testing on float strings due to small inference differences in platforms
Jo Kristian Bergum
2024-03-29
1
-5
/
+10
*
fix unwanted import
Jo Kristian Bergum
2024-03-29
1
-1
/
+0
*
Add support for binarization and matryoshka for hf-embedder
Jo Kristian Bergum
2024-03-29
3
-5
/
+140
*
All embedders are the same
Jon Bratseth
2024-02-09
1
-2
/
+2
*
Support embedding into rank 3 tensors
Jon Bratseth
2024-02-02
3
-29
/
+42
*
- Add alternative sparsify implementation using generic tensor.reduce/map.
Henning Baldersheim
2024-01-31
2
-9
/
+52
*
- Put the inner loops in separate methods. This improves ability to inline.
Henning Baldersheim
2024-01-20
2
-54
/
+52
*
Rename getIndex => getDirectIndex
Henning Baldersheim
2024-01-20
1
-1
/
+1
*
Add a class for assist efficient traversal of dimensions in an IndexedTensor.
Henning Baldersheim
2024-01-19
2
-4
/
+9
*
Cache sizes.totalSize() in variable to prevent recomputation.
Henning Baldersheim
2024-01-18
1
-20
/
+19
*
Since both value and log(value) are monotonically increasing for value >= 1,
Henning Baldersheim
2024-01-18
1
-8
/
+8
*
Construct array right away instead of going via a single element list and the...
Henning Baldersheim
2024-01-18
1
-5
/
+15
*
Avoid generic reduce and keep PAD token embedding
Jo Kristian Bergum
2024-01-15
2
-24
/
+47
*
remove extra space
Jo Kristian Bergum
2024-01-11
1
-1
/
+1
*
address review
Jo Kristian Bergum
2024-01-11
2
-43
/
+25
*
Avoid generic reduce to reduce gc pressure
Jo Kristian Bergum
2024-01-11
2
-19
/
+61
*
final
Jo Kristian Bergum
2024-01-06
1
-1
/
+1
*
handle multilingual models better
Jo Kristian Bergum
2024-01-06
3
-65
/
+147
*
Allow mapped 1d tensor for embed expressions
Jo Kristian Bergum
2023-12-17
2
-13
/
+13
*
Add a splade embedder implementation
Jo Kristian Bergum
2023-12-15
5
-0
/
+30962
*
Move Jackson util from vespajlib to container-core.
Henning Baldersheim
2023-11-24
3
-3
/
+3
*
jackson 2.16 changes some of its default settings so we consolidate our use o...
Henning Baldersheim
2023-11-23
3
-8
/
+7
[next]