index
:
vespa
6
7
andreer/permanent-enclave-flag
aressem/test-dummy
aressem/test-pr-bk
aressem/test-pr-build-3
aressem/test-valgrind
arnej/add-feature-flag
arnej/cosmetic-message-fix
arnej/golang-slime-port-1
arnej/remove-convert-in-calculator
arnej/use-our-shell-quote
arnej/wip-sand-fixups
balder/apply-termwise-filters-on-match-phase-2
balder/cpu-specific-compiles-for-bit-operations
balder/deinline
balder/enable-std-stding-as-default
balder/hosted-always-convert-percentages-in-config-model
balder/no-longer-need-commit-and-wait
balder/prepare-for-hw-specialized-hamming-distance
balder/prepare-for-string_view-1
balder/thread-local-jetty-bytebuffer-pool
balder/update-defaults-for-use-xxx-fetch-postings
balder/zncurve
bjormel/aws-main-controller
bjormel/aws-main-controller-take2
bratseth/grouping-trace
bratseth/linguistics-context-rebased
bratseth/more-exclusive-take-2
bratseth/stem-prefixes
bratseth/streamed-fill
hakonhall/enumerate-all-prod-regions
hakonhall/fix-remembertoupdatesystemflagsdataarchive-javadoc
havardpe/enable-nested-ctf-meta-data
havardpe/extract-default-query-feature-values
havardpe/protoc-gen-csi
interns/languageserver
interns/magnus/symbols
jdk21-preparations
jonmv/allow-private-endpoints-in-dev-perf
jonmv/dependency-inversion-for-mbus-config
jvenstad/utils
kkraune/ci-warning
ldalves/querybuilder
leandroalves/prod-controller
lesters/bert-testing
lesters/external-llms
lesters/stateless-onnx-eval-once
master
mortent/calypso
mortent/new-public-cd-endpoint
mpolden/update-abi
olaa/delete-flags
olaa/otel-config-model
renovate/junit5-monorepo
renovate/major-protobuf.vespa.version
renovate/maven-shade-plugin.vespa.version
renovate/plexus-archiver.vespa.version
revert-26576-revert-26567-bjorncs/cloud-app-validation
revert-26584-revert-26578-bjorncs/tlsv13
revert-27857-bjorncs/tls13
revert-28660-revert-28656-hmusum/fix-onnx-model-cost
revert-30559-toregge/require-vespa-build-dependencies-for-vespa-devel
toregge/port-to-appleclang
vekterli/change-test-and-set-update-not-found-semantics
yngveaasheim/skeleton-for-component-in-metrics-enum
An engine for low-latency computation over large data sets
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
path:
root
/
model-integration
Commit message (
Expand
)
Author
Age
Files
Lines
*
Use var type to handle renaming of jllama interface classes.
Henning Baldersheim
2024-05-16
1
-1
/
+1
*
Avoid methods deprecated in jackson 2.17.1
Henning Baldersheim
2024-05-06
1
-2
/
+2
*
Revert "Update jackson2.vespa.version to v2.17.0"
Henning Baldersheim
2024-05-06
1
-2
/
+2
*
Merge pull request #31120 from vespa-engine/lesters/local-llm-timeout
Harald Musum
2024-05-06
4
-13
/
+57
|
\
|
*
Update ABI spec
Lester Solbakken
2024-05-06
1
-0
/
+2
|
*
Add timeout for requests waiting to start local llm inference
Lester Solbakken
2024-05-06
3
-13
/
+55
*
|
Avoid deprecated methods.
Henning Baldersheim
2024-05-06
1
-2
/
+2
|
/
*
Merge pull request #31049 from vespa-engine/jobergum/add-prepend-embedder-sup...
Bjørn Christian Seime
2024-04-26
2
-1
/
+64
|
\
|
*
add prepend support
Jo Kristian Bergum
2024-04-25
2
-1
/
+64
*
|
Update defaults for local LLM config
Lester Solbakken
2024-04-24
1
-3
/
+3
|
/
*
Revert "Specifically set number of threads to use in llama unit test"
Harald Musum
2024-04-22
1
-4
/
+5
*
Specifically set number of threads to use in llama unit test
Lester Solbakken
2024-04-22
1
-5
/
+4
*
Remove unneccessary import
Lester Solbakken
2024-04-22
1
-1
/
+0
*
Set minimum number of threads to 1
Lester Solbakken
2024-04-22
1
-1
/
+1
*
Disable local LLM unit tests
Lester Solbakken
2024-04-16
1
-1
/
+6
*
Reapply "Lesters/add local llms 2"
Lester Solbakken
2024-04-16
13
-0
/
+957
*
Revert "Lesters/add local llms 2"
Harald Musum
2024-04-15
13
-957
/
+0
*
Reapply "Lesters/add local llms"
Lester Solbakken
2024-04-15
13
-0
/
+957
*
Revert "Lesters/add local llms"
Lester Solbakken
2024-04-15
13
-957
/
+0
*
Merge branch 'master' into lesters/add-local-llms
Lester Solbakken
2024-04-12
9
-23
/
+15
|
\
|
*
Unify on List.of
Henning Baldersheim
2024-04-11
7
-17
/
+11
|
*
Unify on Map.of
Henning Baldersheim
2024-04-11
1
-3
/
+2
*
|
Move LLM client stuff from container-search to model-integration
Lester Solbakken
2024-04-12
13
-0
/
+958
|
/
*
cache more and re-factor
Jo Kristian Bergum
2024-04-08
2
-68
/
+109
*
Key by embedder id and don't recompute inputs
Jon Bratseth
2024-04-07
2
-65
/
+73
*
Add equivalent to `Map.computeIfAbsent()` to simplify typical usage of the cache
Bjørn Christian Seime
2024-04-04
2
-20
/
+3
*
Add caching of onnx inference output using Context cache
Jo Kristian Bergum
2024-04-04
2
-18
/
+55
*
Support for dimensionality flexbility and caching onnx inference output using...
Jo Kristian Bergum
2024-04-04
2
-53
/
+131
*
Add some more tests on the binarization
Jo Kristian Bergum
2024-03-30
2
-2
/
+39
*
relax testing on float strings due to small inference differences in platforms
Jo Kristian Bergum
2024-03-29
1
-5
/
+10
*
fix unwanted import
Jo Kristian Bergum
2024-03-29
1
-1
/
+0
*
Add support for binarization and matryoshka for hf-embedder
Jo Kristian Bergum
2024-03-29
3
-5
/
+140
*
All embedders are the same
Jon Bratseth
2024-02-09
1
-2
/
+2
*
Support embedding into rank 3 tensors
Jon Bratseth
2024-02-02
3
-29
/
+42
*
- Add alternative sparsify implementation using generic tensor.reduce/map.
Henning Baldersheim
2024-01-31
2
-9
/
+52
*
- Put the inner loops in separate methods. This improves ability to inline.
Henning Baldersheim
2024-01-20
2
-54
/
+52
*
Rename getIndex => getDirectIndex
Henning Baldersheim
2024-01-20
1
-1
/
+1
*
Add a class for assist efficient traversal of dimensions in an IndexedTensor.
Henning Baldersheim
2024-01-19
2
-4
/
+9
*
Cache sizes.totalSize() in variable to prevent recomputation.
Henning Baldersheim
2024-01-18
1
-20
/
+19
*
Since both value and log(value) are monotonically increasing for value >= 1,
Henning Baldersheim
2024-01-18
1
-8
/
+8
*
Construct array right away instead of going via a single element list and the...
Henning Baldersheim
2024-01-18
1
-5
/
+15
*
Avoid generic reduce and keep PAD token embedding
Jo Kristian Bergum
2024-01-15
2
-24
/
+47
*
remove extra space
Jo Kristian Bergum
2024-01-11
1
-1
/
+1
*
address review
Jo Kristian Bergum
2024-01-11
2
-43
/
+25
*
Avoid generic reduce to reduce gc pressure
Jo Kristian Bergum
2024-01-11
2
-19
/
+61
*
final
Jo Kristian Bergum
2024-01-06
1
-1
/
+1
*
handle multilingual models better
Jo Kristian Bergum
2024-01-06
3
-65
/
+147
*
Allow mapped 1d tensor for embed expressions
Jo Kristian Bergum
2023-12-17
2
-13
/
+13
*
Add a splade embedder implementation
Jo Kristian Bergum
2023-12-15
5
-0
/
+30962
*
Move Jackson util from vespajlib to container-core.
Henning Baldersheim
2023-11-24
3
-3
/
+3
[next]