index
:
vespa
6
7
andreer/permanent-enclave-flag
aressem/test-dummy
aressem/test-pr-bk
aressem/test-pr-build-3
aressem/test-valgrind
arnej/add-feature-flag
arnej/cosmetic-message-fix
arnej/golang-slime-port-1
arnej/remove-convert-in-calculator
arnej/use-our-shell-quote
arnej/wip-sand-fixups
balder/apply-termwise-filters-on-match-phase-2
balder/cpu-specific-compiles-for-bit-operations
balder/deinline
balder/hosted-always-convert-percentages-in-config-model
balder/no-longer-need-commit-and-wait
balder/prepare-for-hw-specialized-hamming-distance
balder/prepare-for-string_view-1
balder/thread-local-jetty-bytebuffer-pool
balder/update-defaults-for-use-xxx-fetch-postings
balder/zncurve
bjormel/aws-main-controller
bjormel/aws-main-controller-take2
bratseth/grouping-trace
bratseth/linguistics-context-rebased
bratseth/more-exclusive-take-2
bratseth/stem-prefixes
bratseth/streamed-fill
hakonhall/enumerate-all-prod-regions
hakonhall/fix-remembertoupdatesystemflagsdataarchive-javadoc
havardpe/enable-nested-ctf-meta-data
havardpe/extract-default-query-feature-values
havardpe/protoc-gen-csi
interns/languageserver
interns/magnus/symbols
jdk21-preparations
jonmv/allow-private-endpoints-in-dev-perf
jonmv/dependency-inversion-for-mbus-config
jvenstad/utils
kkraune/ci-warning
ldalves/querybuilder
leandroalves/prod-controller
lesters/bert-testing
lesters/external-llms
lesters/stateless-onnx-eval-once
master
mortent/calypso
mortent/new-public-cd-endpoint
mpolden/update-abi
olaa/delete-flags
olaa/otel-config-model
renovate/junit5-monorepo
renovate/major-protobuf.vespa.version
renovate/maven-shade-plugin.vespa.version
renovate/plexus-archiver.vespa.version
revert-26576-revert-26567-bjorncs/cloud-app-validation
revert-26584-revert-26578-bjorncs/tlsv13
revert-27857-bjorncs/tls13
revert-28660-revert-28656-hmusum/fix-onnx-model-cost
revert-30559-toregge/require-vespa-build-dependencies-for-vespa-devel
revert-31846-balder/explicit-string-from-view
vekterli/change-test-and-set-update-not-found-semantics
yngveaasheim/skeleton-for-component-in-metrics-enum
An engine for low-latency computation over large data sets
about
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
path:
root
/
linguistics
/
src
/
test
/
java
/
com
/
yahoo
Commit message (
Expand
)
Author
Age
Files
Lines
*
Compute code points in whole string only when needed
jonmv
2022-12-06
1
-1
/
+14
*
Split out opennlp-linguistics
Henning Baldersheim
2022-11-26
3
-365
/
+0
*
No functional changes
Jon Bratseth
2022-09-11
1
-2
/
+2
*
Determine token types considering all characters
Jon Bratseth
2022-08-16
2
-11
/
+44
*
Expand test case for language detection
Jon Marius Venstad
2021-12-20
1
-3
/
+28
*
Revert "Merge pull request #20578 from vespa-engine/revert-20568-jonmv/replac...
Jon Marius Venstad
2021-12-20
3
-35
/
+82
*
Revert "Replace optimaize with OpenNLP language detector [run-systemtest]"
Jon Marius Venstad
2021-12-18
3
-82
/
+35
*
Re-add files
Jon Marius Venstad
2021-12-18
2
-0
/
+82
*
Replace optimaize with OpenNLP language detector
Jon Marius Venstad
2021-12-17
1
-35
/
+0
*
Time out requests after 200s
Jon Marius Venstad
2021-12-13
1
-1
/
+0
*
Update 2020 Oath copyrights.
gjoranv
2021-10-27
1
-1
/
+1
*
Update Verizon Media copyright notices.
gjoranv
2021-10-07
1
-1
/
+1
*
Update 2017 copyright notices.
gjoranv
2021-10-07
20
-20
/
+20
*
Separate component from linguistics
Jon Bratseth
2021-09-25
3
-197
/
+0
*
Refactor to separate classes
Jon Bratseth
2021-09-17
1
-2
/
+1
*
Encode to sparse tensor
Jon Bratseth
2021-09-16
1
-0
/
+6
*
Encode to dense tensor
Jon Bratseth
2021-09-16
2
-0
/
+23
*
Make SentencePieceEncoder configurable
Jon Bratseth
2021-09-16
3
-30
/
+102
*
More unit tests
Jon Bratseth
2021-09-14
1
-1
/
+20
*
Pure Java sentencepiece implementation
Jon Bratseth
2021-09-13
1
-0
/
+78
*
Revert "Merge pull request #17754 from vespa-engine/revert-17747-bratseth/spe...
Jon Bratseth
2021-05-05
1
-0
/
+40
*
Revert "Reapply "Bratseth/special tokens""
Jon Bratseth
2021-05-05
1
-40
/
+0
*
Revert "Merge pull request #17746 from vespa-engine/revert-17738-revert-17737...
Jon Bratseth
2021-05-05
1
-0
/
+40
*
Revert "Revert "Revert "Bratseth/special tokens"""
Jon Bratseth
2021-05-05
1
-40
/
+0
*
Revert "Revert "Bratseth/special tokens""
Jon Bratseth
2021-05-04
1
-0
/
+40
*
Revert "Bratseth/special tokens"
Jon Bratseth
2021-05-04
1
-40
/
+0
*
Expose tokens as map
Jon Bratseth
2021-05-04
1
-5
/
+3
*
Move specialtokens to linguistics
Jon Bratseth
2021-05-04
1
-0
/
+42
*
No functional changes
Jon Bratseth
2021-04-14
1
-37
/
+26
*
No functional changes
Jon Bratseth
2021-04-14
8
-19
/
+16
*
No functional changes
Jon Bratseth
2021-02-03
1
-0
/
+19
*
handle plugin tokenizer returning tokens with empty original string
Arne Juul
2020-08-24
1
-0
/
+51
*
Minor unification of tests.
Henning Baldersheim
2020-08-12
2
-20
/
+36
*
Surrogate aware gram splitting
Jon Bratseth
2020-06-25
1
-9
/
+37
*
Add/corect copyright headers
Jon Bratseth
2020-01-03
1
-0
/
+1
*
Remove deprecated apis in linguistics.
gjoranv
2019-01-21
1
-27
/
+0
*
Deprecated methods and add OptimaizeDetector
Jon Bratseth
2018-11-01
2
-6
/
+36
*
use com.optimaize.langdetect for lang detection
Jefim Matskin
2018-07-24
1
-0
/
+5
*
add opennlp stemmers - revert previous changes
Jefim Matskin
2018-07-18
3
-5
/
+238
*
add lang detection and opennlp stemmers
Jefim Matskin
2018-07-17
2
-0
/
+6
*
Fix author tag for Simon
Bjørn Christian Seime
2018-07-05
12
-12
/
+12
*
Update copyright headers
Jon Bratseth
2017-06-14
20
-20
/
+20
*
Revert "Update copyright headers"
Jon Bratseth
2017-06-14
20
-20
/
+20
*
Update copyright headers
Jon Bratseth
2017-06-14
20
-20
/
+20
*
Remove carriage return
Jon Bratseth
2017-06-14
1
-1
/
+1
*
Revert "Copyright header"
Jon Bratseth
2017-06-13
20
-21
/
+21
*
Copyright header
Jon Bratseth
2017-06-13
20
-21
/
+21
*
Publish
Jon Bratseth
2016-06-15
20
-0
/
+1485