summaryrefslogtreecommitdiffstats
path: root/linguistics
Commit message (Expand)AuthorAgeFilesLines
* Update ABI spec format, and update all specsjonmv2022-10-251-198/+198
* much simpler CharSequenceNormalizerArne Juul2022-10-063-9/+100
* Merge pull request #24007 from vespa-engine/bratseth/cleanup-082Jon Bratseth2022-09-252-13/+11
|\
| * No functional changesJon Bratseth2022-09-112-13/+11
* | Make validation messages clearer given multiple instancesJon Bratseth2022-09-151-2/+0
|/
* bump protoc versionArne Juul2022-08-271-4/+0
* Determine token types considering all charactersJon Bratseth2022-08-166-119/+133
* Set project version to 8-SNAPSHOTgjoranv2022-06-081-2/+2
* Remove on Vespa 8Jon Bratseth2022-06-082-10/+1
* Use '@Inject' from 'annotations' in multiple bundlesBjørn Christian Seime2022-05-062-2/+2
* Resolve rank profile inputsJon Bratseth2022-04-211-1/+1
* Update abi-specLester Solbakken2022-03-221-1/+1
* Rename defaultEmbedderName to defaultEmbedderIdLester Solbakken2022-03-221-2/+2
* Add convenience function to represent embedder as mapLester Solbakken2022-03-212-3/+30
* Stem by linguistics in rule basesJon Bratseth2022-01-102-3/+21
* unify java warnings (use compiler args from parent)Arne H Juul2022-01-061-8/+0
* annotate intentional switch fallthroughArne H Juul2022-01-061-0/+1
* Specify how the class is actually loadedJon Marius Venstad2021-12-211-1/+1
* Provide array of correct size.Jon Marius Venstad2021-12-201-1/+1
* Override ngram creation with something less sillyJon Marius Venstad2021-12-202-1/+32
* Use smaller chunks for faster detectionJon Marius Venstad2021-12-201-2/+2
* Expand test case for language detectionJon Marius Venstad2021-12-201-3/+28
* Upper bound on input size, and use opennlp before simple detectorJon Marius Venstad2021-12-201-6/+3
* Avoid putting nulls in languange mapJon Marius Venstad2021-12-201-2/+5
* Revert "Merge pull request #20578 from vespa-engine/revert-20568-jonmv/replac...Jon Marius Venstad2021-12-2013-176/+246
* Revert "Replace optimaize with OpenNLP language detector [run-systemtest]"Jon Marius Venstad2021-12-1813-246/+176
* Re-add filesJon Marius Venstad2021-12-185-0/+142
* Move model to module where it is needed, to simplify, at the cost of larger b...Jon Marius Venstad2021-12-183-22/+21
* Replace UrlcharSequenceNormalizer with one with an improved regexJon Marius Venstad2021-12-171-6/+0
* Mockito test scopeJon Marius Venstad2021-12-171-0/+1
* Add some javadoc, and no need to handle null return for modelJon Marius Venstad2021-12-172-2/+4
* Replace optimaize with OpenNLP language detectorJon Marius Venstad2021-12-178-170/+102
* Add a BERT embedderJon Bratseth2021-12-161-2/+3
* Time out requests after 200sJon Marius Venstad2021-12-131-1/+0
* Update 2020 Oath copyrights.gjoranv2021-10-272-2/+2
* Update 2019 Oath copyrights.gjoranv2021-10-271-1/+1
* Update Verizon Media copyright notices.gjoranv2021-10-073-3/+3
* Update 2018 copyright notices.gjoranv2021-10-073-3/+3
* Update 2017 copyright notices.gjoranv2021-10-0770-70/+70
* Encapsulate in a contextJon Bratseth2021-10-012-16/+65
* Pass destinationJon Bratseth2021-09-302-8/+14
* encode -> embedJon Bratseth2021-09-283-64/+64
* Separate component from linguisticsJon Bratseth2021-09-2517-1206/+0
* Linguistics cleanupJon Bratseth2021-09-2118-45/+29
* Add 'encode' expressionJon Bratseth2021-09-192-1/+35
* Provide a (non-working) encoder by defaultJon Bratseth2021-09-171-1/+1
* Update ABI specJon Bratseth2021-09-171-20/+46
* CleanupJon Bratseth2021-09-175-9/+2
* Refactor to separate classesJon Bratseth2021-09-178-203/+279
* Encoder interfaceJon Bratseth2021-09-163-4/+55