summaryrefslogtreecommitdiffstats
path: root/linguistics
Commit message (Collapse)AuthorAgeFilesLines
...
* Encode to sparse tensorJon Bratseth2021-09-163-0/+17
|
* Encode to dense tensorJon Bratseth2021-09-163-4/+36
|
* Use a result builderJon Bratseth2021-09-161-21/+53
|
* Make SentencePieceEncoder configurableJon Bratseth2021-09-166-56/+283
|
* Merge pull request #19130 from vespa-engine/bratseth/sp-exportJo Kristian Bergum2021-09-142-0/+79
|\ | | | | Make public
| * Add to abi specJon Bratseth2021-09-141-0/+72
| |
| * Make publicJon Bratseth2021-09-141-0/+7
| |
* | Merge pull request #19131 from vespa-engine/bratseth/sp-simplifyJon Bratseth2021-09-141-13/+7
|\ \ | | | | | | Slight algorithm simplification
| * | Slight algorithm simplificationJon Bratseth2021-09-141-6/+4
| | |
| * | Slight algorithm simplificationJon Bratseth2021-09-141-6/+3
| | |
| * | Slight algorithm simplificationJon Bratseth2021-09-141-11/+10
| |/
* / More unit testsJon Bratseth2021-09-141-1/+20
|/
* Pure Java sentencepiece implementationJon Bratseth2021-09-137-2/+731
|
* we want to compare Linguistics objects for equivalenceArne Juul2021-08-044-1/+9
|
* Require replacements to be applied during tokenizationJon Bratseth2021-06-153-12/+11
|
* Revert "Merge pull request #17754 from ↵Jon Bratseth2021-05-058-13/+336
| | | | | | | vespa-engine/revert-17747-bratseth/special-tokens-take-2" This reverts commit a2c9cd4bc04f1a3eaa31524b3970b96be5c2eda9, reversing changes made to 8c61a373af0066fbdf1cca354c24b197c7347321.
* Revert "Reapply "Bratseth/special tokens""Jon Bratseth2021-05-058-336/+13
|
* Revert "Merge pull request #17746 from ↵Jon Bratseth2021-05-058-13/+336
| | | | | | | vespa-engine/revert-17738-revert-17737-revert-17736-bratseth/special-tokens" This reverts commit 491856b396d003885e159345fe3f533f0fa35933, reversing changes made to 3720186303f4aef1d185525eaf61092097a64ec9.
* Revert "Revert "Revert "Bratseth/special tokens"""Jon Bratseth2021-05-058-336/+13
|
* Revert "Revert "Bratseth/special tokens""Jon Bratseth2021-05-048-13/+336
|
* Revert "Bratseth/special tokens"Jon Bratseth2021-05-048-336/+13
|
* Avoid config in simple tokenizerJon Bratseth2021-05-041-7/+4
|
* Expose tokens as mapJon Bratseth2021-05-044-12/+17
|
* Wire in (but don't use) SpecialTokensJon Bratseth2021-05-046-18/+40
|
* Move specialtokens to linguisticsJon Bratseth2021-05-044-0/+299
|
* No functional changesJon Bratseth2021-04-141-37/+26
|
* No functional changesJon Bratseth2021-04-1413-128/+84
|
* No functional changesJon Bratseth2021-02-032-1/+20
|
* Add a testJon Bratseth2020-11-121-3/+0
|
* Allow no argument to install_config_definitionsHarald Musum2020-09-121-1/+1
| | | | | Use src/main/resources/configdefinitions as default when no argument is given
* Use full name in config definition file namesHarald Musum2020-09-102-1/+1
|
* handle plugin tokenizer returning tokens with empty original stringArne Juul2020-08-242-1/+55
|
* Minor unification of tests.Henning Baldersheim2020-08-123-25/+36
|
* Update ABI specJon Bratseth2020-06-261-1/+2
|
* Surrogate aware gram splittingJon Bratseth2020-06-252-33/+122
|
* SpareCapacityMaintainer sketchJon Bratseth2020-06-126-66/+35
|
* variables in lambdas must be finalArne Juul2020-04-242-10/+16
|
* Apply suggestions from code reviewArne H Juul2020-04-243-7/+7
| | | Co-Authored-By: Jon Bratseth <bratseth@oath.com>
* add more tracing and debug logging of stemmingArne Juul2020-04-244-1/+25
|
* Add/corect copyright headersJon Bratseth2020-01-032-0/+2
|
* Install more config definition files.Tor Egge2019-11-291-0/+2
|
* Build tensors purely with floatsJon Bratseth2019-04-261-1/+1
|
* Move bound builder double array into double subclassJon Bratseth2019-04-261-1/+1
|
* Allow destructive changes in manually deployed zonesJon Bratseth2019-04-011-1/+1
|
* Nonfunctional changes onlyJon Bratseth2019-01-241-1/+1
|
* Update abi specs for 7.gjoranv2019-01-211-8/+3
|
* Generate html5 javadocgjoranv2019-01-211-7/+7
|
* Remove deprecated method (again)Jon Bratseth2019-01-212-16/+0
|
* Make SimpleLinguistics simple againJon Bratseth2019-01-214-104/+32
| | | | | - Remove SimpleLinguistics config and optional use of Optimaize - Add Optimaize to OpennlpLinguistics; on by default and config to disable
* 6-SNAPSHOT -> 7-SNAPSHOTArnstein Ressem2019-01-211-2/+2
|