Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Require replacements to be applied during tokenization | Jon Bratseth | 2021-06-15 | 3 | -12/+11 |
| | |||||
* | Revert "Merge pull request #17754 from ↵ | Jon Bratseth | 2021-05-05 | 6 | -13/+245 |
| | | | | | | | vespa-engine/revert-17747-bratseth/special-tokens-take-2" This reverts commit a2c9cd4bc04f1a3eaa31524b3970b96be5c2eda9, reversing changes made to 8c61a373af0066fbdf1cca354c24b197c7347321. | ||||
* | Revert "Reapply "Bratseth/special tokens"" | Jon Bratseth | 2021-05-05 | 6 | -245/+13 |
| | |||||
* | Revert "Merge pull request #17746 from ↵ | Jon Bratseth | 2021-05-05 | 6 | -13/+245 |
| | | | | | | | vespa-engine/revert-17738-revert-17737-revert-17736-bratseth/special-tokens" This reverts commit 491856b396d003885e159345fe3f533f0fa35933, reversing changes made to 3720186303f4aef1d185525eaf61092097a64ec9. | ||||
* | Revert "Revert "Revert "Bratseth/special tokens""" | Jon Bratseth | 2021-05-05 | 6 | -245/+13 |
| | |||||
* | Revert "Revert "Bratseth/special tokens"" | Jon Bratseth | 2021-05-04 | 6 | -13/+245 |
| | |||||
* | Revert "Bratseth/special tokens" | Jon Bratseth | 2021-05-04 | 6 | -245/+13 |
| | |||||
* | Avoid config in simple tokenizer | Jon Bratseth | 2021-05-04 | 1 | -7/+4 |
| | |||||
* | Expose tokens as map | Jon Bratseth | 2021-05-04 | 2 | -6/+13 |
| | |||||
* | Wire in (but don't use) SpecialTokens | Jon Bratseth | 2021-05-04 | 6 | -18/+40 |
| | |||||
* | Move specialtokens to linguistics | Jon Bratseth | 2021-05-04 | 2 | -0/+206 |
| | |||||
* | No functional changes | Jon Bratseth | 2021-04-14 | 5 | -109/+68 |
| | |||||
* | No functional changes | Jon Bratseth | 2021-02-03 | 1 | -1/+1 |
| | |||||
* | Add a test | Jon Bratseth | 2020-11-12 | 1 | -3/+0 |
| | |||||
* | Use full name in config definition file names | Harald Musum | 2020-09-10 | 1 | -0/+0 |
| | |||||
* | handle plugin tokenizer returning tokens with empty original string | Arne Juul | 2020-08-24 | 1 | -1/+4 |
| | |||||
* | Surrogate aware gram splitting | Jon Bratseth | 2020-06-25 | 1 | -24/+85 |
| | |||||
* | SpareCapacityMaintainer sketch | Jon Bratseth | 2020-06-12 | 6 | -66/+35 |
| | |||||
* | variables in lambdas must be final | Arne Juul | 2020-04-24 | 2 | -10/+16 |
| | |||||
* | Apply suggestions from code review | Arne H Juul | 2020-04-24 | 3 | -7/+7 |
| | | | Co-Authored-By: Jon Bratseth <bratseth@oath.com> | ||||
* | add more tracing and debug logging of stemming | Arne Juul | 2020-04-24 | 4 | -1/+25 |
| | |||||
* | Add/corect copyright headers | Jon Bratseth | 2020-01-03 | 1 | -0/+1 |
| | |||||
* | Build tensors purely with floats | Jon Bratseth | 2019-04-26 | 1 | -1/+1 |
| | |||||
* | Move bound builder double array into double subclass | Jon Bratseth | 2019-04-26 | 1 | -1/+1 |
| | |||||
* | Allow destructive changes in manually deployed zones | Jon Bratseth | 2019-04-01 | 1 | -1/+1 |
| | |||||
* | Nonfunctional changes only | Jon Bratseth | 2019-01-24 | 1 | -1/+1 |
| | |||||
* | Generate html5 javadoc | gjoranv | 2019-01-21 | 1 | -7/+7 |
| | |||||
* | Remove deprecated method (again) | Jon Bratseth | 2019-01-21 | 2 | -16/+0 |
| | |||||
* | Make SimpleLinguistics simple again | Jon Bratseth | 2019-01-21 | 4 | -104/+32 |
| | | | | | - Remove SimpleLinguistics config and optional use of Optimaize - Add Optimaize to OpennlpLinguistics; on by default and config to disable | ||||
* | Remove deprecated apis in linguistics. | gjoranv | 2019-01-21 | 3 | -41/+0 |
| | |||||
* | Deprecated methods and add OptimaizeDetector | Jon Bratseth | 2018-11-01 | 6 | -0/+125 |
| | |||||
* | Prepare for removal of deprecated members | Jon Bratseth | 2018-10-16 | 3 | -4/+8 |
| | |||||
* | Reduce code duplication | Henning Baldersheim | 2018-10-05 | 2 | -15/+14 |
| | |||||
* | Do not create huge optimaize structures when not necessary. | Henning Baldersheim | 2018-10-05 | 2 | -1/+9 |
| | |||||
* | Add copyright header | Jon Bratseth | 2018-10-01 | 2 | -0/+2 |
| | |||||
* | Defer loading the huge optimaize knowledgepool until you really need it. ↵ | Henning Baldersheim | 2018-09-10 | 1 | -20/+32 |
| | | | | This cuts min memory footprint by 100MB+. | ||||
* | Send global constants | Jon Bratseth | 2018-09-06 | 3 | -1/+4 |
| | |||||
* | Add missing newline at end of file | Bjørn Christian Seime | 2018-07-26 | 1 | -1/+2 |
| | |||||
* | Add config for simple-linguistics | Bjørn Christian Seime | 2018-07-26 | 3 | -8/+45 |
| | | | | Add a config parameter for enabling/disabling optimaize detector | ||||
* | Merge pull request #6452 from jefimm/master | gjoranv | 2018-07-25 | 1 | -2/+52 |
|\ | | | | | use com.optimaize.langdetect for lang detection | ||||
| * | use com.optimaize.langdetect for lang detection | Jefim Matskin | 2018-07-24 | 1 | -2/+52 |
| | | |||||
* | | Export package com.yahoo.language.opennlp. | gjoranv | 2018-07-24 | 1 | -0/+5 |
|/ | |||||
* | Merge pull request #6444 from vespa-engine/bratseth/java-model-inference | Jon Bratseth | 2018-07-23 | 1 | -1/+6 |
|\ | | | | | Bratseth/java model inference | ||||
| * | Java model evaluation WIP | Jon Bratseth | 2018-07-20 | 1 | -1/+6 |
| | | |||||
* | | add opennlp stemmers - revert previous changes | Jefim Matskin | 2018-07-18 | 4 | -118/+153 |
| | | | | | | | | https://github.com/vespa-engine/vespa/issues/6403 | ||||
* | | add lang detection and opennlp stemmers | Jefim Matskin | 2018-07-17 | 1 | -3/+7 |
| | | | | | | | | https://github.com/vespa-engine/vespa/issues/6403 | ||||
* | | add lang detection and opennlp stemmers | Jefim Matskin | 2018-07-17 | 1 | -3/+2 |
| | | | | | | | | https://github.com/vespa-engine/vespa/issues/6403 | ||||
* | | add lang detection and opennlp stemmers | Jefim Matskin | 2018-07-17 | 2 | -6/+114 |
| | | | | | | | | https://github.com/vespa-engine/vespa/issues/6403 | ||||
* | | Fix author tag for Simon | Bjørn Christian Seime | 2018-07-05 | 8 | -8/+8 |
|/ | |||||
* | Merge pull request #6228 from vespa-engine/bratseth/nonfunctional-changes | gjoranv | 2018-06-19 | 3 | -4/+4 |
|\ | | | | | Nonfunctional changes only |