aboutsummaryrefslogtreecommitdiffstats
path: root/indexinglanguage
Commit message (Collapse)AuthorAgeFilesLines
* unify java warnings (use compiler args from parent)Arne H Juul2022-01-061-8/+0
|
* suppress varargs warningArne H Juul2022-01-061-0/+1
|
* Revert "Merge pull request #20578 from ↵Jon Marius Venstad2021-12-201-0/+5
| | | | | | | vespa-engine/revert-20568-jonmv/replace-optimaize-with-lingua" This reverts commit 5476504932cd90eb2dad82dbab633e3ffa2034c3, reversing changes made to 235a78cc4707f78d18c6818a577de1b7507f5e40.
* Revert "Replace optimaize with OpenNLP language detector [run-systemtest]"Jon Marius Venstad2021-12-181-5/+0
|
* Fix dependenciesJon Marius Venstad2021-12-171-0/+5
|
* Update Verizon Media copyright notices.gjoranv2021-10-074-4/+4
|
* Update 2018 copyright notices.gjoranv2021-10-071-1/+1
|
* Update 2017 copyright notices.gjoranv2021-10-07184-184/+184
|
* Encapsulate in a contextJon Bratseth2021-10-012-4/+6
|
* Pass destinationJon Bratseth2021-09-3010-14/+72
| | | | | This allows embedders to switch on it to enable bucket testing and similar.
* encode -> embedJon Bratseth2021-09-2811-62/+53
|
* Set statement output explicitlyJon Bratseth2021-09-265-0/+41
|
* Remove outputType (not needed after all)Jon Bratseth2021-09-192-25/+8
|
* Add 'encode' expressionJon Bratseth2021-09-1919-30/+206
|
* Set output typeJon Bratseth2021-09-172-9/+24
|
* Non-functional changes onlyJon Bratseth2021-09-1754-100/+93
|
* Non-functional changes onlyJon Bratseth2021-09-1753-152/+162
|
* No functional changesJon Bratseth2021-09-1719-12/+32
|
* we want to compare Linguistics objects for equivalenceArne Juul2021-08-041-0/+3
|
* avoid depending on Linguistics equals()Arne Juul2021-08-031-1/+7
|
* do not depend on Linguistics equals()Arne Juul2021-08-031-1/+1
| | | | | | * a Linguistics object typically has no state and does not implement the semantics we want for equals() method. * alternately, we could remove this check.
* don't call accentDrop at all for empty inputArne Juul2021-07-162-2/+28
|
* try to trap spurious failureArne Juul2021-07-132-0/+65
| | | | | | * we have seen spurious failures when verifying output from accent dropping; so far nothing reproducible, so add some extra logging and retry once if it happens (in case it's some kind of race-condition glitch).
* Add a testJon Bratseth2021-07-022-8/+6
|
* No functional changesJon Bratseth2021-06-281-11/+10
|
* Require replacements to be applied during tokenizationJon Bratseth2021-06-152-28/+13
|
* Support literal 'true' and 'false'Jon Bratseth2021-04-309-3/+147
|
* Support to_boolJon Bratseth2021-04-307-2/+152
|
* Avoid changing the input string field value by cloning it before doing changes.Geir Storli2021-01-071-5/+8
| | | | | This is needed as the input can be used by other expressions as well. The same cloning is done in ExactExpression and TokenizeExpression.
* Merge pull request #13709 from ↵Jon Bratseth2020-06-261-2/+2
|\ | | | | | | | | vespa-engine/bratseth/surrogate-aware-gram-splitting Surrogate aware gram splitting
| * Surrogate aware gram splittingJon Bratseth2020-06-251-2/+2
| |
* | Handle an existing spantree in exactmatchJon Bratseth2020-06-242-6/+13
|/ | | | | This may happen if a field which is indexed is used as an input for another field indexed as exact match.
* Disallow unbound tensor dimensions in document fieldsJon Bratseth2020-06-082-0/+3
|
* Remove use of apache commons libraries in indexinglanguageBjørn Christian Seime2020-01-034-11/+7
|
* Preserve array updates with element index matching in indexing docprocTor Brede Vekterli2019-12-044-5/+102
| | | | | | The resulting `MapValueUpdate` would previously be constructed with the wrong type for the index, causing an exception to be thrown and for the update to fail entirely.
* Preserve AssignValueUpdates to struct fields in indexing docprocTor Brede Vekterli2019-12-022-6/+28
| | | | | | | Would previously be rewritten as MapValueUpdates for unknown reasons. This wouldn't actually work, as an exception would be thrown during serialization when sanity checking code figured out that the MapValue update was attempted used for a non-array/wset field data type.
* doc: -> id:Henning Baldersheim2019-08-163-11/+11
|
* doc: -> id:Henning Baldersheim2019-08-166-21/+21
|
* Merge pull request #9969 from vespa-engine/geirst/more-robust-ngram-expressionArne H Juul2019-07-082-0/+20
|\ | | | | Ensure that NGramExpression can be executed multiple times, where spa…
| * Ensure that NGramExpression can be executed multiple times, where span tree ↵Geir Storli2019-07-052-0/+20
| | | | | | | | | | | | | | from first execution is used. This can happen when we have multiple UpdateAdapters (e.g. regular updates + field path updates) and then all scripts are executed per adapter in Expression.execute().
* | Nonfunctional changes onlyJon Bratseth2019-07-031-0/+1
|/
* Assert that TensorRemoveUpdate passes through indexinglanguageLester Solbakken2019-02-153-7/+61
|
* Test that TensorModifyUpdate and TensorAddUpdate pass through unmodified.Geir Storli2019-02-142-2/+62
|
* Pass TensorAddUpdate right through indexing docproc.Geir Storli2019-02-112-1/+6
|
* Pass TensorModifyUpdate right through indexing docproc.Geir Storli2019-02-012-0/+5
|
* Make SimpleLinguistics simple againJon Bratseth2019-01-213-3/+3
| | | | | - Remove SimpleLinguistics config and optional use of Optimaize - Add Optimaize to OpennlpLinguistics; on by default and config to disable
* 6-SNAPSHOT -> 7-SNAPSHOTArnstein Ressem2019-01-211-2/+2
|
* Update to latest ph-javacc-maven-plugin.gjoranv2018-12-061-6/+7
| | | | | | | | | - Update FastCharStream method names to follow changes in generated CharStream class. - Update expected test output due to improvements in generated code. - Improve ability to debug tests by more clearly printing out the diff between expected and actual output.
* Manage ph-javacc-maven-plugin config in parent pom.gjoranv2018-12-061-9/+0
| | | | + Remove unused management for old javacc plugin.
* Deprecated methods and add OptimaizeDetectorJon Bratseth2018-11-013-0/+3
|