summaryrefslogtreecommitdiffstats
path: root/indexinglanguage
Commit message (Collapse)AuthorAgeFilesLines
* Set project version to 8-SNAPSHOTgjoranv2022-06-081-2/+2
|
* Don't embed annotations in osgi bundlesBjørn Christian Seime2022-05-041-0/+6
|
* switch to new-style configArne H Juul2022-04-124-1342/+1108
|
* Remove unnecessary shade-plugingjoranv2022-04-081-22/+0
| | | | - The fat jar is not installed or referred to anywhere.
* Implement create(String value)Henning Baldersheim2022-03-311-3/+2
|
* Add embedder selection argument to indexing languageLester Solbakken2022-03-2111-52/+141
|
* Add function ownerJon Bratseth2022-02-141-1/+0
|
* Type inference where the output type is an arrayJon Bratseth2022-02-095-28/+110
|
* CleanupJon Bratseth2022-02-063-15/+17
|
* Add hash functionJon Bratseth2022-02-044-4/+148
|
* Improve maven-shade-plugin filter, and move to parent pom.gjoranv2022-02-031-11/+0
| | | | - Removes 223 build warnings (out of 562 for building non-test code)
* Revert "Exclude META-INF files and module-info.class from uber jars."Harald Musum2022-02-021-2/+4
|
* Exclude META-INF files and module-info.class from uber jars.gjoranv2022-02-021-4/+2
| | | | - Generate countless warnings for duplicates and breaking encapsulation.
* unify java warnings (use compiler args from parent)Arne H Juul2022-01-061-8/+0
|
* suppress varargs warningArne H Juul2022-01-061-0/+1
|
* Revert "Merge pull request #20578 from ↵Jon Marius Venstad2021-12-201-0/+5
| | | | | | | vespa-engine/revert-20568-jonmv/replace-optimaize-with-lingua" This reverts commit 5476504932cd90eb2dad82dbab633e3ffa2034c3, reversing changes made to 235a78cc4707f78d18c6818a577de1b7507f5e40.
* Revert "Replace optimaize with OpenNLP language detector [run-systemtest]"Jon Marius Venstad2021-12-181-5/+0
|
* Fix dependenciesJon Marius Venstad2021-12-171-0/+5
|
* Update Verizon Media copyright notices.gjoranv2021-10-074-4/+4
|
* Update 2018 copyright notices.gjoranv2021-10-071-1/+1
|
* Update 2017 copyright notices.gjoranv2021-10-07184-184/+184
|
* Encapsulate in a contextJon Bratseth2021-10-012-4/+6
|
* Pass destinationJon Bratseth2021-09-3010-14/+72
| | | | | This allows embedders to switch on it to enable bucket testing and similar.
* encode -> embedJon Bratseth2021-09-2811-62/+53
|
* Set statement output explicitlyJon Bratseth2021-09-265-0/+41
|
* Remove outputType (not needed after all)Jon Bratseth2021-09-192-25/+8
|
* Add 'encode' expressionJon Bratseth2021-09-1919-30/+206
|
* Set output typeJon Bratseth2021-09-172-9/+24
|
* Non-functional changes onlyJon Bratseth2021-09-1754-100/+93
|
* Non-functional changes onlyJon Bratseth2021-09-1753-152/+162
|
* No functional changesJon Bratseth2021-09-1719-12/+32
|
* we want to compare Linguistics objects for equivalenceArne Juul2021-08-041-0/+3
|
* avoid depending on Linguistics equals()Arne Juul2021-08-031-1/+7
|
* do not depend on Linguistics equals()Arne Juul2021-08-031-1/+1
| | | | | | * a Linguistics object typically has no state and does not implement the semantics we want for equals() method. * alternately, we could remove this check.
* don't call accentDrop at all for empty inputArne Juul2021-07-162-2/+28
|
* try to trap spurious failureArne Juul2021-07-132-0/+65
| | | | | | * we have seen spurious failures when verifying output from accent dropping; so far nothing reproducible, so add some extra logging and retry once if it happens (in case it's some kind of race-condition glitch).
* Add a testJon Bratseth2021-07-022-8/+6
|
* No functional changesJon Bratseth2021-06-281-11/+10
|
* Require replacements to be applied during tokenizationJon Bratseth2021-06-152-28/+13
|
* Support literal 'true' and 'false'Jon Bratseth2021-04-309-3/+147
|
* Support to_boolJon Bratseth2021-04-307-2/+152
|
* Avoid changing the input string field value by cloning it before doing changes.Geir Storli2021-01-071-5/+8
| | | | | This is needed as the input can be used by other expressions as well. The same cloning is done in ExactExpression and TokenizeExpression.
* Merge pull request #13709 from ↵Jon Bratseth2020-06-261-2/+2
|\ | | | | | | | | vespa-engine/bratseth/surrogate-aware-gram-splitting Surrogate aware gram splitting
| * Surrogate aware gram splittingJon Bratseth2020-06-251-2/+2
| |
* | Handle an existing spantree in exactmatchJon Bratseth2020-06-242-6/+13
|/ | | | | This may happen if a field which is indexed is used as an input for another field indexed as exact match.
* Disallow unbound tensor dimensions in document fieldsJon Bratseth2020-06-082-0/+3
|
* Remove use of apache commons libraries in indexinglanguageBjørn Christian Seime2020-01-034-11/+7
|
* Preserve array updates with element index matching in indexing docprocTor Brede Vekterli2019-12-044-5/+102
| | | | | | The resulting `MapValueUpdate` would previously be constructed with the wrong type for the index, causing an exception to be thrown and for the update to fail entirely.
* Preserve AssignValueUpdates to struct fields in indexing docprocTor Brede Vekterli2019-12-022-6/+28
| | | | | | | Would previously be rewritten as MapValueUpdates for unknown reasons. This wouldn't actually work, as an exception would be thrown during serialization when sanity checking code figured out that the MapValue update was attempted used for a non-array/wset field data type.
* doc: -> id:Henning Baldersheim2019-08-163-11/+11
|