summaryrefslogtreecommitdiffstats
path: root/linguistics/src/main/java/com/yahoo/language/process/GramSplitter.java
Commit message (Collapse)AuthorAgeFilesLines
* Revert "Merge pull request #29328 from ↵Jon Bratseth2023-11-141-2/+1
| | | | | | | vespa-engine/revert-29314-bratseth/casing-take-2" This reverts commit a72e949533a46d665440a9c72ca2b8fb58f3a9c3, reversing changes made to 944d635d00e165166508ef23399e9ed65a87a9c8.
* Revert "Bratseth/casing take 2"Harald Musum2023-11-131-1/+2
|
* Revert "Revert "Don't lowercase linguistics annotations""Jon Bratseth2023-11-091-2/+1
| | | | This reverts commit 0dfd4fe4c6ddbded490da36e71f27c4b70aa4226.
* Revert "Don't lowercase linguistics annotations"Jon Bratseth2023-11-091-1/+2
|
* Don't lowercase linguistics annotationsJon Bratseth2023-11-091-2/+1
| | | | | | Tokens are already lowercased by our bundled linguistics components. Lowercasing again when annotating precludes plugging in a lingustics component which preserves casing.
* Update copyrightJon Bratseth2023-10-091-1/+1
|
* Always treat each symbol as a separate tokenJon Bratseth2023-05-221-16/+24
|
* Compute code points in whole string only when neededjonmv2022-12-061-5/+3
|
* Update 2017 copyright notices.gjoranv2021-10-071-1/+1
|
* No functional changesJon Bratseth2021-02-031-1/+1
|
* Surrogate aware gram splittingJon Bratseth2020-06-251-24/+85
|
* SpareCapacityMaintainer sketchJon Bratseth2020-06-121-58/+25
|
* Reduce code duplicationHenning Baldersheim2018-10-051-1/+6
|
* Update copyright headersJon Bratseth2017-06-141-1/+1
|
* Revert "Update copyright headers"Jon Bratseth2017-06-141-1/+1
|
* Update copyright headersJon Bratseth2017-06-141-1/+1
|
* Revert "Copyright header"Jon Bratseth2017-06-131-1/+1
|
* Copyright headerJon Bratseth2017-06-131-1/+1
|
* PublishJon Bratseth2016-06-151-0/+222