aboutsummaryrefslogtreecommitdiffstats
path: root/vespajlib/src/main/java/com/yahoo/text
Commit message (Collapse)AuthorAgeFilesLines
* Add utility for writing XML from a DOMjonmv2024-05-231-0/+23
|
* Fix more xxe preventionMorten Tokle2023-12-191-0/+15
|
* Use code point count for truncate as welljonmv2023-10-201-13/+4
|
* Avoid cutting surrogate pairs when tokenisingjonmv2023-10-201-2/+12
|
* Merge pull request #28820 from vespa-engine/jonmv/text-character-validationJon Marius Venstad2023-10-101-4/+3
|\ | | | | Jonmv/text character validation
| * 0xFFFE, 0xFFFF and stand-alone low surrogates are not valid textjonmv2023-10-061-4/+3
| |
* | Update copyrightJon Bratseth2023-10-0930-30/+30
|/
* Handle the exception that will come when codepoints < 'from' < len.Henning Baldersheim2023-09-181-1/+6
| | | | Already done and tested for 'to'.
* - Add utility to do substring extraction by codepoints, instead of java ↵Henning Baldersheim2023-09-151-0/+19
| | | | | | char index. - Test and use it in SubstringExpression in indeing language.
* Add helper `XML.getChildValue()`Bjørn Christian Seime2023-06-151-0/+7
|
* Revert "Merge pull request #25776 from ↵jonmv2023-01-301-11/+12
| | | | | | | vespa-engine/jonmv/revert-private-endpoints" This reverts commit 350b36dd88baef7548c0066b01ea1e328eb78f3f, reversing changes made to 8a006bc9ca202713ec54c7961a9256790c87d10d.
* Revert "Merge pull request #25770 from vespa-engine/jonmv/private-endpoints"jonmv2023-01-271-12/+11
| | | | | This reverts commit a3ae8f5b0ec3a7f2f3c9205289470dbb89e477ff, reversing changes made to 6534f02466a8958513a8b8684cc2a4369fab7666.
* Check private endpoints from same-zone config server insteadjonmv2023-01-271-11/+12
|
* Test direct renderingJon Bratseth2023-01-161-0/+4
|
* Use recommended and most efficient way to create a string from a byte array ↵Henning Baldersheim2023-01-021-10/+9
| | | | outputstream
* Reduce the simple usage of guava where java has caught upHenning Baldersheim2022-11-301-6/+4
|
* Add 'model' config typeJon Bratseth2022-08-311-2/+2
|
* CleanupJon Bratseth2022-08-241-280/+215
|
* Test model id lookupJon Bratseth2022-08-241-6/+3
|
* use namespace unaware parser for bundle validator MERGEOK (#23585)Andreas Eriksen2022-08-051-2/+26
| | | | | * use namespace unaware parser for bundle validator * update abi spec
* Disable external entity processingMartin Polden2022-07-011-1/+6
|
* Disallow xincludeMartin Polden2022-07-011-12/+13
|
* Also allow 0xFFFE and 0xFFFFHenning Baldersheim2022-04-071-0/+1
|
* Add simplification of long if based on tip from @jonmv.Henning Baldersheim2022-04-061-32/+2
|
* Better naming and skip empty else.Henning Baldersheim2022-04-061-5/+4
|
* Skip unnecessary surrogate check if ! isBmpCodePoint, and move high ↵Henning Baldersheim2022-04-061-9/+4
| | | | surrogate check inside isTextCharacter.
* Avoid extra check for isValidCodePoint.Henning Baldersheim2022-04-061-4/+1
|
* Refactor to optimize for hot path.Henning Baldersheim2022-04-061-7/+19
| | | | | No reason to count characters when you already now there can only be one. Nor is there any reason to check for surrogates when there can be none.
* Update abi and split in hot/cold path.Henning Baldersheim2022-04-051-1/+5
|
* Use a primitive to see if that makes the JIT compiler more predictable.Henning Baldersheim2022-04-051-0/+17
|
* Update abi-spec, and some more non-functional changesJon Marius Venstad2022-03-181-2/+2
|
* Defer public API decisionMartin Polden2022-01-282-1/+9
|
* Move SnippetGenerator to vespajlibMartin Polden2022-01-281-0/+34
|
* Cleanup; No functional changesJon Bratseth2022-01-0718-64/+52
|
* Update 2019 Oath copyrights.gjoranv2021-10-272-2/+2
|
* Update 2017 copyright notices.gjoranv2021-10-0726-26/+26
|
* Text.fmt -> Text.formatArne Juul2021-06-291-1/+1
|
* add convenience function wrapping "String.format(Locale.US, ...)"Arne Juul2021-06-291-0/+4
|
* - Add option to provide token map while construction wand/wset/dotproduct.Henning Baldersheim2021-03-271-3/+0
| | | | - Parse to number directly if possible.
* Revert "Remove unused Utf8 methods"Harald Musum2020-10-052-7/+21
|
* Use StandardCharsetsMartin Polden2020-10-051-8/+6
|
* Remove unused Utf8 methodsMartin Polden2020-10-051-13/+1
| | | | In-lined in the tests that used them.
* Remove custom Utf8.toBytes implementationMartin Polden2020-10-051-23/+5
| | | | `String` optimizations have caught up.
* Remove custom Utf8.toString implementationMartin Polden2020-10-021-21/+4
| | | | | | | | | | | | | | | | | | | `String::new` is now faster for both ASCII and Unicode strings: ``` Utf8::toString of ascii string took 132 ms String::new of ascii string took 59 ms Change = -55.30% Utf8::toString of unicode string took 410 ms String::new of unicode string took 280 ms Change = -31.71% ``` There's at least two reasons for this: * Java 9 introduced compact strings, which means that `String` is now backed by a byte array to reduce the memory footprint of ASCII strings. * Detection of Unicode strings may use HotSpot intrinsics.
* utility -> UtilityArne Juul2020-07-151-2/+2
|
* More descriptive message when mistyping flag data fieldHåkon Hallingstad2020-04-161-1/+2
|
* Avoid new dependency on jacksonHåkon Hallingstad2020-04-151-10/+7
|
* Ignore comment fieldsHåkon Hallingstad2020-04-141-1/+1
|
* Fix spelling errorHåkon Hallingstad2020-04-081-1/+1
|
* Validate deserialization of flag dataHåkon Hallingstad2020-04-081-1/+22
|