Up to [cvs.NetBSD.org] / pkgsrc / textproc / icu
Request diff between arbitrary revisions
Keyword substitution: kv
Default branch: MAIN
icu: update to 76.1 again, now that fallout is/can be fixed. bulk-test-icu has same failures as with 75.1 except for webkit-gtk, for which I'll commit a fix next.
icu: downgrade to 75.1 The world is not ready for 76.1 yet: - c++17 requirement (I think) - more libraries need to be explicitly listed
*: recursive bump for icu 76.1 shlib bump
icu: update BUILDLINK_ABI_DEPENDS to 75.1 Ride revbump from earlier that reflected this change.
*: recursive bump for icu 74.1
*: recursive bump for Python 3.11 as new default
icu: reflect solib major bump to 73 Ride PKGREVISION bump already made earlier.
icu: reset revision
massive revision bump after textproc/icu update
icu: reset revision
revbump for textproc/icu update
icu: properly bump the bl3 for version 70 solib major I'll let someone else decide if the recursive revbump needs to be redone or if this can ride the one that took place 8 hours ago.
revbump for textproc/icu
icu: reflect shlib major bump Ride PKGREVISION bump already made earlier.
icu: Build dependencies with an equivalent gcc version Should avoid linker errors such as libicuuc.so: undefined reference to `__cxa_throw_bad_array_new_length@CXXABI_1.3.8'
reset revision
Revbump for icu
icu: bump BUILDLINK_ABI_DEPENDS
Update to 64.1 Changelog: Common Changes Unicode 12: 554 new characters, including 4 new scripts and 61 new emoji characters. CLDR 35 Somali and Javanese data now up to modern level Cebuano, Hausa, Igbo, and Yoruba data now up to basic level 23 additional measurement units Many data additions and corrections in many other languages The following language has been added to ICU: Cebuano This version of ICU does not yet implement the Indic Grapheme Cluster improvements from CLDR 35. New Japanese calendar era from 2019: CLDR and ICU include data for testing that can be enabled. (ICU #12973, CLDR #10750) To enable CLDR new Japanese era placeholder name, set environment variable (and Java system property for ICU4J) ICU_ENABLE_TENTATIVE_ERA=true (This was added in ICU 63). Support added for Gannen year numbering (using 元 for the first year of an era) in the Japanese locale Japanese-calendar full, long, and medium formats. Gannen year support is also automatically added for other non-numeric formats (those containing other kanji characters such as 年) derived from pattern skeletons unless specifically overridden. (ICU #20441, CLDR #11843, CLDR #11819) We are planning for an ICU 64.2 update in 2019-April which will add the new Japanese era with its real name. ICU 64 now uses "rearguard" TZ data. (Recent versions have used "vanguard" data with certain overrides.) (ICU-20398) ICU data filtering: The ICU4C build accepts an optional filter script that specifies a subset of the data to be built, with whitelists and blacklists for locales and for resource bundle paths. (ICU-10923, design doc) See this new documentation page: userguide/icu_data/buildtool.md MessageFormat has new pattern syntax for specifying the style of a date/time argument via a locale-independent skeleton rather than a locale-specific pattern. (ICU-9622) Date/time skeletons use the same "::" prefix as number skeletons. Example MessageFormat pattern string: "We close on {closing,date,::MMMMd} at {closing,time,::jm}." Many formatting APIs can now output a new type of result object which is-a FormattedValue (Java & C++), or convertible to a UFormattedValue (C). These combine the result strings with easy iteration over FieldPosition metadata. ICU4C Specific Changes New C++ class LocaleBuilder for building a Locale from subtags, keywords, and extensions. (ICU-20328) Parallel to the existing ICU4J ULocale.Builder class. For C++ MeasureUnit instances, there are now additional factory methods that return units by value, not by pointer-with-ownership. (ICU-20337) Various Out-Of-Memory (OOM) issues have been fixed. (ticket query)
Update to 62.1 Changelog: Common Changes Unicode 11: 684 new characters, including 7 new scripts, Mtavruli Georgian capital letters, 5 new Han characters, and 66 new emoji characters. CLDR 33.1: Unicode 11 script metadata, collation, Chinese transliteration. Chinese collation stroke order updated from Unicode 7 to Unicode 11 after tooling bug fixes. NumberFormatter A NumberFormatter can now be constructed from a locale-neutral skeleton string (like a DateFormat) (#8610). This is particularly useful in translated messages where placeholder details should not be translated. MessageFormat recognizes the style field as a number skeleton if it is prefixed with "::", as in "Number of files: {num, number, :: round-integer group-min2}." (#13742) New "conversion" functions for getting a NumberFormatter from a DecimalFormat, and a Format from a NumberFormatter. New C API (unicode/unumberformatter.h [permanent API docs link TBD]). (#13597) Currently it supports formatting settings only via a skeleton string. Several still-draft NumberFormatter methods and helper classes have been modified or renamed; the previous versions remain temporarily (as deprecated) for one release, to help with the transition. Break Iterator Rules: "Safe" rules are no longer required for correct break iterator operation. For back compatibility, existing rule sets containing safe rules will continue to work, with the safe rules they contain being ignored. The Break Iterator binary data format has been updated to reflect this change. Line Break: The boundary rules have been updated to reflect the Unicode 11 version of UAX #14. Specifically, the handling of Emoji ZWJ sequences has been improved. ICU4C Specific Changes Under-the-hood overhaul of number parsing. See the design doc for a summary of changes; behavior is mostly compatible with previous versions, but there are some known differences. DecimalFormat now wraps the new NumberFormatter code.
textproc/icu: updated to 61.1 61.1: Common Changes * CLDR 33: - Two additional locales (Odia, Assamese) were brought up to Modern coverage level. - 4 new transforms: fa-fa_FONIPA, ha-ha_NE, nv-nv_FONIPA, vec-vec_FONIPA. - New currency code MRU for Mauritania. - Arabic native vs. ASCII digits. - Data additions & bug fixes. * Many small API additions, improvements, and bug fixes. ICU4C Specific Changes * Added Google double-conversion library for formatting doubles. This is the library used in V8 and a number of other projects for converting doubles to decimals. To avoid name collisions, the library is linked internally under the ICU namespace as icu::double_conversion. Our copy of double_conversion is not intended for public usage. * Re-wrote U8_NEXT macros to eliminate all library function calls.
icu: bump minimum requirement for library major bump
revbump for requiring ICU 59.x
icu: belatedly require at least ICU 59.x 59 was a major bump, ICU 58 will not have the right library name. pointed out by kapper, thanks. ok leot
Remove -std=c++11 from here, it causes breakage in code that uses C++11 extensions or wants C++14.
Add -std=c++11 to CXXFLAGS. icu uses char16_t in a central header file; this type is added by c++11.
Update to 58.1 * Fix regression with upstream patch, https://ssl.icu-project.org/trac/ticket/12827 Changelog: Common Changes CLDR 30.0.2: For details of the many changes in CLDR, see CLDR 30. Some things to note: For some combinations of numbering system (arab, arabext, latn) and/or locale (ar, fa, he), there were changes to the bidirectional control characters used with certain symbols (percent, minus, plus), and changes to number patterns (currency and/or percent, including addition of bidirectional control characters in some cases). New in this release, the bidirectional controls used for such purposes include U+061C ARABIC LETTER MARK (ALM), which requires use of the bidirectional algorithm from Unicode 6.3 or later. The time separator for Norwegian locales (nb, nn) was changed to be ':' throughout. Unicode 9.0: Version 9.0 adds exactly 7,500 characters, for a total of 128,172 characters. These additions include six new scripts, 19 symbols for the new 4K TV standard, and 72 new emoji characters. Draft Emoji 4.0 data Emoji updates for word & line breaking. (#12664 & Unicode 9 update #12526) UBiDiTransform/BidiTransform API for convenient transformation of text between different Bidi layouts. (#11679) MeasureFormat API for measurement unit display names. (#12029) Most COUNT and LIMIT enum constants have been deprecated. (#12420) SpoofChecker: Handling of "whole script confusables" has been removed from ICU, in accordance with its removal from UTS #39 Version 9.0.0 and the removal of the corresponding Unicode data file. (#12549) Greek uppercasing ("el" locale ID) removes most diacritics. (#5456) More robust locale data loading across ICU implementation code. Reduced heap memory usage in DateTimePatternGenerator. (#11782) ICU4C Specific Changes The layout engine code has been removed; the ParagraphLayout is not deprecated and remains (and must now be built on top of HarfBuzz). See http://userguide.icu-project.org/layoutengine (#12708) Windows: Supports & requires Visual Studio 2015.
Update to 57.1 Changelog: Common Changes CLDR 29: For details of the many changes in CLDR, see CLDR 29. Grapheme/word/line breaking for emoji sequences, based on Unicode 9 proposed rules. See the Unicode emoji break proposal and the Unicode Emoji Technical Report Proposed Update describing the new emoji sequences. (#12081). Four new Unicode emoji properties (#11802). DateFormat day period formatting of "noon", "at night", etc. via new pattern characters b & B, and DateTimePatternGenerator support of C for selecting the customary form (#11872). Except: Formatting of "0:00 midnight" has been disabled because it is confusing except for at the end of an interval. RelativeDateTimeFormatter: Simpler formatting API (#12072). More robust CLDR data loading for MeasureFormat (#11986, #12030), RelativeDateTimeFormatter (#12018), and DateIntervalFormat/DateIntervalInfo (#12013). New simple & fast SimpleFormatter class for a trivial subset of MessageFormat as used in CLDR data, e.g., "{0} {1}" (#10896). ICU4C Specific Changes C API support for RelativeDateTimeFormatter (#12072). Clang annotations for intended switch case fallthroughs, can now compile with -Wimplicit-fallthrough (#12166). Internal header files can be compiled by themselves, for simpler alternative build scripts (#12141).
Bump ABI.
Changes 55.1: The features for this release include support of CLDR 27 (with a major cleanup of region locales, among many other improvements), formatting for scientific notation ("1.2 × 10³"), an update to Unicode 7.0 data for spoof-checking, narrow AM/PM markers ("7:45p"), and various performance enhancements. For C/C++, there are new methods for flexible dates ("Nov 10", or "Sept 2015"), named capture groups for regular expressions, formatting of compound units ("3.5 meters per second"), new C wrappers, and independent timezone resource loading. ICU4J has been improved and tested for using ICU4C data and for running on Android.
Revbump after updating libwebp and icu
recursive bump from icu shlib major bump.
Revbump after updating textproc/icu
Changes 51.1: Common Changes ============== CLDR 23: Collation tailorings put native script first; non-Gregorian calendar formats are more consistent; much improved data for Armenian (hy), Georgian (ka), Mongolian (mn), and Welsh (cy); … Time zone data: 2013b Date format/parse now supports CLDR short weekday names ("EEEEEE", "cccccc"). Support DisplayContext for date formatting, locale display names. DateTimePatternGenerator behavior is now much more consistent between C and J. Support new timezone pattern characters in LDML spec: X+, x+, O, OOOO, V, VV, VVV. Updated SpoofChecker for v5 of UTS39. AlphabeticIndex enhancements: New thread-safe ImmutableIndex sub-API Build an index for a custom Collator. Make data-driven for Chinese collations. New API for CLDR script metadata. ICU4C Specific Changes ====================== Support for “dangi” Korean luni-solar calendar (already in ICU4J). Add CompactDecimalFormat (already in ICU4J). Add TerritoryContainment APIs (already in ICU4J). UnicodeString default constructor and destructor now inline. Layout engine now supports 'morx' tables. Fixed some ICU 50 regressions: Affixes set with e.g. DecimalFormat::setPositivePrefix were ignored for parse. UNUM_PARSE_INT_ONLY no longer handled grouping separator. Add ucal_getTimeZoneID. The C++ AlphabeticIndex implementation is now on par with Java, including full support for all Chinese collation tailorings. U8_NEXT() and similar low-level macros now support NUL-terminated UTF-8 strings. New macros like U8_NEXT_OR_FFFD() return U+FFFD for an ill-formed sequence. Conversion: New "good one-way" mapping type, for example for Variation Selector sequences.
Revbump after graphics/jpeg and textproc/icu
increase BUILDLINK_ABI_DEPENDS.icu due to so change from 49 to 50. Also bump PKGREVISION for a few packages using it. The packages I did this for: net/yaz lang/parrot misc/openoffice3 (where I noticed the run-time failure due to missing shared library) www/webkit-gtk sysutils/open-vm-tools inputmethod/ibus-qt I didn't do this recursively or for all packages using icu since I didn't know if they used the shared library directly, some use was optional. The list of packages I didn't touch: devel/devhelp databases/idzebra databases/sqlite3 devel/gnustep-base/ finance/gnucash games/openttd graphics/shotwell lang/mono meta-pkgs/boost misc/calibre misc/libreoffice news/tin textproc/php-intl www/deforaos-surfer www/epiphany www/liferea-current www/midori
Recursive bump from icu shlib major bumped to 49.
recursive bump from textproc/icu shlib major bump.
Changes 4.8: * CLDR 2.0: The CLDR 2.0 release contains numerous improvements and bug fixes approved by the CLDR committee, including much additional data for many languages. * Explicit parent locale support in data imported from CLDR. * MessageFormat and related classes (choice/plural/select) have been reimplemented, with several improvements and some incompatible changes. * Extended PluralFormat pattern syntax supports explicit-value forms and offsets. * Utility APIs in PluralRules (get some/all/unique keyword values) * Time zone API to return a list of available canonical system time zone IDs. * Time zone API to return a region. * Collation: Full implementation & public API for script reordering * Dictionary-type trie * GB18030-2005 update
Bump PKGREVISION from icu shlib major bump.
oops, shlib major changed in last update, adjust dependency, addresses PR pkg/41855 by Robert Elz
Update from version 3.6nb2 to 4.0.1. Pkgsrc changes: o New MASTER_SITE o Adjust PLIST o Remove no-longer-needed patches, since corresponding changes have been adopted upstream o BUILDLINK_ABI_DEPENDS bumped to >=4.0, since a new shared library version is installed o Fixes security vulnerability, ref. below. Dependent pkgsrc packages will have their revisions bumped shortly due to the (possibly/probably) changed ABI. Upstream changes: 4.0.1: ICU4C 4.0.1 is a maintenance release of ICU4J 4.0. The primary changes of this release were: * Updated time zone data to 2008i * Technical preview of string search implementation using Boyer-Moore algorithm (#6286). For detail information, please see the tech note here. * #5691 Conversion: consistent illegal sequences * #6435 Bad @stable ICU4.0 tags * #6597 TestDisplayNamesMeta failure * #6670 Test failure in format/TimeZoneTest/TestShortZoneIDs 4.0: Major changes in ICU 4.0 include the following: * Common Changes o Unicode 5.1 (#5696) o Locale Data: ICU uses and supports data from Common Locale Data Repository (CLDR) 1.6 , which includes many improvements in quality and quantity of data. o add/removeLikelySubtags (#6124) o Charset converter file size improvement (#5987) o Date Interval Formatting (#6157) Note: Calendar type supported by this feature is Gregorian only in this release. o Improved Plural support * ICU4C Specific Changes Additional Calendars + Chinese (#4081) + Coptic/Ethiopic (#4571) * ICU4J Specific Changes o Charset + Graduated from Technology Preview status + ICU2022 Converter (#5791) + HZ Converter (#6128) + SCSU/BOCU-1 Converter (#2147) + Charset Converter Callback (#6144) o Thai Dictionary break iterator (#5385) o JDK TimeZone support (#5975) o Locale Service Provider (#5976) o More convenient formatting of year+month, day+month, and other combinations (#6304) o Simple Duration Formatting (#6303) * ICU4C Security Fixes ICU4C 4.0 resolves the vulnerabilities CVE-2007-4770 and CVE-2007-4771 which were found in earlier versions of ICU. The standard ICU tests verify that these have been corrected, however, the updated versions of the previous tests may be run by applying the following patch to ICU 4.0: r24324. As well, ICU4C and ICU4J 4.0 resolve the issue underlying CVE-2008-1036.
Simply and speed up buildlink3.mk files and processing. This changes the buildlink3.mk files to use an include guard for the recursive include. The use of BUILDLINK_DEPTH, BUILDLINK_DEPENDS, BUILDLINK_PACKAGES and BUILDLINK_ORDER is handled by a single new variable BUILDLINK_TREE. Each buildlink3.mk file adds a pair of enter/exit marker, which can be used to reconstruct the tree and to determine first level includes. Avoiding := for large variables (BUILDLINK_ORDER) speeds up parse time as += has linear complexity. The include guard reduces system time by avoiding reading files over and over again. For complex packages this reduces both %user and %sys time to half of the former time.
Change the format of BUILDLINK_ORDER to contain depth information as well, and add a new helper target and script, "show-buildlink3", that outputs a listing of the buildlink3.mk files included as well as the depth at which they are included. For example, "make show-buildlink3" in fonts/Xft2 displays: zlib fontconfig iconv zlib freetype2 expat freetype2 Xrender renderproto
Track information in a new variable BUILDLINK_ORDER that informs us of the order in which buildlink3.mk files are (recursively) included by a package Makefile.
Aligned the last line of the buildlink3.mk files with the first line, so that they look nicer.
Over 1200 files touched but no revisions bumped :) RECOMMENDED is removed. It becomes ABI_DEPENDS. BUILDLINK_RECOMMENDED.foo becomes BUILDLINK_ABI_DEPENDS.foo. BUILDLINK_DEPENDS.foo becomes BUILDLINK_API_DEPENDS.foo. BUILDLINK_DEPENDS does not change. IGNORE_RECOMMENDED (which defaulted to "no") becomes USE_ABI_DEPENDS which defaults to "yes". Added to obsolete.mk checking for IGNORE_RECOMMENDED. I did not manually go through and fix any aesthetic tab/spacing issues. I have tested the above patch on DragonFly building and packaging subversion and pkglint and their many dependencies. I have also tested USE_ABI_DEPENDS=no on my NetBSD workstation (where I have used IGNORE_RECOMMENDED for a long time). I have been an active user of IGNORE_RECOMMENDED since it was available. As suggested, I removed the documentation sentences suggesting bumping for "security" issues. As discussed on tech-pkg. I will commit to revbump, pkglint, pkg_install, createbuildlink separately. Note that if you use wip, it will fail! I will commit to pkgsrc-wip later (within day).
Recursive revision bump / recommended bump for gettext ABI change.
Update to 3.4: New Features: Major changes in ICU 3.4 include the following: Updates to conform to Unicode 4.1, including new characters properties and values, text segmentation, plus collation updated for Unicode Technical Standard #10 (UCA) and regex updated for Unicode Technical Standard #18. * Updates to conform to the Common Locale Data Repository (CLDR), Version 1.3 for the latest locale data. This includes: * New data to support localization of timezones, United Nations M.49 regions (including continents and regions), mappings from language to script and territory. * Consolidation of inherited data and improved resource aliasing for smaller data footprint * Additional locales, and many other fixes and additions of locale data. * POSIX migration support: direct API support for all POSIX character classes, implemented according to Unicode recommendations
ICU 3.2 includes the latest bug fixes, locale/charset updates, and performance/build/porting enhancements. The following list summarizes the main new features in this release.sion. CLDR 1.2. This is the main new feature in the release. ICU locale data is now completely built from the CLDR 1.2 data, which contains data for 232 locales, covering 72 languages and 108 territories. Many translated names for languages, territories, and scripts have been added, as well as for time zones, calendars, and other named items such as collation. For more information, see http://www.unicode.org/press/pr-cldr1.2.html. Miscellaneous Universal Timescale conversions. ICU now provides mechanisms for quickly and reliably converting between the different binary representations of date/time used on different platforms. Accept-Language. ICU provides a mechanism for matching Accept-Language against a list of locales. DateFormat and Calendar Performance. Object construction performance has been significantly improved. Footprint. The size of executables that statically link to ICU has been reduced. Stdin. The icuio library can now read from stdin. UnicodeSet C API. More uset_* C API were added. i5/OS (os/400). Building ICU has been simplified to allow more configure options to work. POSIX. Default codepage determination has been fixed.
update to icu-3.0 major changes: ICU 3.0 includes the latest bug fixes, locale/charset updates, and performance/build/porting enhancements. - Collation Collation data is in a separate data tree, allowing for easier modularization and maintenance. getFunctionalEquivalent API allows for better caching and UI support. - Unicode 4.0.1 ICU is updated to the latest version of Unicode standard, which had significant property changes. - CLDR 1.1 Updates to CLDR 1.1, with many updates to locale data, and special emphasis on collation data. - Formatting As an aid to migration of traditional C (stdio) and C++ (iostream) formatting, the POSIX-like input/output library, icuio, is officially supported. Significant digits now supported in DecimalFormat, for general use and %g support. - RFC822 time zone format support in DateFormat for compatibility. - Currency formatting/parsing improvements Allows parsing multiple currencies with one formatter, without knowing the currency in advance. Much cleaner design allowing extensibility to other measurement units in the future. - Regular expressions (C) The regular expressions framework now features a C API, instead of just C++. - Locales Locale canonicalization spec defined and implemented. Provides interoperability with POSIX and .NET locale IDs, more RFC 3066 support. - Layout engine Layout engine now supports using different canonically-equivalent Unicode forms of the same text: e.g. a + ´ or á. This is especially important for non-Latin scripts. - Build Environment ICU can now build its data library much faster on most platforms. For a complete list see: http://oss.software.ibm.com/cvs/icu/~checkout~/icu/readme.html?tag=release-3-0
update to 2.8 Lot's of changes and fixes. For example: # Number Formatting ICU4C adds support for formatting and parsing of 64-bit integers. # Text Analysis (Break Iterators) Full conformance with Unicode Consortium UAX 29 and UAX 14 definitions for text boundary positions. Significantly improved performance for reverse direction iteration and isBoundary tests of arbitrary string positions. # StringPrep ICU 2.8 adds APIs and a tool for generic support of StringPrep profiles such as those used in NFS 4. For a complete list see: http://oss.software.ibm.com/icu/download/2.8/index.html
Reorder location and setting of BUILDLINK_PACKAGES to match template buildlink3.mk file in revision 1.101 of bsd.buildlink3.mk.
The BUILDLINK_DEPENDS.<pkg> lines should match between the buildlink3.mk and any pre-existing buildlink2.mk files.
BUILDLINK_DEPENDS.<pkg> lines should _always_ use += to _append_ to the existing value. This is critical to set the correct list of dependencies for a package.
Add buildlink3.mk. (needed to bl3ify lang/mono)