Up to [cvs.NetBSD.org] / pkgsrc / converters / recode
Request diff between arbitrary revisions
Keyword substitution: kv
Default branch: MAIN
*: recursive bump for Python 3.11 as new default
Mass-change BUILD_DEPENDS to TOOL_DEPENDS outside mk/. Almost all uses, if not all of them, are wrong, according to the semantics of BUILD_DEPENDS (packages built for target available for use _by_ tools at build-time) and TOOL_DEPEPNDS (packages built for host available for use _as_ tools at build-time). No change to BUILD_DEPENDS as used correctly inside buildlink3. As proposed on tech-pkg: https://mail-index.netbsd.org/tech-pkg/2023/06/03/msg027632.html
recode: update to 3.7.14. Version 3.7.14 ============== + No user-visible changes; minor clean-ups as part of the Debian packaging process. Version 3.7.13 ============== + Fix request diagnostics with --verbose: avoid output of confusing and possibly incorrect diagnostics. + Fix a file descriptor leak. Version 3.7.12 ============== + Simplify support for ignoring invalid input with iconv, integrated with the --strict and --force mechanisms. + Various documentation improvements. Version 3.7.11 ============== + Re-add support for transliteration with iconv (removed in 3.7). + Add support for ignoring invalid input with iconv. + Fix a bug introduced in 3.7.10 that prevented building the code. Version 3.7.10 ============== + In recode program, only use iconv if needed; add --prefer-iconv option to allow its use in all cases. + Remove support for pre-3.5 request syntax (colon as charset separator). + PO files have been updated from the Translation Project. + Build system fixes and improvements. + Update gnulib to fix a problem building on Android. + Fix confusingly wrong NEWS entry for 3.7.4. Version 3.7.9 ============= + A fix to the CP1252 encoding: U+017E LATIN SMALL LETTER Z WITH CARON is at byte 0x9e, not byte 0x8f. + Minor documentation fixes. Version 3.7.8 ============= + Improvements to the build system. + Fix man page generation, and document that help2man must be built with gettext. + Updates to nl, pt, sv translations (thanks, translators!). Version 3.7.7 ============= + Improvements to the build system. + Updates to nl, pt, sv translations (thanks, translators!). Version 3.7.6 ============= + Improvements to the build system. Version 3.7.5 ============= + Port tests to Python 3. Version 3.7.4 ============= + Fixes to file handling in recode program. + Fix tests on Windows. Version 3.7.3 ============= + No code changes to recode itself; this release features a properly versioned shared library. Version 3.7.2 ============= + No code changes to recode itself; this release includes updates to license headers to guide users to the GPL online, corrects the version of COPYING-LIB shipped with the sources, and updates the message files for various languages. Version 3.7.1 ============= + No code changes to recode itself; this release just updates the version of gnulib to fix a bug in glibc 2.28: (GitHub issue #11 https://github.com/rrthomas/recode/issues/11 Version 3.7 =========== + Converters for BibTeX (from Vincent Danjean) and the ANSEL and ISO 5426 character sets (from Wolfram Schneider) have been added. + The conversion strategies (whether to use pipes, memory or files) are no longer available. Now it is reasonable to assume virtual memory, so files and memory have similar performance characteristics (in particular, the memory method is not limited by physical memory.) Further, tests showed that even for runs on little data, the pipes method has minimal performance impact (none was measured). This is not a surprise, as for one-step recodings, the commonest case, no forking is needed. The command-line options -i, -p and --sequence=STRATEGY are ignored for backwards compatibility. + Recode does not include libiconv anymore, but uses an external iconv library if one was available at installation time. The -x: option to the program, or a new flag to the library recode_new_outer function, inhibits the initialisation and usage of iconv. + The experimental ``tree`` surface is removed. Structured data needs a proper parser, and that doesn't fit the framework of Recode. + Many bug fixes. + Long ago, I renamed GNU recode to Free recode: the permission for using the GNU prefix mandated a level of obedience to the FSF that once went overboard, in my opinion. After that change, I realized that some people read Free as a four letter word! To be peaceful, this version changes the name again, to merely Recode. recode (no capital) still names the executable program specifically, or the distribution archive itself. + make check accepts a LIMIT= option, for limiting tests to one or a few cases. See tests/Makefile.am. + PO files have been updated from the Translation Project. + The test system has been overhauled. Tests now run much faster, and require Python and Cython. + Overhauled build system, now using gnulib for portability. This reduces the amount of code in the Recode tree considerably.
recode: set TEST_TARGET
recode: Drop maintainership.
all: migrate several HOMEPAGEs to https pkglint --only "https instead of http" -r -F With manual adjustments afterwards since pkglint 19.4.4 fixed a few indentations in unrelated lines. This mainly affects projects hosted at SourceForce, as well as freedesktop.org, CTAN and GNU.
converters: align variable assignments pkglint -Wall -F --only aligned --only indent -r No manual corrections.
Drop superfluous PKG_DESTDIR_SUPPORT, "user-destdir" is default these days.
Include lib/error.c in librecode.so, or else librecode.so ends up with an undefined symbol "error". Fix mostly from PR 32253, from Evaldo Gardenali, six years ago. (sigh) PKGREVISION -> 6
Fix build with gcc-4.5+, from Joern Clausen in PR 45286. While here, silence pkglint.
recursive bump from gettext-lib shlib bump.
Replace /usr/pkg/share/locale with $(prefix)/share/locale, my bad. Bump PKGREVISION. Ok'ed by wiz.
Add DESTDIR support.
Whitespace cleanup, courtesy of pkglint. Patch provided by Sergey Svishchev in private mail.
Drop support for LTCONFIG_OVERRIDE. For quite a long time, pkgsrc had actually been ignoring LTCONFIG_OVERRIDE anyway and just using the default LIBTOOL_OVERRIDE to replace libtool scripts in packages. This just formalizes the fact that LTCONFIG_OVERRIDE is not used meaningfully by pkgsrc.
* Honor PKGINFODIR. * List the info files directly in the PLIST.
Recursive revision bump / recommended bump for gettext ABI change.
Needs BROKEN_GETTEXT_DETECTION.
Remove USE_BUILDLINK3 and NO_BUILDLINK; these are no longer used.
Libtool fix for PR pkg/26633, and other issues. Update libtool to 1.5.10 in the process. (More information on tech-pkg.) Bump PKGREVISION and BUILDLINK_DEPENDS of all packages using libtool and installing .la files. Bump PKGREVISION (only) of all packages depending directly on the above via a buildlink3 include.
Convert to buildlink3.
USE_NEW_TEXINFO is unnecessary now.
s/netbsd.org/NetBSD.org/
PKGREVISION bump for libiconv update.
USE_NEW_TEXINFO.
USE_PKGLOCALEDIR
Convert to buildlink2.
Introduce new framework for handling info files generation and installation. Summary of changes: - removal of USE_GTEXINFO - addition of mk/texinfo.mk - inclusion of this file in package Makefiles requiring it - `install-info' substituted by `${INSTALL_INFO}' in PLISTs - tuning of mk/bsd.pkg.mk: removal of USE_GTEXINFO INSTALL_INFO added to PLIST_SUBST `${INSTALL_INFO}' replace `install-info' in target rules print-PLIST target now generate `${INSTALL_INFO}' instead of `install-info' - a couple of new patch files added for a handful of packages - setting of the TEXINFO_OVERRIDE "switch" in packages Makefiles requiring it - devel/cssc marked requiring texinfo 4.0 - a couple of packages Makefiles were tuned with respect of INFO_FILES and makeinfo command usage See -newly added by this commit- section 10.24 of Packages.txt for further information.
Better late than never, update to recode-3.6. Changes since 3.5 include: .* General changes . + The recode manual is now indexed, by charset, by concept, etc. . + Program messages are also available in Greek, Gallicean and Italian. . + Bruno Haible's nice portable iconv library has been integrated. . + RFC 1345 tables and French character names have been updated. . + The Texinfo charset has been refreshed, and made reversible. .* New charsets (most from libiconv) . + Japanese EUC-JP (csEUCPkdFmtJapanese, EUC_JP, Extended_UNIX_Code_Packed_Format_for_Japanese); ISO-2022-JP (csISO2022JP); ISO-2022-JP-1; ISO-2022-JP-2 (csISO2022JP2); JIS_C6220-1969-ro (csISO14JISC6220ro, ISO646-JP, iso-ir-14, jp); JIS_X0201 (csHalfWidthKatakana, JIS0201, JISX0201-1976, JISX0201.1976-0, X0201); JIS_X0208 (csISO87JISX0208, ISO-IR-87, JIS0208, JIS_X0208.1983-0, JIS_X0208.1983-1, JIS_X0208-1990-0, JIS_X0208.1983-1, X0208); JIS_X0212 (csISO159JISX02121990, ISO-IR-159, JIS0212, JIS_X0212.1990-0, JIS_X0212-1990, X0212); SJIS (csShiftJIS, MS_KANJI, SHIFT-JIS). . + Chinese BIG5 (BIG-5, BIG-FIVE, BIGFIVE, CN-BIG5 csBig5); BIG5HKSCS; EUC-CN (CN-GB, csGB2312, EUC_CN, GB2312); EUC-TW (csEUCTW, EUC_TW); GB18030; HZ (HZ-GB-2312); ISO-2022-CN (csISO2022CN); ISO-2022-CN-EXT; GB_1988-80 (cn, csISO57GB1988, ISO646-CN, iso-ir-57); GB_2312-80 (CHINESE, csISO58GB231280, GB2312.1980-0, ISO-IR-58); ISO-IR-165 (CN-GB-ISOIR165). . + Korean JOHAB (CP1361); EUC-KR (csEUCKR, EUC_KR); GBK (CP936); ISO-2022-KR (csISO2022KR); KSC_5601 (CP949, csKSC56011987, ISO-IR-149, KOREAN, KSC5601.1987-0, KS_C_5601-1987, KS_C_5601-1989, KSX1001:1992). . + Vietnamese (independently of libiconv) TCVN; VIQR; VISCII; VNI; VPS. . + Other languages ARMSCII-8; Georgian-Academy; Georgian-PS; WINDOWS-874 (CP874); MuleLao-1; CP1133 (IBM-CP1133); CP1258 (WINDOWS-1258); TIS-620 (ISO-IR-166, TIS620, TIS620.2529-1, TIS620-0, TIS620.2533-0, TIS620.2533-1). . + Apple specifics MacArabic; MacCentralEurope; MacCroatian; MacCyrillic; MacGreek; MacHebrew; MacIceland; MacRomania; MacThai; MacTurkish; MacUkraine . + Unicode JAVA; UCS-2-INTERNAL; UCS-2LE (UnicodeLITTLE); UCS-2-SWAPPED; UCS-4BE; UCS-4-INTERNAL; UCS-4LE; UCS-4-SWAPPED; UTF-16BE; UTF-16LE. . + Others CP932; CP949 (UHC); CP950; CP866 (866, csIBM866, IBM866). ISO-8859-16 (ISO-IR-226, ISO_8859-16:2000). . + Recode internal :libiconv: (:) [so option -x: avoids going through libiconv] .* New aliases (from libiconv) [list to be revised] csASCII (for ANSI_X3.4-1968); csHPRoman8 (for hp-roman8); csISOLatin1 (for ISO-8859-1); csISOLatin2 (for ISO-8859-2); csISOLatin3 (for ISO-8859-3); csISOLatin4 (for ISO-8859-4); csISOLatin5 (for ISO-8859-9); csISOLatin6 and ISO_8859-10:1992 (for ISO-8859-10); csISOLatinArabic (for ISO-8859-6); csISOLatinCyrillic (for ISO-8859-5); csISOLatinGreek (for ISO-8859-7); csISOLatinHebrew (for ISO-8859-8); csKOI8R (for KOI8-R); csPC850Multilingual (for IBM850); csUCS4 (for ISO-10646-UCS-4); csUnicode, csUnicode11, UCS-2BE, UnicodeBIG (for ISO-10646-UCS-2); csUnicode11UTF7 (for UNICODE-1-1-UTF-7); csVISCII and VISCII1.1-1 (for VISCII); ISO-IR-179 (for ISO-8859-13); csMacintosh and MacRoman (for macintosh); TCVN5712-1, TCVN5712-1:1993 and TCVN-5712 (for TCVN). .* New surfaces tree (experimental).
This package uses gettext and libtool, so make the appropriate inclusions and definitions. Also mark as USE_BUILDLINK_ONLY, and change PLIST to refer to the installed po files in ${PKGLOCALEDIR}, not "share".
Update to new COMMENT style: COMMENT var in Makefile instead of pkg/COMMENT.
Update recode to 3.5 User-visible changes are: .* Incompatible changes . + A double dot `..' should now be used instead of a colon `:'. . + Option --force (-f) is needed to pursue recoding despite errors. . + There is no more quoting for special characters within charsets names. . + Auto check (`-a') and popen (`-o') options have been withdrawn. . + Some charsets and aliases were deleted, see `Charsets & aliases' below. .* Extended features . + Program messages are available in localised form for many languages. . + Long character names are available in French, if LANGUAGE is set to `fr'. . + A new request syntax allows for recode chaining, and for surfaces. . + Option --header-file (-h) accepts a language parameter, and Perl is new. . + Full charset listings now show the UCS-2 value for characters. . + Option --known=PAIRS (-k) also accepts octal and hexadecimal numbers. . + Option --list (-l) better sorts charsets and aliases, also fully written. . + Charset `RFC1345' implements mnemonic+ascii+38, and is now reversible. . + HTML is not limited anymore to Latin-1, HTML 4.0 entities are supported. .* New features . + Euro support. . + Updated RFC 1345 set of tables, from Keld Simonsen. . + Some African charsets and transliterated forms. . + Conversions for ISO 10646 and Unicode. . + Combining or explosion of UCS-2 diacriticized characters and ligatures. . + Implementation of surfaces, see `Surfaces & aliases' below. . + Mixed mode for recoding only comments and strings in C sources or PO files. . + A stand-alone recoding library gets installed, often as a shared library. . + Option --find-subsets (-T) lists charsets which are subsets of another. . + The library may generate testing data, and study character frequencies. .* Charsets & aliases . + New ISO 10646 and Unicode charsets . - combined-UCS-2: pseudo-charset. . - count-characters: pseudo-charset. . - dump-with-names: pseudo-charset. . - ISO-10646-UCS-2: aliases are UNICODE-1-1, BMP, rune and u2. . - ISO-10646-UCS-4: aliases are 10646, ISO-10646, UCS-4 and u4. . - UNICODE-1-1-UTF-7: aliases are TF-7 and u7. . - UTF-8: aliases are UTF-2, UTF-FSS, FSS_UTF, TF-8 and u8. . - UTF-16: aliases are Unicode, TF-16 and u6. . + RFC 1345.bis matters . - Deleted charsets dk-us, us-dk (because of &duplicate which `recode' does not handle yet). . - New charsets baltic (alias is iso-ir-179); CP1250 (1250, ms-ee, windows-1250); CP1251 (1251, ms-cyrl, windows-1251); CP1252 (1252, ms-ansi, windows-1252); CP1253 (1253, ms-greek, windows-1253); CP1254 (1254, ms-turk, windows-1254); CP1255 (1255, ms-hebr, windows-1255); CP1256 (1256, ms-arab, windows-1256); CP1257 (1257, WinBaltRim, windows-1257); CWI (CWI-2, cp-hu); EBCDIC-IS-FRISS (friss); GOST_19768-87 with aliases of previous GOST_19768-74; IBM256 (256, CP256, EBCDIC-INT1); IBM875 (875, CP875, EBCDIC-Greek); IBM1004 (1004, CP1004, os2latin1); IBM1047 (1047, CP1047); ISO-8859-13 (ISO_8859-13:1998, iso-baltic, iso-ir-179a, l7, latin7); ISO-8859-14 (ISO_8859-14:1998, iso-celtic, iso-ir-199, l8, latin8); ISO-8859-15 (ISO_8859-15:1998, iso-ir-203, l9, latin9); KOI-7; KOI-8 (GOST_19768-74); KOI8-R; KOI8-RU; KOI8-U; macintosh_ce (macce); mac-is; NeXTSTEP (next) yet previous `recode' had it outside RFC 1345. . - Alias promoted to charset (with previous charset becoming alias) ISO-646.basic (with ISO-646.basic:1983); ISO-646.irv (ISO-646.irv:1983); ISO_5427-ext (ISO_5427:1981); ISO_5428 (ISO_5428:1980); ISO-8859-1 (ISO_8859-1:1987); ISO-8859-2 (ISO_8859-2:1987); ISO-8859-3 (ISO_8859-3:1988); ISO-8859-4 (ISO_8859-4:1988); ISO-8859-5 (ISO_8859-5:1988); ISO-8859-6 (ISO_8859-6:1987); ISO-8859-7 (ISO_8859-7:1987); ISO-8859-8 (ISO_8859-8:1988); ISO-8859-9 (ISO_8859-9:1989); ISO-8859-10 (latin6); NC_NC00-10 (NC_NC00-10:81); sami (latin-lap). . - New aliases 037 (for charset IBM037); 038 (IBM038); 273 (IBM273); 274 (IBM274); 275 (IBM275); 278 (IBM278); 280 (IBM280); 281 (IBM281); 284 (IBM284); 285 (IBM285); 290 (IBM290); 297 (IBM297); 367 (ANSI_X3.4-1968); 420 (IBM420); 423 (IBM423); 424 (IBM424); 500, 500V1 (IBM500); 819 (ISO-8859-1); 864 (IBM864); 868 (IBM868); 870 (IBM870); 871 (IBM871); 880 (IBM880); 891 (IBM891); 903 (IBM903); 905 (IBM905); 912, CP912, IBM912 (ISO-8859-2); 918 (IBM918); 1026 (IBM1026); ECMA-113, ECMA-113:1986 (ECMA-Cyrillic); GOST_19768-74 (KOI8); ISO_8859-N (ISO-8859-N) for N = 1 through 10 and 13 through 15; ISO_8859-10:1993 (ISO-8869-10); iso-ir-170 (INVARIANT); KOI8_L2 (CSN_369103); pclatin2, pcl2 (IBM852); SS636127 (SEN_850200_B). . + New African charsets . - AFRL1-101-BPI_OCIL: aliases are t-francais and t-fra. . - AFRFUL-102-BPI_OCIL: aliases are bambara, bra, ewondo and fulfulde. . - AFRFUL-103-BPI_OCIL: aliases are t-bambara, t-bra, t-ewondo and t-fulfulde. . - AFRLIN-104-BPI_OCIL: aliases are lingala, lin, sango and wolof. . - AFRLIN-105-BPI_OCIL: aliases are t-lingala, t-lin, t-sango and t-wolof. . + Extra miscellaneous charsets . - KEYBCS2, Kamenicky. . - CORK, T1. . - KOI-8_CS2. . + New HTML pseudo-charsets . - HTML_1.1: alias is h1. . - HTML_2.0: aliases are RFC 1866, 1866 and h2. . - HTML-i18n: alias is RFC 2070. . - HTML_3.2: reimplemented; alias is h3. . - HTML_4.0: aliases are h4, HTML and h. . - Deleted aliases: HTF, 8859, ISO 8859, Entities, SGML, WWW, w3. .* Surfaces & aliases . + New MIME encoding surfaces . - Base64: aliases are 64 and b64. . - Quoted-Printable: aliases are qp and Quote-Printable. . + New permutation surfaces . - 21-Permutation: alias is swabytes. . - 4321-Permutation. . + New end of line surfaces . - CR. . - CR-LF: alias is cl. . + New (fully reversible) dump surfaces . - Decimal-1: aliases are d and d1. . - Decimal-2: alias is d2. . - Decimal-4: alias is d4. . - Hexadecimal-1: aliases are x and x1. . - Hexadecimal-2: alias is x2. . - Hexadecimal-4: alias is x4. . - Octal-1: aliases are o and o1. . - Octal-2: alias is o2. . - Octal-4: alias is o4. . + New miscellaneous surfaces. . - data, test7, test8, test15, test16.
Remove unnecessary slash from master site list.
Add a homepage for this package.
Delete unnecessary USE_GTEXINFO - if INFO_FILES is defined, USE_GTEXINFO gets defined automatically.
Catch up with MASTER_SITE_GNU reorganization.
Oops, forgot about MASTER_SITE_SUBDIR.
GNU recode: convert files between character sets and usages.
Initial revision