Up to [cvs.NetBSD.org] / pkgsrc / textproc / split-thai
Request diff between arbitrary revisions
Keyword substitution: kv
Default branch: MAIN
Update to version 2.38 - add force option for 'pthai-message, use switch-to-buffer-other-window - remove 'pthai-cycle-split-mode, just use customize to set - make 'pthai-soundfiles-sort work with 'sort, small cleanups - in 'pthai-mp3-fetch, print download url to *Messages* buffer - fix 'pthai-download-and-play when force fetching of complete word if pieces were previously downloaded - fix borderline cases in 'pthai-spell-merge-singles - simplify options to 'pthai-spell-suggestions and 'pthai-unknown-bounds - change 'pthai-unknown-words-extract to extract words from current buffer into new temp buffer, instead of current and named buffers - modify 'pthai-unknown-words-toggle, add 'pthai-unknown-words-toggle-dictionary and 'pthai-unknown-words-toggle-unknown for specific toggling of undefined or unknown words - comment cleanups and corrections in a few places
Update to version 2.37 - simplify pthai-dictionary-add-word - add reset option to pthai-spell-region - use delete-dups instead of seq-uniq in a few places - other small cleanups
Update to version 2.36 - add pthai-classifier-practice-(region/line/buffer/file), simplify class - clarify comments, a few minor code cleanups in a couple places - rename pthai-dictionary-def-p -> pthai-dictionary-any-p. change semantics of pthai-dictionary-def-p - add &optional force to pthai-classifier-by, pthai-classifier-for - add option for classifiers in pthai-practice-words, add pthai-classifier-practice-region/line/buffer/file - remove optional audio parameter from pthai-practice-region/line/buffer/file
Update to 2.35 - add 'pthai-message for large messages - use standard emacs regexp in pthai-split-region - include playable/unplayable with pthai-soundfiles-counts - small cleanups for pthai-time-loc2th - rename pthai-reverse-lookup -> pthai-lookup-reverse, simplify - include classifiers in dictionary word counts - simplify classifier functions, rename pthai-lookup-classifier -> pthai-classifier-lookup, rename pthai-lookup-classifier-at-point -> pthai-classifier-lookup-at-point, add pthai-classifier-list pthai-classifier-count pthai-classifier-by pthai-classifier-for, remove pthai-lookup-classifier-detail, pthai-lookup-classifier-detail-at-point, and pthai-classifier-info - display skipped words without definitions for pthai-practice-*, small cleanups
Update to 2.34 - remove 'pthai-pkg-wordlist and 'pthai-pkg-sample-dictionary, set as defaults for 'pthai-dictionary-list. remove some now unneeded bootstrap code - slight cleanups for 'pthai-url-lingopolo - simplify 'pthai-dictionary-parse-filelist, require version emacs 27.1 for 'directory-files-recursively - move 'pthai-dictionary-list to accommodate changes
Update to 2.33 - remove 'pthai-verbose-wordloads, cleanups for pthai-bootstrap. ensure 'pthai-dictionary-list doesn't load user dictionaries during setup - for pkgsrc, use -Q flag with emacs initialization of dictionaries
Update to 2.32 - Update pkgsrc Makefile to require emacs version >= 27 for json and 'string-distance. use ALL_ENV for utf8 settings - cleanups for pthai-url-* functions - remove custom variable 'pthai-audio-download-non-dictionary - change custom variable 'pthai-audio-download to 3-state variable from boolean, add 'pthai-mp3-permit-fetch to accommodate change - add 'pthai-fetch-silence to disable messaging during downloads, simplify 'pthai-download-and-play, 'pthai-highlight-say-bounds, 'pthai-mp3-fetch and various 'pthai-audio-* control flags
Update to 2.31 - some clean ups for 'pthai-url-thai2english from last checkin
Update to 2.30 - simplfy 'pthai-rwb slightly - fix 'pthai-url-thai2english after website format changed, uses json now
Update to version 2.29 - add 'pthai-copy-word-at-point 'pthai-copy-string-at-point - remove all external-program word splitting functionality associated with icu and swath - for word splitting, use only internal elisp functions - remove 'pthai-twt-lock, 'pthai-twt-splitter-enable. this results in the 'thai-word-table always being modified when adding or removing words - rename pthai-splitter-max-recurse-word-length to pthai-rwb-max-word-length - remove pthai-temperature-* functions
textproc/split-thai: Fix build with ICU 75
revbump after icu and protobuf updates
Update to 2.28 all changes for pthai.el - add option for looking up, viewing words/missed words in pthai-practice-words - add pthai-dictionary-clear - bind pthai-complete-word to key like alt-tab for ispell - for pthai-practice-* functions, don't require dictionary format - simplify pthai-dictionary-read-file and its *Messages* - update external dictionaries to latest builds
*: recursive bump for icu 74.1
Update to 2.27 all changes for pthai.el - reduce some unnecessary splitting and joining of strings - default to word at point for 'pthai-dictionary-find-word - make 'pthai-dictionary-find-regexp work same whether interactive or not - add prefix option for 'pthai-twt-words - in 'pthai-unknown-words-toggle, display clearer messaging whether words toggled on or off, or no words found - simplify code in a few places
Update to 2.26 all changes for pthai.el - add seconds to pthai-say-time, simplify time parsing - rename 'pthai-parse-hour-minute-second to 'pthai-parse-hms - rename 'pthai-bounds-offset-of-split-string to 'pthai-bounds-offsets - rename 'pthai-randomize to 'pthai-nrandomize, use faster in-place algorithm - remove 'pthai-nth-set, pthai-nth-delete - many changes related to 'thai-word-table including: * rename 'pthai-thai-break-words to 'pthai-twt-split * remove 'pthai-twt-update, change interface for 'pthai-twt-read * add 'pthai-twt-add/remove/clear for manipulating the 'thai-word-table * modification to 'pthai-dictionary will also be applied to 'thai-word-table * simplify pthai-dictionary-read-*, only have clear option for 'pthai-dictionary-read-files * add custom variables pthai-twt-splitter-enable, pthai-twt-lock
Mass-change BUILD_DEPENDS to TOOL_DEPENDS outside mk/. Almost all uses, if not all of them, are wrong, according to the semantics of BUILD_DEPENDS (packages built for target available for use _by_ tools at build-time) and TOOL_DEPEPNDS (packages built for host available for use _as_ tools at build-time). No change to BUILD_DEPENDS as used correctly inside buildlink3. As proposed on tech-pkg: https://mail-index.netbsd.org/tech-pkg/2023/06/03/msg027632.html
Update to 2.25 all changes for pthai.el - consider numbers as being in the dictionary - various fixes to handle when point at thai character and not word - simplify and fix some lookup-at-point functions - add option to pthai-practice-* to practice words without audio - rename 'pthai-fetch-thailanguage to 'pthai-fetch-thai-language - rename 'pthai-dictionary-grep to 'pthai-dictionary-find-word - rename 'pthai-unknown-words-insert to 'pthai-unknown-words-extract - add counting by letter function 'pthai-dictionary-word-counts - many small cleanups
revbump after textproc/icu update
Update to 2.24 all changes for pthai.el - use expand-file-name in a few places - fix pthai-audio-display-definition plumbing - use call-process* for pthai-mp3-play and pthai-split-command - rename pthai-splitter-swath-word-length to pthai-splitter-max-swath-word-length - restore pthai-thai-break-words splitter
Update to 2.23 all changes for pthai.el - move to end of word after *insert *complete-word functions - only toggle highlighting for 'pthai overlays - various bug fixes, cleanups, and simplifications - make end of word bounds detection more consistent - add 'pthai-soundfiles-counts to display downloaded word counts per letter - rename 'pthai-spell-string to pthai-spell-string-at-point, and 'pthai-spell-word to pthai-spell-word-at-point - rename pthai-overlay-off to pthai-hightlight-off
massive revision bump after textproc/icu update
Update to 2.22 - fix memory leaks in st-icu.cc - for st-swath, don't spawn new process for every stdin readline pthai.el: - clean up pthai-unknown* funcs and add toggle to highlight unknown words - add option to use external processes for swath and icu which is faster and gives more split choices - add pause and pthai-no-audio-pause-length custom setting for pausing on words with no audio when attempting audio play - clean-ups for pthai-currency* functions - consolidate text highlighting into pthai-highlight-on/off functions - when reloading pthai.el library, don't clear 'pthai-dictionary
Update to 2.21 all changes for pthai.el - convert between arbitrary currencies with choice-lists - small clean-ups
Update to 2.20 all changes for pthai.el - redo practice routine without having to reset dictionary - add pthai-practice-extract-words and pthai-practice-buffer/file/region
Update to 2.19 all changes for pthai.el - add pthai-practice-words for practicing words in a dict file - add pthai-soundfiles-all/playable/unplayable - fix pthai-rwb to handle numbers not in dictionary - fix parsing for thai strings with numbers and no spaces - various small cleanups
Update to 2.18 all changes for pthai.el - small clean-ups, add completing-read a few places - for pthai-unknowns*, keep word same as found in buffer
Update to 2.17 all changes for pthai.el - remove some unnecessary split methods and related customize settings - simplify and optimize 'pthai-split-all, reducing redundant word splitting - rename pthai-spell-correct-p -> pthai-spell-p, fix check for single char words - clarify few function descriptions - replace 'remove with 'delete in a few places - remove pthai-completing-read-mode and ido prompt customize option - replace pthai-completing-read with pthai-completing-read-full/dynamic, allows for thai word completion to be add to some functions
Update to 2.16 all changes for pthai.el - add pthai-dictionary-read-region/line/buffer functions - rename pthai-dictionary-read -> pthai-dictionary-read-file - rename pthai-dictionary-readall -> pthai-dictionary-read-files - rename pthai-word-count -> pthai-dictionary-word-count - add option to exit spell check at point - add progess meters to some word counting funcs - add pthai-spell-correct-p to spell check without word splitting - add pthai-unknowns/count/insert for extracting/counting unknown words
*: recursive bump for perl 5.36
revbump for textproc/icu update
Update to 2.15 all changes for pthai.el - add pthai-count-words ala count-words - for lookups, indicate when thai word is in dictionary but undefined - customize flag to skip download attempts for words not in dictionary - add lists of consonants, vowels, and numbers and functions to display
revbump for icu and libffi
Update to 2.14 all changes for pthai.el - yet more url-fetching clean-ups
Update to 2.13 all changes for pthai.el - more url-fetching fixes and cleanups
Update to 2.12 all changes for pthai.el - fix fetching of some words with pthai-url-thai-language
Update to 2.11 all changes for pthai.el - simplify some pthai-url-* functions, fix fetching of some words for pthai-url-thai-language
Update to 2.10 all changes for pthai.el - use (interactive "r") for a few functions - add simple pthai-mode with some common keybindings - add pthai-kill-word, pthai-backward-kill-word, pthai-transpose-words & keybindings - put detailed classifier info in own buffer if "long" result - add lookup classifier-at-point - better handling for some misspelled words with single char splits - customize option for completing-read/ido-completing-read prompts - for emacs versions < 27, still indicate word misspelled without giving suggestions instead of erroring - when spelling, don't spam messages and cursor movements, add progress reporter, bug fixes
Update to 2.9 changes mainly for pthai.el - add spell checking capability, similar to ispell, for word, line, buffer, region and related functions - many small clean ups, fix def display when downloading, double writing of files, always fetching some words, others fixes - add grep-like function to search dictionary files - add complete-word function for inserting words - remove interactive args with \nP - count nor-nayn as valid single char - use completing-read instead of ido in a few places
Update to 2.8 all changes for pthai.el - replace parser to extract words from 'thai-word-table using a non-recursive algorithm which mitigates the need to adjust max-specpdl-size - adjust pthai-twt-* functions to use new parser, fix pthai-twt-count - fix some whitespace regexp boundary matching and boundary at end of thaiword - add pthai-dictionary-save function - rename a few functions for better organization
Update to 2.7 all: - added libreoffice thai dictionary words pthai.el only: - fix pthai-twt-* to handle larger dictionary - add more thingatpoint functionality with forward-thaiword/backword-thaiword - use pulse instead of region to highlight words when playing - major code clean ups - split up audio word downloads in subdirectories
Update to 2.6 All changes for pthai.el - highlight thai text when looking up or playing audio - highlight text and move cursor as audio played for words - add customize toggles for message display when playing audio - add customize toggle to prevent downloading audio - add customize toggle to prevent playing audio - fix pthai-say* args to actually force downloads - (thing-at-point 'thaiword) works for thai words not separated by spaces - other clean ups
Update to 2.5 All changes for pthai.el - use buffer instead of file for pthai-twt-read - reread 'thai-word-table as needed when reading dictionary files - clean up some comments
Update to 2.4 All changes for pthai.el module - combine dictionary and wordlist hashes, update related functions - add customize toggles for url-fetchers and splitters - add lingopolo url fetcher
*: recursive bump for perl 5.34
Update to 2.3 All changes for pthai.el module - in pthai-play-thaiword, fix logic when sox/play not found - display duplicate word counts in pthai-unique-word-count - fix splitting when last character is punctuation and last word is english - add pthai-say-*-thai-only function to say thai words only buffer/line/region - add pthai-find-word-regexp to search for words in dictionary/wordlist
revbump for textproc/icu
Update to 2.2 - depend on sox instead mpg123 - in pthai.el, use sox as sole audio player. this simplifies things a bit and allows silences to be stripped from audio files while playing. remove pthai-mp3-* custom variables - minor pthai.el clean ups - remove seq-contain-p from pthai.el, which appears deprecated in later emacs versions - in pthai.el, work around apparent emacs bug when unsplitting and buffer name contains combining characters
Move all SUBST_STAGE to pre-configure
Update to 2.1 - fix initialization of default wordlist in pthai.el - add pthai-unique-word-count to pthai.el
Restore split-thai 2.0 package after moving source from pkgsrc to MASTER_SITE_LOCAL
Remove package. Apparently software without its own separate repository cannot be stored in pkgsrc
Update to version 2.0 - add new emacs module pthai.el, merging all emacs lisp code into it - rename files thaidict* to words* - modify other scripts to use new file names
*: Recursive revbump from textproc/icu-68.1
Update to 1.1 ------------- - in thai-utility.el update thai-word-table-in-p to use lookup-nested-alist, which is much faster and non-recursive
Update to 1.0 - small clean ups - add thai-word-table-in-p and split-thai-word to thai-utility.el
Update to 0.9 - add a perl script st-wordbreak
*: bump PKGREVISION for perl-5.32.
Update to 0.8 - add 'tgrep' perl script for grepping thai words
Update to 0.7 - don't skip strings with numbers [0-9] for st-icu
Update to 0.6 - handle numbers [0-9] better with emacs splitter
Update to 0.5 - clean up some comments and simplify lisp code a bit
Update to 0.4 - always use pkgsrc path for swath for st-swath script - make splitting of numbers a little more consistent for st-emacs & st-icu - add split-thai, split-thai-line, wrapper functions to emacs lisp code
Update to version 0.3 all changes for emacs splitter: - load custom dictionary first because 'thai-word-table is a defvar - add count function and return word counts for a few funcs - add lisp wrappers functions split-thai, split-thai-line which can split thai text in an emacs buffer using 'thai-break-words
Update to version 0.2 - generate the emacs dictionary once at build time, not every time the program is run - clean up the README
Add split-thai 0.1, a set of utilities for splitting Thai UTF8 text by word boundaries