Up to [cvs.NetBSD.org] / pkgsrc / mail / spamprobe
Request diff between arbitrary revisions
Keyword substitution: kv
Default branch: MAIN
mail: Replace RMD160 checksums with BLAKE2s checksums All checksums have been double-checked against existing RMD160 and SHA512 hashes The following distfiles were unfetchable (possibly fetched conditionally?): ./mail/qmail/distinfo netqmail-1.05-TAI-leapsecs.patch
mail: Remove SHA1 hashes for distfiles
Add SHA512 digests for distfiles for mail category Problems found locating distfiles: Package mutt: missing distfile patch-1.5.24.rr.compressed.gz Package p5-Email-Valid: missing distfile Email-Valid-1.198.tar.gz Package pine: missing distfile fancy.patch.gz Package postgrey: missing distfile targrey-0.31-postgrey-1.34.patch Package qmail: missing distfile badrcptto.patch Package qmail: missing distfile outgoingip.patch Package qmail: missing distfile qmail-1.03-realrcptto-2006.12.10.patch Package qmail: missing distfile qmail-smtpd-viruscan-1.3.patch Package thunderbird24: missing distfile enigmail-1.7.2.tar.gz Package thunderbird31: missing distfile enigmail-1.7.2.tar.gz Otherwise, existing SHA1 digests verified and found to be the same on the machine holding the existing distfiles (morden). All existing SHA1 digests retained for now as an audit trail.
Fixes build with giflib-5.1.
Pullup ticket #4285 - requested by htodd mail/spamprobe: build fix Revisions pulled up: - mail/spamprobe/distinfo 1.18 - mail/spamprobe/patches/patch-src_parser_GifParser.cc 1.1 ------------------------------------------------------------------- Module Name: pkgsrc Committed By: htodd Date: Sun Jan 5 00:29:35 UTC 2014 Modified Files: pkgsrc/mail/spamprobe: distinfo Added Files: pkgsrc/mail/spamprobe/patches: patch-src_parser_GifParser.cc Log Message: Fix build for changed DGifOpen in giflib. To generate a diff of this commit: cvs rdiff -u -r1.17 -r1.18 pkgsrc/mail/spamprobe/distinfo cvs rdiff -u -r0 -r1.1 \ pkgsrc/mail/spamprobe/patches/patch-src_parser_GifParser.cc
Fix build for changed DGifOpen in giflib.
Compator function must be const.
Fix template look up.
Fix build with newer GCC
Fix build with png-1.4.x. Addresses PR 43477.
Fix building with db5; revision bump for db4 update
Update to 1.4d (previous 1.4.2 was 1.4b). Take MAINTAINERship. 2006-11-16 Brian Burton <brian@burton-computer.com> * Released as 1.4d * configure.ac: Added ability to selectively disable image processing using --without-gif, --without-jpeg, and/or --without-png. * src/spamprobe/spamprobe.cc (set_headers): Added ability to selectively ignore individual headers using -H-headername. * src/includes/Ptr,Ref,Array.h: Restored missing <cassert> include. * src/parser/PngParser.cc (tokenizeImage): added basic tokens from PNG images. 2006-11-16 Brian Burton <brian@localhost.localdomain> * src/parser/PngParser.cc (PngParser): Stub for PNG parsing using libpng. * src/parser/JpegParser.cc (tokenizeMarker): Preliminary implementation of jpeg parsing using jpeglib. * configure.ac: Auto detect of either libungig or libgif depending on which one is available. 2007-01-04 Brian Burton <brian@burton-computer.com> * Released as 1.4c * spamprobe.1: Modified man page to remove unnecessary informaton and make it more conformant with man page conventions. * src/spamprobe/spamprobe.cc (process_extended_options): added ignore-body option. * src/parser/HeaderPrefixList.cc (HeaderPrefixList::addHeaderPrefix): Forced header prefixes and names to lower case instead of relying on an assert to enforce the restriction. * src/database/FrequencyDBImpl_hash.cc (hash::FrequencyDBImpl_hash): Disabled experimental hash database auto-cleaning. * src/includes/Ref.h: Removed cassert include. * src/spamprobe/spamprobe.cc (process_extended_options): Added whitelist option to allow use of SP as a bayesian white list in conjunction with other filters.
Fix build with gcc4.
Update spamprobe to 1.4b (PKGNAME changed to 1.4.2). Changes in 1.4a: This release fixes a bug in decoding malformed base64 encoded data that manifested itself in some architectures. Also corrects a typo in the online help. Some code cleanup and minor performance tweaks were added as well. Changes in 1.4b: This release fixes pair of bugs related to messages with no lines in their bodies. No new features were added.
Update spamprobe to version 1.4. Submitted by Ossi 'ohub' Herrala <oherrala@ee.oulu.fi> in private mail as discussed on IRCnet #NetBSD. Changes to software: * Spamprobe has a config file! * Added comprehensive command line option help. * Added support for maildir directories to all file based commands. * Added support for auto-purge (-P command line option). * Commands create-db and create-config to greate database and config file. * Added gif parser support using libungif to analyzes image attachments and derive useful information from them. Changes to package: * bdb is not in SUGGESTED_OPTIONS anymore. * added depency to graphics/libungif and gif is in SUGGESTED_OPTIONS.
Update spamprobe to 1.2a. SpamProbe changes: - New email parsing implementation - New hash database, in addition to Berkeley DB (bdb/db4) (=> Hopefully no more "caught runtime exception: get: DB_PAGE_NOTFOUND: Requested page not found" on Linux every other day!) - And much more, check the ChangeLog for accurate information pkgsrc changes: - Added db4 database backend to be optional (not requirement anymore) Patches sent by the maintainer Ossi Herrala <oherrala@ee.oulu.fi> in private mail.
Add RMD160 digests.
Update spamprobe to 1.0a, patch sent via IRC by the maintainer. Changes: * MimeLineReader.cc: 1.0 branch - fixed MBX record header regex * spamprobe.cc (main): Added exec and exec-shared commands. (import_words): modified import command to allow negative values to be specified in the import file. * Applied patches for configure.in and aclocal.m4 contributed by Siggy Brentrup for debian compatibility. * FrequencyDBImpl_pbl.cc: Invokes new WordData methods to allow storing data in big endian format. * WordData.h: Added optional support for storing counts/flags in big endian order for data portability. * MimeLineReader.cc (readMBXFileHeader): UW IMAP MBX file format is now auto detected from the first line of the mailbox file. * spamprobe.cc (process_extended_options): Removed -o imap-mbx option. * spamprobe.cc (process_extended_options): Added -o imap-mbx option to process files as WU-IMAP MBX files rather than mbox files. * MimeLineReader.cc (readLine): Added support for WU-IMAP MBX file format. * spamprobe.cc (process_stream): Added -o tokenized option to allow people to use an external tokenizer with spamprobe. * SpamFilter.cc (scoreToken): Reduced sorting overhead by pre-computing and integer sort value with sorting priorities reflected in the value. This eliminates several calculations inside of the sort routine. * SpamFilter.cc (computeRatio): Capped ratios in calculations to within MIN_PROB and MAX_PROB. Widened that range. This avoids problems with div/0 and makes it easier to sort terms. * spamprobe.cc (dump_words): dump command can now optionally accept a regular expression as an argument and will only dump terms matching the regular expression. (purge_terms): Added purge-terms command to purge from the database all terms matching a regular expression. * spamprobe.cc (main): Fixed bug in command line processing. Thanks to Jem for bug report. * spamprobe.cc (train_on_message): Code simplified. Eliminated redundant recalculation of scores. (train_on_message): Timestamps are now longer updated by train-spam and train-good commands. They are still updated by train command. (main): Fixed assertion if -P option is specified in a read only operation. * spamprobe.cc (main): Added -C command line option to allow users to specify their own min word count. * SpamFilter.cc (SpamFilter): Set default minimum word count back to 5 (was 3). * spamprobe.cc (process_extended_options): Removed "alt-score" from -o options list because it distributes scores poorly. New formula achieves the same end with better accuracy. Added "orig-score" option to allow people to continue using the old formula. Added "honor-xstatus-header" option for people whose mail server uses X-Status: rather than Status: for the deleted flag. (main): Added -l command line option to allow people to set their own spam threshold if they don't like the default value. * SpamFilter.cc (scoreMessage): Added a new scoring formula based on Paul's but taking the nth root of spam and good probabilities to produce more evenly distributed scores. Lowered the spam threshold to 0.6 to keep accuracy about the same as the original formula. Highest score seen for a ham so far in tests is 0.44 so 0.6 seems safe. Made the new formula the default instead of Paul's.
Update spamprobe to 0.9h. ChangeLog: * New manual page * spamprobe.cc (process_stream): Added -o tokenized option to allow people to use an external tokenizer with spamprobe. * SpamFilter.cc (scoreToken): Reduced sorting overhead by pre-computing and integer sort value with sorting priorities reflected in the value. This eliminates several calculations inside of the sort routine. * SpamFilter.cc (computeRatio): Capped ratios in calculations to within MIN_PROB and MAX_PROB. Widened that range. This avoids problems with div/0 and makes it easier to sort terms. * spamprobe.cc (dump_words): dump command can now optionally accept a regular expression as an argument and will only dump terms matching the regular expression. (purge_terms): Added purge-terms command to purge from the database all terms matching a regular expression. Patch submitted by Ossi Herrala <PGP: 0x78CD0337> in private email.
Update spamprobe to 0.9g2. Patch provided by Ossi Herrala <$MAINTAINER> in private mail. CHANGED: * This release moves from Paul's original formula to a slightly modified one that yields more evenly distributed scores. To continue using old formula, use -o orig-score command line option. ADDED: * -C <number> command line option. This tells SpamProbe to assign a default, somewhat neutral, probability to any term that does not have a weighted (good count doubled) count of at least specified number in the database. This prevents terms which have been seen only a few times from having an unreasonable influence on the score of an email containing them. Default count have changed. It is now 5. Old was 3. * Added -o <option_name> command line option to specify alternate way of scoring. Consult README.txt for more info. * Added -l <number> command line option. Changes the spam probability threshold for emails from the default (0.7) to <number>. * Added tokenize command. Prints the tokens found in the file one word per line in human readable format with spam probability, good count, spam count, message count, and word in columns separated by whitespace. IMPROVED: * -H command line option to add more headers to scan. * Improved performance by removing some redundant calculations and reducing the amount of I/O in train-* mode.
Fix typos in the manual page.
Update to spamprobe-0.9e. Too many changes to list, the old pkg was from 2002(!) Update probided by ohub @ #NetBSD/IRCnet, buildlink3 code OK'd by jlam.
Initial import of mail/spamprobe (version 0.8b) Spamprobe is a fast, intelligent, automatic spam detector using Bayesian analysis of terms contained in emails. Works with procmail, maildrop or a similar tool to produce a complete server or client side spam filtering system. Provided by Daniel Farrugia in PR#20286, buildlinkified by me.
Initial revision