Just released SpamCopURI 0.21:
0.21 Wed Aug 4 16:41:31 PDT 2004
- Defaulting config param spamcop_uri_limit to 20. If
the number of URLs in a message exceeds the limit, they
are shuffled and only up to the number specified for the limit
are tested.
- New syntax for specifying the address match for multi.surbl.org:
Instead of '127.0.0.0/2' use '127.0.0.0+2' this is being
done to eliminate any confusion with CIDR syntax.
The old '/' syntax is still supported, but will be deprecated
in a future version.
http://sourceforge.net/projects/spamcopuri/
Package has also been upload to CPAN as well.
--eric
ws.surbl.org now has some additional data sources included in
it: MailSecurity's formerly proprietary SURBL lists, data from
Joe Wein's jwSpamSpy Windows POP mail spam-filtering agent, and
data from Raymond Dijkxhoorn's Prolocation RBL. Thanks for
providing those Joe, MailSecurity and Raymond! Here are some
links to more information about those sources:
http://www.surbl.org/lists.html#wshttp://www.mailsecurity.net.au/http://www.joewein.de/sw/jwSpamSpy/index.htmhttp://www.prolocation.net/
jwSpamSpy is a POP3 spam scanner for Windows. jwSpamSpy looks
like it should have pretty good spam performance since Joe's
URI domain blacklist has a lot of overlap with some of the
ws.surbl.org data plus many other quickly added entries as
described at:
http://www.joewein.de/sw/spam-bl.htm
Those new data sources join entries in ws from three SpamAssassin
rulesets: Bill Stearns' sa-blacklist, BigEvil.cf from Chris
Santerre and his SARE cohorts, and Paul Barbeau's MidEvil.cf.
Other manual lists are also included in WS, most of which are
being processed and hand-checked by Chris Santerre.
ws now has about 30k records, and here are some statistics
from Raymond's mail system:
SpamAssassin tag hits: (top 100)
#1 64675 BAYES_99
#2 54198 HTML_MESSAGE
#3 44694 RCVD_IN_SBL+XBL
#4 43427 RCVD_IN_BL_SPAMCOP_NET
#5 37191 RCVD_IN_SORBS
#6 35888 WS_URI_RBL
#7 33729 SPAMCOP_URI_RBL
#8 33089 MIME_HTML_ONLY
#9 32485 OUTBLAZE_URI_RBL
#10 26678 RCVD_IN_DSBL
#11 19665 RCVD_IN_AHBL
#12 19662 RCVD_IN_DYNABLOCK
#13 19146 CLICK_BELOW
#14 18374 ABUSEBUTLER_URI_RBL
WS, SC and OB are detecting roughly the same numbers of spams,
though they may be somewhat different spams.
Thanks to everyone who is providing data, checking data or
otherwise helping with the SURBL project!
Please remember to report false positives in SURBLs back
to us on the SURBL discussion list or whitelist at surbl dot org
http://lists.surbl.org/mailman/listinfo/discuss
Jeff C.
--
Jeff Chan
mailto:jeffc@surbl.org
http://www.surbl.org/
Just released SpamCopURI 0.20. Biggest change is support for multi.surbl.org.
Let me know if you see anything strange. See the change notes below
for what you need to do for your config.
0.20 Sat Jul 31 22:02:20 PDT 2004
- adding max url config param to limit number of URLs checked
in an email. Usage (place into .cf file):
spamcop_uri_limit 50
Default is unlimited.
- adding support for multi.surbl.org / bitmasked results.
query results are cached on a per msg basis to prevent additional
lookups.
Modify your configuration to look like the following for sc.surbl.org:
uri SPAMCOP_URI_RBL eval:check_spamcop_uri_rbl('multi.surbl.org','127.0.0.0/2')
describe SPAMCOP_URI_RBL URI's domain appears in spamcop database at sc.surbl.org
tflags SPAMCOP_URI_RBL net
ws.surbl.org would look like this:
uri WS_URI_RBL eval:check_spamcop_uri_rbl('multi.surbl.org','127.0.0.0/4')
describe WS_URI_RBL URI's domain appears in ws database at ws.surbl.org
tflags WS_URI_RBL net
- Removed configuration params: spamcop_uri_src
and spamcop_uri_path since
these should never be used anymore.
- added cleanup for hosts that come in with a dot in front of
of the host (e.g. http://.spammy-site.org)
http://sourceforge.net/projects/spamcopuri/
--eric
We've updated the SURBL Lists document to add information about
how to contact MailSecurity with reports about false positives in
ph, the anti-phishing list included (only) in multi.surbl.org:
http://www.surbl.org/lists.html#ph
"Please report false positives on the ph list to: postmaster
at corp.mailsecurity.net.au. Be sure to include supporting
reasons explaining why the domain should not be listed."
Jeff C.
--
Jeff Chan
mailto:jeffc@surbl.org
http://www.surbl.org/
We've updated the SURBL Lists document to add information about
how to contact OutBlaze with reports about false positives in
ob.surbl.org:
http://www.surbl.org/lists.html
"Please report false positives found in the ob list to
postmaster at outblaze dot com. Be sure to include a
supporting email explaining why the domain should not
be listed."
Jeff C.
--
Jeff Chan
mailto:jeffc@surbl.org
http://www.surbl.org/
We'd like to welcome and thank another new public name server
administered by:
Lindsay Snider of pa.net
Without the help of all of our public nameservers and their
administrators, SURBLs would not be possible.
Thanks all!
Jeff C.
--
Jeff Chan
mailto:jeffc@surbl.org
http://www.surbl.org/
We'd like to welcome and thank a new public name server
administered by:
Mike Atkinson of kconline.com
Without the help of all of our public nameservers and their
administrators, SURBLs would not be possible.
Thanks Mike, kconline and all!
If anyone else would like to provide DNS for SURBLs to the
Internet community, please contact us at rsync at surbl.org. :-)
Jeff C.
--
Jeff Chan
mailto:jeffc@surbl.org
http://www.surbl.org/
We are pleased to announce the availability of three new SURBL
lists:
ob.surbl.org - OutBlaze spamvertised sites
ab.surbl.org - AbuseButler spamvertised sites
multi.surbl.org - Combined SURBL list
ob is a large list of about 20k spamvertised sites kindly
provided by OutBlaze and based on data found in their spam traps.
ob has a strong spam detection rate of around 70% and a low
false positive rate around 0.1%. Only domains that have been
registered within the past 90 days are included in ob.surbl.org
and this "newness" requirement is probably one of the reasons
for the low FP rate, given how quickly spammers use and must
discard domains for their web sites.
ab is a smaller list of the top 425 or so spamvertised sites
over the past 7 days kindly provided by AbuseButler:
http://spamvertised.abusebutler.com/
AbuseButler's data sources include SpamCop and native reports.
In general philosophy and data processing styles, ab.surbl.org
is similar to my own sc.surbl.org which is also based on SpamCop
data, and the results of both lists are similar, but not identical.
multi is a bitmask-combined version of all of the other lists
plus an anti-phishing list provided by Mail Security:
http://www.mailsecurity.net.au/
The latter is identified as "ph" and it is not available as
a separate list as its size probably does not justify the
resources of its own zone file, etc. However the data in ph
is important since it represents sites likely to be criminally
phishing for personal and financial information.
Because list membership in multi.surbl.org is encoded in a
bitmasked fashion, results from multi need to be decoded into
their constituent lists by programs such as urirhssub in
SpamAssassin 3.X. Support for this decoding may be back-ported
into the SpamAssassin 2.63 program for using SURBLs, SpamCopURI.
We expect that multi.surbl.org will become the list of choice
for SURBL use going forward, due to the convenience of getting
all list data in a single list, only needing to cache a single
zone file, etc.
More information about the all SURBL lists can be found at:
http://www.surbl.org/lists.html
Please feel free to ask questions or leave comments about
SURBLs on our discussion list at:
http://lists.surbl.org/mailman/listinfo/discuss
Please let us know about any false positives at:
whitelist at surbl dot org
Cheers,
Jeff C.
--
Jeff Chan
mailto:jeffc@surbl.org
http://www.surbl.org/