[SURBL-Discuss] Whitelist Please
Justin Mason
jm at jmason.org
Thu Sep 9 09:00:38 CEST 2004
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Jeff Chan writes:
> On Wednesday, September 8, 2004, 8:02:33 PM, Justin Mason wrote:
> > Jeff Chan writes:
>
> >> Here's a definition (note there is no H in the name):
> >>
> >> http://www.stopspam.org/usenet/mmf/breidbart.html
> >>
> >> "The BI is a measure of how spammy a spammed news article is. It
> >> is the sum of the square root of the number of groups each copy
> >> of a spam article is posted to. So if you post 10 copies of an
> >> article, each cross-posted to 4 groups, the BI is 20. Other ways
> >> of reaching the BI=20 mark (a threshhold used by some cancellers)
> >> is to post 20 copies, each to just one group, 4 copies to 25
> >> groups each, or 8 articles to 6 groups each and one more to just
> >> one group. (for BI=20.6)"
>
> > As a matter of interest -- and I should just ask Seth Breidbart ;) -- does
> > this deal with hashbusters? ie. if a message is 80% hashbuster strings,
> > and 20% payload, it's not so easy to automate BI calculation. (cf. dcc,
> > Pyzor, Razor, AOL's paper at CEAS, et al.)
>
> > - --j.
>
> I'm not sure I understand the question. It seems to me that
> BI is a calculation based on counts of crossposting per message
> and does not consider content.
>
> I guess you're saying that detection of multiple postings could
> be thrown off by hash busting, when the crossposting is done
> by posting to different newsgroups individually and not overtly
> listed in the headers.
yep.
- --j.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.4 (GNU/Linux)
Comment: Exmh CVS
iD8DBQFBP/+WQTcbUG5Y7woRAnCFAJ9t+A64Cr6d/1Womc/4SbiCuIxLWgCdFeb5
D0YJgisYA8EGMDNKIeLgDeQ=
=+Y3X
-----END PGP SIGNATURE-----
More information about the Discuss
mailing list