Re: [SURBL-Discuss] Whitelist Please

9 Sep 2004


      -----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Jeff Chan writes:
...
On Wednesday, September 8, 2004, 8:02:33 PM, Justin Mason wrote:
...
Jeff Chan writes:
...
...
Here's a definition (note there is no H in the name):
http://www.stopspam.org/usenet/mmf/breidbart.html
"The BI is a measure of how spammy a spammed news article is. It
is the sum of the square root of the number of groups each copy
of a spam article is posted to. So if you post 10 copies of an
article, each cross-posted to 4 groups, the BI is 20. Other ways
of reaching the BI=20 mark (a threshhold used by some cancellers)
is to post 20 copies, each to just one group, 4 copies to 25
groups each, or 8 articles to 6 groups each and one more to just
one group. (for BI=20.6)"
...
As a matter of interest -- and I should just ask Seth Breidbart ;) -- does
this deal with hashbusters?   ie. if a message is 80% hashbuster strings,
and 20% payload, it's not so easy to automate BI calculation.  (cf. dcc,
Pyzor, Razor, AOL's paper at CEAS, et al.)
...

--j.

I'm not sure I understand the question.  It seems to me that
BI is a calculation based on counts of crossposting per message
and does not consider content.
I guess you're saying that detection of multiple postings could
be thrown off by hash busting, when the crossposting is done
by posting to different newsgroups individually and not overtly
listed in the headers.
yep.
- --j.
...PGP SIGNATURE...
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.4 (GNU/Linux)
Comment: Exmh CVS

iD8DBQFBP/+WQTcbUG5Y7woRAnCFAJ9t+A64Cr6d/1Womc/4SbiCuIxLWgCdFeb5
D0YJgisYA8EGMDNKIeLgDeQ=
=+Y3X
-----END PGP SIGNATURE-----

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

Re: [SURBL-Discuss] Whitelist Please