[SURBL-Discuss] Ham corpora needed

Raymond Dijkxhoorn raymond at prolocation.net
Mon Sep 6 03:14:11 CEST 2004


Hi Ryan,

>  1.091   3.4773   0.0000    1.000   1.00    4.00  URIBL_OB_SURBL
>  5.258   9.0622   3.5181    0.720   0.29    3.00  URIBL_WS_SURBL
>  0.000   0.0000   0.0000    0.500   0.14    5.00  URIBL_AB_SURBL
>  0.000   0.0000   0.0000    0.500   0.14    2.00  URIBL_PH_SURBL
>  0.000   0.0000   0.0000    0.500   0.14    4.00  URIBL_SC_SURBL
>  0.265   0.3688   0.2169    0.630   0.00    1.00  URIBL_PJ_SURBL
>
> I don't have time to go through the results right now, but feel free:
>
> Ham that hit any URIBL rule: http://ry.ca/geturi/pc-ham-uribl.log (14K)
> Full ham log: http://ry.ca/geturi/pc-ham.log (340K)
> Full spam log: http://ry.ca/geturi/pc-spam.log (159K)

There were 9 'hits' on the PJ list, and all 9 were from the exact same 
domain. (partner2profit.com). I have whitelisted that one now, it was in 
WS. 'Besides' that one not a single FP in that set, for PJ, next! :)

> What you want to do is go through pc-ham-uribl.log, and check each
> message mentioned in the log in the SA public corpus to see if you have
> any FP candidates or not.

If someone has a couple of minutes, please lookup the ones that should be 
removed from WS.

Thanks!
Raymond.


More information about the Discuss mailing list