[SURBL-Discuss] mbox parser
Jose Marcio Martins da Cruz
Jose-Marcio.Martins at ensmp.fr
Tue May 18 13:48:11 CEST 2004
Yes, but you can be very careful with extracted URIs. Many spams come
with poisonned URIs. Some of them are good sites (I even found
unesco.org in a porn spam). Some of them come with hundreds of random
generated URIs and others come with nonexistent URIs, usually on the
unsubscribe web page.
Justin Mason wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
>
> SpamAssassin 3.0.0's "mass-check --loguris" works quite well ;)
>
> - --j.
>
> Jeff Chan writes:
>
>>On Monday, May 17, 2004, 5:07:32 PM, David Coulson wrote:
>>
>>>I've got a decent mailbox containing a variety of spam e-mail. Is there
>>>a nice little Perl script out there which will spit out the URLs so I
>>>can submit them to Bill's list?
>>
>>Hi David,
>>Someone else asked about this recently, saying he could not find
>>a good message body URI parser. Presumably the reason is that
>>it's a little more complicated than it may seem at first, given
>>the need to decode MIME, weird cases, etc. I suggested starting
>>with some of the code form SpamCopURI or urirhsbl from the SA 3.0
>>URIBL module.
>>
>>Jeff C.
>>
>>_______________________________________________
>>Discuss mailing list
>>Discuss at lists.surbl.org
>>http://lists.surbl.org/mailman/listinfo/discuss
>>
>>
>
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.2.4 (GNU/Linux)
> Comment: Exmh CVS
>
> iD8DBQFAqdWBQTcbUG5Y7woRAi3FAKC/MOpFwDjuCWrzpGESCr4NbQy4wACfeTcX
> PmBsVtbPuxa56CTiN8BhrSI=
> =dJjR
> -----END PGP SIGNATURE-----
>
> _______________________________________________
> Discuss mailing list
> Discuss at lists.surbl.org
> http://lists.surbl.org/mailman/listinfo/discuss
>
>
--
---------------------------------------------------------------
Jose Marcio MARTINS DA CRUZ Tel. :(33) 01.40.51.93.41
Ecole des Mines de Paris http://j-chkmail.ensmp.fr
60, bd Saint Michel http://www.ensmp.fr/~martins
75272 - PARIS CEDEX 06 mailto:Jose-Marcio.Martins at ensmp.fr
More information about the Discuss
mailing list