We are now removing any subdomains (third or greater level domains or host names) from generic TLDs:
using a "new"-style regular expression:
s/^([^.]*.)+([^.]*).(com|net|org|edu|mil|biz|info|int|arpa|name|museum|coop|aero|pro)$/\2.\3/
It seems to do the right thing, both on test cases and the actual data, so it's now live on all the SURBL lists. If anyone sees any problems with this regex, please let me know.
Bill's domains from sa-blacklist are already in the correct form :-) and have no subdomains on these gTLD domains going into ws.surbl.org. I added it also to sc.surbl.org and be.surbl.org which did get rid of a few errant records. Subdomains are now properly removed in be and sc, as they should have been.
This may result in slightly better matching on both be and sc since the clients are supposed to be doing similar things with domains found in message URIs.
Jeff C.