jducoeur: (Default)
jducoeur ([personal profile] jducoeur) wrote2008-10-31 12:17 pm
Entry tags:

Desired feature of the day:

A language flag for Gmail, that says, "Just mark everything in any Asian language as spam". My new crossert.com domain (which I'll be using for my consulting work) is immediately getting hit by a bunch of spam, all of it apparently in Korean.

Really, it would be a fine first-level filter. I don't believe I've ever received a single legitimate email that contained more than a few characters in any Asian language (none of which I read), and lots of spam that did, so as spam-filters go, it would be fine bang for the buck...

[identity profile] eclecticmagpie.livejournal.com 2008-10-31 04:38 pm (UTC)(link)
Interesting. I get a ton of foreign language spam, but only on one account -- the one I use for my tea dances. It was listed in plaintext on a folkdance directory and on the website of the church where the dances are held. My other accounts don't get this stuff in this kind of quantity. Who have you given your new address to?

[identity profile] dkapell.livejournal.com 2008-10-31 04:56 pm (UTC)(link)
before we swapped over to Google Apps for domains, [livejournal.com profile] knobhdy had this same issue with russian and japanese spam, which she solved by snagging the most common characters, and adding them to a filter which dumped them directly in the spam folder. It would be like filtering everything that contains an 'e' over to spam, it worked really nicely.

[identity profile] leanne-opaskar.livejournal.com 2008-10-31 05:00 pm (UTC)(link)
Wouldn't work for me. (: I have a subscription to the Japan Times online edition. (;

But as a flag, yes, that'd be useful for other people, I'm sure.

[identity profile] meranthi.livejournal.com 2008-10-31 06:16 pm (UTC)(link)
Well, most filters will allow you to set "Not this address" so it should still work. :)

[identity profile] be-well-lowell.livejournal.com 2008-10-31 06:02 pm (UTC)(link)
Spamassassin more or less does this for me. Bayesian filters tend to match undesired languages pretty quickly, too.