Category Archives: Tech

Anti-Bayesian spam

If you get spam, you’ll probably have noticed the recent trend towards spam full of random words. This is intended to defeat Bayesian anti-spam filters which mark emails as spam if they have words in them which are found in other spam. And it does generally seem to get through SpamAssassin which is the filter I’m using on my mail server.

The reason it gets by is that the random words e.g. “storm antiquated biaxial genevieve askew evensong compressor foothill ludwig eyeglass irwin delano narcissist calumny messrs dan begin oratorical depict platitude“, are not in themselves spam-like. However, real non-spam email never has strings of words like that. Of the anti-Bayesian spam I’ve seen, none actually looks like real mail; there’s no grammar and too many consectutive long words (or maybe the people who send me email have small vocabularies?). There are too few occurances of “the”, “it”, “and” etc.

The next step towards defeating this spam could be to perform basic lexical analysis of the content, to see whether it looks like real text. There are a number of problems with this though:

  • Non-English languages may be much harder to handle.
  • The spammers may start including random sections of real text
  • It’s yet another load on the spam filters

The blacklists have taken a battering over the past few months, which various viruses being targetted at bringing them down, but they’re still one of the best weapons we have to stop spam.

Most of my spam arrives through Demon as my mail server blocks blacklisted hosts. I use fetchmail to pull my Demon mail onto my server, where it’s passed through SpamAssassin . SpamAssassin marks the stuff that it thinks is spam, which is then dumped into an IMAP folder for later checking. With Demon finally annoucing that they’re putting in some spam filtering, the level of spam I get should drop off even more.

Update: Just after posting this I got my first piece of spam marked with the Habeas mark which is a short verse of trademark poetry used to indicate that a sender is trustworthy. I’ve reported the spam to Habeas; the idea being that illegal use of their trademark means that they can sue the spammer. We’ll see…

From Outlook to Apple Address Book

I’ve been trying to consolidate the address books on my various machines (Windows PC running Outlook 2002, Apple Powerbook running Address Book, and Sony Clie UX50). Being the nice program that it is, Outlook 2002 will let you export contacts as vCards. Great! Address Book will read those. However, Microsoft don’t want to be seen making it easy for people to migrate away from their programs so they only let you export to vCards one contact at a time. I could learn VBA and write a script to export all my contacts, but, frankly, I have better things to do with my time. Fortunately, someone else has encountered much the same problem, though they were creating vCards for import into Evolution So, if you want to export multiple vCards, for importing into another contact manager, I can recommend Outport which does exactly that (and also handles all of the other data stored in Outlook).

I’ve now consolidated everything onto the Mac, and using Missing Sync I can sync that with my Clie, and my Sony Ericsson T68i.
If only I hadn’t somehow managed to delete all the birthday dates from Outlook’s calendar…