bcholmes: (Default)
BC Holmes ([personal profile] bcholmes) wrote2004-09-20 10:00 am

Changing the World With Metrics

Since last year, I've been pestering the e-mail guy at work to install Spam-filtering software, since I get inundated with the stuff.

It's pretty bad. My bcholmes.org domain kicks out the most obvious spam, and I probably get about 5 to 10 pieces of spam a day. But work is just ridiculous. It doesn't help that we use the worst of all possible e-mail clients, so there's only so much I can do about, say, turning off images and not running scripts in e-mail.

Some time in May, I started tracking how much spam I get. It was over 120 pieces of spam a day. I kept tracking this in a graph, and would periodically print out the graph and drop in on my boss' desk.

I had the active e-mail address that received the most spam in our office (I figure because I once posted some questions to a Java newsgroup -- my e-mail was harvested from there). There were four inactive addresses that received more spam than me, but I was top of the list for active e-mail addresses.

Some time in August, we installed spam-filtering software, and now I get around one spam a day.

ext_28663: (Default)

[identity profile] bcholmes.livejournal.com 2004-09-20 04:32 pm (UTC)(link)
So this was the first time I'd heard of Bayesian analysis as a method of Spam filtering. What I found interesting was that when I Googled "Bayesian", Google gave me an ad to work for Google.