POPfileAbout POPfile...

2005-09-10: Efficiency

After a first quick training phase, I can report the actual efficiency of this software tool. Strictly no problem here: 97,89% of successful identification of SPAM with a training on only a little less than 200 messages (655 good words, 14564 spam words).

The result is exactly what was expected and satisfying. Good for POPfile. It can be recommended on the basis of the detailed evaluation you'll find on the right (pros/cons).

2005-09-03: First filtering

As should be expected the first filtering/classification left all messages as unclassified, but after the first reclassification using the web interface, it corrected itself nicely.

For the moment, the only inconvenience seems to be the lack of a presentation of messages sorted by probability of being in the category they were assigned to (it's a neat feature of K9 that saves a lot of time in everyday management of the mail).

2005-09-02: Installation

The installation of POPfile under Windows is refreshingly simple. I feared that the Linux origin may be seen in a rough installation process. Nothing could be more wrong. It's zip. I appreciated the full recognition of all my email programs (and their automagic configuration):

POPfileBayesian filtering of SPAMs

For a long time now, I have been advising individual users to protect themselves with a software based on Bayesian statistical techniques. My prefered PC-Windows solution for this still goes to K9 from keir.net.

But the program at the origin of K9, POPfile, now has a specific Windows version. I am going to evaluate it in order to decide if it should go to my list of prefered downloads.

Some later news will be shown here.

Not far from POPfile...

POPfilePOPfile: pros and cons

While evaluating POPfile, here are the positive and negative sides I could detect. I let you know:

Pros:

  • Fully automated installation
  • Good filtering efficiency.
  • Able to handle more than SPAM (as many categories as you'd like)
  • Works with Windows from 98 to XP and with Linux.

Cons:

  • The presentation of messages (in order to reclassify them to avoid/correct errors) does not help in concentrating on the near-the-limit probabilities, despite that it would help management of large quantities of email (as in my case).
  • Sometimes, the User Interface (UI) of the Windows version will seem to stop working. The user manual says this is only temporary and happens when the software is classifying email. My experience shows it leads to restarting POPfile instead.

Download POPfile.

Paul Graham's plan

A (rather long) paper from Paul Graham about why fight against SPAM will necesarily use content evaluation and preferably Bayesian techniques.

I say the same thing for quite some time, but Paul is much clearer and argumented than I could be.

A plan for SPAM

Some older news archived at SpamAnti.net

SpamAnti.net

2010

July
June
May
April
February
January

2009

October
September
August
July
June
May
April
February
January

2008

December
November
October
September
August
July
June
May
April
March
February
January

2007

December
November
October
September
August
June
May
April
March
February

2006

December
November
October
September
August
July
June
May
April
March
February
January

2005

December
November
October
September
August
July
June
May
April
March
February
January

2004

December
November
October
September
August
July
June
May

2003

October
July
March
January

2002

December
November
October
September
August
July
June



http://www.SpamAnti.net/
Copyright (C) 1996-2008 - Yves Roumazeilles (all rights reserved)

Last update: 23-nov-08