Open Discussion → My POPFile Stats

My POPFile Stats

Hello.

I've been using POPFile since 2004. I use 2 buckets, despite knowing that having multiple buckets leads to more accurate filtering. My logic was that I use POPFile for one reason - to differentiate between legitimate and illegitimate messages - and therefore wanted to train it to walk a fine line. I use Pegasus Mail with a domain that is 10+ years old, a personal e-mail address that gets way too much spam and a "spam" e-mail address (for newsletters and crap) that I am positive gets sold to just about everybody. For about a third of the past three years I have been using POPFile with "catch-all" (from a dozen or so domains) being sent to my "spam" e-mail.

In May of 2007 I reset my statistics because I had reached what I considered a fully functional filtering state. That is, to say, I hit 99% accuracy.

I meant to post my stats in this past May but moved cross country. Here they are. (ignore the "last reset" date, I just reinstalled POPFile and this database on a new computer)

http://www.redlinewhiteline.com/images/popfile.png

I find that my database has no trouble keeping up with the continual evolution of spam tactics. Every couple of weeks they try a new one that sneaks past POPFile's filtering and all I need do is flag one message as spam and I am "in the clear" again. One advantage of using POPFile with Pegasus is that by utilizing Pegasus's built-in whitelist it is extremely easy to catch false positives - it has been a good long while since a 'personal' e-mail skated by as 'spam'.

I realize most of this info is probably utterly useless, but I thought I would post about my setup.

Thanks for making such an awesome program.

Bryce

  • Message #1353

    I realize most of this info is probably utterly useless, but I thought I would post about my setup.

    It is not useless, it makes a nice change from getting complaints!

    On my system I have one spam bucket plus six others for good mail and my accuracy is similar to yours:
    104,119 msgs, 460 errors, 99.56% accuracy

    Some users have configured POPFile to send some basic statistics (the number of buckets, the number of messages that POPFile has classified and the number of classification errors) and these reports are summarised on the POPFile Real Time Statistics page.

    Brian

  • Message #1355

    I have been using POPfile since 8/2004 and love it. I reset my stats at the end of February in 2006. Since then POPFile has processed 921,453 emails into 80-100 buckets with 99.62% accuracy. 97.75% of those emails were spam.
    POPFile started on a windows xp box and then switched to a Liunx box w/Thunderbird sometime in late 04 or early 05. Right now it resides on an Ubuntu 9.04 box with Thunderbird for an email client.

    Thanks for the outstanding application

  • Message #1365

    I have 3 buckets (spam, inbox and dm) and I've received 7,251 messages since Feb 27, 2010.
    Meanwhile, POPFile has sorted 53 messages into wrong buckets and it's accuracy is 99.26%.

    Naoki

  • Message #1381

    I'd like to contribute with my stats. I have 7 buckets and since 4th March (when I reset the stats after a month of training over 547 emails with 32 errors) I received 4565 emails and got 111 errors with an accuracy of 97.56%. The overall statistics including the initial training is 97.20% (from 2 February 2010). In the last 30 days I received 929 emails with 24 errors with a rate of 97.42%.

    Ciao

    Paolo