Help → magnet bug
I have a magnet ".gov" in the from field. However, POPfile sent spam from ".gov.ve" also into my inbox. I thought ".gov" and ".gov.ve" has significant diferences.
-
Message #1311
The MAGNETS page in POPFile's User Interface (UI) has some examples showing how magnets work.
If you create a "From" magnet using John this will match all cases where the text John appears in the "From" field. If you create a "Subject" magnet using hello it will match all messages with the text hello in the "Subject".
This explains why your .gov magnet is also matching messages with .gov.ve addresses.
ticket:76 notes that the current magnet detection code could be improved by ensuring that a magnet for the email address test@example.com does not match the address nottest@example.com. I think this proposed change could also ensure addresses ending in .gov.ve do not match a magnet for .gov (i.e. the magnet means match only addresses ending in .gov)
I am not familiar with this code so I don't know how much work would be involved in making these changes.
Sometimes it is better to create a filter rule in the email client instead of a POPFile magnet.
We recommend you use magnets carefully. If magnets are used, POPFile cannot not learn enough from your e-mail messages. For more information, please see:
http://getpopfile.org/docs/glossary:amagnet
http://getpopfile.org/docs/faq:whitelists
POPFile needs some time to learn from your messages and achieve good accuracy. Please give it enough time and enough information. For more information about the learning process, please see:
http://getpopfile.org/docs/faq:whengood
I use POPFile with 7 buckets (one spam bucket and 6 buckets for good email) and have never used magnets. My POPFile accuracy is usually around 99.67%.
Brian
brian05/19/10 21:36:12 -
-
Message #1356
In my business, I need to ensure zero false positives for spam, so I use magnets - heavily. It works well for me because my e-mail domain is mine and I give different e-mail addresses to different senders. It seems that most of the time spam occurs when someone sells my address or gets hacked. I mean how hard is it for a major software vendor to keep their customer database secure? Perhaps they don't care, because they'll never get caught, but when I get spam at <BigSoft?>info@<mydomain>.com, I pretty much know who leaked it.
Using this scheme, I can easily kill the address. If I still like the company, I edit my profile with them to give them a fresh address. I have an email client filter rule that immediately deletes all messages that have been "magnetted" with the spam-address magnet. For all others, I let the Bayes classifier do its thing and it does it well, even though it never sees the spam-address messages. I know the POPFile team discourages magnets, but that rubric is not necessarily for those who filter spam based on the "To:" header.
[Since my e-mail client does not show the "To:" header in its list of incoming mail (why should it, since it expects the reader to know their own e-mail address), I have hacked the Bayes.PM file to modify the Subject line when it suspects spam and adds the "To:" header to the Subject line so I can see what e-mail address the spam is coming in on.]
I guess I could hack bayes.pm to classify everything and THEN apply the magnet, but the classifier works well enough (scary well) with just the legitimate mail, plus the spam that has not been "magnetted" away, plus spam on the addresses I do not want to abandon.
However, the one thing I would really like to change in the magnet logic (either have someone else do it because I make a convincing case, or have me learn a little more about how the code is organized - Perl is not my first language - and submit that for approval) so that magnets can be exact matches or substrings, instead of just substring matching. For example, for the past 8 years or so, I have been getting spam at on@<mydomain>.com and I would like to kill it with a magnet, but since I have given out evolution@<mydomain>.com and creation@<mydomain>.com as my e-mail address and want to continue to use those, I cannot use a magnet to kill on@<mydomain>.com without killing all other mail that has a "To:" header that has on@<mydomain>.com as a substring.
PApopfile08/29/10 20:17:47
-
-
Message #1363
I've fixed this bug: ticket:76 [3666]
The change will be included in the next version of POPFile.
Naoki
amatubu09/09/10 19:02:15