Mark as Spam - Apple Mail

pmahnke · November 17, 2006, 1:14am

I am a little confused.

From reading the documentation, it seems that I don’t want to have too much spam in the ‘training’ corpus, under 1000 messages. Sadly I can get there in about 3 days.

However, how do you ‘Mark as Spam’ without using the ‘Train as Spam’ rule?

Or am I missing something?

Do you just have to keep clearing the corpus? That doesn’t seem right.

Thanks,

Peter

Michael_Tsai · November 17, 2006, 6:37am

The corpus contains the messages that you’ve trained SpamSieve with (using the “Train as Spam” and “Train as Good” commands), as well as the messages that SpamSieve has trained itself with using auto-training. You can see the corpus numbers in the Statistics window. There you will also see some numbers for “Filtered Mail.” These are the incoming messages that SpamSieve has processed (decided whether they were good or spam), but it was probably only trained with some of them. I think this difference may be the key to your misunderstanding.

So, for the initial training, you are supposed to use 1,000 or fewer messages, 65% of them spam. After that, you don’t need to worry about the corpus size or ratio. You only need to use the training commands if SpamSieve puts a good message in the spam folder or a spam message in your inbox.

No.