SPAM Queue
Overview
The SPAM Queue options allow modification to some of the default behaviors.
The SPAM queue options also allows tweaking of the AI Filter.
General
Delete entries in spam queue after x days
Instructs the emailAI Pro engine to delete entries in the Spam Queue after the number of days specified from the date the email was originally received.
Search through spam queue if a new White List entry is made
When adding entries to the White List either manually or automatically with this option enabled the emailAI Pro engine can be instructed to search through the SPAM Queue to see if the new entry would of affected the outcome of an email that ended in the Spam Queue. If the emailAI Pro engine finds an entry in the SPAM Queue that now successfully validates to the White List it will be delivered.
AI Filter Options
Good Token Weight
The weight given to a word when it is contained in a delivered email. Words contained in spam emails are given a weight of 1. Setting this value to 2 will there fore ranking words in good emails 2 to 1.
Min Token Count
The minimum number of words that must be in an email to be determined as good or bad. For example if an email contained just the word viagra and the minimum count was 2 the email would not be considered spam. The default value for this option is 0.
Min Count for Inclusion
The minimum number of times a word must appear across all emails to be included in the over all filter. For example if 100 emails are used to build the filters and the word viagra only showed up 2 times the word would be ignored by the filters. The default value for this is 5.
Min Score
The minimum score a word can have. The default for this is 0.011
Max Score
The maximum score a word can have. The default for this is .99
Likely Spam Score
The ranking to give a word when it is considered to likely be spam. The default for this is .9998
Certain Spam Score
The ranking to give a word when it is known as spam determined by the certain spam count. The default for this is .9999
Certain Spam Count
The number of times a word has to appear in spam scanned emails to determine if that word is a typical spam word. For example if 100 emails are scanned when building the filter and the word viagra showed up in spam emails11 times, it would be classed as a certain spam word. The default for this is 10.
Interesting Word Count
The amount of words to use in a email to guage whether it is spam or not. The default for this is 15. By using the most interesting words long emails that are intended to trick filters don't fool the system. Ie if the email contain Viagra then a paragraph about humpty dumpty trying to trick the filter, the filter only looks at the 15 most interesting words in the email, and viagra would be one of them.
More Information
For more information about how this filter works see Paul Grahams' article