SPAM Queue

Overview

The SPAM Queue options allow modification to some of the default behaviors.

The SPAM queue options also allows tweaking of the AI Filter.

General

Delete entries in spam queue after x days

Instructs the emailAI Pro engine to delete entries in the Spam Queue after the number of days specified from the date the email was originally received.

Search through spam queue if a new White List entry is made

When adding entries to the White List either manually or automatically with this option enabled the emailAI Pro engine can be instructed to search through the SPAM Queue to see if the new entry would of affected the outcome of an email that ended in the Spam Queue. If the emailAI Pro engine finds an entry in the SPAM Queue that now successfully validates to the White List it will be delivered.

AI Filter Options

Good Token Weight

The weight given to a word when it is contained in a delivered email. Words contained in spam emails are given a weight of 1. Setting this value to 2 will there fore ranking words in good emails 2 to 1.

Min Token Count

The minimum number of words that must be in an email to be determined as good or bad. For example if an email contained just the word viagra and the minimum count was 2 the email would not be considered spam. The default value for this option is 0.

Min Count for Inclusion

The minimum number of times a word must appear across all emails to be included in the over all filter. For example if 100 emails are used to build the filters and the word viagra only showed up 2 times the word would be ignored by the filters. The default value for this is 5.

Min Score

The minimum score a word can have. The default for this is 0.011

Max Score

The maximum score a word can have. The default for this is .99

Likely Spam Score

The ranking to give a word when it is considered to likely be spam. The default for this is .9998

Certain Spam Score

The ranking to give a word when it is known as spam determined by the certain spam count. The default for this is .9999

Certain Spam Count

The number of times a word has to appear in spam scanned emails to determine if that word is a typical spam word. For example if 100 emails are scanned when building the filter and the word viagra showed up in spam emails11 times, it would be classed as a certain spam word. The default for this is 10.

Interesting Word Count

The amount of words to use in a email to guage whether it is spam or not. The default for this is 15. By using the most interesting words long emails that are intended to trick filters don't fool the system. Ie if the email contain Viagra then a paragraph about humpty dumpty trying to trick the filter, the filter only looks at the 15 most interesting words in the email, and viagra would be one of them.

More Information

For more information about how this filter works see Paul Grahams' article