Abstract
Spam emails are causing major resource wastage by unnecessarily flooding the network links. Though many antispam solutions have been implemented, the Bayesian spam score approach looks quite promising. A proposal for spam detection algorithm is presented and its implementation using Java is discussed, along with its performance test results on two independent spam corpuses - Ling-spam and Enron-spam. We use the Bayesian calculation for single keyword sets and multiple keywords sets, along with its keyword contexts to improve the spam detection and thus to get good accuracy.
| Original language | English |
|---|---|
| Title of host publication | 2009 International Conference on Computer Engineering and Technology |
| Pages | 326-330 |
| Number of pages | 5 |
| Volume | 2 |
| DOIs | |
| Publication status | Published - 2 Feb 2009 |
| Externally published | Yes |
Keywords
- Bayesian approach
- Context matching
- Keyword sets
- Spam corpus
- Spam mails