Abstract
Spam emails are causing major resource wastage by unnecessarily flooding the network links. Though many antispam solutions have been implemented, the Bayesian spam score approach looks quite promising. A proposal for spam detection algorithm is presented and its implementation using Java is discussed, along with its performance test results on two independent spam corpuses - Ling-spam and Enron-spam. We use the Bayesian calculation for single keyword sets and multiple keywords sets, along with its keyword contexts to improve the spam detection and thus to get good accuracy.
Original language | English |
---|---|
Title of host publication | 2009 International Conference on Computer Engineering and Technology |
Pages | 326-330 |
Number of pages | 5 |
Volume | 2 |
DOIs | |
Publication status | Published - 2 Feb 2009 |
Externally published | Yes |
Keywords
- Bayesian approach
- Context matching
- Keyword sets
- Spam corpus
- Spam mails