Improved bayesian anti-spam filter - Implementation and analysis on independent spam corpuses

Biju Issac*, Wendy Utra Jap, Jofry Hadi Sutanto

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

15 Citations (Scopus)

Abstract

Spam emails are causing major resource wastage by unnecessarily flooding the network links. Though many antispam solutions have been implemented, the Bayesian spam score approach looks quite promising. A proposal for spam detection algorithm is presented and its implementation using Java is discussed, along with its performance test results on two independent spam corpuses - Ling-spam and Enron-spam. We use the Bayesian calculation for single keyword sets and multiple keywords sets, along with its keyword contexts to improve the spam detection and thus to get good accuracy.

Original languageEnglish
Title of host publication2009 International Conference on Computer Engineering and Technology
Pages326-330
Number of pages5
Volume2
DOIs
Publication statusPublished - 2 Feb 2009
Externally publishedYes

Keywords

  • Bayesian approach
  • Context matching
  • Keyword sets
  • Spam corpus
  • Spam mails

Fingerprint

Dive into the research topics of 'Improved bayesian anti-spam filter - Implementation and analysis on independent spam corpuses'. Together they form a unique fingerprint.

Cite this