Machine Learning Approaches for Modeling Spammer Behavior

Computer Science – Information Retrieval

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

12 pages, 3 figures, 5 tables, Submitted to AIRS 2010

Scientific paper

Spam is commonly known as unsolicited or unwanted email messages in the Internet causing potential threat to Internet Security. Users spend a valuable amount of time deleting spam emails. More importantly, ever increasing spam emails occupy server storage space and consume network bandwidth. Keyword-based spam email filtering strategies will eventually be less successful to model spammer behavior as the spammer constantly changes their tricks to circumvent these filters. The evasive tactics that the spammer uses are patterns and these patterns can be modeled to combat spam. This paper investigates the possibilities of modeling spammer behavioral patterns by well-known classification algorithms such as Na\"ive Bayesian classifier (Na\"ive Bayes), Decision Tree Induction (DTI) and Support Vector Machines (SVMs). Preliminary experimental results demonstrate a promising detection rate of around 92%, which is considerably an enhancement of performance compared to similar spammer behavior modeling research.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Machine Learning Approaches for Modeling Spammer Behavior does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Machine Learning Approaches for Modeling Spammer Behavior, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Machine Learning Approaches for Modeling Spammer Behavior will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-525209

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.