Login   Register  
PHP Classes
elePHPant
Icontem

Bayesian Spam Filter: Detect spam in text using Bayesian techniques

Recommend this page to a friend!
Stumble It! Stumble It! Bookmark in del.icio.us Bookmark in del.icio.us

  Author Author  
Picture of Cesar D. Rodas
Name: Cesar D. Rodas is available for providing paid consulting. Contact Cesar D. Rodas .
Classes: 37 packages by
Country: Paraguay Paraguay
Age: 27
All time rank: 71 in Paraguay Paraguay
Week rank: 35 Down1 in Paraguay Paraguay Equal
Innovation award
Innovation award
Nominee: 24x

Winner: 5x


  Detailed description   Download Download .zip .tar.gz   Install with Composer Install with Composer  
This class can be used to detect spam in text messages using Bayesian techniques.

It analyzes the text words in terms of n-grams in a way that is idiom independent.

It can be trained to progressively distinguish what is spam and what is not spam by detecting patterns in training samples. Training data is stored in a MySQL database.

  Classes of Cesar D. Rodas  >  Bayesian Spam Filter  >  Download Download .zip .tar.gz  >  Support forum Support forum (6)  >  Blog Blog  >  RSS 1.0 feed RSS 2.0 feed Latest changes  
Name: Bayesian Spam Filter
Base name: bayesianspamfilter
Description: Detect spam in text using Bayesian techniques
Version: 1.1
PHP version: 4.0
License: BSD License
All time users: 4253 users
All time rank: 635
Week users: 5 users
Week rank: 298 Up
 
  Groups   Rate classes User ratings   Trackback  
  Applications   Related pages   Files Files  

  Groups  
Group folder image Text processing Manipulating and validating text data View top rated classes
Group folder image Artificial intelligence Automation of tasks using human-like intelligence View top rated classes


  Innovation Award  
PHP Programming Innovation award nominee
December 2007
Number 3
Determining whether a message is spam or not, is a subjective task.

Using Bayesian techniques it is possible to make a computer program learn what can be spam and what may not be spam.

A program may be trained using sample messages and telling it which are spam. Once the program is well trained, it may start guessing what is spam all by itself.

This package implements solution written completely in PHP that uses Bayesian techniques to detect spam text messages.

Manuel Lemos

  User ratings  
RatingsUtility Consistency Documentation Examples Tests Videos Overall Rank
All time: Good (83%) Sufficient (70%) Sufficient (66%) Sufficient (75%) Sufficient (75%) - Sufficient (70%) 185
Month: Not yet rated by the users

  Pages that reference this package  
Filtro Bayesiano in PHP
Segnalo una interessante implementazione dei filtri bayesiani in php + mysql...
GSoC - WP - Category Suggester - [Proposal]
This project is an idea that I got while I was reading the N-gram text Categorization...
PHP Spam detection project
While I was googling-up for know how the LibTextCat works internally, I found the paper that had changed my life, N-Gram-Based Text Categorization...

Latest pages that reference packages Latest pages that reference packages


  Applications that use this package  
No pages of applications that use this class were specified.
Add link image If you know an application of this package, send a message to the author to add a link here.

  Related pages  
PDF Documentation
PDF Documentation

  Files folder image Files  
File Role Description
Files folder imageexample (5 files)
Plain text file ngram.php Class N-gram class
Plain text file spam.php Class Spam detection class
Accessible without login HTML file spamdetection.html Doc. General information about this class
Plain text file trainer.php Class Spam trainer class

  Files folder image Files  /  example  
File Role Description
  Accessible without login Plain text file test.php Test Testing script
  Accessible without login Plain text file base.sql Data Example of Spams and mysql table to save the knowledge
  Accessible without login Plain text file config.php Conf. Example
  Accessible without login Plain text file example.php Example Classification example
  Accessible without login Plain text file example_trainer.php Example Example of how to traine the spam database

Install with Composer Install with Composer - Download Download all files: bayesianspamfilter.tar.gz bayesianspamfilter.zip
NOTICE: if you are using a download manager program like 'GetRight', please Login before trying to download this archive.