PHP Classes
Icontem

Class: Text Cat


  Search   All class groups All class groups   Latest entries Latest entries   Top 10 charts Top 10 charts   Newsletter Newsletter   Blog Blog   Forums Forums   Help FAQ Help FAQ  
  Login   Register  
Recommend this page to a friend! ReTweet ReTweet Stumble It! Stumble It! Bookmark in del.icio.us Bookmark in del.icio.us
  Classes of Cesar D. Rodas  >  Text Cat  >  Download  >  Support forum Support forum  >  Blog Blog  >  RSS 1.0 feed RSS 2.0 feed Latest changes  
Name: Text Cat Support forum
Base name: libtextcat
Description: Guess the language of a given text
Related top rated classes: , , ,
Version: -
Required PHP version: 3.0
License: Public Domain
All time users: 660 users
All time rank: 3061
Week users: 2 users
Week rank: 2561
 
  Author   Group folder image Groups   Detailed description   Freshmeat project  
  Rate classes User ratings   Trackback   Applications   Files Files  

Author

Picture of Cesar D. Rodas
Name: Cesar D. Rodas is available for providing paid consulting. Contact Cesar D. Rodas .
Published packages: 34 Browse this author's classes Browse this author's classes
Country: Paraguay Paraguay - PHP jobs in Paraguay
Home page: http://cesarodas.com/
Age: 21
All time rank: 15
Week rank: 9

Innovation Award

PHP Programming Innovation award nominee
June 2006
Number 5
A text can be written in many different idioms. Without a prior knowledge of the idiom on which a text is written, it is hard for a human to guess and eventually use an appropriate idiom translation tool.

This class can be used to guess the idiom of a text. It takes prebuilt data files that are used to give different weights to the presence of certain characters in a text that are more associated to an idiom.

This way the class can give a good idea of the idioms on which a given text is more likely to be written.

Manuel Lemos

Groups

Group folder image Text processing Manipulating and validating text data View top rated classes

Detailed description

This class can be used to guess the language of a given text.

The class reads data files that contain ranking information about characters that are most likely to be found in texts of several languages.

The text being analyzed is converted to Unicode to be compared with the language character ranking data.

The class returns an array of the language sorted by ranking .

Currently it support the language are: Arabic, Belarus, Chinese, Czech, Danish, Dutch, English, Esperanto, French, German, Greek, Hebrew, Italian, Japanese, Russian, and Spanish.

Have a lot of fun with this!

Freshmeat project

Project record: libtextcat
Popularity score: 61.02
Vitality score: 1.48

User ratings

There are not enough user ratings to display for this class.

Trackback links

Link Description
PHP: Guess the language of a given text There exist a open project called LibTextCat. I’ve used this class for many projects with greats results. What this project do is recieve a text as a parameter and return in what lang it is text written...

Applications that use this class

No application links were specified for this class.
Add link image If you know an application of this package, send a message to the author to add a link here.

Files

File Role Description
Plain text file arabic.lm Data arabic
Plain text file belarus.lm Data belarus
Plain text file chinese.lm Data chinese
Plain text file czech.lm Data czech
Plain text file danish.lm Data danish
Plain text file dutch.lm Data dutch
Plain text file english.lm Data english
Plain text file esperanto.lm Data esperanto
Plain text file french.lm Data french
Plain text file german.lm Data german
Plain text file greek.lm Data greek
Plain text file hebrew.lm Data hebrew
Plain text file italian.lm Data italian
Plain text file japanese.lm Data japanese
Plain text file russian.lm Data russian
Plain text file saddorlibtextcat.php Class This is the main class
Plain text file spanish.lm Data spanish
Plain text file test.php Example test
Download all files: libtextcat.tar.gz libtextcat.zip
NOTICE: if you are using a download manager program like 'GetRight', please Login before trying to download this archive.

 
  Advertise on this site Advertise on this site   Site map Site map   Statistics Statistics   Site tips Site tips   Privacy policy Privacy policy   Contact Contact  

For more information send a message to :
info at phpclasses dot org.
Copyright (c) Icontem 1999-2009 PHP Classes - PHP Class Scripts
  PHP Book Reviews - Reviews of books and other products