PHP Classes
Icontem

Class: Fast Chinese Word Segmentation


  Search   All class groups All class groups   Latest entries Latest entries   Top 10 charts Top 10 charts   Newsletter Newsletter   Blog Blog   Forums Forums   Help FAQ Help FAQ  
  Login   Register  
Recommend this page to a friend! ReTweet ReTweet Stumble It! Stumble It! Bookmark in del.icio.us Bookmark in del.icio.us
  Classes of Wudi  >  Fast Chinese Word Segmentation  >  Download  >  Support forum Support forum (1)  >  Blog Blog  >  RSS 1.0 feed RSS 2.0 feed Latest changes  
Name: Fast Chinese Word Segmentation Support forum
Base name: fcws
Description: Segment Chinese text using the RMM approach
Related top rated classes: , , ,
Version: -
Required PHP version: -
License: Free for non-commercial use
All time users: 457 users
All time rank: 3866
Week users: 0 users
Week rank: 3787
Country specific: This package is specific mainly for applications used in China China .
 
  Screenshots Screenshots   Author   Group folder image Groups   Detailed description  
  Rate classes User ratings   Applications   Related links   Files Files  

Screenshots

Example
File Role Description
Accessible without login Image file screenshot.png Screen Example

Author

Picture of Wudi
Name: Wudi <e-mail contact>
Published packages: 5 Browse this author's classes Browse this author's classes
Country: China China - PHP jobs in China
Home page: http://www.wudilabs.org/
Age: 21
All time rank: 982
Week rank: 959

Innovation Award

PHP Programming Innovation award nominee
July 2005
Number 9
Chinese is a language that is becoming more and more relevant on the Internet due to the growth of the Chinese economy. This growth is making it possible for many Chinese speaking people becoming Internet users.

The Chinese language words are actually individual symbols. Certain encodings may include ASCII characters allowing for words in other languages to be mixed in Chinese documents.

This class provides a solution to break a Chinese text in a way that it avoids breaking English words that may be mixed with Chinese symbols.

Manuel Lemos

Groups

Group folder image Text processing Manipulating and validating text data View top rated classes

Detailed description

This class can segment Chinese text.

It uses the RMM (reverse maximum match) approach. Therefore it may commit some mistakes that cannot be avoided with perfection.

It handles English but in a very simple way.

User ratings

Not yet rated by the users

Applications that use this class

No application links were specified for this class.
Add link image If you know an application of this package, send a message to the author to add a link here.

Related links

Link Description
Default dictionary The default dict for this class (@mediafire.com)
Default dictionary The default dict for this class (@box.net)

Files

File Role Description
Plain text file cwordseg_fast.lib.php Class Class
HTML file Readme_CN.htm Doc. Readme (Chinese)
HTML file Readme_EN.htm Doc. Readme (English)
Plain text file test.php Example Test
Download all files: fcws.tar.gz fcws.zip
NOTICE: if you are using a download manager program like 'GetRight', please Login before trying to download this archive.

 
  Advertise on this site Advertise on this site   Site map Site map   Statistics Statistics   Site tips Site tips   Privacy policy Privacy policy   Contact Contact  

For more information send a message to :
info at phpclasses dot org.
Copyright (c) Icontem 1999-2010 PHP Classes - PHP Class Scripts
  PHP Book Reviews - Reviews of books and other products