Innovation Award
 July 2005
Number 9 |
Chinese is a language that is becoming more and more relevant on the Internet due to the growth of the Chinese economy. This growth is making it possible for many Chinese speaking people becoming Internet users.
The Chinese language words are actually individual symbols. Certain encodings may include ASCII characters allowing for words in other languages to be mixed in Chinese documents.
This class provides a solution to break a Chinese text in a way that it avoids breaking English words that may be mixed with Chinese symbols.
Manuel Lemos |
This class can segment Chinese text.
It uses the RMM (reverse maximum match) approach. Therefore it may commit some mistakes that cannot be avoided with perfection.
It handles English but in a very simple way.
| Not yet rated by the users |
No application links were specified for this class.

If you know an application of this package, send a message to the
author to add a link here.