This package can be use to perform several types of manipulation operations on UTF-8 encoded and unencoded strings.
There are several classes:
a) One to compute the length and extract parts of a string encoded in UTF-8. It uses PCRE extension functions, so it does not rely on multi-byte string manipulation extension.
b) Another class to perform white space normalization functions like: mapping any line break sequences to simple line breaks, reduce multiple space or line break sequences to only one, convert tabs to spaces and vice-versa, etc..
c) Another to perform letter case normalization functions like: convert strings to camel case words and vice-versa, or words separated by underscore characters, etc.. |
|