Lingua::TH::Segmentation - an object-oriented interface of TH word segmentation
use Lingua::TH::Segmentation; #create object $sg=Lingua::TH::Segmentation->new(); # insert separator to $thai_string $result=$sg->separate($thai_string,$separator); # split $thai_string to array include all spacing @result=$sg->cut($thai_string); # split $thai_string to array exclude spacing @result=$sg->cut_no_space($thai_string); # split $thai_string to array as of the original library @result=$sg->cut_raw($thai_string);
TH language is known to be a "word-sticked language", all words in a sentence are next to each other with out spacing. It is hard for programmers to solve problems, such as translating or searching, on this kind of language.
The module is a object-oriented interface of TH word segmentation library (http://thaiwordseg.sourceforge.net).
None by default.
http://thaiwordseg.sourceforge.net
Komtanoo Pinpimai(romerun@romerun.com)
Perl License.
To install Lingua::TH::Segmentation, copy and paste the appropriate command in to your terminal.
cpanm
cpanm Lingua::TH::Segmentation
CPAN shell
perl -MCPAN -e shell install Lingua::TH::Segmentation
For more information on module installation, please visit the detailed CPAN module installation guide.