Hailo::Role::Tokenizer - A role representing a Hailo tokenizer
new
This is the constructor. It takes no arguments.
make_tokens
Takes a line of input and returns an array reference of tokens. A token is an array reference containing two elements: a spacing attribute and the token text. The spacing attribute is an integer which will be stored along with the token text in the database. The following values are currently being used:
0
1
2
3
make_output
Takes an array reference of tokens and returns a line of output. A token is an array reference as described in make_tokens. The tokens will be joined together into a sentence according to the whitespace attributes associated with the tokens, as well as any formatting provided by the tokenizer implementation.
Hinrik Örn Sigurðsson, hinrik.sig@gmail.com
Ævar Arnfjörð Bjarmason <avar@cpan.org>
Copyright 2010 Hinrik Örn Sigurðsson and Ævar Arnfjörð Bjarmason <avar@cpan.org>
This program is free software, you can redistribute it and/or modify it under the same terms as Perl itself.
To install Hailo, copy and paste the appropriate command in to your terminal.
cpanm
cpanm Hailo
CPAN shell
perl -MCPAN -e shell install Hailo
For more information on module installation, please visit the detailed CPAN module installation guide.