AI::Categorizer::Collection::Files - One document per file
my $c = new AI::Categorizer::Collection::Files (path => '/tmp/docs/training', category_file => '/tmp/docs/cats.txt'); print "Total number of docs: ", $c->count_documents, "\n"; while (my $document = $c->next) { ... } $c->rewind; # For further operations
This implements a Collection class in which each document exists as a single file on a filesystem. The documents can exist in a single directory, or in several directories.
This is a subclass of the abstract AI::Categorizer::Collection class, so any methods mentioned in its documentation are available here.
Creates a new Collection object and returns it. In addition to the parameters accepted by the superclass, the following parameters are accepted:
Indicates a location on disk where the documents can be found. The path may be specified as a string giving the name of a directory, or as a reference to an array of such strings if the documents are located in more than one directory.
Indicates whether subdirectories of the directory (or directories) in the path parameter should be descended into. If set to a true value, they will be descended into. If false, they will be ignored. The default is false.
path
Ken Williams, ken@mathforum.org
Copyright 2002-2003 Ken Williams. All rights reserved.
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
AI::Categorizer::Collection(3)
To install AI::Categorizer, copy and paste the appropriate command in to your terminal.
cpanm
cpanm AI::Categorizer
CPAN shell
perl -MCPAN -e shell install AI::Categorizer
For more information on module installation, please visit the detailed CPAN module installation guide.