SWISH::Filters::ppt2txt - convert PowerPoint docs to text using catppt
This is a plug-in module that uses the catppt program to convert MS PowerPoint documents to text for indexing by Swish-e. catppt is part of the catdoc package and can be downloaded from:
catppt
catdoc
http://www.45.free.net/~vitus/software/catdoc/
The program catppt must be installed and in your PATH.
This filter does not specify input or output character encodings.
A minor optimization during spidering (i.e. when docs are in memory instead of on disk) would be to use open2() call to let catdoc read from stdin instead of from a file.
Peter Karman perl@peknet.com
SWISH::Filter
To install SWISH::Filter, copy and paste the appropriate command in to your terminal.
cpanm
cpanm SWISH::Filter
CPAN shell
perl -MCPAN -e shell install SWISH::Filter
For more information on module installation, please visit the detailed CPAN module installation guide.