The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

Text::Perfide::BookSync - Synchronize books in plain text format.

VERSION

Version 0.01_09

SYNOPSIS

Text::Perfide::BookSync performs a structural alignment at section level of books in plain text format. The books have to be previously annotated by Text::Perfide::BookCleaner.

EXPORT

SUBROUTINES/METHODS

htmlmatrix

Generates an HTML file containing a matrix showing the matches between sections of two books.

marksync

Given two files FILEL and FILER, creates new versions of these files (FILEL.sync and FILER.sync) with synchronization tags <sync id="x"> marking the points where the texts synchronize.

splitchunks

Given two files FILEL and FILER, splits them by their synchronization points, storing each chunk in a file, where each FILEL.cXX matches FILER.cXX.

calchunks

Calculates chunks for a given pair of files. A chunk is a set of consecutive sections, which are grouped in order to match the corresponding chunk.

populate

From a given file in which sections have been delimited with Text::Perfide::BookCleaner, creates and returns a list containing information about the sections of this file: id, start offset and end offset.

moreinfosecs

Calculates metrics on each pair of sections (length in words, ...)

moreinfochunks

Calculates metrics on each pair of chunks (length in words, ...)

load_localrc

AUTHOR

Andre Santos, <andrefs at cpan.org> Jose Joao Almeida, <jj at di.uminho.pt>

BUGS

Please report any bugs or feature requests to bug-text-perfide-booksync at rt.cpan.org, or through the web interface at http://rt.cpan.org/NoAuth/ReportBug.html?Queue=Text-Perfide-BookSync. I will be notified, and then you'll automatically be notified of progress on your bug as I make changes.

ACKNOWLEDGEMENTS

LICENSE AND COPYRIGHT

Copyright 2011 Project Natura.

This program is free software; you can redistribute it and/or modify it under the terms of either: the GNU General Public License as published by the Free Software Foundation; or the Artistic License.

See http://dev.perl.org/licenses/ for more information.