WebService::CIA::Parser - Parse pages from the CIA World Factbook
use WebService::CIA::Parser; my $parser = WebService::CIA::Parser->new; my $data = $parser->parse($string);
WebService::CIA::Parser takes a string of HTML and parses it. It will only give sensible output if the string is the HTML for a page whose URL matches https://www.cia.gov/library/publications/the-world-factbook/print/[a-z]{2}\.html
https://www.cia.gov/library/publications/the-world-factbook/print/[a-z]{2}\.html
This parsing is somewhat fragile, since it assumes a certain page structure. It'll work just as long as the CIA don't choose to alter their pages.
new
Creates a new WebService::CIA::Parser object. It takes no arguments.
parse($html)
Parses a string of HTML take from the CIA World Factbook. It takes a single string as its argument and returns a hashref of fields and values.
The values are stripped of all HTML. <br> tags are replaced by newlines.
<br>
It also creates four extra fields: "URL", "URL - Print", "URL - Flag", and "URL - Map" which are the URLs of the country's Factbook page, the printable version of that page, a GIF map of the country, and a GIF flag of the country respectively.
use WebService::CIA::Parser; use LWP::Simple qw(get); $html = get( "https://www.cia.gov/library/publications/the-world-factbook/print/uk.html" ); $parser = WebService::CIA::Parser->new; $data = $parser->parse($html); print $data->{"Population"};
Ian Malpass (ian-cpan@indecorous.com)
Copyright 2003-2007, Ian Malpass
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
The CIA World Factbook's copyright information page (https://www.cia.gov/library/publications/the-world-factbook/docs/contributor_copyright.html) states:
The Factbook is in the public domain. Accordingly, it may be copied freely without permission of the Central Intelligence Agency (CIA).
WebService::CIA
To install WebService::CIA, copy and paste the appropriate command in to your terminal.
cpanm
cpanm WebService::CIA
CPAN shell
perl -MCPAN -e shell install WebService::CIA
For more information on module installation, please visit the detailed CPAN module installation guide.