May 26, 2018

Yet another HTML tag parser by pure Perl implementation

HTMLTagParser is a pure Perl implementaion for parsing HTML files. This module provides some methods like DOM. This module is not strict about XHTML format because many of HTML pages are not strict. You know, many pages use
elemtents instead of
and have <p> elements which are not closed.

This module natively understands a character set of document by reading its meta element.

The parsed document’s encoding is converted as this class’s fixed internal encoding “UTF-8”.

