FreeBSD.software
Home/www/py311-html5-parser

py311-html5-parser

0.4.12_4www

Fast implementation of the HTML 5 parsing spec for Python

A fast implementation of the HTML 5 parsing spec for Python. Parsing is done in C using a variant of the gumbo parser. The gumbo parse tree is then transformed into an lxml tree, also in C, yielding parse times that can be a thirtieth of the html5lib parse times. That is a speedup of 30x. This differs, for instance, from the gumbo python bindings, where the initial parsing is done in C but the transformation into the final tree is done in python.

$pkg install py311-html5-parser
html5-parser.readthedocs.io
Origin
www/py-html5-parser
Size
544KiB
License
APACHE20
Maintainer
madpilot@FreeBSD.org
Dependencies
2 packages
Required by
2 packages

Dependencies (2)

Required By (2)